{ "cells": [ { "cell_type": "markdown", "source": [ "# Neural networks and gradient calculations with Pytorch" ], "metadata": {} }, { "cell_type": "markdown", "source": [ "# Introduction\n", "\n", "The latest Reinforcement Learning assignment involves some basic neural networks and gradient computations. Pytorch, an open-source machine learning framework, offers a bunch of features to construct, train and deploy neural networks as well as calculate derivatives, etc. This tutorial aims to offer you some basic background of neural networks and gradient calculations to help to start your assignment. This tutorial includes two parts. In the first part, we cover the basic linear regression model and neural network and you can skip this part if you have prior knowledge. In the second part, we talk about how to stop gradient. This operator is used when implementing the DQN algorithm. So please make sure you understand this operator before Exercise 4.\n", "\n", "If you are interested in diving into the Pytorch and deep learning, please check these excellent materials. \n", "\n", "1. Pytorch Tutorial: https://pytorch.org/tutorials/\n", "\n", "2. Dive into Deep Learing: https://d2l.ai/\n", "\n", "3. Deep Learning course: CS-E4890, Aalto; CS231N, Stanford http://cs231n.stanford.edu/" ], "metadata": {} }, { "cell_type": "code", "execution_count": 1, "source": [ "import random\n", "\n", "import torch\n", "import torch.nn as nn\n", "import torch.nn.functional as F\n", "import numpy as np\n", "import matplotlib.pyplot as plt" ], "outputs": [], "metadata": {} }, { "cell_type": "markdown", "source": [ "# Part I\n", "## 1. Let's start with linear regression\n", "Suppose we have a bunch of data points generated from a linear model $y_i = wx_i + b$ with additive noise. Our task is to decide the linear model's weight $w$ and bias $b$ using these data on hand." ], "metadata": {} }, { "cell_type": "code", "execution_count": 2, "source": [ "# generate synthetic data\n", "def synthetic_data(w, b, num_examples): #@save\n", " \"\"\"Generate y = Xw + b + noise.\"\"\"\n", " X = torch.normal(0, 1, (num_examples, len(w)))\n", " y = torch.matmul(X, w) + b\n", " y += torch.normal(0, 0.5, y.shape) # additive noise\n", " return X, y.reshape((-1, 1))\n", "\n", "true_w = torch.tensor([-3.4])\n", "true_b = torch.tensor([4.2])\n", "\n", "# generate data\n", "features, labels = synthetic_data(true_w, true_b, 1000) # generate 1000 data points" ], "outputs": [], "metadata": {} }, { "cell_type": "code", "execution_count": 3, "source": [ "# plot data\n", "plt.scatter(features, labels)" ], "outputs": [ { "output_type": "execute_result", "data": { "text/plain": [ "" ] }, "metadata": {}, "execution_count": 3 }, { "output_type": "display_data", "data": { "image/png": "", "text/plain": [ "