Kurssi: CS-E4890 - Deep Learning D, Lecture, 27.2.2024-30.5.2024 | MyCourses

Etusivu Koulut Kurssipalaute Palvelulinkit Intelliboard

Kurssiasetusten perusteella kurssi on päättynyt 30.05.2024 Etsi kursseja: CS-E4890

Osion kuvaus

Select topic Yleinen

Yleinen
The information in SISU may be outdated. The up-to-date information is on these web pages. Course slack: deeplearn24-aalto.slack.com

Contact information

If you have questions regarding the course, please send an email to cs-e4890@aalto.fi .

Course description

The course introduces the fundamental and current topics of deep learning. After the course, the student understands the general principles of training deep neural networks (backpropagation, stochastic gradient descent, regularization) and knows the most common neural network architectures (convolutional and recurrent neural networks, graph neural networks and transformers); the student also has practical experience in implementing these models from scratch in PyTorch. In every weekly assignment, the students get to train a deep neural network for various tasks including image classification, machine translation, solving reasoning problems, few-shot learning and generative modeling. The course covers the most recent advances (such as unsupervised and self-supervised deep leaning) to give the student a good starting position to do research in this field.

Assessment
Returned assignments (no exam).

Prerequisites

NB: good knowledge of Python and numpy

linear algebra: vectors, matrices, eigenvalues and eigenvectors

basics of probability and statistics: sum rule, product rule, Bayes' rule, expectation, mean, variance, maximum likelihood, Kullback-Leibler divergence

basics of machine learning (recommended): supervised and unsupervised learning, overfitting

Course contents

Introduction to deep learning

Optimization methods

Regularization methods

Convolutional neural networks

Recurrent neural networks

Attention-based models

Graph neural networks

Deep learning with few labeled examples

Deep autoencoders

Flow-based and autoregressive generative models

Generative adversarial networks

Unsupervised learning via denoising
Large language models
- Valitse aktiviteetti Announcements
  
  Announcements Keskustelualue
- Valitse aktiviteetti Announcements
  
  Announcements Keskustelualue