Algorithmic methods of data mining, fall 2018
The course covers a range of topics in data mining, including
distance functions, analysis of high-dimensional data, similarity
search, data-stream computation, data clustering, graph mining,
and spectral methods.
Some of course topics will be covered from the textbook:
Leskovec, Rajaraman, and Ullman: Mining of massive datasets,
available by Cambridge University Press and online:
Additional reading material will be posted in the course webpage.
The syllabus will cover:
- Introduction to data mining.
- Distance functions and embeddings.
- High dimensional data and dimensionality reduction.
- Similarity search and locality-sensitive hashing
- Data-stream computation.
- Approximation algorithms for clustering problems, such as, k-means and k-median.
- Graph mining.
- Graph partitioning and spectral graph analysis.
- Link analysis and methods for ordering data.
Course meetings are Mon, Tue, 4-6pm, at T1.
Exercise sessions are scheduled for Thu, 2-4pm, at T2. Not every week there will be exercise sessions. We will announce in advance when there will be.
Course instructor: Aristides Gionis, aristides.gionis@aalto.fi
Teaching assistants: Suhas Muniyappa (suhas.muniyappa@aalto.fi), Han Xiao (han.xiao@aalto.fi), and Nikita Alexandrov (nikita.alexandrov@aalto.fi)
Office hours, by appointment (email)