Topic: Student Project | CS-EJ3211 - Machine Learning with Python D, 29.03.2021-04.06.2021

Topic outline

Student Project
There are two groups in Student Project: (1) with a project submission deadline at 17.5; peer-grading deadline 31.5 and (2) with a project submission deadline at 31.5; peer-grading deadline 14.6. First group is meant for students learning with a faster pace and wishing to complete course and get grade before summer. Second option is for Python beginners who might need more time to complete project.
In the student project, your task is to utilize machine learning methods to solve a problem of your choice.
In order to participate in the project, you must submit a project report on MyCourses. The report is submitted as a Python notebook (.ipynb format) on MyCourses page, and should follow the required outline presented below. You can fetch template for the project "R7_StudentProject" with instructions and tips in Jupyter Hub (no need to submit project in JHub). Also, there is an example of how student project could look like.
The submitted report should contain the Python code used in the project (early prototyping and "scrapbooking" can be excluded). If your code includes large class or function implementations, these can be written in separate .py files. The notebook should be arranged so that the reader can replicate your workflow by running the cells in the notebook in order (See example). If you need to include data file, please move notebook and data file in one folder and upload it here as one zip file. Note, that there is a uploading file size limitation 400MB.
In addition to submitting the project report, you will be required to grade 3 reports by other students after the deadline for project submission (see criteria below). Final student project grade is an average of points given by peer-reviewers.
Below is the rough outline that is required for the project report. Note that the contents listed under the sections are not a comprehensive list of requirements, but rather a brief description of the purpose of each section.
Required outline of the project report:
Introduction: Explain the application domain which might be a particular research question, a study assignment, a work-related aspect, or just some every-day life aspect (e.g. predict waiting time at the bus stop).
Problem Formulation: Formulate the application as an ML problem by explaining what data points are, what features and labels characterize data points, and what metric is used to assess the performance of the models.
Method: Explain how you applied ML methods to solve the problem. How did you obtain data (from wikidata.org or your own data files?). How did you split data into training and validation? How did you learn the predictor (which Python library?).
Results: Discuss the results obtained from the methods. What is the training/validation error? How do the results depend on the hyper-parameters of the methods?
Conclusion: Summarize the main findings during the project work and outline avenues for future work. Are the results suggesting that the problem is solved satisfactorily or might there be room for improvement?
- Select activity Student project - Group I - submission deadline 17.05.2020
  
  Student project - Group I - submission deadline 17.05.2020 Workshop
  
  Students must
  
  Receive a grade
- Select activity Student Project - Group II - submission deadline 31.05.2020
  
  Student Project - Group II - submission deadline 31.05.2020 Workshop
  
  Students must
  
  Receive a grade
- Select activity Tentative grading criteria
  
  Tentative grading criteria File PDF
  
  Grading criteria for peer-review.
- Select activity Student Project Example
  
  Student Project Example File PDF
- Select activity List of project ideas: Detecting the sleep stage o...
  List of project ideas:
  Detecting the sleep stage of rodents based on EEG signals
  Choose an ML problem from a kaggle data analysis competition.
  Choose an ML problem based on a dataset or task on OpenML.
  Predict the next song that a user would love to listen to.
  Just from the current scene, predict if a goal will be scored.
  Given the location and snapshot of soil, decide if one should grow potatoes or tomatoes.
  Predict available power from a solar cell based on weather forecasts.
  Analyzing the effect of school closures on the dynamics of COVID-19 infections.
  Develop a tool that determines if a hand-drawing depicts an apple or not.
  Fault detection with hydraulic pumps, (dataset)
  Self-defined project based on your research/study/work/everyday-life topics of interest.

CS-EJ3211 - Machine Learning with Python D, 29.03.2021-04.06.2021

Topic outline

Student Project

Students

Teachers

About service