Översikt

  • Course Description

    This is an introductory course on relational databases designed for Data Science BSc students; so no prior knowledge of databases is assumed. The course covers the fundamentals of relational algebra, the design of the relational schema including the Unified Modeling Language (UML), functional dependency and normal forms, the concept of transactions, creating SQL tables (including indexes), and using SQL to query the database. 

    Following the course, the students will have the know-how to design and implement relational databases that meet the normalization rules. Moreover, the student should be able to use SQL to write and run various types of queries so as to extract the desired data from the database, an essential part when analyzing data. In particular, the course will draw on relevant examples to prepare students to apply the principles of relational databases to projects in data science.

    Instructors

     Prof. Nitin Sawhney (email: nitin.sawhney@domain)

    Teaching Assistants

    • Amina Chahla (email: amina.chahla@domain) - Lead TA
    • Duy To (email: duy.to@domain)
    • Fathima Afrooz (email: fathima.abdulmahir@domain)
    • Krzysztof Modrzyński (email: krzysztof.modrzynski@domain)
    • Minh Ha Le (email: minhha.le@domain)
    • Rasmus Innanen (email: rasmus.innanen@domain)



    • Studenten måste skicka in enkäten för att aktiviteten ska vara fullföljd

      The course Databases for Data Science, under the code CS-A1155, is offered by the Department of Computer Science within Aalto University and as such upholds and operates under the ethics and values enforced by the university. Thus, a code of conduct that contains three guidelines has been drawn up to ensure adherence to these values. Any proven or suspected violation of these guidelines will result in consequences decided by the teaching body in accordance with Aalto University’s rules; these could entail but are not limited to grade deduction, automatic failure, and disciplinary action. At their core, the guidelines aim to conserve integrity, ensure fairness, and prevent harassment. By respecting them, students will help create a safe and just learning environment for all.


      1. ALL INDIVIDUAL WORK MUST BE THE STUDENT’S OWN WORK

      Databases for Data Science expects enrolled students to work on the individual exercises by themselves. Students are free and encouraged to discuss these exercises but all answers should be written individually. Sharing or receiving answers with other students, whether currently or previously enrolled is not allowed, except for the case where a student is retaking the course and using their own work. Using AI tools such as ChatGPT to do homework is not allowed. However, using online resources to further understand a topic is allowed as long as they are referenced within the answers. Students are required to use discretion in these situations, but all malicious behaviour, such as copying solutions, is not tolerated.

      2. PARTICIPATION IN THE GROUP WORK SHOULD BE FAIR AND EQUAL

      Databases for Data Science expects enrolled students to partake in the group project in a fair manner. All students must fulfil their responsibilities as agreed upon by all group members. 

      Students may share and distribute tasks as they wish as long as it is done equitably. In the case of major conflict, members of the teaching body should be consulted.

      3. NO FORMS OF HARASSMENT ARE TOLERATED UNDER ANY CIRCUMSTANCES 

      Databases for Data Science expects enrolled students to act civilly and respectfully towards each other and members of the teaching body. Any form of harassment or inappropriate behaviour is unacceptable. This could include, but is not limited to, harassment based on gender, race, beliefs, and sexual orientation. If a student witnesses or experiences harassment or inappropriate behaviour, they must contact members of the teaching body.