Human Connectome Project
- Responsible TA: Onerva
- Data description: The Human Connectome Project (HCP) aims to construct a complete map of structural and functional networks of the human brain. As a part of the project, they have published a large open dataset (available at https://db.humanconnectome.org (free but requires registration). The dataset includes MRI and task and rest fMRI for 897 healthy subjects, diffusion imaging data for 730 healthy subjects, and MEG data for 95 healthy subjects. fMRI tasks include working memory, recognition memory, gambling, motor, language processing, social cognition (Theory of Mind), relational processing, and emotion processing tasks. Data are available at different processing levels; we recommend using minimally preprocessed data.
- Possible research questions: Differences in brain network modules between different tasks and rest. Differences between networks extracted from different imaging data (fMRI, dMRI, and MEG networks)
Public transport networks
- Responsible TA: Rainer
- Data description:
Open GTFS timetable data (see
https://developers.google.com/transit/gtfs/reference), available for Helsinki and other cities, that is preprocessed into SQL databases. Also available: Historical GPS-locations of Kutsuplus buses (on-demand public
transport); real-time delay data for Helsinki region.
- Possible research questions:
Multiple, from robustness of public transport networks to better routing algorithms. For more inspiration, see our visualizations of public transport network data.
- Responsible TA: Mikko
- Data description: This is a network database that is part of the Panama Papers and the Offshore Leaks investigations. The network contains more than million nodes and around 1.5 million links. There are 4 types of nodes and multiple types of links. The data contains temporal data over 40 years and rich metadata such as addresses of the various entities. Data can be downloaded from and is described in detail in https://www.occrp.org/en/panamapapers/database
- Possible research questions: How connected is this network? Are there clusters, or separated components? Which entities dominate the network?
- Responsible TA: Jari
- Data description: "Since 2008, we have collected longitudinal data on the physical proximity and face-to-face contacts of individuals in numerous real-world environments, covering widely varying contexts across several countries: schools, museums, hospitals, etc. We use the data to study human behaviour and to develop agent-based models for the transmission of infectious diseases" (see www.sociopatterns.org)
- Possible research questions: What types of temporal structures can be discovered in these contact networks? How do they affect processes such as spreading of contagious disease?