MS-E1623 - How to lie with statistics?, Lecture, 24.10.2022-1.12.2022
This course space end date is set to 01.12.2022 Search Courses: MS-E1623
Topic outline
-
How to lie with statistics? (5cr)
This is an advanced course in statistics. The course is aimed at master's students and doctoral students interested in statistics. Maturity in performing statistical analysis is needed and thus students should have taken at least one master's level statistics course before attending this course. There are no other prerequisites.
Note that all the lectures are given on campus. Remote attendance is not possible.
Content
During this course, students talk about typical problems and faults in sample selection, choices of location measure, graphical presentation of data, forming questionnaires, statistical testing, and regression analysis. Students are assumed to be familiar with these methods before attending the course. The focus will be on examples about using these methods wrongly --- either accidentally or on purpose --- and on improving statistical analyses.
Intended learning outcomes
The objectives are to learn to evaluate statistical analyses critically, to learn to avoid typical pitfalls in simple statistical analyses and to learn to improve presentation of the results obtained in statistical analyses. The objective is not to learn to lie with statistics, but to learn to spot if there is something fishy in a statistical analysis. The ultimate goal is to learn to tell the truth with statistics.
Lectures and assignments
The course consists of 12 lectures, lecture assignments, project work and study journal. Lectures are on Mondays and on Wednesdays from 10.15 to 12.00 in Y313. Majority of the lectures, instead of traditional lecturing, consists of discussions. Students will find problematic data examples themselves and their findings and ideas for improving data analyses are discussed during the lectures. Students will also learn to defend their ideas and discoveries by conducting their project works where statistical analyses are used in justifying opinions and claims. Students will also write a study journal. In the study journals students may write down notes about their thoughts and reactions to what has been discussed. Writing and submitting a study journal on time is compulsory for completing the course!
Lecture topics
Lecture 1: Introduction --- We talk about the project works and about all the lecture assignment and about common errors and problems that are related to the lecture assignment topics.
Lecture 2: Getting ready for the project works
Lecture 3: Selecting the sample
Lecture 4: Measures of location
Lecture 5: Graphics
Lecture 6: Questionnaires
Lecture 7: Testing
Lecture 8: Regression analysis
Lecture 9: Statistics related to timely topics (pandemic, climate change, ...)
Lecture 10: Project work presentations
Lecture 11: Project work presentations
Lecture 12: Summary
Lecture assignments
There is an assignment related to almost every lecture. Submit your assignments on time! Late submission is not possible! For lecture 2, every student has to come up with at least two possible project work topics. On lecture 2, we discuss about the topics and every student selects his/her topic or project group. Project work presentations take place on Lecture 10 and 11 so there is plenty of time to prepare for that. For Lecture 3, every student has to find one real data example or invent two examples that illustrate the problems related to biased sample. For Lecture 4, every student has to find one real data example or simulate two examples, where different location measures tell completely different stories. For Lecture 5, every student has to find one real data example or simulate two examples about misleading graphical presentation. For Lecture 6, every student has to find one real data example or write two examples of badly worded questionnaire questions or answer choices. For Lecture 7, every student has to find one real data example or simulate two examples, where results of statistical testing are false or misleading. For Lecture 8, every student has to find one real data example or simulate two examples, where regression analysis gives misleading results. For Lecture 9, every student has to give one example or simulate two examples related to misleading interpretation, analysis or comparison of data that is related to some timely topic.
Examples and ways to improve statistical analyses are discussed during the lectures.
Study journal
In order to complete the course, students have to keep a study journal (approximately 1/2 pages per lecture). Study journal must be submitted on time! Writing and submitting the study journal on time is compulsory for completing the course!
Assessment
The assessment is based on the lecture assignments, compulsory study journal and the project work. Writing and submitting the study journal on time is compulsory for completing the course! Final grade of the course is given by
grade = 5 - 0.5ms - 0.5ma - 1md - 1ij,
where ms is the number of the student's missed lectures, ma is the number of the student's missed lecture assignments, md is 1 if the student does not present his/her project work (and 0 if the student does present his/her project work), and ij is 1 if the student's study journal is incomplete (and 0 if the study journal is complete). The grades are rounded up to the closest integer. For example, grade 5 may be obtained by full attendance, completing all but one lecture assignments, submitting a complete study journal on time and presenting the project work. Grade 3 may be obtained by full attendance, completed lecture assignments, and submitting an incomplete study journal on time. Grade 1 may be obtained by attending all but 2 lectures, completing all but 2 lecture assignments, and submitting an incomplete study journal on time.
Workload
Majority of students' workload will come from independent assignments. Lecture assignments will take on average 7*8 = 56 hours to complete. That includes finding representative data examples and observing problems in them. Writing the study journal takes on average 20-25 h as total. Project work will take on average about 15-20 h. Attending the lectures takes as total 24 h.
Learning materials
Main materials for this course are the examples found by the students. The book "How to lie with Statistics" written by Darrell Huff may also be used as study material, but there is no need for the students to purchase this book for the course.