About this Course

R is free to use for everyone and powerful. It has become one of the most widely-used programming languages for statistical analyses in the social sciences and is, for this reason, a highly-sought skill among employers. R is probably more versatile than you imagine. In fact, I programmed this website and all my lecture slides in R — and so can you!

This course will teach you how to do (social) data science with R. You will learn how to get your data into shape, transform and manipulate it, visualize it, and how to statistically model it. The course will also briefly introduce you to logistic regression and multilevel modelling. Apart from these skills that are necessary for conducting classical statistics, you will also learn how to do reproducible research and report your results using R Markdown. Beware that this class presumes that you have a solid background in basic statistics (i.e., descriptive statistics and multiple OLS regression).

This course is based on the fantastic and openly accessible book Introduction to R for Data Science by Garrett Grolemund and Hadley Wickham.

Find further infos about the class in the course catalogue and about time and place here

Before the first session

Before the first session, I would like you to please download R, download RStudio, and install both on your laptop (see also). Please always make sure to bring your laptop to class.

Course structure

We meet once a week, every Friday with two sessions and a lunch break in between. Every session consists of a 45 minutes lecture, a 15 minutes break, and finally a 45 minutes lab session during which you work on that day's exercise in small groups. In just seven weeks you will be able to read complex code and conduct sophisticated analyses.

SessionTopicSessionTopic
Week 36Week 40
1Intro9Modeling: OLS I
2Vectors10Modeling: OLS II
Week 37Week 41
3Data frames & tibbles11Modeling: Logistic regression
4Visualization I12Modeling: Mixed effects models
Week 38Week 43
5Piping & grouped operations13Dynamic documents with R Markdown
6Visualization II14(Peer-) Feedback
Week 39
7Reshaping & relational data**End November **Peer feedback
8Modeling: PCA07 January ‘22Deadline: Written essay