LOT Winter School 2018

RM1 Statistics for Linguists - COURSE UNTIL 17.15

Martijn Wieling

Contact

University of Groningen




m.b.wieling@rug.nl
http://www.martijnwieling.nl

Course info

Level: RM1

Course description:


This course provides an overview of several statistical concepts and
methods. The software used in this course is R, which is freely
available and provides excellent facilities for sophisticated
statistical analyses. The topics treated in this course are: basic
concepts (p-values, etc.), data exploration (basic visualization),
t-tests, ANOVA, non-parametric tests, correlation, regression
analysis, logistic regression analysis, and mixed-effects regression
analysis. While this course focuses on frequentist approaches to
statistics, some practical information is given how a Bayesian
approach may augment the frequentist analysis.

The main objectives of this course are to refresh and augment your
statistical knowledge. The course will be relatively hands-on, meaning
that the focus of the course lies on determining which test to use,
how to use it, and how to interpret the results. Most examples in the
course will focus on linguistic material. Except for the first
lecture, each session starts with a lecture, followed by a lab
session. Therefore, you should bring your laptop (with Internet
access) with the most recent version of R
(https://cloud.r-project.org/) and RStudio
(https://www.rstudio.com/products/rstudio/download/) installed.

Day-to-day program

Monday: Basic concepts of Statistics
(http://www.let.rug.nl/wieling/Statistics/Basics, no lab session)

Tuesday: Intro R & data exploration
(http://www.let.rug.nl/wieling/Statistics/Intro-R + /lab)

Wednesday: Basic statistical tests
(http://www.let.rug.nl/wieling/Statistics/Basic-Tests + /lab)

Thursday: Correlation and regression
(http://www.let.rug.nl/wieling/Statistics/Regression + /lab)

Friday: Mixed-effects regression
(http://www.let.rug.nl/wieling/Statistics/Mixed-Effects + /lab)

Reading list

Background and preparatory readings (not obligatory, but very useful):

  • Baayen, R. H. (2008). Analyzing linguistic data: A practical introduction to statistics using R. Cambridge University Press. All chapters except for Chapter 5].
  • Levshina, N. (2015). How to do linguistics with R: Data exploration and statistical analysis. John Benjamins Publishing Company. [Chapters 1 to 9 (except 8.4) and 12].

http://www.sfs.uni-tuebingen.de/~hbaayen/publications/baayenCUPstats.pdf

https://benjamins.com/#catalog/books/z.195 (approx. EUR 36)

Course readings (obligatory):

  • Lecture 1: Levshina (2015): Chapter 1
  • Lecture 2: Levshina (2015): Chapters 2, 3, 4 and 9 (chi-square)
  • Lecture 3: Levshina (2015): Chapters 5, 8 (except 8.4) and 9
  • Lecture 4: Levshina (2015): Chapters 6, 7 and 12
  • Lecture 5: Baayen (2008): Chapter 7

Further readings (optional):

  • There are many suitable statistics books, one free statistics book can be found here:

http://www.compcogscisydney.com/learning-statistics-with-r.html

Assignments