Teaching resources
Analysing quantitative data: An introduction for language researchers
These lecture notes as well as the datasets used are available from GitHub.
Contents:
- Software
- Working with datasets
- Fundamentals of probability theory
- Descriptive statistics of a univariate sample
- Random samples
- Estimating estimation uncertainty
- Another look at the mean
- Adding a predictor
- Group differences
- Differences between differences
- Several predictors in one model
- The logic of significance testing
- The t-test
- Analysis of variance
- Calculating statistical power
- Silly tests
- Questionable research practices
- Within-subjects experiments
- Logistic regression
- Recommendations for self-study
Quantitative methodology: An introduction
The script for the course on quantitative methodology I teach is available here.
Contents:
- Association and causality.
- Constructing a control group.
- Alternative explanations.
- Inferential statistics 101. (The course is not a statistics course, but there’s no avoiding talking about p-values given their omnipresence.)
- Increasing precision.
- Pedagogical interventions.
- Within-subjects experiments.
- Quasi-experiments and correlational studies.
- Constructs and indicators.
- Questionable research practices.
I’ve also included two appendices:
- Reading difficult results sections.
- Reporting research transparently.
Introduction to the general linear model
These are the lecture notes for a summer school module I taught. They are available from GitHub.
Contents:
- Nuts and bolts: General linear model equation; optimisation criteria (least absolute deviations, least squares, maximum likelihood); estimating uncertainty (bootstrapping, i.i.d. normality assumption).
- Adding a predictor: Interpretation of parameter estimates and regression lines; confidence bands.
- Group differences: Dummy variables (treatment coding, sum coding); bootstrapping without homoskedasticity.
- Interactions.
- Multiple predictors: Confounding variables; control variables; posttreatment variables.
- The basic of logistic regression: Linear probability model; odds, odds ratios, log-odds.
Working with datasets and visualising data in R: Two primers
I wrote a primer on working with datasets in R and another on visualising data in R. You can find the datasets used in the primers on GitHub.
Visualising statistical uncertainty using model-based graphs
I wrote a tutorial about visualising the statistical uncertainty in statistical models for the BICLCE 2019 conference in Bamberg. You can find the tutorial here: Visualising statistical uncertainty using model-based graphs.
Contents:
- Why plot models, and why visualise uncertainty?
- The principle: An example with simple linear regression
- Step 1: Fit the model
- Step 2: Compute the conditional means and confidence intervals
- Step 3: Plot!
- Predictions about individual cases vs. conditional means
- More examples
- Several continuous predictors
- Dealing with categorical predictors
- t-tests are models, too
- Dealing with interactions
- Ordinary logistic regression
- Mixed-effects models
- Logistic mixed effects models
- Caveats
- Other things may not be equal
- Your model may be misspecified
- Other models may yield different pictures
Miscellaneous tutorials
The blog archive contains a number of tutorials.
cannonball (R package)
The cannonball package bundles a couple of functions that I use when teaching introductory courses in quantitative methodology and statistics. These include
plot_r()for drawing different scatterplots with the same correlation coefficient,walkthrough_p()andwalkthrough_blocking(), which both aim to help students see the connection between an experiment’s design and its analysis,clustered_data()for simulating data from cluster-randomised experiments,parade()and associated functions for helping researchers check the assumptions of their statistical models.
More information is available on GitHub.