I blog about statistics and research design with an audience consisting of researchers in bilingualism, multilingualism, and applied linguistics in mind.

Latest blog posts

Adjusting to Julia: Piecewise regression

Adjusting to Julia: Tea tasting

Adjusting to Julia: The Levenshtein algorithm

Adjusting to Julia: Generating the Fibonacci sequence

In research, don’t do things you don’t see the point of

An R function for computing Levenshtein distances between texts using the word as the unit of comparison

The consequences of controlling for a post-treatment variable

Quantitative methodology: An introduction

Capitalising on covariates in cluster-randomised experiments

Tutorial: Visualising statistical uncertainty using model-based graphs

Interpreting regression models: a reading list

Tutorial: Obtaining directly interpretable regression coefficients by recoding categorical predictors

Nonparametric tests aren’t a silver bullet when parametric assumptions are violated

Baby steps in Bayes: Incorporating reliability estimates in regression models

Baby steps in Bayes: Accounting for measurement error on a control variable

Five suggestions for simplifying research reports

Drawing scatterplot matrices

Adjusting for a covariate in cluster-randomised experiments

Collinearity isn’t a disease that needs curing

Interactions in logistic regression models

Walkthrough: A significance test for a two-group comparison

Before worrying about model assumptions, think about model relevance

Guarantees in the long run vs. interpreting the data at hand: Two analyses of clustered data

Baby steps in Bayes: Recoding predictors and homing in on specific comparisons

A closer look at a classic study (Bailey et al. 1974)

Introducing cannonball - Tools for teaching statistics

Looking for comments on a paper on model assumptions

Baby steps in Bayes: Piecewise regression with two breakpoints

A data entry form with failsafes

Baby steps in Bayes: Piecewise regression

A brief comment on research questions

Checking model assumptions without getting paranoid

Consider generalisability

Suggestions for more informative replication studies

Increasing power and precision using covariates

Confidence interval-based optional stopping

Creating comparable sets of stimuli

Abandoning standardised effect sizes and opening up other roads to power

Fitting interactions between continuous variables

Tutorial: Adding confidence bands to effect displays

Tutorial: Plotting regression models

Confidence intervals for standardised mean differences

Which predictor is most important? Predictive utility vs. construct importance

Automatise repetitive tasks

A few examples of bootstrapping

What data patterns can lie behind a correlation coefficient?

Common-language effect sizes

The Centre for Open Science’s Preregistration Challenge: Why it’s relevant and some recommended background reading

Tutorial: Drawing a dot plot

R tip: Ordering factor levels more easily

Classifying second-language learners as native- or non-nativelike: Don’t neglect classification error rates

Tutorial: Drawing a boxplot

Tutorial: Drawing a line chart

Tutorial: Drawing a scatterplot

Surviving the ANOVA onslaught

Why reported R² values are often too high

On correcting for multiple comparisons: Five scenarios

Silly significance tests: The main effects no one is interested in

Experiments with intact groups: spurious significance with improperly weighted t-tests

Some advantages of sharing your data and code

Drawing a scatterplot with a non-linear trend line

Causes and consequences of unequal sample sizes

The problem with cutting up continuous variables and what to do when things aren’t linear

Analysing experiments with intact groups: the problem and an easy solution

Covariate adjustment in logistic mixed models: Is it worth the effort?

Controlling for confounding variables in correlational research: Four caveats

Covariate adjustment in logistic regression — and some counterintuitive findings

Some tips on preparing your data for analysis

Silly significance tests: Tests unrelated to the genuine research questions

Power simulations for comparing independent correlations

More on why I don’t like standardised effect sizes

A more selective approach to reporting statistics

Explaining key concepts using permutation tests

Thinking about graphs

Why I don’t like standardised effect sizes

Overaccuracy and false precision

Some alternatives to bar plots

The curious neglect of covariates in discussions of statistical power

Assessing differences of significance