A crash course in Bayesian mixed models with brms (Lesson 3)

Introduction and review

What is this class?

Part 3 of a practical introduction to fitting Bayesian multilevel models in R and Stan
Uses the brms package (Bayesian Regression Models using Stan)

How to follow the course

Slides and text version of lessons are online
Fill in code in the worksheet (replace ... with code)
You can always copy and paste code from text version of lesson if you fall behind

What we’ve learned in Lessons 1 and 2

Relationship between prior, likelihood, and posterior
How to sample from the posterior with Hamiltonian Monte Carlo
How to use posterior distributions from our models to make predictions and test hypotheses
GLMMs with non-normal response distributions and random intercepts and slopes

What will we learn in this lesson?

How to fit a linear mixed model with residuals that are correlated in time (repeated measures in time)
How to fit a linear mixed model to a split plot design, and add a spatial additive component to the model

Space and time

Basic insight: the closer two things are in time or space, the more closely related they are
First Law of Geography

Space and time relationship: Example

We are studying the effect of farm-level management practices on soil carbon
The closer together two farms are, the more likely they are to share environmental conditions
Adjacent farms are not statistically independent even if they make independent management decisions
This is an issue with observational data and with designed experiments

Residuals are independent … right?

The models we’ve seen so far assume the residuals of every data point are independent of all the others
Even the mixed models assume this is true after the random effects are accounted for
- You may know the term “G-side models” from SAS: random effects are in the \(\textbf{G}\) matrix (variances and covariances of the random effects)
Patterns of variation in space and time can cause this assumption to be wrong

One way to model data correlated in space or time

Within the framework of a GLMM, we can model a residual covariance structure
- Data points are modeled as being more closely correlated, the closer they are to each other in space or time
- SAS calls these “R-side models”: random effects are in the \(\textbf{R}\) matrix, which tells us the pattern of how the residuals (errors of each data point) are correlated with each other

Another way to model data correlated in space or time

We can include an additive term, otherwise known as a “smoother” or “spline”
- Models the response variable as depending on a smooth function of either space or time
- Our model is now a GAMM (generalized additive mixed model)

Example 1: a GLMM with residual covariance structure to model autocorrelation in time
Example 2: a GAMM with an additive term to model autocorrelation in space

Both ways work for space or time, and you can even combine them in the same model!

Models with residual covariance for repeated measures in time

Repeated measures within subjects

Many datasets have the same subjects (plots, animals) measured at different time points
We need to account for measurements from the same individual at different times not being independent

Load packages

Same as before except ape for spatial autocorrelation statistics

library(brms)
library(agridat)
library(tidyverse)
library(emmeans)
library(tidybayes)
library(ggdist)
library(bayestestR)
library(gt)
library(ape)

theme_set(theme_bw())
colorblind_friendly <- scale_color_manual(values = unname(palette.colors(5)[-1]), aesthetics = c('colour', 'fill'))

options(mc.cores = 4, brms.file_refit = 'on_change') # If you don't have CmdStan
options(mc.cores = 4, brms.file_refit = 'on_change', brms.backend = 'cmdstanr') # If you do have CmdStan

Meet the example data

Image (c) Te Ara, the Encyclopedia of New Zealand

diggle.cow from agridat package
26 cows split among four treatment combinations (animal is individual ID)
2×2 factorial
- Iron supplementation ('NoIron' or 'Iron')
- Tuberculosis infection ('NonInfected' or 'Infected')
weight measured at 23 time points, roughly evenly spaced

Relevel factors so control groups are first in the factor level ordering

data(diggle.cow)

diggle.cow <- diggle.cow %>%
  mutate(iron = relevel(iron, ref = 'NoIron'),
         infect = relevel(infect, ref = 'NonInfected'))

Plot each individual animal’s weight gain trend as a thin line, colored by treatment combination
Plot median trend for each treatment combination as a thick line (stat_summary())

ggplot(diggle.cow, aes(x = day, y = weight, color = iron, linetype = infect, group = animal)) +
  geom_line() +
  stat_summary(aes(group = interaction(iron, infect)), fun = 'median', geom = 'line', linewidth = 2, show.legend = FALSE) +
  colorblind_friendly +
  labs(y = 'weight (kg)') +
  theme(legend.title = element_blank())

Scaling the variables

Very large or small values cause numerical issues with Monte Carlo sampling
Standardization, or z-transformation, done using scale()
Also helpful for setting priors because the units are always the same (standard deviations)

diggle.cow$dayafterstart <- diggle.cow$day - min(diggle.cow$day)
diggle.cow$weight_scaled <- scale(diggle.cow$weight)

Fitting the model

Fixed part of model formula: weight_scaled ~ dayafterstart * iron * infect
- Continuous time effect, two categorical treatment effects, and interactions
- How is the slope of weight versus time affected by iron and infection?
But each of the 26 cows has her own unique time trend
Our model needs to account for non-independence of measurements of the same cow on different days

Residual covariance matrix

Models we’ve fit so far assumed conditionally independent residuals (independent after accounting for fixed and random effects)
Those models still had a residual variance-covariance matrix \(\textbf{R} = \sigma^2\textbf{I}\)
- \(n \times n\) matrix, \(n\) is number of time points
- \(\sigma^2\) on every element of the diagonal: all residuals have same variance
- 0 everywhere else: zero covariance means no relationship between any two residuals

\[\textbf{R}=\sigma^2\begin{pmatrix}1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{pmatrix}\]

Covariance matrix for correlated residuals

Our big step is to put nonzero values into \(\textbf{R}\) to model residual autocorrelation
There are many ways to model residual autocorrelation, we will cover three of the simplest ones today
- Unstructured covariance
- Compound symmetry
- First-order autoregressive

Unstructured covariance

Every pair of time points is modeled as having its own unique correlation
Every single off-diagonal element of \(\textbf{R}\) (symmetrical) has its own covariance parameter \(\rho_{ij}\) that we have to estimate
Most flexible because it has the most parameters

\[\textbf{R}=\sigma^2\begin{pmatrix}1 & \rho_{12} & \rho_{13} & \rho_{14} \\ \rho_{12} & 1 & \rho_{23} & \rho_{24} \\ \rho_{13} & \rho_{23} & 1 & \rho_{34} \\ \rho_{14} & \rho_{24} & \rho_{34} & 1 \end{pmatrix}\]

Compound symmetry

Simplest model with the fewest parameters, but less flexible
Only a single covariance parameter \(\rho\) is estimated for all pairs of time points
Two residuals from the same individual are correlated the same, no matter how far apart in time they are

\[\textbf{R}=\sigma^2\begin{pmatrix}1 & \rho & \rho & \rho \\ \rho & 1 & \rho & \rho \\ \rho & \rho & 1 & \rho \\ \rho & \rho & \rho & 1 \end{pmatrix}\]

First-order autoregressive, or AR(1)

Correlation between two time points within same individual is higher, the closer in time they are
Like compound symmetry, only a single covariance parameter \(\rho\)
Pairs of times 1 unit apart have correlation \(\rho\), 2 time units apart have \(\rho^2\), then \(\rho^3\) . . .

\[\textbf{R}=\sigma^2\begin{pmatrix}1 & \rho & \rho^2 & \rho^3 \\ \rho & 1 & \rho & \rho^2 \\ \rho^2 & \rho & 1 & \rho \\ \rho^3 & \rho^2 & \rho & 1 \end{pmatrix}\]

Choosing an appropriate covariance structure

We will fit all three as proof of concept
23 time points, so there are 253 pairs
Unstructured covariance model will probably be overparameterized and cause model fitting issues
Autoregressive or compound symmetry?

Model syntax for residual covariance structures

brms syntax: name(time = ..., gr = ...)
- name: name of covariance function: unstr, cosy, ar, etc.
- time: name of time variable column
- gr: name of subject or grouping variable column

Model syntax for residual covariance structures

Unstructured covariance: unstr(time = dayafterstart, gr = animal)
Compound symmetry covariance: cosy(time = dayafterstart, gr = animal)
Autoregressive AR(1) covariance: ar(time = dayafterstart, gr = animal, p = 1)
- Increase p to get more complex autoregressive structures with “deeper” time lags

Fitting the residual covariance models

Unstructured covariance model requires increased warmup iterations, initial values of all parameters set to 0
- In practice these are red flags that the model is overparameterized
normal(0, 5) prior on all fixed effects (standardized response)
Prior on covariance matrix of unstructured covariance model that sets low probability on high correlations
- This is required for the model to converge; frequentist version in glmmTMB fails

R code for residual covariance models

diggle_unstructured <- brm(weight_scaled ~ iron * infect * dayafterstart + unstr(time = dayafterstart, gr = animal), 
                           prior = c(
                             prior(normal(0, 5), class = b),
                             prior(lkj_corr_cholesky(3), class = Lcortime)
                           ),
                           iter = 4000, warmup = 3000, init = 0,
                           data = diggle.cow, 
                           seed = 1001, file = 'fits/diggle_unstructured')
diggle_compoundsymm <- brm(weight_scaled ~ iron * infect * dayafterstart + cosy(time = dayafterstart, gr = animal), 
                           prior = c(
                             prior(normal(0, 5), class = b)
                           ),
                           data = diggle.cow, 
                           seed = 1002, file = 'fits/diggle_compoundsymm')
diggle_ar1 <- brm(weight_scaled ~ iron * infect * dayafterstart + ar(time = dayafterstart, gr = animal, p = 1), 
                  prior = c(
                    prior(normal(0, 5), class = b)
                  ),
                  data = diggle.cow, 
                  seed = 1003, file = 'fits/diggle_ar1')

Comparing the models: posterior predictive check plots

pp_check(diggle_unstructured)
pp_check(diggle_compoundsymm)
pp_check(diggle_ar1)

Cross-validation two different ways

Leave-one-out (LOO) cross-validation does not work well with unstructured covariance model and gives misleading results
K-fold cross-validation works but takes a lot of computation time (each model needs to be refit K times)

kfold(diggle_unstructured, diggle_compoundsymm, diggle_ar1, folds = 'grouped', group = 'animal', K = 5, compare = TRUE)

Model comparisons:
                   elpd_diff se_diff
diggle_ar1             0.0       0.0 
diggle_unstructured  -41.3      19.0 
diggle_compoundsymm -295.1      24.5

Model comparison results

AR(1) model performs best
Compound symmetry performs very poorly
Weight gain of an animal over time is an autoregressive process (cow’s weight at time \(t\) depends on its weight at time \(t-1\))
Compound symmetry might be better for other processes that are subject to random environmental fluctuations at each time (e.g., annual yield trends)

Model comparison results

Unstructured covariance model does much better than it would in a frequentist context
- Bayesian priors “regularize” models (shrink parameters toward zero) so even complex models are not too overfit

Assessing model predictions

Use emmeans package to explore the posterior
emtrends() (estimated marginal trends): posterior estimate of slope for each treatment combo, averaged over random effects
Like emmeans() with one extra argument, var: name of time variable
revpairwise ~ iron | infect: reverse pairwise contrast between each level of iron within each level of infect.
- Reverse pairwise subtracts the control mean from the treatment mean
Slopes are still in standardized units (we will convert to kg/day later)

emt_diggle <- emtrends(diggle_ar1, revpairwise ~ iron | infect, var = 'dayafterstart')

Bayesian p-values

No strong evidence of effect of iron supplementation (but only \(n=7\) per group)
We can also contrast the contrasts if we want

describe_posterior(emt_diggle$contrasts, ci = NULL, test = c('pd', 'p_map'))

Presentation ideas: How to report this analysis in a manuscript?

Description of statistical model for methods section (including model comparison results)
Verbal description of results
Figure
Table

Methods section

We fit a linear mixed model to the trend in standardized body weight over time with residual covariance structure to account for repeated measures within animals. The model had fixed effects for time, iron supplementation, tuberculosis infection, and their interactions. We fit models with unstructured, compound symmetry, and AR(1) residual covariance. A Normal(0, 5) prior distribution was assumed for the fixed effects. An LKJ prior with η = 3 was given to the unstructured residual covariance matrix. Models were fit with Hamiltonian Monte Carlo. The joint posterior distribution was sampled with four Markov chains, run for a variable number of warmup iterations and 1000 sampling iterations. We verified the R-hat statistic was < 1.01 for all parameters, indicating model convergence. In k-fold cross-validation, the first-order autoregressive model performed better than the unstructured (ΔELPD = 41) and compound symmetry models (ΔELPD = 295). Thus we present predictions from the autoregressive model.

Results section

There was little evidence for an effect of iron supplementation on the rate of weight gain. In uninfected animals the posterior estimate of the difference in mean weight gain rate between animals receiving supplemental iron and those that did not was -0.046 kg/day (95% CI [-0.135, 0.046]; p_MAP = 0.64), while in infected animals it was -0.039 kg/day (95% CI [-0.094, 0.015]; p_MAP = 0.40).

Manuscript-quality figure

Raw data and modeled trends superimposed, with 95% credible intervals
Need to calculate expected value of posterior predictive across a grid of time points by treatment combination
day and weight are rescaled to their original scales
incl.autocor = FALSE gives us overall prediction, not individual trends
Fit looks good, possibly a nonlinear trend would slightly improve fit?

cow_weight_preds <- expand.grid(dayafterstart = 0:max(diggle.cow$dayafterstart), iron = c('NoIron', 'Iron'), infect = c('NonInfected', 'Infected')) %>%
  add_epred_draws(diggle_ar1, incl_autocor = FALSE) %>%
  mutate(day = dayafterstart + min(diggle.cow$day),
         weight = .epred * sd(diggle.cow$weight, na.rm = TRUE) + mean(diggle.cow$weight, na.rm = TRUE))

ggplot(diggle.cow, aes(x = day, y = weight, color = iron, fill = iron, linetype = infect)) +
  geom_line(aes(group = animal)) +
  stat_summary(geom = 'ribbon', data = cow_weight_preds, fun.min = ~ quantile(., 0.025), fun.max = ~ quantile(., 0.975), alpha = 0.25, show.legend = FALSE) +
  stat_summary(geom = 'line', data = cow_weight_preds, fun = 'median', linewidth = 2, show.legend = FALSE) +
  colorblind_friendly +
  labs(y = 'weight (kg)') +
  theme(legend.title = element_blank())

Manuscript-quality table

Rescale slopes from standard deviations/day to kg/day units (multiply by sd(weight))
66% and 95% CIs: if the posterior distribution is roughly normal, they are roughly mean ± 1 and 2 SD

weight_slope_summary <- gather_emmeans_draws(emt_diggle$emtrends) %>%
  mutate(.value = .value * sd(diggle.cow$weight, na.rm = TRUE)) %>%
  median_qi(.width = c(0.66, 0.95)) %>%
  pivot_wider(names_from = .width, values_from = c(.lower, .upper)) %>%
  select(-c(.point, .interval))

gt(weight_slope_summary) %>%
  fmt_number(decimals = 3) %>%
  cols_merge(c(.lower_0.66, .upper_0.66), pattern = '[{1}, {2}]') %>% 
  cols_merge(c(.lower_0.95, .upper_0.95), pattern = '[{1}, {2}]') %>%
  cols_label(infect = 'tuberculosis infection', .value = 'weight gain rate (kg/day)', .lower_0.66 = '66% CI', .lower_0.95 = '95% CI') %>%
  tab_options(column_labels.font.weight = 'bold')

Spatial additive models

Explicit space and time terms

Instead of modeling residual covariance we can model space or time directly
It is not just a “nuisance” variable; we can make inference about it
Simple way: just add linear covariate for space and/or time
But the effect is rarely linear, especially with geospatial data

Generalized additive mixed models (GAMMs)

Model the response variable as a function of some combination of unknown smooth functions of the covariates: spline or smoother term
Many different functions can be used
There is a built-in penalty term that prevents overfitting

GAM models fit with different `k`

examples of GAMs fit to motorycle deceleration data

GAMMs

We will only scratch the surface of this complex topic today
See “further reading” section to learn more

Meet the example data

Field trial of barley varieties in Scotland (yay Scotch)
2 fungicide treatments × 70 genotypes, full factorial
Split-plot randomized complete block design
4 blocks, each with 2 main plots (fungicide, fung), each with 70 subplots (genotype, gen)
Response is yield

Visualize layout of experiment

Main plots are laid out adjacent to each other in a long line
Total area of study: 10 rows by 56 beds

data(durban.splitplot)

ggplot(durban.splitplot, aes(x = bed, y = row, fill = block)) +
  geom_tile() +
  theme(aspect.ratio = 10/56, legend.position = 'bottom') +
  ggtitle('Blocks')

ggplot(durban.splitplot, aes(x = bed, y = row, fill = fung)) +
  geom_tile() +
  theme(aspect.ratio = 10/56, legend.position = 'bottom') +
  ggtitle('Main plots (fungicide treatment)')

ggplot(durban.splitplot, aes(x = bed, y = row, fill = gen)) +
  geom_tile() +
  theme(aspect.ratio = 10/56, legend.position = 'none') +
  ggtitle('Subplots (genotypes)')

Examining spatial trends

Can’t we just account for the spatial trends using the block random effect?
Heat map of spatial pattern of yield note single-color palette
Blocks and main plots will only account for environmental factors that vary from east to west

ggplot(durban.splitplot, aes(x = bed, y = row, fill = yield)) +
  geom_tile() +
  scale_fill_distiller(palette = 'Oranges', direction = 1) +
  theme(aspect.ratio = 10/56, legend.position = 'bottom')

Examining spatial trends

For demo purposes, randomly shuffle subplots within each main plot and make another heat map

set.seed(125)

durban.splitplot_shuffle <- durban.splitplot %>%
  group_by(block, fung) %>%
  mutate(yield_shuffled = sample(yield))

ggplot(durban.splitplot_shuffle, aes(x = bed, y = row, fill = yield_shuffled)) +
  geom_tile() +
  scale_fill_distiller(palette = 'Oranges', direction = 1) +
  theme(aspect.ratio = 10/56, legend.position = 'bottom')

Linear mixed model without spatial component

Fixed effects are genotype, fungicide, and interaction: gen + fung + gen:fung
scale(yield) is response variable
- z-transformation makes it easier to define a prior because we know y is in standard deviation units
- normal(0, 3) encompasses large negative and large positive effect sizes
Disregard warnings about divergent transitions for now

durban_lmm <- brm(scale(yield) ~ gen + fung + gen:fung + (1 | block) + (1 | fung:block), 
                  prior = c(
                    prior(normal(0, 3), class = b)
                  ),
                  data = durban.splitplot, 
                  chains = 4, iter = 4000, warmup = 2000,
                  seed = 111, file = 'fits/durban_lmm')

Checking residuals for spatial autocorrelation

If we see spatial pattern in the residuals, our fixed and random effects did not fully account for spatial variation in yield
residuals() gives us mean and 95% quantiles of residuals across all posterior draws
Extract first column (mean), append to the data, and create a heat map

resid_durban_lmm <- residuals(durban_lmm)

durban.splitplot <- mutate(durban.splitplot, resid_nospatial = resid_durban_lmm[,1])

ggplot(durban.splitplot, (aes(x = bed, y = row, fill = resid_nospatial))) +
  geom_tile() +
  scale_fill_distiller(palette = 'Greens', direction = 1) +
  theme(aspect.ratio = 10/56, legend.position = 'bottom')

Moran’s I

Test statistic to measure strength of spatial autocorrelation
Correlation between the difference between any two data points and the spatial distance between them
Uses inverse distance: the higher the correlation, the more likely two close-together data points are to have a similar value
Ranges from -1 to 1
- 0 means no spatial pattern
- 1 means perfect spatial correlation
We don’t want high values of Moran’s I for residuals

Calculation of Moran’s I

Calculate inverse distance matrix of all pairs of spatial points in the dataset
Set diagonal of that matrix (distance between a point and itself) to 0
Moran.I() from ape package gives us the statistic

durban_inv_dist <- 1 / as.matrix(dist(cbind(durban.splitplot$bed, durban.splitplot$row)))
diag(durban_inv_dist) <- 0

Moran.I(durban.splitplot$resid_nospatial, durban_inv_dist)

Additive mixed model

Add spatial smoother with s()
- s(bed, row, k = _, bs = 'gp')
First two arguments are columns of x and y coordinates: bed and row
Next argument is basis dimension k: too low will underfit, too high will be computationally expensive
- Three models: k = 10, k = 25, k = 50
bs = 'gp' is Gaussian process smoother
- Flexible model that does not make strong assumptions about functional form
- May be computationally intensive for big datasets
- Lots of other options but we won’t get into those

Additive mixed model fitting code

Again disregard divergent transition warnings
In practice either set tighter priors, increase warmup iterations, or tweak sampling options

durban_gamm_k10 <- brm(scale(yield) ~ gen + fung + gen:fung + (1 | block) + (1 | fung:block) + s(bed, row, k = 10, bs = 'gp'), 
                       prior = c(
                         prior(normal(0, 3), class = b)
                       ),
                       data = durban.splitplot, chains = 4, iter = 4000, warmup = 2000,
                       seed = 222, file = 'fits/durban_gamm_k10')

durban_gamm_k25 <- brm(scale(yield) ~ gen + fung + gen:fung + (1 | block) + (1 | fung:block) + s(bed, row, k = 25, bs = 'gp'), 
                       prior = c(
                         prior(normal(0, 3), class = b)
                       ),
                       data = durban.splitplot, chains = 4, iter = 4000, warmup = 2000,
                       seed = 222, file = 'fits/durban_gamm_k25')

durban_gamm_k50 <- brm(scale(yield) ~ gen + fung + gen:fung + (1 | block) + (1 | fung:block) + s(bed, row, k = 50, bs = 'gp'), 
                       prior = c(
                         prior(normal(0, 3), class = b)
                       ),
                       data = durban.splitplot, chains = 4, iter = 4000, warmup = 2000,
                       seed = 222, file = 'fits/durban_gamm_k50')

Comparing the models

Some warnings about problematic observations, few enough that we can overlook them

durban_lmm <- add_criterion(durban_lmm, 'loo')
durban_gamm_k10 <- add_criterion(durban_gamm_k10, 'loo')
durban_gamm_k25 <- add_criterion(durban_gamm_k25, 'loo')
durban_gamm_k50 <- add_criterion(durban_gamm_k50, 'loo')
loo_compare(durban_lmm, durban_gamm_k10, durban_gamm_k25, durban_gamm_k50)

Checking residuals of spatial additive model

Heat map of best-performing model (k = 50)

durban.splitplot <- mutate(durban.splitplot, resid_spatial = residuals(durban_gamm_k50)[,1])

ggplot(durban.splitplot, (aes(x = bed, y = row, fill = resid_spatial))) +
  geom_tile() +
  scale_fill_distiller(palette = 'Greens', direction = 1) +
  theme(aspect.ratio = 10/56, legend.position = 'bottom')

Calculate Moran’s I statistic

Moran.I(durban.splitplot$resid_spatial, durban_inv_dist)

Impact of spatial additive term

Check whether inference is different for non-spatial and spatial models
Estimate marginal mean for each combination of genotype and fungicide for each model

emm_durban_nonspatial <- emmeans(durban_lmm, ~ gen + fung)
emm_durban_spatial <- emmeans(durban_gamm_k50, ~ gen + fung)

Combine both estimates into one dataset for plotting
Error bars are 95% highest-density credible intervals
fct_reorder() sorts genotypes in increasing order of mean

means_durban_combined <- bind_rows(
  tibble(model = 'nonspatial', as_tibble(emm_durban_nonspatial)),
  tibble(model = 'spatial', as_tibble(emm_durban_spatial))
)

means_durban_combined %>%
  mutate(gen = fct_reorder(gen, emmean)) %>%
  ggplot(aes(x = gen, y = emmean, ymin = lower.HPD, ymax = upper.HPD, group = interaction(gen, model), color = model)) +
    facet_wrap(~ fung, ncol = 1) +
    geom_errorbar(position = position_dodge(width = .5)) +
    geom_point(position = position_dodge(width = .5), size = 2) +
    colorblind_friendly +
    theme(legend.position = 'bottom', axis.text.x = element_text(angle = 90))

Visualizing spatial additive term

Pull out spatial term from model
- Get expected value of posterior predictive for all combinations of bed, row, gen, and fung
- Only take 10 draws from posterior to speed things up
- Average across the predictor combinations and posterior draws to get one value per coordinate
Make heat map

pred_grid_durban <- expand_grid(
  bed = 1:56,
  row = 1:10,
  gen = unique(durban.splitplot$gen),
  fung = unique(durban.splitplot$fung))

spatial_field_durban <- add_epred_draws(pred_grid_durban, durban_gamm_k50, re_formula = ~ 0, ndraws = 10) %>% 
  group_by(bed, row) %>% 
  summarize(.epred = mean(.epred)) 

ggplot(spatial_field_durban, (aes(x = bed, y = row, fill = .epred))) +
  geom_tile() +
  scale_fill_distiller(palette = 'Reds', direction = 1) +
  theme(aspect.ratio = 10/56, legend.position = 'bottom')

Estimating marginal means

emmeans() to get model predictions averaged over random effects of block and main plot, and over spatial smoother
Results still on z-transformed scale

emms_bygenotype <- emmeans(durban_gamm_k50, pairwise ~ fung | gen)
emms_overall <- emmeans(durban_gamm_k50, pairwise ~ fung)

Presentation ideas: How to report this analysis in a manuscript?

Description of model for methods section
Table
Results write-up and figure left as an exercise!

Methods section

We fit a general additive mixed model to the z-transformed yield variable. The fixed effects were genotype, fungicide, and their interaction, with random intercepts for block and for main plot nested within block. To account for spatial variation, we included a Gaussian process spatial smoothing term. We fit models with basis dimension (k) values 10, 25, and 50. We put a Normal(0, 3) prior on all fixed effects. Models were fit with Hamiltonian Monte Carlo, with 4 Markov chains, each run for 2000 warmup iterations before sampling 2000 draws for each chain. Model convergence was assessed by checking that R-hat < 1.01 for all parameters, and model fit was assessed by examining posterior predictive check plots. We compared models with leave-one-out cross-validation. The model with basis dimension k=50 performed best (ΔELPD 58.8 or greater); thus we present predictions from that model.

Manuscript-quality table

Predictions for fungicides averaged over genotypes
Show a few different credible intervals
Transform from standard deviation units to original units (tonnes/ha)

means_data_wide <- gather_emmeans_draws(emms_overall) %>%
  mutate(.value = .value * sd(durban.splitplot$yield),
         .value = if_else(.grid == 'emmeans', .value + mean(durban.splitplot$yield), .value)) %>%
  median_qi(.width = c(0.66, 0.95, 0.99)) %>%
  mutate(fung = coalesce(fung, contrast)) %>%
  pivot_wider(id_cols = c(fung, .value), names_from = .width, values_from = c(.lower, .upper)) 

gt(means_data_wide) %>%
  fmt_number(decimals = 3) %>%
  cols_merge(c(.lower_0.66, .upper_0.66), pattern = '[{1}, {2}]') %>%
  cols_merge(c(.lower_0.95, .upper_0.95), pattern = '[{1}, {2}]') %>%
  cols_merge(c(.lower_0.99, .upper_0.99), pattern = '[{1}, {2}]') %>%
  cols_label(fung = 'fungicide', .value = 'median', .lower_0.66 = '66% CI', .lower_0.95 = '95% CI', .lower_0.99 = '99% CI') %>%
  tab_options(column_labels.font.weight = 'bold')

Conclusion

Congratulations, you’ve now made it through three lessons of the Bayesian crash course!

We’ve learned . . .

How to fit a model that assumes residuals are correlated in time, and compare different covariance structures
How to diagnose spatial autocorrelation in residuals, and fit a GAMM with a spatial additive term to deal with the problem

You’re really starting to get the hang of this Bayes thing!

See text version of lesson for further reading and useful resources
Please provide feedback to help me improve this course!
- See course homepage for link to feedback form

A crash course in Bayesian mixed models with brms (Lesson 3)

Introduction and review

What is this class?

How to follow the course

What we’ve learned in Lessons 1 and 2

What will we learn in this lesson?

Space and time

Space and time relationship: Example

Residuals are independent … right?

One way to model data correlated in space or time

Another way to model data correlated in space or time

Models with residual covariance for repeated measures in time

Repeated measures within subjects

Load packages

Meet the example data

Scaling the variables

Fitting the model

Residual covariance matrix

Covariance matrix for correlated residuals

Unstructured covariance

Compound symmetry

First-order autoregressive, or AR(1)

Choosing an appropriate covariance structure

Model syntax for residual covariance structures

Model syntax for residual covariance structures

Fitting the residual covariance models

R code for residual covariance models

Comparing the models: posterior predictive check plots

Cross-validation two different ways

Model comparison results

Model comparison results

Assessing model predictions

Bayesian p-values

Presentation ideas: How to report this analysis in a manuscript?

Methods section

Results section

Manuscript-quality figure

Manuscript-quality table

Spatial additive models

Explicit space and time terms

Generalized additive mixed models (GAMMs)

GAM models fit with different k

GAMMs

Meet the example data

Visualize layout of experiment

Examining spatial trends

Examining spatial trends

Linear mixed model without spatial component

Checking residuals for spatial autocorrelation

Moran’s I

Calculation of Moran’s I

Additive mixed model

Additive mixed model fitting code

Comparing the models

Checking residuals of spatial additive model

Impact of spatial additive term

Visualizing spatial additive term

Estimating marginal means

Presentation ideas: How to report this analysis in a manuscript?

Methods section

Manuscript-quality table

Conclusion

We’ve learned . . .

GAM models fit with different `k`