MultiBLUP

Please note that we no longer recommend using MultiBLUP, and therefore this page exists only for completeness. We instead recommend you use the newer Prediction tools, which are generally faster and more accurate.

MultiBLUP is a generalisation of BLUP (Best Linear Unbiased Prediction), used for constructing prediction models. Full details of the method are provided in our Genome Research paper. In brief, MultiBLUP extends the BLUP model to allow for k+r genetic random effect terms:

Y = a + g1 + g2 + ... + gk+r + e    where    gj~N(0,Kjvj2) and e~N(0,Ive2)

where the first k random effects correspond to standard kinship matrices (typically full-rank), while the last r correspond to regional kinship matrices, each constructed from a subset of predictors (typically low-rank).
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

There are two steps when constructing a MultiBLUP prediction model: first you perform REML to estimate the variance terms v12, ..., vk+r2, ve2, then given these you calculate the BLUE estimates of SNP effect sizes.

MultiBLUP improves on BLUP when the kinship matrices correspond to subsets of predictors with distinct effect size variances. Either you can use Pre-specified Kinships (i.e., constructed based on your prior knowledge) or use Adaptive MultiBLUP to automatically decide the kinship matrices.

Note that when the focus is on prediction, rather than on estimating variance explained, we do not generally advise using SNP weightings (see here for our reasoning). Therefore, kinship matrices should be computed with the option --ignore-weights YES.