y=Xb+Zu+e where,
[ue]∼N[0,(Gσ2u00Rσ2e)]
Or, u∼N(0,G∗)e∼N(0,R∗)
Morex | Robust | Stander | Excel | |
---|---|---|---|---|
Morex | 1 | 1/2 | 11/32 | 7/16 |
Robust | 1 | 43/64 | 27/32 | |
Stander | 1 | 91/128 | ||
Excel | 1 |
V(u)=AVA=[2111/167/81243/3227/1611/1643/32291/647/827/1691/642]VA
Where the elements of A, the additive relationship matrix, are equal to twice the fXY among inbreds.
Genetic covariances for general relatives is
Cov(X,Y)=2fXYVA+ΔXYVD
Parental contributions to progeny may differ from those expected from pedigrees.
Parental contributions to progeny may differ from those expected from pedigrees.
The expected marker similarity between X and Y is equal to:
SXY=fXY−(1−fXY)θXY
The expected marker similarity between X and Y is equal to:
SXY=fXY−(1−fXY)θXY
If fXY=0, SXY=−θXY
θXY can be estimated from unrelated individuals as
Rearrange the marker similarity equation and replace θXY with θ:
fXY=SXY−θXY1−θXY=SXY−θ1−θ
Rearrange the marker similarity equation and replace θXY with θ:
fXY=SXY−θXY1−θXY=SXY−θ1−θ
Marker-based fXY: the probability of identity by descent (IBD) between two individuals by accounting for the frequency of marker alleles that are identity by state (IBS).
Rearrange the marker similarity equation and replace θXY with θ:
fXY=SXY−θXY1−θXY=SXY−θ1−θ
Marker-based fXY: the probability of identity by descent (IBD) between two individuals by accounting for the frequency of marker alleles that are identity by state (IBS).
Step1: Coefficients of coancestry calculated from genome-wide markers instead of pedigree records
Step2: Genomic relationship (G) matrix replaces the additive relationship matrix or A matrix
Step3: Fit the linear mixed model or the GBLUP model.
Step1: Coefficients of coancestry calculated from genome-wide markers instead of pedigree records
Step2: Genomic relationship (G) matrix replaces the additive relationship matrix or A matrix
Step3: Fit the linear mixed model or the GBLUP model.
Empirical results in plants have shown that GBLUP is superior to BLUP.
For maize yield, the predictive ability ranged from 0.7 to 0.8 with GBLUP vs. 0.66 to 0.79 with BLUP (Bernardo, 1994)
Recent results with high density markers: 0.10-0.25 higher (Albrecht et al., 2014; Schrag et al., 2019)
Genomic selection is a procedure that utilizes a large set of random markers in marker-based selection.
Obtaining genome-enabled prediction of genotypic value
Selection is conducted on the basis of such predicted values.
Genomic selection is a procedure that utilizes a large set of random markers in marker-based selection.
Obtaining genome-enabled prediction of genotypic value
Selection is conducted on the basis of such predicted values.
Meuwissen et al. (2001) published the landmark paper outlining the application of BLUP of allele effects to breeding, specifically in the context of animal breeding.
Meuwissen, et al.,2001. Prediction of total genetic value using genome-wide dense marker maps. Genetics 157:1819-1829.
Heffner, E.L., M.E. Sorrells, and J.L. Jannink. 2009. Genomic selection for crop improvement. Crop Sci. 49:1-12.
Bernardo 2020. textbook
Suppose N=150 F3 families are developed from the cross between two maize inbreds.
These F3 families are evaluated for their testcross performance and are genotyped with N=384 random SNP markers.
The yield trials are conducted in the same set of environments and the data are assumed balanced so that the only fixed effect is the overall mean.
The performance of the testcrosses on an entry-mean basis can be modeled as:
y=1μ+Zm+e
where,
The performance of the testcrosses on an entry-mean basis can be modeled as:
y=1μ+Zm+e
where,
The performance of the testcrosses on an entry-mean basis can be modeled as:
y=1μ+Zm+e
Note that fitting marker effects as random instead of fixed does not require degrees of freedom
The performance of the testcrosses on an entry-mean basis can be modeled as:
y=1μ+Zm+e
Note that fitting marker effects as random instead of fixed does not require degrees of freedom
The covariance matrix of m can be modeled as:
Meuwiseen et al., 2001.
V(m)=IVMi=I(VG/NM)
y=1μ+Zm+e
V(m)=IVMi=I(VG/NM)
This convenient way to calculated the effects of genome-wide markers by BLUP is called ridge regression BLUP or RR-BLUP.
y=1μ+Zm+e
V(m)=IVMi=I(VG/NM)
This convenient way to calculated the effects of genome-wide markers by BLUP is called ridge regression BLUP or RR-BLUP.
(1) Each random marker is assumed to account for an equal amount of the genetic variance
(2) Epistasis is ignored for convenience in the prediction
Four related barley cultivars:
Four related barley cultivars:
Env | Number | inbred | Grain yield | SNP1 | SNP2 | SNP3 |
---|---|---|---|---|---|---|
Set1 | 18 | M | 4.45 | C | A | C |
Set1 | 18 | R | 4.61 | C | G | C |
Set1 | 18 | S | 5.27 | T | A | A |
Set2 | 9 | R | 5.00 | C | G | C |
Set2 | 9 | E | 5.82 | T | G | C |
Set2 | 9 | S | 5.79 | T | A | A |
y=Xb+Zm+e
y=Xb+Zm+e
e is the vector of residuals.
Z is the incidence matrix for the SNP markers
V(m)=IVMi=I(VG/NM)
y=Xb+Zm+e
e is the vector of residuals.
Z is the incidence matrix for the SNP markers
V(m)=IVMi=I(VG/NM)
We will use biallelic SNPs and consider Morex (coded as 1) as our reference
Env | Number | inbred | Grain yield | SNP1 | SNP2 | SNP3 |
---|---|---|---|---|---|---|
Set1 | 18 | M | 4.45 | C | A | C |
Set1 | 18 | R | 4.61 | C | G | C |
Set1 | 18 | S | 5.27 | T | A | A |
Set2 | 9 | R | 5.00 | C | G | C |
Set2 | 9 | E | 5.82 | T | G | C |
Set2 | 9 | S | 5.79 | T | A | A |
Env | Number | inbred | Grain yield | SNP1 | SNP2 | SNP3 |
---|---|---|---|---|---|---|
Set1 | 18 | M | 4.45 | C | A | C |
Set1 | 18 | R | 4.61 | C | G | C |
Set1 | 18 | S | 5.27 | T | A | A |
Set2 | 9 | R | 5.00 | C | G | C |
Set2 | 9 | E | 5.82 | T | G | C |
Set2 | 9 | S | 5.79 | T | A | A |
# let's input data by columnZ <- matrix(c(1,1,-1,1,-1,-1, 1,-1,1,-1,-1,1, 1,1,-1,1,1,-1), byrow=FALSE, nrow=6)Z
## [,1] [,2] [,3]## [1,] 1 1 1## [2,] 1 -1 1## [3,] -1 1 -1## [4,] 1 -1 1## [5,] -1 -1 1## [6,] -1 1 -1
[ˆbˆu]=[X′R−1XX′R−1ZZ′R−1XZ′R−1Z+G−1Ve/Vu]−1[X′R−1yZ′R−1y]
[ˆbˆu]=[X′R−1XX′R−1ZZ′R−1XZ′R−1Z+G−1Ve/Vu]−1[X′R−1yZ′R−1y]
In our case,
[ˆbˆm]=[X′R−1XX′R−1ZZ′R−1XZ′R−1Z+IVe/VMi]−1[X′R−1yZ′R−1y]
[ˆbˆu]=[X′R−1XX′R−1ZZ′R−1XZ′R−1Z+G−1Ve/Vu]−1[X′R−1yZ′R−1y]
In our case,
[ˆbˆm]=[X′R−1XX′R−1ZZ′R−1XZ′R−1Z+IVe/VMi]−1[X′R−1yZ′R−1y]
Here, we suppose the heritability is H2=0.5.
H2=VGVG+VeVMi=VGNMVeVMi=(1−H2)×NMH2=3
y <- matrix(c(4.45, 4.61, 5.27, 5.00, 5.82, 5.79), byrow=FALSE, nrow=6)X <- matrix(c(1,1,1,0,0,0, 0, 0,0, 1, 1,1), byrow=FALSE, nrow=6)R <- matrix(c(1/18,0,0,0,0,0, 0,1/18,0,0,0,0, 0,0,1/18,0,0,0, 0,0,0,1/9,0,0, 0,0,0,0,1/9,0, 0,0,0,0,0,1/9), nrow=6, byrow=T)
y <- matrix(c(4.45, 4.61, 5.27, 5.00, 5.82, 5.79), byrow=FALSE, nrow=6)X <- matrix(c(1,1,1,0,0,0, 0, 0,0, 1, 1,1), byrow=FALSE, nrow=6)R <- matrix(c(1/18,0,0,0,0,0, 0,1/18,0,0,0,0, 0,0,1/18,0,0,0, 0,0,0,1/9,0,0, 0,0,0,0,1/9,0, 0,0,0,0,0,1/9), nrow=6, byrow=T)
[ˆbˆm]=[X′R−1XX′R−1ZZ′R−1XZ′R−1Z+IVe/VMi]−1[X′R−1yZ′R−1y]
y <- matrix(c(4.45, 4.61, 5.27, 5.00, 5.82, 5.79), byrow=FALSE, nrow=6)X <- matrix(c(1,1,1,0,0,0, 0, 0,0, 1, 1,1), byrow=FALSE, nrow=6)R <- matrix(c(1/18,0,0,0,0,0, 0,1/18,0,0,0,0, 0,0,1/18,0,0,0, 0,0,0,1/9,0,0, 0,0,0,0,1/9,0, 0,0,0,0,0,1/9), nrow=6, byrow=T)
[ˆbˆm]=[X′R−1XX′R−1ZZ′R−1XZ′R−1Z+IVe/VMi]−1[X′R−1yZ′R−1y]
solve_mme <- function(X, y, R, Z, H2, nmarker){ a11 <- t(X) %*% solve(R) %*% X a12 <- t(X) %*% solve(R) %*% Z a21 <- t(Z) %*% solve(R) %*% X a22 <- t(Z) %*% solve(R) %*% Z v = (1-H2)*nmarker/H2 a22_2 <- diag(3)*3 lhs <- rbind(cbind(a11, a12), cbind(a21, a22 + a22_2)) rhs <- rbind(t(X) %*% solve(R) %*% y, t(Z) %*% solve(R) %*% y) eff <- solve(lhs) %*% rhs return(eff)}eff <- solve_mme(X, y, R, Z, H2=0.5, nmarker=3)
Env | Number | inbred | Grain yield | SNP1 | SNP2 | SNP3 |
---|---|---|---|---|---|---|
Set1 | 18 | M | 4.45 | C | A | C |
Set1 | 18 | R | 4.61 | C | G | C |
Set1 | 18 | S | 5.27 | T | A | A |
Set2 | 9 | R | 5.00 | C | G | C |
Set2 | 9 | E | 5.82 | T | G | C |
Set2 | 9 | S | 5.79 | T | A | A |
eff
## [,1]## [1,] 4.93475643## [2,] 5.41898437## [3,] -0.34842360## [4,] -0.06523449## [5,] -0.06061120
The effect of SNP1 ^m1=−0.35 indicates that at SNP1, the allele carried by Morex
leads to a lower trait value.
If the genotype at SNP1 changes from CC to CT, the predicted value for yield would increase by 0.35. CC -> TT, increase by 0.7.
y=Xb+Zu+e where,
[ue]∼N[0,(Gσ2u00Rσ2e)]
Or, u∼N(0,G∗)e∼N(0,R∗)
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
Esc | Back to slideshow |