Consider a QTL flanked by two markers segregating in an F2 population.
Consider a QTL flanked by two markers segregating in an F2 population.
To fully understand interval mapping, first we need to cover some basics:
Consider a QTL flanked by two markers segregating in an F2 population.
The recombination frequency between M1 and Q is 0.05 or 5cM and between Q and M2 is 0.20 or 20cM. Assume no interference.
P(QQ|M1M1M2M2)P(Qq|M1M1M2M2)P(qq|M1M1M2M2)
Consider a QTL flanked by two markers segregating in an F2 population.
P(QQ|M1M1M2M2)=P(M1M1QQM2M2)P(M1M1M2M2)
Consider a QTL flanked by two markers segregating in an F2 population.
P(QQ|M1M1M2M2)=P(M1M1QQM2M2)P(M1M1M2M2)
P(M1M1M2M2)=P(M1M2)2=(1−c122)2
Consider a QTL flanked by two markers segregating in an F2 population.
P(QQ|M1M1M2M2)=P(M1M1QQM2M2)P(M1M1M2M2)
P(M1M1M2M2)=P(M1M2)2=(1−c122)2
P(M1M1QQM2M2)=P(M1QM2)2=((1−c1Q)(1−c2Q)2)2
Consider a QTL flanked by two markers segregating in an F2 population.
P(QQ|M1M1M2M2)=P(M1M1QQM2M2)P(M1M1M2M2)
P(M1M1M2M2)=P(M1M2)2=(1−c122)2
P(M1M1QQM2M2)=P(M1QM2)2=((1−c1Q)(1−c2Q)2)2
P(QQ|M1M1M2M2)=((1−c1Q)(1−c2Q)2)2(1−c122)2=0.973
Consider a QTL flanked by two markers segregating in an F2 population.
P(QQ|M1M1M2M2)=((1−c1Q)(1−c2Q)2)2(1−c122)2=0.973
Consider a QTL flanked by two markers segregating in an F2 population.
P(QQ|M1M1M2M2)=((1−c1Q)(1−c2Q)2)2(1−c122)2=0.973
P(Qq|M1M1M2M2)=P(M1M1QqM2M2)P(M1M1M2M2)P(M1M1QqM2M2)=P(M1QM2)P(M1qM2)+P(M1qM2)P(M1QM2)=2((1−c1Q)(1−c2Q)2)(c1Qc2Q2)=0.0038
Here, c1Q=0.05 and c2Q=0.2
Therefore, P(Qq|M1M1M2M2)=P(M1M1QqM2M2)P(M1M1M2M2)=0.026
Consider a QTL flanked by two markers segregating in an F2 population.
P(QQ|M1M1M2M2)=((1−c1Q)(1−c2Q)2)2(1−c122)2=0.973P(Qq|M1M1M2M2)=2((1−c1Q)(1−c2Q)2)(c1Qc2Q2)(1−c122)2=0.026P(qq|M1M1M2M2)=(c1Qc2Q2)2(1−c122)2=1.7×10−4
If the genotypic values for each of the QTL genotypes were given as below:
Genotype | Value | Probability |
---|---|---|
7 | P(QQ/M1M1M2M2)=0.973 | |
5 | P(Qq/M1M1M2M2)=0.026 | |
0 | P(qq/M1M1M2M2)=1.7×10−4 |
If the genotypic values for each of the QTL genotypes were given as below:
Genotype | Value | Probability |
---|---|---|
7 | P(QQ/M1M1M2M2)=0.973 | |
5 | P(Qq/M1M1M2M2)=0.026 | |
0 | P(qq/M1M1M2M2)=1.7×10−4 |
E(M1M1M2M2)=0.973×7+0.026×5+1.7×10−4×0=6.94
What does it mean to calculate the likelihood of somethings?
The likelihood function is represented as L(θ|s)=fθ(s) .
This function represents the likelihood of a certain parameter value ( θ ) given a data vector ( s ).
What does it mean to calculate the likelihood of somethings?
The likelihood function is represented as L(θ|s)=fθ(s) .
This function represents the likelihood of a certain parameter value ( θ ) given a data vector ( s ).
The fθ(s) represents the probability density function with θ set as the parameter and s set as the observations.
The value of L(θ|s) is called the likelihood of θ.
What does it mean to calculate the likelihood of somethings?
The likelihood function is represented as L(θ|s)=fθ(s) .
This function represents the likelihood of a certain parameter value ( θ ) given a data vector ( s ).
To find the value of θ with the maximum likelihood, a range of theta values is tested against the observed data, and the θ giving the highest likelihood is determined to be the maximum likelihood estimator of θ.
Note: we are fixing the data and varying the parameter.
You tossed a coin ten times and observed four heads.
What is the maximum likelihood estimator of p, the probability of obtaining a head?
You tossed a coin ten times and observed four heads.
What is the maximum likelihood estimator of p, the probability of obtaining a head?
\begin{align*} & L(p | k) = \binom{n}{k}p^kq^{n-k} \\ & L(p | 4) = \binom{10}{4}p^4q^{10-4} \\ \end{align*}
You tossed a coin ten times and observed four heads.
What is the maximum likelihood estimator of p, the probability of obtaining a head?
\begin{align*} & L(p | k) = \binom{n}{k}p^kq^{n-k} \\ & L(p | 4) = \binom{10}{4}p^4q^{10-4} \\ \end{align*}
\begin{align*} & L(0 | 4) = 0; L(0.1 | 4) = 0.01; L(0.2 | 4) = 0.09; ... \\ & L(0.4 | 4) = 0.25; ... \\ & L(0.7 | 4) = 0.04; ... ; L(1.0 | 4) = 0 \\ \end{align*}
p=0.4 is our maximum likelihood (ML) estimator for p.
When a major bi-allelic locus is segregating in a population. The distribution of the entire population can be broken into three underlying distributions:
When a major bi-allelic locus is segregating in a population. The distribution of the entire population can be broken into three underlying distributions:
The likelihood of the genotypic parameters given phenotypic value z is:
\begin{align*} L(z) & = L(\mu_{QQ}, \mu_{Qq}, \mu_{qq}, \sigma^2 | z) \\ & = P(QQ)f(z, \mu_{QQ}, \sigma^2) + P(Qq)f(z, \mu_{Qq}, \sigma^2) + P(qq)f(z, \mu_{qq}, \sigma^2)\\ \end{align*}
Where P(Q_k) equals the probability of a particular genotype
f(z, \mu_k, \sigma^2) is the probability density function for a normally distributed random variable with mean \mu_k and variance \sigma^2.
When a major bi-allelic locus is segregating in a population. The distribution of the entire population can be broken into three underlying distributions:
The likelihood of the genotypic parameters given phenotypic value z is:
\begin{align*} L(z) & = L(\mu_{QQ}, \mu_{Qq}, \mu_{qq}, \sigma^2 | z) \\ & = P(QQ)f(z, \mu_{QQ}, \sigma^2) + P(Qq)f(z, \mu_{Qq}, \sigma^2) + P(qq)f(z, \mu_{qq}, \sigma^2)\\ \end{align*}
For n random (unrelated) individuals, the overall likelihood is the product of the n individual likelihoods
\begin{align*} L(z_1, z_2, .., z_n) & = L(\mathbf{z}) = \prod_{j=1}^{n}{L(z_j)}\\ \end{align*}
Now, let's return to our conditional probabilities, specifically the probability of a QTL genotype given a marker genotype.
The likelihood of an individual with phenotypic value z given a marker genotype M_i is represented as:
\begin{align*} L(z|M_i) & = P(QQ|M_i)f(z, \mu_{QQ}, \sigma^2) + P(Qq|M_i)f(z, \mu_{Qq}, \sigma^2) + P(qq|M_i)f(z, \mu_{qq}, \sigma^2)\\ \end{align*}
Now, let's return to our conditional probabilities, specifically the probability of a QTL genotype given a marker genotype.
The likelihood of an individual with phenotypic value z given a marker genotype M_i is represented as:
\begin{align*} L(z|M_i) & = P(QQ|M_i)f(z, \mu_{QQ}, \sigma^2) + P(Qq|M_i)f(z, \mu_{Qq}, \sigma^2) + P(qq|M_i)f(z, \mu_{qq}, \sigma^2)\\ \end{align*}
For example, the likelihood for genotype MM is: \begin{align*} L(z|MM) & = P(QQ|MM)f(z, \mu_{QQ}, \sigma^2) + P(Qq|MM)f(z, \mu_{Qq}, \sigma^2) + P(qq|MM)f(z, \mu_{qq}, \sigma^2)\\ \end{align*}
\begin{align*} L(z|MM) & = (1-c)^2f(z, \mu_{QQ}, \sigma^2) + 2c(1-c)f(z, \mu_{Qq}, \sigma^2) + c^2f(z, \mu_{qq}, \sigma^2)\\ \end{align*}
To calculate the likelihoods for an interval, we simply insert the probabilities of a QTL genotype given a marker interval genotype.
For example,
\begin{align*} L(z|M_1M_1M_2M_2) = & \frac{(1-c_1)^2(1-c_2)^2}{(1-c_{12})^2}f(z, \mu_{QQ}, \sigma^2) \\ & + \frac{2c_1c_2(1-c_1)(1-c_2)}{(1-c_{12})^2}f(z, \mu_{Qq}, \sigma^2) \\ & + \frac{c_1^2c_2^2}{(1-c_{12})^2}f(z, \mu_{qq}, \sigma^2)\\ \end{align*}
To calculate the likelihoods for an interval, we simply insert the probabilities of a QTL genotype given a marker interval genotype.
For example,
\begin{align*} L(z|M_1M_1M_2M_2) = & \frac{(1-c_1)^2(1-c_2)^2}{(1-c_{12})^2}f(z, \mu_{QQ}, \sigma^2) \\ & + \frac{2c_1c_2(1-c_1)(1-c_2)}{(1-c_{12})^2}f(z, \mu_{Qq}, \sigma^2) \\ & + \frac{c_1^2c_2^2}{(1-c_{12})^2}f(z, \mu_{qq}, \sigma^2)\\ \end{align*}
This likelihood value is calculated for each genetic position in between the two flanking markers by varying value of recombination rate (c).
The span of the entire interval ( c_{12} ) is calculated using a mapping function.
The values of \mu_{QQ}, \mu_{Qq}, \mu_{qq}, \sigma^2 are estimated at each genetic position.
The ratio is converted to the famous logarithm of odds (LOD) score
\begin{align*} LOD = log_{10}(\frac{L_{full}}{L_{reduced}}) \end{align*}
The ratio is converted to the famous logarithm of odds (LOD) score
\begin{align*} LOD = log_{10}(\frac{L_{full}}{L_{reduced}}) \end{align*}
If the LOD score is 3, for example, this means that the likelihood for a model including a QTL at the given genetic position is 1,000 times higher than no QTL at that position!
QTL mapping involves a large number of tests, which requires adjustments for multiple testing to keep the experiment-wise error rate low.
QTL mapping involves a large number of tests, which requires adjustments for multiple testing to keep the experiment-wise error rate low.
A commonly used technique for QTL mapping.
QTL mapping involves a large number of tests, which requires adjustments for multiple testing to keep the experiment-wise error rate low.
A commonly used technique for QTL mapping.
Basically, the phenotypic data is randomized relative to the marker data so that the null hypothesis is established.
Then, the test statistic for each QTL is calculated and the largest test statistic across the genome is tabulated.
library(qtl)set.seed(12347)# Five autosomes of cM length 50, 75, 100, 125, 60L <- c(50, 75, 100, 125, 60)map <- sim.map(L, L/5+1, eq.spacing=FALSE, include.x=FALSE)# Simulate a backcross with two QTLa <- 0.7mymodel <- rbind(c(1, 40, a), c(4, 100, a))pop <- sim.cross(map, type="bc", n.ind=200, model=mymodel)plot.map(pop)
hist(pop$pheno$phenotype, main="simulated phenotype", breaks=50, xlab="Phenotype", col="#cdb79e")
# single-QTL scan by marker regression with the simulated dataout.mr <- scanone(pop, method="mr")# plot of marker regression results for chr 4 and 12plot(out.mr, chr=c(1,2,3,4,5), ylab="LOD Score")
This is a version of interval mapping which is a very good approximation to interval mapping via maximum likelihood.
# single-QTL scan using Haley-knott Regression approachout.hk <- scanone(pop, method="hk")# plot of marker regression results for chr 4 and 12plot(out.hk, chr=c(1,2,3,4,5), ylab="LOD score")
# summary of out.mrsummary(out.mr, threshold=3)
## chr pos lod## D1M4 1 32 3.77
effectplot(pop, mname1="D1M4", main="Chr1")
Consider a QTL flanked by two markers segregating in an F2 population.
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
Esc | Back to slideshow |