Pop-gen
is the study of evolution.
The language of pop-gen
is Mathematics.
Pop-gen
is the study of evolution.
The language of pop-gen
is Mathematics.
Quant-gen
is the study of the complex trait, or phenotype.Quant-gen
is Statistics.In the next couple weeks, we will be deeply involved with the statistical evalution of the basic quantiative genetic models.
Many genetic factors
Many genetic factors
Genetic factors act in pairs (two alleles per locus)
Many genetic factors
Genetic factors act in pairs (two alleles per locus)
Passed on to progeny at random
Many genetic factors
Genetic factors act in pairs (two alleles per locus)
Passed on to progeny at random
Genetic factors sometimes show independent assortment
[Y1Y2⋮Yn]⏟n×1=[X11X12⋯X1mX21X22⋯X2m⋮⋮⋱⋮Xn1Xn2⋯Xnm]⏟n×m[a1a2⋮am]⏟m×1+[ϵ1ϵ2⋮ϵm]⏟n×1
[Y1Y2⋮Yn]⏟n×1=[X11X12⋯X1mX21X22⋯X2m⋮⋮⋱⋮Xn1Xn2⋯Xnm]⏟n×m[a1a2⋮am]⏟m×1+[ϵ1ϵ2⋮ϵm]⏟n×1
Yi=j=m∑j=1Xijαj+ϵi
Yi=j=m∑j=1Xijαj+ϵi
For a given individual ( i=1 ) with a number of loci ( m=1,000 )
Each allele is Xj∈(A,a) , with the probability of p or q=1−p
Yi=j=m∑j=1Xijαj+ϵi
For a given individual ( i=1 ) with a number of loci ( m=1,000 )
Each allele is Xj∈(A,a) , with the probability of p or q=1−p
The effect of jth allele ( αj ) can be samples from any distribution (e.g., uniform distribution)
According to the CLT, if m is sufficiently large, the sum is normally distributed.
Yi=j=m∑j=1Xijαj+ϵi
For a given individual ( i=1 ) with a number of loci ( m=1,000 )
Each allele is Xj∈(A,a) , with the probability of p or q=1−p
The effect of jth allele ( αj ) can be samples from any distribution (e.g., uniform distribution)
According to the CLT, if m is sufficiently large, the sum is normally distributed.
m <- 1000## for each allele, the chance of A or a is equal to 0.5x <- rbinom(n=m, size=1, prob=0.5)## sample effect from a uniform distribution:a <- runif(n=m)y <- sum(x*a) + 0y
## [1] 243.8146
Yi=j=m∑j=1Xijαj+ϵi
set.seed(1234) # seed for random number generatorm <- 1000n = 2000 # simulate a population of 2,000 individualsout <- c() # create an empty vectorfor(i in 1:n){ x <- rbinom(n=m, size=1, prob=0.5) ## for each allele, the chance of A = 0.5 a <- runif(n=m) ## sample effect from a uniform distribution: y <- sum(x*a) out <- c(out, y)} #shapiro.test(out) # W = 0.99928, p-value = 0.6622hist(out, breaks=50, col="#b8860b", main="Phenotype Distribution", xlab="")
Yi=j=m∑j=1Xijαj+ϵi
set.seed(1234) # seed for random number generatorm <- 2n = 2000 # simulate a population of 2,000 individualsout <- c()for(i in 1:n){ x <- rbinom(n=m, size=1, prob=0.5) ## for each allele, the chance of A = 0.5 a <- runif(n=m) ## sample effect from a uniform distribution: y <- sum(x*a) out <- c(out, y)} #shapiro.test(out) # W = 0.91117, p-value < 2.2e-16hist(out, breaks=50, col="#b8860b", main="Phenotype Distribution", xlab="")
For a continuous trait, i.e., kernel number per ear (raning from 0 to 1,000), what is the Pr(Y=100)?
For a continuous trait, i.e., kernel number per ear (raning from 0 to 1,000), what is the Pr(Y=100)?
Assuming wheat plant height in a population is normally distributed, with m = 30 inch, sd=5. Question: Pr(30<Y≤50)?
For a continuous trait, i.e., kernel number per ear (raning from 0 to 1,000), what is the Pr(Y=100)?
Assuming wheat plant height in a population is normally distributed, with m = 30 inch, sd=5. Question: Pr(30<Y≤50)?
Using the probability density function (or pdf) and integration, we can calculate the probability that Y is contained in a certain bracket as:
Pr(30<Y≤50)=∫5030f(y)dy
For a continuous trait, i.e., kernel number per ear (raning from 0 to 1,000), what is the Pr(Y=100)?
Assuming wheat plant height in a population is normally distributed, with m = 30 inch, sd=5. Question: Pr(30<Y≤50)?
Using the probability density function (or pdf) and integration, we can calculate the probability that Y is contained in a certain bracket as:
Pr(30<Y≤50)=∫5030f(y)dy
Define the random variable X (i.e. for genotype) which counts the number of allele A. X={2if AA with frequency p21if Aa with frequency 2p(1−p)0if aa with frequency (1−p)2 where p is the allele frequency of A.
Define the random variable X (i.e. for genotype) which counts the number of allele A. X={2if AA with frequency p21if Aa with frequency 2p(1−p)0if aa with frequency (1−p)2 where p is the allele frequency of A.
Then, according to Formula (1) in the Note:
E(f(X))=k∑i=1f(xi)Pr(X=xi)
Define the random variable X (i.e. for genotype) which counts the number of allele A. X={2if AA with frequency p21if Aa with frequency 2p(1−p)0if aa with frequency (1−p)2 where p is the allele frequency of A.
Then, according to Formula (1) in the Note:
E(f(X))=k∑i=1f(xi)Pr(X=xi)
Expected value of X:
Define the random variable X (i.e. for genotype) which counts the number of allele A. X={2if AA with frequency p21if Aa with frequency 2p(1−p)0if aa with frequency (1−p)2 where p is the allele frequency of A.
Then, according to Formula (1) in the Note:
E(f(X))=k∑i=1f(xi)Pr(X=xi)
Expected value of X:
E[X]=0×(1−p)2+1×[2p(1−p)]+2×p2=2p
Define the random variable X which counts the number of allele A. X={2if AA with frequency p21if Aa with frequency 2p(1−p)0if aa with frequency (1−p)2 where p is the allele frequency of A.
Define the random variable X which counts the number of allele A. X={2if AA with frequency p21if Aa with frequency 2p(1−p)0if aa with frequency (1−p)2 where p is the allele frequency of A.
Expected value of X2:
Define the random variable X which counts the number of allele A. X={2if AA with frequency p21if Aa with frequency 2p(1−p)0if aa with frequency (1−p)2 where p is the allele frequency of A.
Expected value of X2:
E[X2]=02×(1−p)2+12×[2p(1−p)]+22×p2=2p(1−p)+4p2
Define the random variable X which counts the number of allele A. X={2if AA with frequency p21if Aa with frequency 2p(1−p)0if aa with frequency (1−p)2 where p is the allele frequency of A.
Expected value of X2:
E[X2]=02×(1−p)2+12×[2p(1−p)]+22×p2=2p(1−p)+4p2
Thus, the variance of allelic counts is
Define the random variable X which counts the number of allele A. X={2if AA with frequency p21if Aa with frequency 2p(1−p)0if aa with frequency (1−p)2 where p is the allele frequency of A.
Expected value of X2:
E[X2]=02×(1−p)2+12×[2p(1−p)]+22×p2=2p(1−p)+4p2
Thus, the variance of allelic counts is
Var(X)=E[X2]−E[X]2=2p(1−p)+4p2−(2p)2=2p(1−p)
Define the random variable X as below:
X={1if AA with frequency p20if Aa with frequency 2p(1−p)−1if aa with frequency (1−p)2 where p is the allele frequency of A.
Define the random variable X as below:
X={1if AA with frequency p20if Aa with frequency 2p(1−p)−1if aa with frequency (1−p)2 where p is the allele frequency of A.
Then, E[X]=−1×(1−p)2+0×[2p(1−p)]+1×p2=−(1−2p+p2)+p2=2p−1E[X2]=(−1)2×(1−p)2+02×[2p(1−p)]+12×p2=1−2p+p2+p2=2p2−2p+1 Thus, the variance of allelic counts is Var(X)=E[X2]−E[X]2=2p2−2p+1−(4p2−4p+1)=−2p2+2p=2p(1−p)
Two variables: Genotype and Milk Yield (MY)
Genotype (G) | MY≤100 | 100<MY≤300 | MY>300 | Marginal Pr(G) |
---|---|---|---|---|
aa | 0.10 | 0.04 | 0.02 | 0.16 |
Aa | 0.14 | 0.18 | 0.16 | 0.48 |
AA | 0.06 | 0.10 | 0.20 | 0.36 |
Marg. Prob. | 0.30 | 0.32 | 0.38 | 1.00 |
Two variables: Genotype and Milk Yield (MY)
Genotype (G) | MY≤100 | 100<MY≤300 | MY>300 | Marginal Pr(G) |
---|---|---|---|---|
aa | 0.10 | 0.04 | 0.02 | 0.16 |
Aa | 0.14 | 0.18 | 0.16 | 0.48 |
AA | 0.06 | 0.10 | 0.20 | 0.36 |
Marg. Prob. | 0.30 | 0.32 | 0.38 | 1.00 |
Two random variables to occur together.
Two variables: Genotype and Milk Yield (MY)
Genotype (G) | MY≤100 | 100<MY≤300 | MY>300 | Marginal Pr(G) |
---|---|---|---|---|
aa | 0.10 | 0.04 | 0.02 | 0.16 |
Aa | 0.14 | 0.18 | 0.16 | 0.48 |
AA | 0.06 | 0.10 | 0.20 | 0.36 |
Marg. Prob. | 0.30 | 0.32 | 0.38 | 1.00 |
Two random variables to occur together.
Two variables: Genotype and Milk Yield (MY)
Genotype (G) | MY≤100 | 100<MY≤300 | MY>300 | Marginal Pr(G) |
---|---|---|---|---|
aa | 0.10 | 0.04 | 0.02 | 0.16 |
Aa | 0.14 | 0.18 | 0.16 | 0.48 |
AA | 0.06 | 0.10 | 0.20 | 0.36 |
Marg. Prob. | 0.30 | 0.32 | 0.38 | 1.00 |
Two variables: Genotype and Milk Yield (MY)
Genotype (G) | MY≤100 | 100<MY≤300 | MY>300 | Marginal Pr(G) |
---|---|---|---|---|
aa | 0.10 | 0.04 | 0.02 | 0.16 |
Aa | 0.14 | 0.18 | 0.16 | 0.48 |
AA | 0.06 | 0.10 | 0.20 | 0.36 |
Marg. Prob. | 0.30 | 0.32 | 0.38 | 1.00 |
A sum of mutually exclusive and exhaustive set of events.
Two variables: Genotype and Milk Yield (MY)
Genotype (G) | MY≤100 | 100<MY≤300 | MY>300 | Marginal Pr(G) |
---|---|---|---|---|
aa | 0.10 | 0.04 | 0.02 | 0.16 |
Aa | 0.14 | 0.18 | 0.16 | 0.48 |
AA | 0.06 | 0.10 | 0.20 | 0.36 |
Marg. Prob. | 0.30 | 0.32 | 0.38 | 1.00 |
A sum of mutually exclusive and exhaustive set of events.
Two variables: Genotype and Milk Yield (MY)
Genotype (G) | MY≤100 | 100<MY≤300 | MY>300 | Marginal Pr(G) |
---|---|---|---|---|
aa | 0.10 | 0.04 | 0.02 | 0.16 |
Aa | 0.14 | 0.18 | 0.16 | 0.48 |
AA | 0.06 | 0.10 | 0.20 | 0.36 |
Marg. Prob. | 0.30 | 0.32 | 0.38 | 1.00 |
Two variables: Genotype and Milk Yield (MY)
Genotype (G) | MY≤100 | 100<MY≤300 | MY>300 | Marginal Pr(G) |
---|---|---|---|---|
aa | 0.10 | 0.04 | 0.02 | 0.16 |
Aa | 0.14 | 0.18 | 0.16 | 0.48 |
AA | 0.06 | 0.10 | 0.20 | 0.36 |
Marg. Prob. | 0.30 | 0.32 | 0.38 | 1.00 |
Pr(X=x|Y=y)=Pr(X=x,Y=y)Pr(Y=y)
Two variables: Genotype and Milk Yield (MY)
Genotype (G) | MY≤100 | 100<MY≤300 | MY>300 | Marginal Pr(G) |
---|---|---|---|---|
aa | 0.10 | 0.04 | 0.02 | 0.16 |
Aa | 0.14 | 0.18 | 0.16 | 0.48 |
AA | 0.06 | 0.10 | 0.20 | 0.36 |
Marg. Prob. | 0.30 | 0.32 | 0.38 | 1.00 |
Pr(X=x|Y=y)=Pr(X=x,Y=y)Pr(Y=y)
Pop-gen
is the study of evolution.
The language of pop-gen
is Mathematics.
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
Esc | Back to slideshow |