chapter_18
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
chapter_18 [2024/09/08 16:05] – [Genetic drift/founder effect] mike | chapter_18 [2024/12/24 04:51] (current) – [An example of calculating allele frequency in humans: albinism] mike | ||
---|---|---|---|
Line 1: | Line 1: | ||
+ | <- chapter_17|Chapter 17^table_of_contents|Table of Contents^chapter_19|Chapter 19 -> | ||
+ | |||
<typo fs: | <typo fs: | ||
- | Until now, we have been carrying out genetic analysis | + | In earlier chapters, we have been carrying out genetic analysis |
+ | |||
+ | In the next several chapters, we will use a substantial amount of math. To avoid confusion with fractions, we will write genotypes as $A/a$ instead of $\frac{A}{a}$. | ||
===== The Hardy Weinberg equilibrium ===== | ===== The Hardy Weinberg equilibrium ===== | ||
- | At the heart of population genetics is the concept of allele frequency. Consider a human gene with two alleles:$A$ and $a$. The frequency of $A$ is $f(A)$ ; the frequency of a is $f(a)$. We define the following symbols: | + | At the heart of population genetics is the concept of allele frequency. Consider a human gene with two alleles:$A$ and $a$. The frequency of $A$ is $f(A)$ ; the frequency of $a$ is $f(a)$. We define the following symbols: |
<figure Fig1> | <figure Fig1> | ||
Line 29: | Line 33: | ||
<figure Fig3> | <figure Fig3> | ||
- | $$f(\frac{A}{A}) + f(\frac{A}{a}) + f(\frac{a}{a}) = 1$$ | + | $$f(A/A) + f(A/a) + f(a/a) = 1$$ |
< | < | ||
- | Three possible genotypes with two alleles, and how we represent their genotype frequencies. All possible genotype frequencies must add up to one. Note that the " | + | Three possible genotypes with two alleles, and how we represent their genotype frequencies. All possible genotype frequencies must add up to one. |
</ | </ | ||
</ | </ | ||
Line 39: | Line 43: | ||
<figure Fig4> | <figure Fig4> | ||
- | $$p = f(\frac{A}{A})+\frac{1}{2} f(\frac{A}{a})\\ | + | $$p = f(A/A)+\frac{1}{2} f(A/a)\\ |
- | q = f(\frac{a}{a})+\frac{1}{2} f(\frac{A}{a})$$ | + | q = f(a/a)+\frac{1}{2} f(A/a)$$ |
< | < | ||
Derivation of allele frequencies based on genotype frequencies. | Derivation of allele frequencies based on genotype frequencies. | ||
Line 46: | Line 50: | ||
</ | </ | ||
- | Here is an example: $M$ and $N$ are alleles of a gene that specifies different blood antigens. The alleles are codominant, so a simple blood test can distinguish the three possible genotypes of $\frac{M}{M}$, $\frac{M}{N}$, and $\frac{N}{N}$. A survey of the population reveals that 83% of the population has only the M antigen, 1% of the population only has the N antigen, and 16% of the population has both the M and N antigens. In other words: | + | Here is an example: $M$ and $N$ are alleles of a gene that specifies different blood antigens. The alleles are codominant, so a simple blood test can distinguish the three possible genotypes of $M/M$, $M/N$, and $N/N$. A survey of the population reveals that 83% of the population has only the $M$ antigen, 1% of the population only has the $N$ antigen, and 16% of the population has both the $M$ and $N$ antigens. In other words: |
<figure Fig5> | <figure Fig5> | ||
- | $$f(\frac{M}{M}) = 0.83, f(\frac{M}{N}) = 0.16, f(\frac{N}{N}) = 0.01$$ | + | $$f(M/M) = 0.83, f(M/N) = 0.16, f(N/N) = 0.01$$ |
< | < | ||
Observed phenotype frequencies (and therefore genotype frequencies) of hypothetical blood antigens $M$ and $N$. | Observed phenotype frequencies (and therefore genotype frequencies) of hypothetical blood antigens $M$ and $N$. | ||
Line 68: | Line 72: | ||
<figure Fig7> | <figure Fig7> | ||
- | $$f(\frac{M}{M}) + f(\frac{M}{N}) + f(\frac{N}{N}) = 1$$ | + | $$f(M/M) + f(M/N) + f(N/N) = 1$$ |
< | < | ||
- | Three possible genotypes with two alleles in a population $M$ and $N$, with their genotype frequencies adding up to one. Note that as before, the " | + | Three possible genotypes with two alleles in a population $M$ and $N$, with their genotype frequencies adding up to one. This figure is essentially the same as Fig. {{ref> |
</ | </ | ||
</ | </ | ||
- | Now let's think about how the inverse calculation would be performed. That is, how would we derive the genotype frequencies for the " | + | Now let's think about how the inverse calculation would be performed. That is, how would we derive the genotype frequencies for the " |
<table Tab1> | <table Tab1> | ||
Line 81: | Line 85: | ||
^ | ^ | ||
^ sperm ^ $M$ ($p$) ^ $N$ ($q$) ^ | ^ sperm ^ $M$ ($p$) ^ $N$ ($q$) ^ | ||
- | ^ $M$ ($p$) | $\frac{M}{M}$ ($p^2$ | + | ^ $M$ ($p$) | $M/M$ ($p^2$) |
- | ^ $N$ ($q$) | $\frac{M}{N}$ ($p \cdot q$ ) | $\frac{M}{M}$ ($q^2$) | + | ^ $N$ ($q$) | $N/M$ ($p \cdot q$ ) | $N/N$ ($q^2$) |
</ | </ | ||
< | < | ||
Line 93: | Line 97: | ||
<figure Fig8> | <figure Fig8> | ||
<div centeralign> | <div centeralign> | ||
- | $f(\frac{M}{M}) = p^2$, $f(\frac{M}{N}) = 2pq$, and $f(\frac{N}{N}) = q^2$ | + | $f(M/M) = p^2$, $f(M/N) = 2pq$, and $f(N/N) = q^2$ |
</ | </ | ||
< | < | ||
- | Calculating genotype frequencies based on allele frequencies, | + | Calculating genotype frequencies based on allele frequencies, |
</ | </ | ||
</ | </ | ||
Line 103: | Line 107: | ||
<figure Fig9> | <figure Fig9> | ||
- | $$\begin{aligned} p_1 &= f(\frac{M}{M}) + 1/2 f(\frac{M}{N})\\ | + | $$\begin{aligned} p_1 &= f(M/M) + 1/2 f(M/N)\\ |
&= p^2 + pq\\ | &= p^2 + pq\\ | ||
&=p (p +q)\\ | &=p (p +q)\\ | ||
Line 121: | Line 125: | ||
<table Tab2> | <table Tab2> | ||
<columns 100% *100%*> | <columns 100% *100%*> | ||
- | ^ ^ $\frac{M}{M}$ ^ $\frac{M}{N}$ ^ $\frac{N}{N}$ ^ $p$ ^ $q$ ^ | + | ^ ^ $M/M$ ^ $M/N$ ^ $N/N$ ^ $p$ ^ $q$ ^ |
^ U.S. Caucasians | ^ U.S. Caucasians | ||
^ American Inuit | 0.84 | 0.16 | 0.008 | 0.92 | 0.08 | | ^ American Inuit | 0.84 | 0.16 | 0.008 | 0.92 | 0.08 | | ||
Line 130: | Line 134: | ||
</ | </ | ||
- | Although the allele frequencies are quite different, both populations have the genotype frequencies and allele frequencies that fit the Hardy-Weinberg equilibrium. For instance, in the U.S. Caucasian population, the frequencies of $\frac{M}{M}$, $\frac{M}{N}$, and $\frac{N}{N}$ are obtained by surveying the population (this is your primary data). From these frequencies, | + | Although the allele frequencies are quite different, both populations have the genotype frequencies and allele frequencies that fit the Hardy-Weinberg equilibrium. For instance, in the U.S. Caucasian population, the frequencies of $M/M$, $M/N$, and $N/N$ are obtained by surveying the population (this is your primary data). From these frequencies, |
Now let's consider two sample populations that have the same allele frequencies but have different genotype frequencies. | Now let's consider two sample populations that have the same allele frequencies but have different genotype frequencies. | ||
Line 136: | Line 140: | ||
<table Tab3> | <table Tab3> | ||
<columns 100% *100%*> | <columns 100% *100%*> | ||
- | ^ ^ $\frac{M}{M}$ ^ $\frac{M}{N}$ ^ $\frac{N}{N}$ ^ $p$ ^ $q$ ^ | + | ^ ^ $M/M$ ^ $M/N$ ^ $N/N$ ^ $p$ ^ $q$ ^ |
^ population 1 | 0.2 | 0.2 | 0.6 | 0.3 | 0.7 | | ^ population 1 | 0.2 | 0.2 | 0.6 | 0.3 | 0.7 | | ||
^ population 2 | 0.09 | 0.42 | 0.49 | 0.3 | 0.7 | | ^ population 2 | 0.09 | 0.42 | 0.49 | 0.3 | 0.7 | | ||
Line 145: | Line 149: | ||
</ | </ | ||
- | Based on the observed genotype frequencies of population 1, we can calculate $p$ and $q$, and we can further derive $p^2=0.09$, $2pq=0.42$, and $q^2=0.49$. This clearly does not match the observed genotype frequencies of $\frac{M}{M}=0.2$, $\frac{M}{N}=0.2$, and $\frac{N}{N}=0.6$, which is what we predict the $\frac{M}{M}$ genotype frequency would be based on allele frequency $p$. $p^2$ is greater than the observed genotype frequency $f(\frac{M}{M})$ — suggesting something is happening to cause the $M$ allele to be underrepresented after a generation of breeding. Population 1 is not in a Hardy-Weinberg equilibrium. | + | Based on the observed genotype frequencies of population 1, we can calculate $p$ and $q$, and we can further derive $p^2=0.09$, $2pq=0.42$, and $q^2=0.49$. This clearly does not match the observed genotype frequencies of $f(M/M)=0.2$, $f(M/N)=0.2$, and $f(N/N)=0.6$. $p^2$ is greater than the observed genotype frequency $f(M/M)$ — suggesting something is happening to cause the $M$ allele to be underrepresented after a generation of breeding. Population 1 is not in Hardy-Weinberg equilibrium. |
- | We can do the same things | + | We can do the same analysis |
- | Here is a graph showing the relationship between allele and genotype frequencies for genes that are in a Hardy-Weinberg equilibrium: | + | Here is a graph showing the relationship between allele and genotype frequencies for genes that are in Hardy-Weinberg equilibrium: |
<figure Fig10> | <figure Fig10> | ||
{{ : | {{ : | ||
< | < | ||
- | Relationship between allele frequency $p$ (and $q$) and genotype frequency for genes in a Hardy-Weinberg equilibrium. | + | Relationship between allele frequency $p$ (and $q$) and genotype frequency for genes in Hardy-Weinberg equilibrium. |
</ | </ | ||
</ | </ | ||
Line 162: | Line 166: | ||
===== Hardy-Weinberg vs. the real (human) world ===== | ===== Hardy-Weinberg vs. the real (human) world ===== | ||
- | Our discussion above of the Hardy-Weinberg equilibrium assumes that there is random mating in human populations. That is to say, in order for a population to be in a Hardy-Weinberg equilibrium, | + | Our discussion above of the Hardy-Weinberg equilibrium assumes that there is random mating in human populations. That is to say, in order for a population to be in a Hardy-Weinberg equilibrium, |
==== Genotypic effects on choice of mating partner ==== | ==== Genotypic effects on choice of mating partner ==== | ||
- | Examination of allele frequencies and genotype frequencies for most genes in the human populations reveals that they closely fit a Hardy-Weinberg equilibrium. The implication is that in general, humans choose their mates at random with respect to individual genes and alleles. This may seem odd given that personal experience says that choosing a mate is anything but random. However the usual criteria for choosing mates such as character, appearance, and social position are largely not determined genetically and, to the extent that they are genetically determined, these are all very complex traits that are influenced by a large number of different genes. The net result is that our decision of with whom we have children does not in general systematically favor some alleles over others. | + | Examination of allele frequencies and genotype frequencies for most genes in the human populations reveals that they closely fit Hardy-Weinberg equilibrium. The implication is that in general, humans choose their mates at random with respect to individual genes and alleles. This may seem odd given that personal experience says that choosing a mate is anything but random. However the usual criteria for choosing mates such as character, appearance, and social position are largely not determined genetically and, to the extent that they are genetically determined, these are all very complex traits that are influenced by a large number of different genes. The net result is that our decision of with whom we have children does not in general systematically favor some alleles over others. |
- | One of the exceptional conditions that produce a population that is not in a Hardy-Weinberg equilibrium is known as assortative mating, which means preferential mating between similar individuals. For example, individuals with inherited deafness have a relatively high probability of having children together. But even this type of assortative mating will only affect the genotype frequencies related to deafness. | + | One of the exceptional conditions that produce a population that is not in Hardy-Weinberg equilibrium is known as assortative mating, which means preferential mating between similar individuals. For example, individuals with inherited deafness have a relatively high probability of having children together. But even this type of assortative mating will only affect the genotype frequencies related to deafness. |
Line 185: | Line 189: | ||
To see how this would happen, consider a gene in a very large population with a single major dominant allele $A$ and 10 minor recessive alleles $a_1$, $a_2$, $a_3$ ... $a_{10}$ with allele frequencies $ƒ(a_1) = ƒ(a_2) = ƒ(a_3) ... = 10^{-4}$ and $f(A)=0.999$. Now imagine that a group of 500 individuals from this population move to an island to start a new population. | To see how this would happen, consider a gene in a very large population with a single major dominant allele $A$ and 10 minor recessive alleles $a_1$, $a_2$, $a_3$ ... $a_{10}$ with allele frequencies $ƒ(a_1) = ƒ(a_2) = ƒ(a_3) ... = 10^{-4}$ and $f(A)=0.999$. Now imagine that a group of 500 individuals from this population move to an island to start a new population. | ||
- | The aggregate frequency of recessive alleles in this population is $a_n = 10\times10^{-4}=10^{-3}$. We can calculate the probability that $n$ number of individuals are carriers (either $\frac{a_n}{a_n}$ or $\frac{A}{a_n}$) as: | + | The aggregate frequency of recessive alleles in this population is $a_n = 10\times10^{-4}=10^{-3}$. We can calculate the probability that $n$ number of individuals are carriers (either $a_n/a_n$ or $A/a_n$) as: |
<figure Fig11> | <figure Fig11> | ||
Line 213: | Line 217: | ||
The way to interpret Table {{ref> | The way to interpret Table {{ref> | ||
- | Let's say allele $a_1$ is brought over by a single individual with genotype $\frac{A}{a_1}$. This means that alleles $a_2$ through $a_{10}$ are now lost in the new population. The new allele frequency for $a_1$ is now 1 in 1000, or $f(a_1)=10^{-3}$; | + | Let's say allele $a_1$ is brought over by a single individual with genotype $A/a_1$. This means that alleles $a_2$ through $a_{10}$ are now lost in the new population. The new allele frequency for $a_1$ is now 1 in 1000, or $f(a_1)=10^{-3}$; |
==== Migration of individuals between different populations ==== | ==== Migration of individuals between different populations ==== | ||
- | When individuals from populations with different allele frequencies mix, the combined population will be in a Hardy-Weinberg equilibrium after one generation of random mating. The combined population will be out of equilibrium to the extent that mating is assortatative. | + | When individuals from populations with different allele frequencies mix, the combined population will be in Hardy-Weinberg equilibrium after one generation of random mating. The combined population will be out of equilibrium to the extent that mating is assortatative. |
===== An example of calculating allele frequency in humans: albinism ===== | ===== An example of calculating allele frequency in humans: albinism ===== | ||
Line 228: | Line 232: | ||
</ | </ | ||
- | Therefore, $f(\frac{A}{A})=p^2 \approx 1$, $f(\frac{A}{a})=2pq \approx 2q$, and $f(\frac{a}{a})=q^2$. Since most genetic diseases are rare, these approximations are valid for many of the population genetics calculations that are of medical importance. | + | Therefore, $f(A/A)=p^2 \approx 1$, $f(A/a)=2pq \approx 2q$, and $f(a/a)=q^2$. Since most genetic diseases are rare, these approximations are valid for many of the population genetics calculations that are of medical importance. |
+ | |||
+ | Let's look at a real-life example. Albinism occurs in approximately 1 in 20,000 individuals in humans. Let's say that this condition is due to a recessive allele $a$ of a single gene that is in Hardy-Weinberg equilibrium. Based on this information, | ||
+ | |||
+ | $$f(a/a) = \frac{1}{20000} = 5\times10^{-5}=q^2\\ | ||
+ | q=\sqrt{5\times10^{-5}}=7\times10^{-3}$$ | ||
+ | |||
+ | And based on this we can also calculate the frequency of heterozygotes in the population: | ||
+ | |||
+ | $$f(A/ | ||
+ | |||
+ | In other words, approximately 1 in 140 humans are carriers for albinism. We can next calculate the fraction of alleles for albinism that are in individuals that are actually albinos (i.e., their genotype is $a/a$). If we let $\text{N}$ = population size, then the number of alleles in homozygotes will be $2\times\text{N}\cdot q^2$. The number of alleles in heterozygotes (carriers) will be $1\times\text{N} \cdot 2pq \approx \text{N}(2q)$. Therefore, the fraction of $a$ alleles in homozygotes is: | ||
+ | |||
+ | $$ \frac{2\times\text{N}\cdot q^2}{2\times\text{N}\cdot q^2+\text{N}(2q)}\\ | ||
+ | =\frac{q}{q+1}$$ | ||
+ | |||
+ | Since $q<< | ||
+ | |||
+ | A basic ethics lesson we can get from this simple example is that eugenics is an exercise in futility. Recessive alleles can easily exist at relatively high frequencies inside human populations. | ||
+ | |||
+ | |||
+ | |||
- | Let's look at a real-life example. Albinism occurs in approximately 1 in 20,000 individuals in humans. Let's say that this condition is due to a recessive allele $a$ of a single gene that is in a Hardy-Weinberg equilibrium. Based on this information, | ||
chapter_18.1725836752.txt.gz · Last modified: 2024/09/08 16:05 by mike