chapter_07
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
chapter_07 [2024/08/31 16:10] – [Identifying a gene based on DNA sequence data] mike | chapter_07 [2025/03/14 07:29] (current) – [Polymerase Chain Reaction (PCR)] mike | ||
---|---|---|---|
Line 1: | Line 1: | ||
+ | < | ||
+ | |||
+ | <typo fs: | ||
+ | |||
Although eukaryotic genes may be generally more interesting to most students, it is useful to first consider bacterial genes. Most eukaryotic molecular biologists use bacteria as tools for various things (e.g., molecular cloning; see [[chapter_09|Chapter 09]]), so it’s useful to understand how bacteria work from a practical perspective. Also, although bacterial genes have some pretty important differences compared to eukaryotic genes, many basic principles are the same. | Although eukaryotic genes may be generally more interesting to most students, it is useful to first consider bacterial genes. Most eukaryotic molecular biologists use bacteria as tools for various things (e.g., molecular cloning; see [[chapter_09|Chapter 09]]), so it’s useful to understand how bacteria work from a practical perspective. Also, although bacterial genes have some pretty important differences compared to eukaryotic genes, many basic principles are the same. | ||
Line 46: | Line 50: | ||
</ | </ | ||
- | An experimental way to identify eukaryotic genes physically is by examining mRNA instead of DNA. If an mRNA exists in a cell, this means that it was most likely transcribed from a gene. mRNAs can be purified from cells biochemically, | + | An experimental way to identify eukaryotic genes physically is by examining mRNA instead of DNA. If an mRNA exists in a cell, this means that it was most likely transcribed from a gene. mRNAs can be purified from cells biochemically, |
===== How to sequence DNA: background information ===== | ===== How to sequence DNA: background information ===== | ||
Line 79: | Line 83: | ||
{{ : | {{ : | ||
< | < | ||
- | The substrate for DNA polymerase includes a template strand, a primer with a free 3' hydroxyl group, and dNTPs (not shown). The primer can be made from single-stranded DNA (ssDNA) or RNA, but in this diagram | + | The substrate for DNA polymerase includes a template strand, a primer with a free 3' hydroxyl group, and dNTPs (not shown). The primer can be made from single-stranded DNA (ssDNA) or RNA, but in this diagram |
</ | </ | ||
</ | </ | ||
Line 94: | Line 98: | ||
===== DNA Sequencing: the details ===== | ===== DNA Sequencing: the details ===== | ||
- | We first discuss an older but still relevant type of DNA sequencing technology called Sanger sequencing. The basic method was invented in 1977 and is named after its inventor Frederick Sanger. Consider a segment of dsDNA that is about 1000 base pairs long that we wish to sequence. To sequence this DNA, we first need to have a source of DNA material, which we consider [[chapter_07# | + | We first discuss an older but still relevant type of DNA sequencing technology called Sanger sequencing. The basic method was invented in 1977 and is named after its inventor |
- | Next, a short single-stranded primer((In DNA sequencing and similar applications, | + | Next, a short single-stranded primer (about 18-20 bases long) designed to be complimentary to the end of one of the strands is allowed to anneal to the single stranded DNA. These primers are designed with the help of a computer and synthesized through commercially available services. The primer is added at a huge molar excess compared to the DNA you are trying to sequence – so most ssDNAs will pair with primer DNA rather than their original complementary partner ssDNA. The resulting DNA hybrid looks much like the general DNA polymerase substrate shown in Fig. {{ref> |
DNA polymerase is then added along with the four dNTP nucleotide precursors (dATP, dGTP, dCTP, and dTTP). A small quantity of a slightly different nucleotide precursor called a dideoxyribonucleotide triphosphate is also added. Dideoxy nucleotide precursors are abbreviated ddATP, ddGTP, ddCTP, and ddTTP (or ddNTPs collectively). The ddNTPs have each been chemically labeled with a unique fluorophore that emits a different color of light after stimulation with a laser – for instance, green for ddATP, cyan for ddCTP, yellow for ddGTP, and red for ddTTP. These molecules are identical to the normal dNTPs in all respects except that they lack a hydroxyl group at their 3’ position (3’ OH) and that their nucleotide bases are chemically labeled with a fluorophore (Fig. {{ref> | DNA polymerase is then added along with the four dNTP nucleotide precursors (dATP, dGTP, dCTP, and dTTP). A small quantity of a slightly different nucleotide precursor called a dideoxyribonucleotide triphosphate is also added. Dideoxy nucleotide precursors are abbreviated ddATP, ddGTP, ddCTP, and ddTTP (or ddNTPs collectively). The ddNTPs have each been chemically labeled with a unique fluorophore that emits a different color of light after stimulation with a laser – for instance, green for ddATP, cyan for ddCTP, yellow for ddGTP, and red for ddTTP. These molecules are identical to the normal dNTPs in all respects except that they lack a hydroxyl group at their 3’ position (3’ OH) and that their nucleotide bases are chemically labeled with a fluorophore (Fig. {{ref> | ||
Line 116: | Line 120: | ||
</ | </ | ||
- | Dideoxynuclotides | + | ddNTPs |
===== Polymerase Chain Reaction (PCR) ===== | ===== Polymerase Chain Reaction (PCR) ===== | ||
Line 123: | Line 127: | ||
Now let’s consider how to physically obtain DNA for sequencing. A relatively large amount of DNA (approx. 1 μg worth for a piece of DNA several kbp long) is needed for the Sanger chemistry to work. As a student, you might not have a feel for how much DNA this is, but it's a substantial amount! To give you an idea of the scale of the problem, see [[chapter_07# | Now let’s consider how to physically obtain DNA for sequencing. A relatively large amount of DNA (approx. 1 μg worth for a piece of DNA several kbp long) is needed for the Sanger chemistry to work. As a student, you might not have a feel for how much DNA this is, but it's a substantial amount! To give you an idea of the scale of the problem, see [[chapter_07# | ||
- | If we want to quickly find the sequence of a new mutant allele of a known gene, we need an easy way to obtain a relatively large quantity of this DNA without needing to go through molecular cloning. The easiest and most common way to do this is to use an //in vitro// method known as the polymerase chain reaction (PCR) that was developed by Kary Mullis in the mid-1980s (Fig. {{ref> | + | If we want to quickly find the sequence of a new mutant allele of a known gene, we need an easy way to obtain a relatively large quantity of this DNA without needing to go through molecular cloning. The easiest and most common way to do this is to use an //in vitro// method known as the polymerase chain reaction (PCR) that was developed by [[wp> |
- A crude preparation of chromosomal DNA is extracted from the tissue source of interest (there is usually not enough DNA for sequencing from this step). | - A crude preparation of chromosomal DNA is extracted from the tissue source of interest (there is usually not enough DNA for sequencing from this step). | ||
- Two short primers (each about 18-20 bases long) are added to the DNA at an enormous molar excess. The primers are designed from the known genomic sequence to be complimentary to opposite strands of DNA and to flank the chromosomal segment of interest. | - Two short primers (each about 18-20 bases long) are added to the DNA at an enormous molar excess. The primers are designed from the known genomic sequence to be complimentary to opposite strands of DNA and to flank the chromosomal segment of interest. | ||
- | - The double stranded DNA is melted by heating to around 100 ˚C (in practice we usually use 95 oC) and then the mixture is cooled to allow the primers to anneal to the template DNA. Since there is a huge molar excess of primer vs. template, most of the template will anneal with primer rather than reanneal | + | - The double stranded DNA is melted by heating to around 100 ˚C (in practice we usually use 95 °C) and then the mixture is cooled |
- | - DNA polymerase and the four nucleotide precursors are added, and the reaction is incubated at 37 ˚C for a period of time to allow a copy of the segment to be synthesized. | + | - DNA polymerase and the four nucleotide precursors are added, and the reaction is incubated at around 72 ˚C for a period of time to allow a copy of the segment to be synthesized. The reason we use 72 °C instead of 37 °C like we do for most enzymatic reactions is that we use a special heat stable enzyme called Taq DNA polymerase instead of standard DNA polymerase. |
- | - Repeat steps 3 and 4 multiple times (up to 30-35 cycles). To avoid the inconvenience of having to add new DNA polymerase in each cycle, a special DNA polymerase called Taq polymerase that can withstand heating to 100 ˚C is used. | + | - Repeat steps 3 and 4 multiple times (up to 30-35 cycles). To avoid the inconvenience of having to add new DNA polymerase in each cycle (due to the heating cycle eliminating DNA polymerase activity), a special DNA polymerase called Taq polymerase that can withstand heating to 100 ˚C is used. |
The idea behind PCR is that in each cycle of melting, annealing, and DNA synthesis, the amount of the DNA segment is doubled. This gives an exponential increase((If you think about it, the idea behind PCR is actually quite simple. It is mimicking how DNA replication occurs in dividing cells. Dividing cells with unlimited resources also replicate exponentially.)) in the amount of the specific DNA bounded by the primers on either side as the cycles proceed. After 10 cycles the DNA is amplified 2< | The idea behind PCR is that in each cycle of melting, annealing, and DNA synthesis, the amount of the DNA segment is doubled. This gives an exponential increase((If you think about it, the idea behind PCR is actually quite simple. It is mimicking how DNA replication occurs in dividing cells. Dividing cells with unlimited resources also replicate exponentially.)) in the amount of the specific DNA bounded by the primers on either side as the cycles proceed. After 10 cycles the DNA is amplified 2< | ||
Line 140: | Line 144: | ||
</ | </ | ||
- | Let's do some quick back of the envelope | + | Let's do some quick back of the envelope |
The first thing we need to do is to figure out how many molecules is 1 μg of a 1000 bp fragment of dsDNA. From a quick Google search, we learn that the average molecular weight of a nucleotide is approximately 330 Da (g/mol). Since DNA is double-standed, | The first thing we need to do is to figure out how many molecules is 1 μg of a 1000 bp fragment of dsDNA. From a quick Google search, we learn that the average molecular weight of a nucleotide is approximately 330 Da (g/mol). Since DNA is double-standed, | ||
Line 167: | Line 171: | ||
- | The Sanger sequencing method [[chapter_07# | + | The Sanger sequencing method [[chapter_07# |
==== Illumina sequencing ==== | ==== Illumina sequencing ==== | ||
- | There are several different types of NGS technology. The most common type of NGS is called Illumina sequencing (Figs {{ref> | + | There are several different types of NGS technology. The most common type of NGS is called Illumina sequencing (Figs {{ref> |
- | The flow cell is then exposed to sequencing reagents similar to Sanger sequencing, except that instead of ddNTP chain terminators, | + | The flow cell is then exposed to sequencing reagents similar to Sanger sequencing, except that instead of ddNTP chain terminators, |
<figure Fig10> | <figure Fig10> | ||
Line 197: | Line 201: | ||
Technologies such as Illumina sequencing are now the preferred method for most types of large-scale DNA sequencing, and it has been adapted for related technologies such as RNA sequencing (RNAseq). In RNA sequencing, RNA is first converted to complementary DNA (cDNA) using an enzyme called reverse transcriptase; | Technologies such as Illumina sequencing are now the preferred method for most types of large-scale DNA sequencing, and it has been adapted for related technologies such as RNA sequencing (RNAseq). In RNA sequencing, RNA is first converted to complementary DNA (cDNA) using an enzyme called reverse transcriptase; | ||
- | Illumina sequencing can also be adapted for other applications. For instance, proteins interact with DNA //in vivo// to form a dynamic structure called chromatin. Let's say you are interested in a DNA protein called X. To find out what DNA sequences X binds to, you can extract and purify chromatin from cells, then use enzymes to gently cleave the DNA into small fragments under conditions in which X still binds to DNA. You can then purify X using antibodies and use Illumina sequencing to sequence the DNA fragments that co-purify with X. This procedure is called chromatin immunoprecipitation sequencing, or ChIPseq. There are many other similar applications too many to list and discuss here in detail. | + | Illumina sequencing can also be adapted for other applications. For instance, proteins interact with DNA //in vivo// to form a dynamic structure called chromatin. Let's say you are interested in a DNA-binding |
For small scale DNA sequencing, Sanger sequencing described above is still a commonly used method, although the cost for various NGS technologies have dropped so much that it is also starting to replace Sanger sequencing for small scale sequencing experiments. For instance, Nanopore sequencing (Fig. {{ref> | For small scale DNA sequencing, Sanger sequencing described above is still a commonly used method, although the cost for various NGS technologies have dropped so much that it is also starting to replace Sanger sequencing for small scale sequencing experiments. For instance, Nanopore sequencing (Fig. {{ref> |
chapter_07.1725145824.txt.gz · Last modified: 2024/08/31 16:10 by mike