Use Sophia to knock out your gen-ed requirements quickly and affordably. Learn more
×

Amplifying, Visualizing, and Characterizing DNA, RNA, and Proteins

Author: Sophia

what's covered
In this lesson, you will build on information from other lessons to learn about more ways in which nucleic acids and proteins are studied in the laboratory. These approaches form the basis of many genetic approaches that we now encounter regularly, such as genetic testing of crime scene evidence, clinical testing for the presence of a virus associated with a viral infection such as COVID-19, and genetic testing for the potential of gene variants associated with disease. Specifically, this lesson will cover the following:

Table of Contents

1. Amplification-Based DNA Analysis Methods

In many cases, small amounts of DNA are available, and it is helpful or essential to have much larger amounts for analysis. Prior to using other techniques, DNA samples are often amplified (in other words, many copies of the DNA samples are produced).

The polymerase chain reaction (PCR) is especially important because it revolutionized molecular genetics by making it possible to create huge quantities of DNA for analysis without relying on cells to copy the DNA. PCR makes it possible to rapidly produce large quantities of DNA from relatively small and impure samples.

The image and steps below summarize how PCR works. The process is now carried out by automated thermocyclers. Samples can be added, and the thermocycler is set to cycle through specific temperature changes a certain number of times.

Note that primers are needed, as in standard DNA replication. These primers bind to single-stranded DNA and determine where DNA polymerase will begin to add bases on each strand. Therefore, primers need to be chosen carefully to amplify the correct region of DNA. This differs from natural DNA replication, in which the entire molecule is replicated.

step by step
Step 1: The DNA of interest, primers, heat-stable DNA polymerase, and nucleotides are combined.
Step 2: Double-stranded DNA containing the target sequence is heated to approximately 95 °C to denature (separate) the strands.
Step 3: The temperature is lowered to approximately 50 °C, allowing the primers to bind (anneal) to complementary sites (annealing). One primer anneals to each strand.
Step 4: The temperature is raised to approximately 72 °C, the optimal temperature for the heat-stable DNA polymerase to add nucleotides.
Step 5: The cycle is repeated 25 to 40 times, allowing the amplification of a single target sequence by tens of millions to over a trillion.

The image shows the events during three cycles out of the 25 to 40 total. It is possible to see how the primers anneal, followed by the addition of nucleotides. As the number of cycles increases, the number of copies of the target sequence increases rapidly.

A diagram of PCR. In cycle 1 a double stranded piece of DNA is denatured (split into 2 single strands) at 95C. Primers (short pieces of single stranded DNA) bind (anneal) to the longer strand at ~50C. DNA polymerase then binds to the primers and copies the longer strand; this is extension and occurs at 72C. This produces 2 copies of the original DNA. This repeats in cycle 2 to produce 4 copies. Cycle 3 produces 8 copies; cycle 4 produces 16; and each future cycle continues to double the number of copies.

It is important to note that advances in PCR are continuing to occur and the process may become easier and less reliant (or no longer reliant) on thermocyclers.

Additionally, variations of PCR add to its utility. Reverse transcriptase PCR (RT-PCR) is used to obtain DNA copies of mRNA molecules. Real-time PCR, also known as quantitative PCR (qPCR) uses fluorescence to monitor the increase in a double-stranded template during a PCR as it occurs. This produces kinetics data that can be used to quantify the amount of the original target sequence.

did you know
PCR has many clinical applications. PCR and RT-PCR can be used to detect viral DNA to diagnose diseases. For patients with HIV, it is important to monitor viral load to determine whether treatment is sufficiently effective. This can be accomplished by using qPCR to determine viral load.

2. DNA Sequencing

DNA produced by PCR or obtained from other sources can be sequenced using DNA sequencing techniques. This lesson will first introduce the basic chain termination method (dideoxy method or Sanger sequencing method) developed in 1972 and then discuss some of the newer approaches that are making sequencing more readily available for a wide range of uses.

The chain termination method involves DNA replication of a single-stranded template with the use of DNA primer to initiate synthesis of a complementary strand, DNA polymerase, a mix of the four regular deoxynucleotide (dNTP) monomers, and a small proportion of dideoxynucleotides (ddNTPs). The ddNTPs are labeled with some type of molecular beacon for easy visualization and are monomers that are lacking the hydroxyl group necessary for another nucleotide to be added. Therefore, the addition of a ddNTP terminates chain elongation.

Every time a ddNTP is added during replication, the process stops. This occurs randomly, producing a range of newly replicated DNA strands of different sizes. Each strand has a labeled ddNTP at one end, allowing the identity of the terminal nucleotide to be determined. When the samples are run on a gel, they are separated by length and the nucleotides can be ordered based on their position on the gel.

In the original procedure, four separate reactions were used for each DNA molecule being sequenced. Each reaction had a different ddNTP (one for adenine, one for thymine, one for cytosine, and one for guanine). Each ddNTP was labeled with a radioactive phosphorus molecule. The products of the four reactions were placed in different lanes on a long, narrow, polyacrylamide gel (similar to an agarose gel). Electrophoresis produced bands of varying lengths in each lane that could be detected using autoradiography.

In more recent approaches, ddNTPs are labeled using fluorescent dyes and are placed in a single sequencing reaction. The fluorochromes are detected using fluorescence spectroscopy that detects each color and produces an output showing the sequence.

The image below shows the difference between a ddNTP and a dNTP. The lower left vertex of the ddNTP sugar ring, shown on the left, has an H at the same position where the dNTP on the right has an OH. DNA polymerase requires the OH to add the next nucleotide, so having an H in that position prevents the addition of another nucleotide and chain elongation terminates.

A drawing of dNTPs and ddNTPs. Deoxynucleotide (dNTP) is a nucleotide with an OH at carbon #3. This is drawn as a pentagon with an O at the top. Moving counterclockwise – the next point has the word “base”, the next only has H’s, the next has an OH, and the last has 3 phosphates. Dideeoxynucleotide (ddNTP) is a nucleotide with an H at carbon #3. This is drawn as a pentagon with an O at the top. Moving counterclockwise – the next point has the word “base”, the next only has H’s, the next aso has only H’s, and the last has 3 phosphates.

The image below shows the dideoxy chain termination method using ddNTPs tagged with fluorochromes. The end of each strand shows a different color corresponding to the nucleotide present. This is used to produce a graph that shows the nucleotide sequence as shown.

A diagram showing the Sanger method. A strand of DNA has the sequence GATTCAGC. Dye-labeled dideoxynucleotides are used to generate DNA fragments of different lengths. The shortest fragment ends with a red star to indicate that the ddTTP is what ended the chain. The next shortest fragment has a green star to indicate that a ddATP ended the chain. The next has a black star to indicate that a ddGTP ended the chain. The longest has a blue star to indicate that a ddCTP ended the chain. Not all of the fragments are shown in the diagram. To the right is a computer printout that does show all the fragments that would be seen in a sample. The computer printout shows a colored peak to indicate which fragment moved through the gel at that position. The first (shortest) position shows a black peak indicating a G, next is a green peak indicating an A, next is a red peak indicating a T, next are 3 green peaks indicating A’s, etc.

The image below shows the process in more detail. In step 1, it shows the components added to the PCR tube: the DNA template, primers, DNA polymerase, dNTPs, and fluorescently labeled ddNTPs. In step 2, it shows how complementary strands begin to grow but stop at fluorescently labeled ddNTPs. In step 3, it shows the fragments in a capillary gel. A laser passing through the gel to a detector determines the sequence for identification and visualization on a computer, as shown in step 4.

A diagram summarizing the Sanger method. 1 – The following are added to the PCR tube: DNA template, primers, DNA polymerase, dNTPs, and fluorescently labeled ddNTPs. 2 – At each base in the DNA template, either a dNTP is added and elongation continues or a ddNTP is added and elongation stops. This process results in fragments of all sizes, each with a different fluorescently labeled end nucleotide. 3 – The fragments are run through a capillary gel and detected by a laser. A computer identifies each band as it passes by a laser.

Since 2005, automated sequence techniques used by laboratories fall under the umbrella of next-generation sequencing, which is a group of automated techniques used for rapid DNA sequencing. These methods have revolutionized the field of molecular genetics because the low-cost sequencers can generate huge quantities of short fragments in a day. There are multiple devices available from different manufacturers and they have evolved to become even more effective over time. This is why sequencing is so much more accessible at present than it has ever been in the past.

Additionally, vast amounts of biological data produced by these methods are readily available to researchers. For example, the National Center for Biotechnology Information houses a widely used genetic sequence database called GenBank. Researchers can deposit information in GenBank so that it is available for others to use.

terms to know
Chain Termination Method (Dideoxy Method or Sanger Sequencing Method)
An early method used to determine DNA sequences.
Next-Generation Sequencing
A group of automated techniques used for rapid DNA sequencing.

3. RNA-Seq

As advances in DNA sequencing have progressed, new ways to sequence RNA have followed. An important example is RNA sequencing (RNA-Seq). This technique has progressed over the years, like other types of sequencing, and now can be performed by using RNA to produce cDNA that can be sequenced using next-generation sequencing approaches (Kukurba & Montgomery, 2010).

The image below illustrates some of the approaches used. For the purposes of this lesson, it is not necessary to know all of the techniques in detail. However, the image shows how RNA-Seq can be used in vitro (in the lab), in vivo (in living organisms), and in silico (using computers). For example, RNA in living organisms can be analyzed as pre-mRNA or as mature mRNA. In the lab, fragments of RNA can be analyzed, or reverse transcriptase can be used to produce cDNA for analysis. Computers can perform complex analyses to find matching sequences that indicate areas where fragments overlap, allowing them to assemble a larger molecule from fragments. These and other approaches can be accomplished using variations of RNA-Seq techniques.

term to know
RNA Sequencing (RNA-Seq)
An approach used to sequence RNA. Current approaches use next-generation sequencing approaches performed on cDNA.

watch
Amplification-Based DNA and RNA Analysis


4. Molecular Analysis of Proteins

Although this lesson has focused on nucleic acids, molecular analysis of proteins is also very valuable. Studying proteins can provide information about how they are actually used and distributed within a cell, as well as ways in which they change because of changes in environmental conditions or the presence of a pathogen.

Polyacrylamide gel electrophoresis (PAGE) uses a variation of gel electrophoresis to separate proteins. A polyacrylamide gel has a finer matrix than an agarose gel. Another important difference from the agarose electrophoresis techniques described above is that PAGE typically uses a vertical gel apparatus.

Because of varying charges associated with amino acid side chains, PAGE can be used to separate intact proteins based on their net charges because of the presence of the electrical field.

Alternatively, proteins can be denatured and coated with a negatively charged detergent called sodium dodecyl sulfate (SDS). SDS masks the native charges of the untreated proteins, meaning that size alone determines how far proteins move across the gel. As with nucleic acids, smaller fragments move more rapidly and therefore travel farther on the gel within a certain timeframe.

PAGE can be further modified to separate proteins based on two characteristics. This is called two-dimensional PAGE. For example, proteins may be separated based on charge at varying pH as well as size.

After separation on a gel, proteins are visualized through staining. Coomassie blue and silver stains are commonly used.

The image below illustrates SDS-PAGE. Part (a) shows how a protein with varied charges can be treated with SDS, denaturing it (changing its shape) and coating it with a uniform negative charge to replace the varied charges of the original protein. In part (b), a vertical gel is shown. Protein samples are added to the wells at the top. A molecular weight standard is added to one well for comparison. An electrical current is applied, causing the proteins to move. Because smaller proteins move faster than larger proteins, the proteins separate by size. Each fragment can be compared with the molecular weight standard to estimate its size. Part (c) shows an example of an SDS-PAGE gel stained with Coomassie blue.

(a) A diagram showing a globular protein with positive and negative charges undergoing SDS treatment. SDS denatures the protein (producing a linear product) and makes them uniformly negative in charge. (B) The protein samples are then placed into the wells of an SDS_PAGE gel. One well is loaded with a molecular weight standard. The gel is then exposed to a power source that results in the top of the gel (near the wells) becoming negative charged and the bottom becoming positively charged. Proteins migrate through the gel from the negative to the positive sides. Small proteins travel through the gel faster than large proteins. The molecular weight standard includes fragments of known size and is used to estimate the size of sample proteins. In this example the standard has sizes of 216, 132, 78, 32 and 7. The other lanes have bands of various sizes. (C) A photograph of an SDS-PAGE gel. Purple bands on a clear background.

term to know
Polyacrylamide Gel Electrophoresis (PAGE)
A variation of gel electrophoresis used to separate proteins.

summary
In this lesson, you learned about ways to characterize and visualize DNA, RNA, and proteins. You learned about amplification-based DNA analysis methods such as PCR that allow researchers to make huge quantities of DNA for analysis. You learned some of the many ways in which these techniques can be useful, including their medical applications. Next, you learned about initial techniques used for DNA sequencing followed by information about newer, more efficient technologies that have made sequencing less expensive and more widely available. You learned about ways that RNA can be sequenced using RNA-Seq. Finally, you learned how electrophoresis methods can be used for the molecular analysis of proteins. This information can be helpful in understanding the functioning of a cell under varying conditions. These techniques are widely used in microbiology and have many clinical applications.

Source: THIS TUTORIAL HAS BEEN ADAPTED FROM OPENSTAX “MICROBIOLOGY.” ACCESS FOR FREE AT openstax.org/details/books/microbiology. LICENSE: CC ATTRIBUTION 4.0 INTERNATIONAL

REFERENCES

Kukurba, K. R., & Montgomery, S. B. (2015). RNA Sequencing and Analysis. Cold Spring Harbor protocols, 2015(11), 951–969. doi.org/10.1101/pdb.top084970

Terms to Know
Chain Termination Method (Dideoxy Method or Sanger Sequencing Method)

An early method used to determine DNA sequences.

Next-Generation Sequencing

A group of automated techniques used for rapid DNA sequencing.

Polyacrylamide Gel Electrophoresis (PAGE)

A variation of gel electrophoresis used to separate proteins.

RNA Sequencing (RNA-Seq)

An approach used to sequence RNA. Current approaches use next-generation sequencing approaches performed on cDNA.