DNA sequencing is the process of determining the nucleotide order of a given DNA fragment, called the DNA sequence. Currently, almost all DNA sequencing is performed using the chain termination method *, developed by Frederick Sanger. This technique uses sequence-specific termination of an in vitro DNA synthesis reaction using modified nucleotide substrates.
The sequence of DNA encodes the necessary information for living things to survive and reproduce. Determining the sequence is therefore useful in 'pure' research into why and how organisms live, as well as in applied subjects. Because DNA is key to all living things, knowledge of DNA sequence may be useful in almost any biological subject area. For example, in medicine it can be used to identify, diagnose and potentially develop treatments for genetic diseases. Similarly, research into pathogens may lead to treatments for contagious diseases.
The classical chain termination method or Sanger method first involves preparing the DNA to be sequenced as a single strand. The DNA sample is divided into four separate samples. Each of the four samples has a primer, the four normal deoxynucleotides (dATP, dGTP, dCTP and dTTP), DNA polymerase, and only one of the four dideoxynucleotides (ddATP, ddGTP, ddCTP and ddTTP) added to it. The dideoxynucleotides are added in limited quantities. The primer or the dideoxynucleotides are either radiolabeled or have a fluorescent tag.
As the DNA strand is elongated the DNA polymerase catalyses the joining of deoxynucleotides to the corresponding bases. However, if a dideoxynucleotide is joined to a base, then that fragment of DNA can no longer be elongated since a dideoxynucleotide lacks a crucial 3'-OH group. Fragments of all sizes should be obtained due to the randomness of when a dideoxynucleotide is added. However, to make sure that all different lengths will occur, only short stretches of DNA can be sequenced in one test.
The DNA is then denatured and the resulting fragments are separated (with a resolution of just one nucleotide) by gel electrophoresis, from longest to shortest. Each of the four DNA samples is run on one of four individual lanes (lanes A, T, G, C) depending on which dideoxynucleotide was added. Depending on the whether the primers or dideoxynucleotides were radiolabeled or fluorescently labeled, the DNA bands can be detected by exposure to X-rays or UV-light and the DNA sequence can be directly read off the gel. In the image on the right, X-ray film was exposed to the dried gel, and the dark bands indicate the positions of the DNA molecules of different lengths. A dark band in a lane indicates a chain termination for that particular DNA subunit and the DNA sequence can be read off as indicated.
There can be various problems with sequencing through the Sanger Method. The primer used can also be annealed to a second site. This would cause two sequences to be interpreted at the same time. This can be solved by higher annealing temperatures and higher G and C content in the primer. Another problem can occur when RNA contaminates the reaction, which can act like a primer and leads to bands in all lanes at all positions due to non specific priming. Other contaminants can be from other plasmids, inhibitors of DNA pol, and low concentrations in general. Secondary structure of DNA being read by DNA pol can lead to reading problems and will be visualized on the readout by bands in all lanes of only a few positions.
There are two sub-types of chain-termination sequencing. In the original method, the nucleotide order of a particular DNA template can be inferred by performing four parallel extension reactions using one of the four chain-terminating bases in each reaction. The DNA fragments are detected by labelling the primer with radioactive phosphorous prior to performing the sequencing reaction. The four reactions would then be run out in four adjacent lanes on a slab polyacrylamide gel.
A development of this method used four different fluorescent dye-labelled primers. This has the advantage of avoiding the need for radioactivity; increasing safety and speed, and also that the four reactions can be combined and run in a single gel lane, if they can be distinguished. This approach is known as 'dye primer sequencing'.
This method is now used for the vast majority of sequencing reactions as it is both simpler and cheaper. The major reason for this is that the primers do not have to be separately labelled (which can be a significant expense for a single-use custom primer), although this is less of a concern with frequently used 'universal' primers.
To produce detectable labelled products from the template DNA, 'cycle sequencing' is most commonly performed. This approach uses repeated (25 - 40) rounds of primer annealing, DNA polymerase extension and disassociation (melting) of the template DNA strands. The major advantages of cycle sequencing is the more efficient use of the expensive sequencing reagent (BigDye) and the ability to sequence templates with strong secondary structures such as hairpins or GC-rich regions. The different stages of cycle sequencing are performed by altering the temperature of the reaction using a PCR thermal cycler. This relies on the fact that complementary DNA will anneal at a lower temperatures and disassociate at higher temperatures. An important part of making this possible is the use of DNA polymerase from a thermophillic organism, which is not rapidly denatured at the high (>95C) temperatures involved. In the past, new DNA polymerase had to be added individually every cycle of PCR.
It is often necessary to obtain the sequence of much larger regions. For example, even simple bacterial genomes contain millions of base pairs, and the human genome has more than 3 billion. Several strategies have been devised for large-scale DNA sequencing, including primer walking (see also chromosome walking) and shotgun sequencing. These involve taking many small reads of the DNA through the Sanger method and subsequently assembling them into a contiguous sequence. The different strategies have different tradeoffs in speed and accuracy; for example, the shotgun method is the most practical for sequencing large genomes, but its assembly process is complex and potentially error-prone.
It is easier to obtain high quality sequence data when the desired DNA is purified and amplified from any contaminants that may be in the original sample. This can be achieved through PCR if it is practical to design primers that cover the entire desired region. Alternatively, the sample can be cloned using a bacterial vector, harnessing bacteria to "grow" copies of the desired DNA a few thousand base pairs at a time. Most large-scale sequencing efforts involve the preparation of a large library of such clones. The advantage of sequencing clones over PCR-products is that the possibility of the presence of non-specific PCR products that may cause signal noise is virtually eliminated.
DNA | Molecular biology | Laboratory techniques
DNA-Sequenzierung | DNA-vicrivelado | Séquençage | Sekuensing | ריצוף DNA | Sequenering | シークエンス | Sekwencjonowanie | Sekvensering | Xác định trình tự | 測序
This article is licensed under the GNU Free Documentation License.
It uses material from the
"DNA sequencing".
Home Page • arts • business • computers • games • health • hospitals • home • kids & teens • news • physicians • recreation• reference • regional • science • shopping • society • sports • world