In complex eukaryotic cells such as those from plants, animals, fungi and protists, most of the DNA is located in the cell nucleus. By contrast, in simpler cells called prokaryotes, including the eubacteria and archaea, DNA is not separated from the cytoplasm by a nuclear envelope. The cellular organelles known as chloroplasts and mitochondria also carry DNA.
DNA is often referred to as the molecule of heredity as it is responsible for the genetic propagation of most inherited traits. In humans, these traits can range from hair color to disease susceptibility. During cell division, DNA is replicated and can be transmitted to offspring during reproduction. Lineage studies can be done based on the facts that the mitochondrial DNA only comes from the mother, and the male Y chromosome only comes from the father.
Every person's DNA, their genome, is inherited from both parents. The mother's mitochondrial DNA together with twenty-three chromosomes from each parent combine to form the genome of a zygote, the fertilized egg. As a result, with certain exceptions such as red blood cells, most human cells contain 23 pairs of chromosomes, together with mitochondrial DNA inherited from the mother.
Between the two strands, each base can only "pair up" with one single predetermined other base: A+T, T+A, C+G and G+C are the only possible combinations; that is, an "A" on one strand of double-stranded DNA will "mate" properly only with a "T" on the other, complementary strand; therefore, naming the bases on the conventionally chosen side of the strand is enough to describe the entire double-strand sequence. Two nucleotides paired together are called a base pair. On rare occasions, wrong pairing can happen, when thymine goes into its enol form or cytosine goes into its imino form. The double-stranded structure of DNA provides a simple mechanism for DNA replication: the DNA double strand is first "unzipped" down the middle, and the "other half" of each new single strand is recreated by exposing each half to a mixture of the four bases. An enzyme makes a new strand by finding the correct base in the mixture and pairing it with the original strand. In this way, the base on the old strand dictates which base will be on the new strand, and the cell ends up with an extra copy of its DNA.
DNA contains the genetic information that is inherited by the offspring of an organism; this information is determined by the sequence of base pairs along its length. A strand of DNA contains genes, areas that regulate genes, and areas that either have no function, or a function yet unknown. Genes can be loosely viewed as the organism's "cookbook" or "blueprint".
Other interesting points:
Each vine-like molecule is a strand of DNA: a chemically linked chain of nucleotides, each of which consists of a sugar (deoxyribose), a phosphate and one of five kinds of nucleobases ("bases"). Because DNA strands are composed of these nucleotide subunits, they are polymers.
The diversity of the bases means that there are five kinds of nucleotides, which are commonly referred to by the identity of their bases. These are adenine (A), thymine (T), uracil (U), cytosine (C), and guanine (G). U is rarely found in DNA except as a result of chemical degradation of C, but in some viruses, notably PBS1 phage DNA, U completely replaces the usual T in its DNA. Similarly, RNA usually contains U in place of T, but in certain RNAs such as transfer RNA, T is always found in some positions. Thus, the only true difference between DNA and RNA is the sugar, 2-deoxyribose in DNA and ribose in RNA.
In a DNA double helix, two polynucleotide strands can associate through the hydrophobic effect and pi stacking. Specificity of which strands stay associated is determined by complementary pairing. Each base forms hydrogen bonds readily to only one other, A to T forming two hydrogen bonds and C to G forming three hydrogen bonds. The GC content and length of each DNA molcule dictates the strength of the association; the more complementary bases exist, the stronger and longer-lasting the association characterised by the temperate required to break the hydrogen bonding, its Tm value.
The cell's machinery is capable of melting or disassociating a DNA double helix, and using each DNA strand as a template for synthesizing a new strand which is nearly identical to the previous strand. Errors that occur in the synthesis are known as mutations. The process known as PCR (polymerase chain reaction) mimics this process in vitro in a nonliving system.
Because pairing causes the nucleotide bases to face the helical axis, the sugar and phosphate groups of the nucleotides run along the outside; the two chains they form are sometimes called the "backbones" of the helix. In fact, it is chemical bonds between the phosphates and the sugars that link one nucleotide to the next in the DNA strand.
In many species, only a small fraction of the total sequence of the genome appears to encode protein. For example, only about 1.5% of the human genome consists of protein-coding exons. The function of the rest is a matter of speculation. It is known that certain nucleotide sequences specify affinity for DNA binding proteins, which play a wide variety of vital roles, in particular through control of replication and transcription. These sequences are frequently called regulatory sequences, and researchers assume that so far they have identified only a tiny fraction of the total that exist. "Junk DNA" represents sequences that do not yet appear to contain genes or to have a function. The reasons for the presence of so much non-coding DNA in eukaryotic genomes and the extraordinary differences in genome size ("C-value") among species represent a long-standing puzzle in DNA research known as the "C-value enigma".
Some DNA sequences play structural roles in chromosomes. Telomeres and centromeres typically contain few (if any) protein-coding genes, but are important for the function and stability of chromosomes. Some genes code for "RNA genes" (see tRNA and rRNA). Some RNA genes code for transcripts that function as regulatory RNAs (see siRNA) that influence the function of other RNA molecules. The intron-exon structure of some genes (such as immunoglobin and protocadeherin genes) is important for allowing alternative splicing of pre-mRNA which allows several different proteins to be made from the same gene. Some non-coding DNA represents pseudogenes, which have been hypothesized to serve as raw genetic material for the creation of new genes through the process of gene duplication and divergence. Some non-coding DNA provided hot-spots for duplication of short DNA regions; such sequence duplication has been the major form of genetic change in the human lineage (see evidence from the Chimpanzee Genome Project). Exons interspersed with introns allows for "exon shuffling" and the creation of modified genes that might have new adaptive functions. Large amounts of non-coding DNA is probably adaptive in that it provides chromosomal regions where recombination between homologous portions of chromosomes can take place without disrupting the function of genes. Some biologists such as Stuart Kauffman have speculated that non-coding DNA may modify the rate of evolution of a species.
Sequence also determines a DNA segment's susceptibility to cleavage by restriction enzymes, the quintessential tools of genetic engineering. The position of cleavage sites throughout an individual's genome determines one kind of an individual's "DNA fingerprint".
DNA replication or DNA synthesis is the process of copying the double-stranded DNA prior to cell division. The two resulting double strands are generally almost perfectly identical, but occasionally errors in replication or exposure to chemicals, or radiation can result in a less than perfect copy (see mutation), and each of them consists of one original and one newly synthesized strand. This is called semiconservative replication. The process of replication consists of three steps: initiation, elongation and termination.
Furthermore, DNA undergoes a stretching phase transition at a force of 65 pN; above this force, DNA is thought to take the form that Linus Pauling originally hypothesized, with the phosphates in the middle and bases splayed outward. This proposed structure for overstretched DNA has been called "P-form DNA," in honor of Pauling.
The B form of the DNA helix twists 360° per 10 bp in the absence of strain. But many molecular biological processes can induce strain. A DNA segment with excess or insufficient helical twisting is referred to, respectively, as positively or negatively "supercoiled". DNA in vivo is typically negatively supercoiled, which facilitates the unwinding of the double-helix required for RNA transcription.
| Geometry attribute | A-form | B-form | Z-form | |||||
|---|---|---|---|---|---|---|---|---|
| Helix sense | right-handed | right-handed | left-handed | |||||
| Repeating unit | 1 bp | 1 bp | 2 bp | - | Rotation/bp | 33.6° | 35.9° | 60°/2 |
| Mean bp/turn | 10.7 | 10.4 | 12 | - | Inclination of bp to axis | +19° | -1.2° | -9° |
| Rise/bp along axis | 0.23 nm | 0.332 nm | 0.38 nm | - | Pitch/turn of helix | 2.46 nm | 3.32 nm | 4.56 nm |
| Mean propeller twist | +18° | +16° | 0° | - | Glycosyl angle | anti | anti | C: anti, G: syn |
| Sugar pucker | C3'-endo | C2'-endo | C: C2'-endo, G: C2'-exo | - | Diameter | 2.55 nm | 2.37 nm | 1.84 nm |
http://www.notahelix.com/delmonte/new_struct_mol_biol.pdf
and a recent research paper summarises some key experimental data which are better explained by SBS models than by the double helix:
http://www.ias.ac.in/currsci/dec102003/1564.pdf
with subsequent correspondence:
http://www.ias.ac.in/currsci/may252004/1352.pdf
However, these theories have problems of their own, such as explaining the near-perfect symmetry of DNA in cells and the activity of DNA repair in the absence of a base-paired strand for comparison. Additionally, the activity of topoisomerases would be entirely redundant, and not nearly as important to cellular function as it patently is, if not for the fact that base-paired double-strands are at least the primary form of cellular DNA.
As a result of their antiparallel arrangement and the sequence-reading preferences of enzymes, even if both strands carried identical instead of complementary sequences, cells could properly translate only one of them. The other strand a cell can only read backwards. Molecular biologists call a sequence "sense" if it is translated or translatable, and they call its complement "antisense". It follows then, somewhat paradoxically, that the template for transcription is the antisense strand. The resulting transcript is an RNA replica of the sense strand and is itself sense.
A small proportion of genes in prokaryotes, and more in plasmids and viruses, blur the distinction made above between sense and antisense strands. Certain sequences of their genomes do double duty, encoding one protein when read 5' to 3' along one strand, and a second protein when read in the opposite direction (still 5' to 3') along the other strand. As a result, the genomes of these viruses are unusually compact for the number of genes they contain, which biologists view as an adaptation. This merely confirms that there is no biological distinction between the two strands of the double helix. Typically each strand of a DNA double helix will act as sense and antisense in different regions.
Friedrich Miescher (1844-1895) discovered a substance he called "nuclein" in 1869. Somewhat later, he isolated a pure sample of the material now known as DNA from the sperm of salmon, and in 1889 his pupil, Richard Altmann, named it "nucleic acid". This substance was found to exist only in the chromosomes.
In 1929 Phoebus Levene at the Rockefeller Institute identified the components (the four bases, the sugar and the phosphate chain) and he showed that the components of DNA were linked in the order phosphate-sugar-base. He called each of these units a nucleotide and suggested the DNA molecule consisted of a string of nucleotide units linked together through the phosphate groups, which are the 'backbone' of the molecule. However Levene thought the chain was short and that the bases repeated in the same fixed order. Torbjorn Caspersson and Einar Hammersten showed that DNA was a polymer.
In 1943, Oswald Theodore Avery and a team of scientists discovered that traits proper to the "smooth" form of the Pneumococcus could be transferred to the "rough" form of the same bacteria merely by making the killed "smooth" (S) form available to the live "rough" (R) form. Quite unexpectedly, the living R Pneumococcus bacteria were transformed into a new strain of the S form, and the transferred S characteristics turned out to be heritable. Avery called the medium of transfer of traits the transforming principle; he identified DNA as the transforming principle, and not protein as previously thought. He essentially redid Fredrick Griffith's experiment. In 1953, Alfred Hershey and Martha Chase did an experiment (Hershey-Chase experiment) that showed, in T2 phage, that DNA is the genetic material (Hershey shared the Nobel prize with Luria).
In 1944, the renowned physicist, Erwin Schrödinger, published a brief book entitled What is Life?, where he maintained that chromosomes contained what he called the "hereditary code-script" of life. He added: "But the term code-script is, of course, too narrow. The chromosome structures are at the same time instrumental in bringing about the development they foreshadow. They are law-code and executive power -- or, to use another simile, they are architect's plan and builder's craft -- in one." He conceived of these dual functional elements as being woven into the molecular structure of chromosomes. By understanding the exact molecular structure of the chromosomes one could hope to understand both the "architect's plan" and also how that plan was carried out through the "builder's craft." Three groups took up Schrödinger's challenge to work out the structure of the chromosomes and the question of how the segments of the chromosomes that were conceived to relate to specific traits could possibly do their jobs.
Just how the presence of specific features in the molecular structure of chromosomes could produce traits and behaviors in living organisms was unimaginable at the time. Because chemical dissection of DNA samples always yielded the same four nucleotides, the chemical composition of DNA appeared simple, perhaps even uniform. Organisms, on the other hand, are fantastically complex individually and widely diverse collectively. Geneticists did not speak of genes as conveyors of "information" in such words, but if they had, they would not have hesitated to quantify the amount of information that genes need to convey as vast. The idea that information might reside in a chemical in the same way that it exists in text--as a finite alphabet of letters arranged in a sequence of unlimited length--had not yet been conceived. It would emerge upon the discovery of DNA's structure, but few researchers imagined that DNA's structure had much to say about genetics.
Watson and Crick had begun to contemplate double helical arrangements, but they lacked information about the amount of twist (pitch) and the distance between the two strands. Rosalind Franklin had to disclose some of her findings for the Medical Research Council and Crick saw this material through Max Perutz's links to the MRC. Franklin's work confirmed that the phosphate "backbone" was on the outside of the molecule and also gave an insight into its symmetry, in particular that the two helical strands ran in opposite directions.
Watson and Crick were again greatly assisted by more of Franklin's data. This is controversial because Franklin's critical X-ray pattern was shown to Watson and Crick without Franklin's knowledge or permission. Wilkins showed the famous Photo 51 of the much simpler B type of DNA to Watson at his lab immediately after Watson had been unsuccessful in asking Franklin to collaborate to beat Pauling in finding the structure.
From the data in photograph 51 Watson and Crick were able to discern that not only was the distance between the two strands constant, but also to measure its exact value of 2 nanometres. The same photograph also gave them the 3.4 nanometre-per-10 bp "pitch" of the helix.
The final insight came when Crick and Watson saw that a complementary pairing of the bases could provide an explanation for Chargaff's puzzling finding. However the structure of the bases had been incorrectly guessed in the textbooks as the enol tautomer when they were more likely to be in the keto form. When Jerry Donohue pointed this fallacy out to Watson, Watson quickly realised that the pairs of adenine and thymine, and guanine and cytosine were almost identical in shape and so would provide equally sized 'rungs' between the two strands. Watson and Crick worked to develop a physical model of the double-helical structure out of wire which they used to confirm that the distances between the molecules were permissable. With the base-pairing, the Watson and Crick quickly converged upon a model, which they announced before Franklin herself had published any of her work.
Franklin was herself two steps away from the solution. She had not guessed the base-pairing and had not appreciated the implications of the symmetry that she had described. However she had been working almost alone and did not have regular contact with a partner like Crick and Watson, and with other experts such as Jerry Donohue. Her notebooks show that she was aware both of Jerry Donohue's work concerning tautomeric forms of bases (she had used the keto forms for three of the bases) and of Chargaff's work.
The disclosure of Franklin's data to Watson has angered some people who believe Franklin did not receive due credit at the time and that she might have discovered the structure on her own before Crick and Watson. In Crick and Watson's famous paper in Nature in 1953, they said that their work had been stimulated by the work of Wilkins and Franklin, whereas it had been the basis of their work. However they had agreed with Wilkins and Franklin that they all should publish papers in the same issue of Nature in support of the proposed structure. Additionally, in his autobiography, The Double Helix, Watson describes Franklin in very unflattering terms (commenting derisively on her lack of "feminine" traits) and all but implies that her work actually impaired that of Wilkins.
Watson, Crick, and Wilkins were awarded the 1962 Nobel Prize for Physiology or Medicine for discovering the molecular structure of DNA, by which time Franklin had died from cancer at 37. Nobel prizes are not awarded posthumously; had she lived, the difficult decision over whom to jointly award the prize would have been complicated as the prize can only be shared between a maximum of three; but because their work could be considered to be chemistry, it is conceivable that Wilkins and Franklin could have been awarded the Nobel Prize for Chemistry instead; see Graeme Hunter's biography of Sir Lawrence Bragg for more information on how scientists were nominated for Nobel Prizes.
Forensic scientists can use DNA located in blood, semen, skin, saliva or hair left at the scene of a crime to identify a possible suspect, a process called genetic fingerprinting or DNA profiling. In DNA profiling the relative lengths of sections of repetitive DNA, such as short tandem repeats and minisatellites, are compared. DNA profiling was developed in 1984 by English geneticist Alec Jeffreys of the University of Leicester, and was first used to convict Colin Pitchfork in 1988 in the Enderby murders case in Leicestershire, England. Many jurisdictions require convicts of certain types of crimes to provide a sample of DNA for inclusion in a computerized database. This has helped investigators solve old cases where the perpetrator was unknown and only a DNA sample was obtained from the scene (particularly in rape cases between strangers). This method is one of the most reliable techniques for identifying a criminal, but is not always perfect, for example if no DNA can be retrieved, or if the scene is contaminated with the DNA of several possible suspects.
DNA plays an important role in computer science, bioinformatics and computational biology, both as a motivating research problem and as a method of computation in itself.
Research on string searching algorithms, which find an occurrence of a sequence of letters inside a larger sequence of letters, was motivated in part by DNA research, where it is used to find specific sequences of nucleotides in a large sequence.Gusfield, Dan. Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press, 15 January 1997. ISBN 0521585198. In other applications such as text editors, even simple algorithms for this problem usually suffice, but DNA sequences cause these algorithms to exhibit near-worst-case behavior due to their small number of distinct characters.
Database theory has been influenced by DNA research, which poses special problems for storing and manipulating DNA sequences. Databases specialized for DNA research are called genomic databases, and must address a number of unique technical challenges associated with the operations of approximate matching, sequence comparison, finding repeating patterns, and homology searching.
In 1994, Leonard Adleman of the University of Southern California made headlines when he discovered a way of solving the directed Hamiltonian path problem, an NP-complete problem, using tools from molecular biology, in particular DNA. The new approach, dubbed DNA computing, has practical advantages over traditional computers in power use, space use, and efficiency, due to its ability to highly parallelize the computation (see parallel computing), although there is labor worth mentioning involved in retrieving the answers. A number of other problems, including simulation of various abstract machines, the boolean satisfiability problem, and the bounded version of the Post correspondence problem, have since been analyzed using DNA computing.
Due to its compactness, DNA also has a theoretical role in cryptography, where in particular it allows unbreakable one-time pads to be efficiently constructed and used.Ashish Gehani, Thomas LaBean and John Reif. DNA-Based Cryptography. Proceedings of the 5th DIMACS Workshop on DNA Based Computers, Cambridge, MA, USA, 14–15 June 1999.
If DNA sequences from different species are compared, then the resulting family tree, or phylogeny can be used to study the evolution of these species. This field of phylogenetics is a powerful tool in evolutionary biology. If DNA sequences within a species are compared, population geneticists can glean information on the history of particular populations. This can be used in studies ranging from ecological genetics to anthropology (for example, DNA evidence is also being used to try to identify the Ten Lost Tribes of IsraelLost Tribes of Israel, NOVA, PBS airdate: 22 February 2000. Transcript available from http://www.pbs.org/wgbh/nova/transcripts/2706israel.html (last accessed on 4 March 2006)).
DNA has also been used to look at fairly recent issues of family relationships, such as establishing some manner of familial relationship between the descendants of Sally Hemings and the family of Thomas Jefferson. This usage is closely related to the use of DNA in criminal investigations detailed above. Indeed, some criminal investigations have been solved when DNA from crime scenes has fortuitously matched relatives of the guilty individual.**
حمض نووي ريبي منقوص الأكسجين | DNA | DNA | ДНК | ADN | DNA | DNA | Dna | DNA | Desoxyribonukleinsäure | Desoksüribonukleiinhape | DNA | ADN | DNA | ADN | Acide désoxyribonucléique | ADN | ADN | DNA | Asam deoksiribonukleat | DNA | DNA | ADN | Acidum deoxyribonucleinicum | Dezoksiribonukleīnskābe | Deoksiribonukleorūgštis | DNS (biológia) | ДНК | DNA | DNA | デオキシリボ核酸 | DNA | Deoksyribonukleinsyre | DNA | Kwas deoksyrybonukleinowy | DNA | ADN | Дезоксирибонуклеиновая кислота | ADN | DNA | Deoxyribonukleová kyselina | Deoksiribonukleinska kislina | ДНК | DNK | DNA | DNA | DNA | DNA | ஆக்சிஜனற்ற ரைபோ கரு அமிலம் | ดีเอ็นเอ | DNA | DNA | ДНК | ڈی این اے | 脱氧核糖核酸