Ancient DNA (aDNA) is DNA isolated from ancient sources (typically specimens, but also environmental DNA).[1][2] Due to degradation processes (including cross-linking, deamination and fragmentation)[3] ancient DNA is more degraded in comparison with contemporary genetic material.[4] Genetic material has been recovered from paleo/archaeological and historical skeletal material, mummified tissues, archival collections of non-frozen medical specimens, preserved plant remains, ice and from permafrost cores, marine and lake sediments and excavation dirt.
Even under the best preservation conditions, there is an upper boundary of 0.4–1.5 million years for a sample to contain sufficient DNA for sequencing technologies.[5] The oldest DNA sequenced from physical specimens are from mammoth molars in Siberia over 1 million years old.[6] In 2022, two-million year old genetic material was recovered from sediments in Greenland, and is currently considered the oldest DNA discovered so far.[7][8]
History of ancient DNA studies
1980s
The first study of what would come to be called aDNA was conducted in 1984, when Russ Higuchi and colleagues at the University of California, Berkeley reported that traces of DNA from a museum specimen of the Quagga not only remained in the specimen over 150 years after the death of the individual, but could be extracted and sequenced.[9] Over the next two years, through investigations into natural and artificially mummified specimens, Svante Pääbo confirmed that this phenomenon was not limited to relatively recent museum specimens but could apparently be replicated in a range of mummified human samples that dated as far back as several thousand years.[10][11][12]
The laborious processes that were required at that time to sequence such DNA (through bacterial cloning) were an effective brake on the study of ancient DNA (aDNA) and the field of museomics. However, with the development of the Polymerase Chain Reaction (PCR) in the late 1980s, the field began to progress rapidly.[13][14][15] Double primer PCR amplification of aDNA (jumping-PCR) can produce highly skewed and non-authentic sequence artifacts. Multiple primer, nested PCR strategy was used to overcome those shortcomings.
1990s
The post-PCR era heralded a wave of publications as numerous research groups claimed success in isolating aDNA. Soon a series of incredible findings had been published, claiming authentic DNA could be extracted from specimens that were millions of years old, into the realms of what Lindahl (1993b) has labelled Antediluvian DNA.[16] The majority of such claims were based on the retrieval of DNA from organisms preserved in amber. Insects such as stingless bees,[17][18] termites,[19] and wood gnats,[20] as well as plant[21] and bacterial[22] sequences were said to have been extracted from Dominican amber dating to the Oligocene epoch. Still older sources of Lebanese amber-encased weevils, dating to within the Cretaceous epoch, reportedly also yielded authentic DNA.[23] Claims of DNA retrieval were not limited to amber.
Reports of several sediment-preserved plant remains dating to the Miocene were published.[24][25] Then in 1994, Woodward et al. reported what at the time was called the most exciting results to date[26] — mitochondrial cytochrome b sequences that had apparently been extracted from dinosaur bones dating to more than 80 million years ago. When in 1995 two further studies reported dinosaur DNA sequences extracted from a Cretaceous egg,[27][28] it seemed that the field would revolutionize knowledge of the Earth's evolutionary past. Even these extraordinary ages were topped by the claimed retrieval of 250-million-year-old halobacterial sequences from halite.[29][30]
The development of a better understanding of the kinetics of DNA preservation, the risks of sample contamination and other complicating factors led the field to view these results more skeptically. Numerous careful attempts failed to replicate many of the findings, and all of the decade's claims of multi-million year old aDNA would come to be dismissed as inauthentic.[31]
2000s
Single primer extension amplification was introduced in 2007 to address postmortem DNA modification damage.[32] Since 2009 the field of aDNA studies has been revolutionized with the introduction of much cheaper research techniques.[33] The use of high-throughput Next Generation Sequencing (NGS) techniques in the field of ancient DNA research has been essential for reconstructing the genomes of ancient or extinct organisms. A single-stranded DNA (ssDNA) library preparation method has sparked great interest among ancient DNA (aDNA) researchers.[34][35]
In addition to these technical innovations, the start of the decade saw the field begin to develop better standards and criteria for evaluating DNA results, as well as a better understanding of the potential pitfalls.[31][36]
2020s
Autumn of 2022, the Nobel Prize of Physiology or Medicine was awarded to Svante Pääbo "for his discoveries concerning the genomes of extinct hominins and human evolution".[37] A few days later, on the 7th of December 2022, a study in Nature reported that two-million year old genetic material was found in Greenland, and is currently considered the oldest DNA discovered so far.[7][8]
Problems and errors
Degradation processes
Due to degradation processes (including cross-linking, deamination and fragmentation),[3] ancient DNA is of lower quality than modern genetic material.[4] The damage characteristics and ability of aDNA to survive through time restricts possible analyses and places an upper limit on the age of successful samples.[4] There is a theoretical correlation between time and DNA degradation,[38] although differences in environmental conditions complicate matters. Samples subjected to different conditions are unlikely to predictably align to a uniform age-degradation relationship.[39] The environmental effects may even matter after excavation, as DNA decay-rates may increase,[40] particularly under fluctuating storage conditions.[41] Even under the best preservation conditions, there is an upper boundary of 0.4 to 1.5 million years for a sample to contain sufficient DNA for contemporary sequencing technologies.[5]
Research into the decay of mitochondrial and nuclear DNA in moa bones has modelled mitochondrial DNA degradation to an average length of 1 base pair after 6,830,000 years at −5 °C.[4] The decay kinetics have been measured by accelerated aging experiments, further displaying the strong influence of storage temperature and humidity on DNA decay.[42] Nuclear DNA degrades at least twice as fast as mtDNA. Early studies that reported recovery of much older DNA, for example from Cretaceousdinosaur remains, may have stemmed from contamination of the sample.
Age limit
A critical review of ancient DNA literature through the development of the field highlights that few studies have succeeded in amplifying DNA from remains older than several hundred thousand years.[43] A greater appreciation for the risks of environmental contamination and studies on the chemical stability of DNA have raised concerns over previously reported results. The alleged dinosaur DNA was later revealed to be human Y-chromosome.[44] The DNA reported from encapsulated halobacteria has been criticized based on its similarity to modern bacteria, which hints at contamination,[36] or they may be the product of long-term, low-level metabolic activity.[45]
aDNA may contain a large number of postmortem mutations, increasing with time. Some regions of polynucleotide are more susceptible to this degradation, allowing erroneous sequence data to bypass statistical filters used to check the validity of data.[31] Due to sequencing errors, great caution should be applied to interpretation of population size.[46] Substitutions resulting from deamination of cytosine residues are vastly over-represented in the ancient DNA sequences. Miscoding of C to T and G to A accounts for the majority of errors.[47]
Contamination
Another problem with ancient DNA samples is contamination by modern human DNA and by microbial DNA (most of which is also ancient).[48][49] New methods have emerged in recent years to prevent possible contamination of aDNA samples, including conducting extractions under extreme sterile conditions, using special adapters to identify endogenous molecules of the sample (distinguished from those introduced during analysis), and applying bioinformatics to resulting sequences based on known reads in order to approximate rates of contamination.[50][51]
Authentication of aDNA
Development in the aDNA field in the 2000s increased the importance of authenticating recovered DNA to confirm that it is indeed ancient and not the result of recent contamination. As DNA degrades over time, the nucleotides that make up the DNA may change, especially at the ends of the DNA molecules. The deamination of cytosine to uracil at the ends of DNA molecules has become a way of authentication. During DNA sequencing, the DNA polymerases will incorporate an adenine (A) across from the uracil (U), leading to cytosine (C) to thymine (T) substitutions in the aDNA data.[52] These substitutions increase in frequency as the sample gets older. Frequency measurement of the C-T level, ancient DNA damage, can be made using various software such as mapDamage2.0 or PMDtools [53][54] and interactively on metaDMG.[55] Due to hydrolytic depurination, DNA fragments into smaller pieces, leading to single-stranded breaks. Combined with the damage pattern, this short fragment length can also help differentiate between modern and ancient DNA.[56][57]
Non-human aDNA
Despite the problems associated with 'antediluvian' DNA, a wide and ever-increasing range of aDNA sequences have now been published from a range of animal and plant taxa. Tissues examined include artificially or naturally mummified animal remains,[9][58] bone,[59][60][61][62] shells,[63] paleofaeces,[64][65] alcohol preserved specimens,[66] rodent middens,[67] dried plant remains,[68][69] and recently, extractions of animal and plant DNA directly from soil samples.[70]
Researchers in 2016 measured chloroplast DNA in marine sediment cores, and found diatom DNA dating back to 1.4 million years.[76] This DNA had a half-life significantly longer than previous research, of up to 15,000 years. Kirkpatrick's team also found that DNA only decayed along a half-life rate until about 100 thousand years, at which point it followed a slower, power-law decay rate.[76]
Human aDNA
Due to the considerable anthropological, archaeological, and public interest directed toward human remains, they have received considerable attention from the DNA community. There are also more profound contamination issues, since the specimens belong to the same species as the researchers collecting and evaluating the samples.
Sources
Due to the morphological preservation in mummies, many studies from the 1990s and 2000s used mummified tissue as a source of ancient human DNA. Examples include both naturally preserved specimens, such as the Ötzi the Iceman frozen in a glacier[78] and bodies preserved through rapid desiccation at high altitude in the Andes,[12][79] as well as various chemically treated preserved tissue such as the mummies of ancient Egypt.[80] However, mummified remains are a limited resource. The majority of human aDNA studies have focused on extracting DNA from two sources much more common in the archaeological record: bones and teeth. The bone that is most often used for DNA extraction is the petrous ear bone, since its dense structure provides good conditions for DNA preservation.[81] Several other sources have also yielded DNA, including paleofaeces,[82] and hair.[83][84] Contamination remains a major problem when working on ancient human material.
Ancient pathogen DNA has been successfully retrieved from samples dating to more than 5,000 years old in humans and as long as 17,000 years ago in other species. In addition to the usual sources of mummified tissue, bones and teeth, such studies have also examined a range of other tissue samples, including calcified pleura,[85] tissue embedded in paraffin,[86][87] and formalin-fixed tissue.[88] Efficient computational tools have been developed for pathogen and microorganism aDNA analyses in a small (QIIME[89]) and large scale (FALCON [90]).
Results
Taking preventative measures in their procedure against such contamination though, a 2012 study analyzed bone samples of a Neanderthal group in the El Sidrón cave, finding new insights on potential kinship and genetic diversity from the aDNA.[91] In November 2015, scientists reported finding a 110,000-year-old tooth containing DNA from the Denisovan hominin, an extinctspecies of human in the genus Homo.[92][93]
The research has added new complexity to the peopling of Eurasia. A study from 2018 [94] showed that a Bronze Age mass migration had greatly impacted the genetic makeup of the British Isles, bringing with it the Bell Beaker culture from mainland Europe.
It has also revealed new information about links between the ancestors of Central Asians and the indigenous peoples of the Americas. In Africa, older DNA degrades quickly due to the warmer tropical climate, although, in September 2017, ancient DNA samples, as old as 8,100 years old, have been reported.[95]
Moreover, ancient DNA has helped researchers to estimate modern human divergence.[96] By sequencing African genomes from three Stone Age hunter gatherers (2000 years old) and four Iron Age farmers (300 to 500 years old), Schlebusch and colleagues were able to push back the date of the earliest divergence between human populations to 350,000 to 260,000 years ago.
^ abPääbo S (1986). "Molecular genetic investigations of ancient human remains". Cold Spring Harbor Symposia on Quantitative Biology. 51 (Pt 1): 441–6. doi:10.1101/SQB.1986.051.01.053. PMID3107879.
^Cano RJ, Poinar H, Poinar Jr GO (1992a). "Isolation and partial characterisation of DNA from the bee Problebeia dominicana (Apidae:Hymenoptera) in 25–40 million year old amber". Med Sci Res. 20: 249–51.
^Cano RJ, Poinar HN, Roubik DW, Poinar Jr GO (1992b). "Enzymatic amplification and nucleotide sequencing of portions of the 18S rRNA gene of the bee Problebeia dominicana (Apidae:Hymenoptera) isolated from 25–40 million year old Dominican amber". Med Sci Res. 20: 619–22.
^Golenberg EM (September 1991). "Amplification and analysis of Miocene plant fossil DNA". Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 333 (1268): 419–26, discussion 426–7. doi:10.1098/rstb.1991.0092. PMID1684052.
^An CC, Li Y, Zhu YX (1995). "Molecular cloning and sequencing of the 18S rDNA from specialized dinosaur egg fossil found in Xixia Henan, China". Acta Sci Nat Univ Pekinensis. 31: 140–47.
^Li Y, An C-C, Zhu Y-X (1995). "DNA isolation and sequence analysis of dinosaur DNA from Cretaceous dinosaur egg in Xixia Henan, China". Acta Sci Nat Univ Pekinensis. 31: 148–52.
^
Grass RN, Heckel R, Puddu M, Paunescu D, Stark WJ (February 2015). "Robust chemical preservation of digital information on DNA in silica with error-correcting codes". Angewandte Chemie. 54 (8): 2552–5. doi:10.1002/anie.201411378. PMID25650567.
^Michelsen C, Pedersen MW, Fernandez-Guerra A, Zhao L, Petersen TC, Korneliussen TS (9 December 2022). "metaDMG – A Fast and Accurate Ancient DNA Damage Toolkit for Metagenomic Data". bioRxiv: 2022.12.06.519264. doi:10.1101/2022.12.06.519264. S2CID254536966.
^Poinar HN, Hofreiter M, Spaulding WG, Martin PS, Stankiewicz BA, Bland H, et al. (July 1998). "Molecular coproscopy: dung and diet of the extinct ground sloth Nothrotheriops shastensis". Science. 281 (5375): 402–6. Bibcode:1998Sci...281..402P. doi:10.1126/science.281.5375.402. PMID9665881.
^Orlando L, Ginolhac A, Zhang G, Froese D, Albrechtsen A, Stiller M, et al. (July 2013). "Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse". Nature. 499 (7456): 74–8. Bibcode:2013Natur.499...74O. doi:10.1038/nature12323. PMID23803765. S2CID4318227.
^Baker LE (2001). Mitochondrial DNA haplotype and sequence analysis of historic Choctaw and Menominee hair shaft samples (PhD thesis). University of Tennessee, Knoxville.
^Donoghue HD, Spigelman M, Zias J, Gernaey-Child AM, Minnikin DE (1998). "Mycobacterium tuberculosis complex DNA in calcified pleura from remains 1400 years old". Lett Appl Microbiol. 27 (5): 265–69. doi:10.1046/j.1472-765x.1998.t01-8-00449.x. PMID9830142.
^Lalueza-Fox C, Rosas A, de la Rasilla M (January 2012). "Palaeogenetic research at the El Sidrón Neanderthal site". Annals of Anatomy - Anatomischer Anzeiger. Special Issue: Ancient DNA. 194 (1): 133–7. doi:10.1016/j.aanat.2011.01.014. hdl:10261/79609. PMID21482084.
Orlando L (June 2014). "A 400,000-year-old mitochondrial genome questions phylogenetic relationships amongst archaic hominins: using the latest advances in ancient genomics, the mitochondrial genome sequence of a 400,000-year-old hominin has been deciphered". BioEssays. 36 (6): 598–605. doi:10.1002/bies.201400018. PMID24706482. S2CID35786511.