Genome Taxonomy Database

Genome Taxonomy Database
Content
DescriptionProposed prokaryotic nomenclature
Contact
Research centerAustralian Centre for Ecogenomics, University of Queensland
Authors
  • Phil Hugenholtz
  • Maria Chuvochina
  • Christian Rinke
Primary citationPMID 30148503
Release date2018
Access
Websitegtdb.ecogenomic.org
Miscellaneous
LicenseCC BY-SA 4.0
VersionR07/RS207 (8 April 2022)
Curation policymixed

The Genome Taxonomy Database (GTDB) is an online database that maintains information on a proposed nomenclature of prokaryotes, following a phylogenomic approach based on a set of conserved single-copy proteins. In addition to resolving paraphyletic groups, this method also reassigns taxonomic ranks algorithmically, updating names in both cases.[1] Information for archaea was added in 2020,[2] along with a species classification based on average nucleotide identity.[3] Each update incorporates new genomes as well as automated and manual curation of the taxonomy.[4]

An open-source tool called GTDB-Tk is available to classify draft genomes into the GTDB hierarchy.[5] The GTDB system, via GTDB-Tk, has been used to catalogue not-yet-named bacteria in the human gut microbiome and other metagenomic sources.[6][7]

The GTDB is incorporated into the Bergey's Manual of Systematics of Archaea and Bacteria in 2019 as its phylogenomic resource.[8]

Methodology

The genomes used to construct the phylogeny are obtained from NCBI (RefSeq and Genbank), and GTDB releases are indexed to RefSeq releases, starting with release 76. Importantly and increasingly, this dataset includes draft genomes of uncultured microorganisms obtained from metagenomes and single cells, ensuring improved genomic representation of the microbial world. All genomes are independently quality controlled using CheckM before inclusion in GTDB.[9]

Genomes first undergo gene calling to extract genes. The taxonomy is based on trees inferred with FastTree from an aligned concatenated set of 120 single copy marker proteins for Bacteria under a WAG model, and with IQ-TREE from a concatenated set of 53 (since RS207; 122 before) marker proteins for Archaea under the PMSF model. Additional marker sets are also used to cross-validate tree topologies including concatenated ribosomal proteins and ribosomal RNA genes.[9] The relative evolutionary divergence (RED) metric, which determines the taxonomic ranks used, is derived from the two main trees by the PhyloRank program.[1]

Species are deliminated using average nucleotide identity and alignment fraction, both calculated by skani. For species existing in a previous release, GTDB compares the quality and position of two genomes and may decide to switch to a new species representative genome.[9]

Taxomony comes from the following sources:

GTDB personnel curates the taxonomy from the aforementioned sources by checking them against the results of PhyloRank and the tree.

  • The tree node corresponding to a taxon name may have a RED inappropriate for its rank. The name may either be moved onto another node or (by changing the Latin suffix) into a different rank.[1]
    • Splitting may happen on the level of species or genera if the divergence turns out too high. Doing so creates new taxa.[3]
  • The taxon may turn out to be polyphyletic. The curator first restricts the taxon to the clade containing its type material. A new taxon is created for each of the other clades.[1]

For the each new taxon, the curators try to find a proposed name in literature for it. If there is no name proposed, the taxon is given a placeholder name by adding a suffix to the original name, e.g. Lactobacillus gasseri_A. After "Z" comes "AA".[1]

Contents of the database

Each release contains:[10]

  • Taxonomy tables containing the assignment of all included genome assemblies to the phylum-to-species taxonomy. (One per domain.)
  • Files containing the metadata given to each genome assembly, including original taxonomy from NCBI, original strain identifier, GTDB taxonomy, quality estimates, and presence of important genes (tRNA and rRNA). (One per domain.)
  • Species tree Newick files containing the species-representative genomes (1 per species), built as described in the previous section. (One per domain.)
  • For species-representative genomes:
    • alignments of marker genes identified from these genomes
    • file containing one 16S rRNA sequence from each species
    • tarballs containing amino acid and nucleotide versions of all predicted genes in these genomes
    • tarball containing the full contents of all these genomes
  • For all genomes that pass quality check:
    • alignments of marker genes identified from these genomes
    • file containing all 16S rRNA sequences identified from these genomes
  • Auxiliary files; see the full FILE_DESCRIPTIONS.txt。

The web interface displays a tree based on the taxonomy (not the entire Newick file), down to the genome assembly level. Each genome assembly has a page detailing its metadata and a history of how it's classified in each GTDB release. There is a search functionality.

See also

References

  1. ^ a b c d e f g h Parks, DH; Chuvochina, M; Waite, DW; Rinke, C; Skarshewski, A; Chaumeil, PA; Hugenholtz, P (November 2018). "A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life". Nature Biotechnology. 36 (10): 996–1004. bioRxiv 10.1101/256800. doi:10.1038/nbt.4229. PMID 30148503. S2CID 52093100.
  2. ^ Rinke, Christian; Chuvochina, Maria; Mussig, Aaron J.; Chaumeil, Pierre-Alain; Davín, Adrián A.; Waite, David W.; Whitman, William B.; Parks, Donovan H.; Hugenholtz, Philip (21 June 2021). "A standardized archaeal taxonomy for the Genome Taxonomy Database" (PDF). Nature Microbiology. 6 (7): 946–959. doi:10.1038/s41564-021-00918-8. ISSN 2058-5276. PMID 34155373. S2CID 235595884.
  3. ^ a b Parks, DH; Chuvochina, M; Chaumeil, PA; Rinke, C; Mussig, AJ; Hugenholtz, P (September 2020). "A complete domain-to-species taxonomy for Bacteria and Archaea". Nature Biotechnology. 38 (9): 1079–1086. bioRxiv 10.1101/771964. doi:10.1038/s41587-020-0501-8. PMID 32341564. S2CID 216560589.
  4. ^ For information on each update, see relevant change logs. For notable, paper-worthy changes, see "Cite GTDB" section on the About page.
  5. ^ Chaumeil, PA; Mussig, AJ; Hugenholtz, P; Parks, DH (15 November 2019). "GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database". Bioinformatics. 36 (6): 1925–1927. doi:10.1093/bioinformatics/btz848. PMC 7703759. PMID 31730192.
  6. ^ Almeida, Alexandre; Nayfach, Stephen; Boland, Miguel; Strozzi, Francesco; Beracochea, Martin; Shi, Zhou Jason; Pollard, Katherine S.; Sakharova, Ekaterina; Parks, Donovan H.; Hugenholtz, Philip; Segata, Nicola; Kyrpides, Nikos C.; Finn, Robert D. (20 July 2020). "A unified catalog of 204,938 reference genomes from the human gut microbiome". Nature Biotechnology. 39 (1): 105–114. doi:10.1038/s41587-020-0603-3. PMC 7801254. PMID 32690973.
  7. ^ Nayfach, Stephen; et al. (9 November 2020). "A genomic catalog of Earth's microbiomes". Nature Biotechnology. 39 (4): 499–509. doi:10.1038/s41587-020-0718-6. PMC 8041624. PMID 33169036.
  8. ^ "Incorporation of Phylogenomics into BMSAB". Bergey's Manual Trust.
  9. ^ a b c "METHODS.txt (GTDB release 220)". data.gtdb.ecogenomic.org. 2024.
  10. ^ "220.0/FILE_DESCRIPTIONS.txt".

Further reading

  • AnnoTree - third-party tool for visualization of genome annotations using the GTDB (R95 or R214) species tree.

Read other articles:

Gocheok Sky Dome Informasi stadionPemilikKota SeoulLokasiLokasi430, Gyeongin-ro, Gocheok-dong, Guro-gu, Seoul, Korea SelatanKoordinat37°29′53.6″N 126°52′02.1″E / 37.498222°N 126.867250°E / 37.498222; 126.867250Koordinat: 37°29′53.6″N 126°52′02.1″E / 37.498222°N 126.867250°E / 37.498222; 126.867250KonstruksiMulai pembangunanFebruari 2009Dibuat2009–2015Dibuka15 September 2015Biaya pembuatan240 miliar wonArsitekIlgeon Arch...

 

 

Penyuntingan Artikel oleh pengguna baru atau anonim untuk saat ini tidak diizinkan.Lihat kebijakan pelindungan dan log pelindungan untuk informasi selengkapnya. Jika Anda tidak dapat menyunting Artikel ini dan Anda ingin melakukannya, Anda dapat memohon permintaan penyuntingan, diskusikan perubahan yang ingin dilakukan di halaman pembicaraan, memohon untuk melepaskan pelindungan, masuk, atau buatlah sebuah akun. Artikel ini membutuhkan rujukan tambahan agar kualitasnya dapat dipastikan. Mohon...

 

 

Japanese casual-wear designer, manufacturer and retailer Uniqlo Co., Ltd.株式会社ユニクロUniqlo flagship store in UmedaCompany typeSubsidiaryIndustryFashionFounded2 June 1984; 39 years ago (1984-06-02)Headquarters717-1, Sayama, Yamaguchi City, Yamaguchi 754-0894, JapanKey peopleTadashi Yanai (CHM, Pres., CEO)Takahiro Wakabayashi (SVP)ProductsClothingaccessoriesNumber of employees30,000+ employeesParentFast Retailing Co., Ltd.(2005–present)Websiteuniqlo.com Uniqlo C...

Giambattista Tubi, nama lain Jean-Baptiste Tuby (lahir di Roma, 1635 - meninggal di Paris, 9 Agustus 1700) adalah seorang pematung Prancis asal Italia. Kehidupan Tuby sampai di Paris pada tahun 1660, lantas bekerja di bawah Charles Le Brun di Pabrik Gobelin. Berdasarkan rancangan Le Brun, Tuby merealisasikan tatahan batu granit di panggung besar choir gereja Saint-Severin di Paris. Dia bekerja bersama Antoine Coysevox, di bawah pimpinan Mazarin; untuknya, ia mengerjakan figur patung La Fidél...

 

 

Синелобый амазон Научная классификация Домен:ЭукариотыЦарство:ЖивотныеПодцарство:ЭуметазоиБез ранга:Двусторонне-симметричныеБез ранга:ВторичноротыеТип:ХордовыеПодтип:ПозвоночныеИнфратип:ЧелюстноротыеНадкласс:ЧетвероногиеКлада:АмниотыКлада:ЗавропсидыКласс:Пт�...

 

 

Egyptian nationalist ideology Taha Hussein, one of the chief promulgators of Pharaonism. Pharaonism was an ideology that rose to prominence in Egypt in the 1920s and 1930s. A version of Egyptian nationalism, it argued for the existence of an Egyptian national continuity from ancient times to the modern era, stressing the role of ancient Egypt and incorporating anti-colonial sentiment.[1] Pharaonism's most notable advocate was Taha Hussein. The movement largely faded by the 1940s, havi...

Lagu Kebangsaan Lebanonالنشيد الوطني اللبنانيLagu kebangsaan  LebanonPenulis lirikRashid NakhleKomponisWadih Sabra, 1925Penggunaan12 Juli 1927Sampel audioKulluna lil watan (instrumental)berkasbantuan Sampel audioLagu kebangsaan Lebanonberkasbantuan Koullouna Lilouataan Lil Oula Lil Alam adalah lagu kebangsaan Lebanon, dengan lirik dikarang oleh Rachid Nakhlé dan lagu oleh Wadih Sabra. Lirik Arab كلنـا للوطـن للعـلى للعـلم ملء عين الزّ...

 

 

此條目可参照英語維基百科相應條目来扩充。 (2021年5月6日)若您熟悉来源语言和主题,请协助参考外语维基百科扩充条目。请勿直接提交机械翻译,也不要翻译不可靠、低品质内容。依版权协议,译文需在编辑摘要注明来源,或于讨论页顶部标记{{Translated page}}标签。 约翰斯顿环礁Kalama Atoll 美國本土外小島嶼 Johnston Atoll 旗幟颂歌:《星條旗》The Star-Spangled Banner約翰斯頓環礁�...

 

 

الشهيد ديدوش مراد   معلومات شخصية الميلاد 13 يوليو 1927(1927-07-13)المرادية،  الجزائر الوفاة 18 يناير 1955 (27 سنة)زيغود يوسف  الجنسية جزائري والدان ديدوش سعيد الحياة العملية المهنة عسكري  الحزب حزب الشعب الجزائري  اللغات العربية  الخدمة العسكرية الرتبة عقيد  المعار�...

توين بريدج     الإحداثيات 45°32′41″N 112°19′54″W / 45.544722222222°N 112.33166666667°W / 45.544722222222; -112.33166666667   [1] تقسيم إداري  البلد الولايات المتحدة[2]  التقسيم الأعلى مقاطعة ماديسون  خصائص جغرافية  المساحة 2.627079 كيلومتر مربع2.475635 كيلومتر مربع (1 أبريل 2010)  �...

 

 

First Summit of the Non-Aligned Movement, Belgrade The Socialist Federal Republic of Yugoslavia was one of the founding members of the Non-Aligned Movement. Its capital, Belgrade, was the host of the First Summit of the Non-Aligned Movement in early September 1961. The city also hosted the Ninth Summit in September 1989. Non-alignment and active participation in the movement was the corner-stone of the Cold War foreign policy and ideology of the Yugoslav federation.[1] As the only Eur...

 

 

Dual impedance and dual network are terms used in electronic network analysis. The dual of an impedance Z {\displaystyle Z} is its reciprocal, or algebraic inverse Z ′ = 1 Z {\displaystyle Z'={\frac {1}{Z}}} . For this reason, the dual impedance is also called the inverse impedance. Another way of stating this is that the dual of Z {\displaystyle Z} is the admittance Y ′ = Z ′ {\displaystyle Y'=Z'} . The dual of a network is the network whose impedances are the duals of ...

Church in Lancashire, EnglandSt Mary's Church, Yealand ConyersWest end of St Mary's Church, Yealand ConyersSt Mary's Church, Yealand ConyersLocation in the City of Lancaster district54°09′38″N 2°45′41″W / 54.1605°N 2.7614°W / 54.1605; -2.7614OS grid referenceSD 504,741LocationYealand Conyers, LancashireCountryEnglandDenominationRoman CatholicWebsiteSt Mary, Yealand ConyersHistoryFounder(s)Richard GillowDedicationSaint MaryArchitectureArchitect(s)E. G....

 

 

Neighbors 2: Sorority RisingPoster resmiSutradaraNicholas StollerProduser Seth Rogen Evan Goldberg James Weaver Ditulis oleh Andrew Jay Cohen Brendan O'Brien Nicholas Stoller Evan Goldberg Seth Rogen Pemeran Seth Rogen Zac Efron Rose Byrne Chloë Grace Moretz Dave Franco Ike Barinholtz Penata musikMichael AndrewsSinematograferBrandon TrostPenyuntingZene BakerPerusahaanproduksi Good Universe Perfect World Pictures Point Grey Pictures DistributorUniversal PicturesTanggal rilis 20 Mei 2016...

 

 

الخطوط الجوية الفلبينية   إياتاPR  إيكاوPAL  رمز النداء؟؟ تاريخ الإنشاء 1941  الجنسية الفلبين  المطارات الثانوية مطار كلارك الدولي[1]مطار ماكتان سيبو الدولي[2]مطار فرانسيسكو بانجوي الدولي (Davao)[3] مدن التركيز مطار كاليبو الدوليمطار تايوان تاويوان الدول�...

Katedral Blagoveshchensk Eparki Blagoveshchensk dan Tynda adalah sebuah eparki Gereja Ortodoks Rusia yang terletak di Blagoveshchensk dan Tynda, Federasi Rusia. Eparki tersebut didirikan pada 1993.[1] Ordinaris Gabriel (Stebluczenko), 1994–2011[2] Lucjan (Kucenko), sejak 2011[3] Referensi ^ http://www.patriarchia.ru/db/text/31102.html ^ Гавриил, архиепископ (на покое) (Стеблюченко Юрий Григорьевич) ^ Лукиан, а...

 

 

American lecturer, photographer and journalist (1858–1923) George Wharton JamesBorn27 September 1858Lincolnshire, EnglandDied1923Occupationlecturer, photographer, journalistSubjectCalifornia and the American Southwest George Wharton James (27 September 1858[1] – 8 November 1923)[2] was an American popular lecturer, photographer, journalist and editor. Born in Lincolnshire, England, he emigrated to the United States as a young man after being ordained as a Methodist ministe...

 

 

Questa voce sull'argomento Formula 1 è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Segui i suggerimenti del progetto di riferimento. Motori ModerniFornitore dimotori Stagioni disputate1985-1987, 1990 GP disputati48 GP vinti0 Pole position0 Giri più veloci{{{giri veloci}}} Motori Moderni è stata un'azienda specializzata nella progettazione e costruzione di motori da competizione, in particolare per Formula 1 dove fu attiva tra il 1985 e il 1987 e d...

Questa voce o sezione sull'argomento Tunisia non cita le fonti necessarie o quelle presenti sono insufficienti. Puoi migliorare questa voce aggiungendo citazioni da fonti attendibili secondo le linee guida sull'uso delle fonti. شعار تونس Lo stemma della Tunisia (شعار تونس) è stato adottato il 21 giugno 1956. Da allora ha subito alcune piccole modifiche, la più recente delle quali risalente al 1963. Descrizione Lo scudo in oro è bipartito; nella prima partizione c'è u...

 

 

Questa voce sull'argomento calciatori cechi è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Segui i suggerimenti del progetto di riferimento. Vlastimil HrubýNazionalità Rep. Ceca Altezza187 cm Peso88 kg Calcio Ruoloportiere Squadra Zbrojovka Brno CarrieraGiovanili  Zbrojovka Brno Squadre di club1 2004 Zbrojovka Brno2004-2005→ Tatran Kohoutovice2005→  Xaverov2005-2006 Zbrojovka Brno2006-2007→  Dosta Bystrc2007...