1000 Genomes Project

The 1000 Genomes Project (1KGP), taken place from January 2008 to 2015, was an international research effort to establish the most detailed catalogue of human genetic variation at the time. Scientists planned to sequence the genomes of at least one thousand anonymous healthy participants from a number of different ethnic groups within the following three years, using advancements in newly developed technologies. In 2010, the project finished its pilot phase, which was described in detail in a publication in the journal Nature.[1] In 2012, the sequencing of 1092 genomes was announced in a Nature publication.[2] In 2015, two papers in Nature reported results and the completion of the project and opportunities for future research.[3][4]

Many rare variations, restricted to closely related groups, were identified, and eight structural-variation classes were analyzed.[5]

The project united multidisciplinary research teams from institutes around the world, including China, Italy, Japan, Kenya, Nigeria, Peru, the United Kingdom, and the United States contributing to the sequence dataset and to a refined human genome map freely accessible through public databases to the scientific community and the general public alike.[2]

The International Genome Sample Resource was created to host and expand on the data set after the project's end.[6]

Changes in the number and order of genes (A-D) create genetic diversity within and between populations.

Background

Since the completion of the Human Genome Project advances in human population genetics and comparative genomics enabled further insight into genetic diversity.[7] The understanding about structural variations (insertions/deletions (indels), copy number variations (CNV), retroelements), single-nucleotide polymorphisms (SNPs), and natural selection were being solidified.[8][9][10][11]

The diversity of Human genetic variation such as that Indels were being uncovered and investigating human genomic variations[citation needed]

Natural selection

It also aimed to provide evidence that can be used to explore the impact of Natural selection on population differences. Patterns of DNA polymorphisms can be used to reliably detect signatures of selection and may help to identify genes that might underlie variation in disease resistance or drug metabolism.[12][13] Such insights could improve understanding of phenotypic variations, genetic disorders and Mendelian inheritance and their effects on survival and/or reproduction of different human populations.

Project description

Goals

The 1000 Genomes Project was designed to bridge the gap of knowledge between rare genetic variants that have a severe effect predominantly on simple traits (e.g. cystic fibrosis, Huntington disease) and common genetic variants have a mild effect and are implicated in complex traits (e.g. cognition, diabetes, heart disease).[14]

The primary goal of this project was to create a complete and detailed catalogue of human genetic variations, which can be used for association studies relating genetic variation to disease. The consortium aimed to discover >95 % of the variants (e.g. SNPs, CNVs, indels) with minor allele frequencies as low as 1% across the genome and 0.1-0.5% in gene regions, as well as to estimate the population frequencies, haplotype backgrounds and linkage disequilibrium patterns of variant alleles.[15]

Secondary goals included the support of better SNP and probe selection for genotyping platforms in future studies and the improvement of the human reference sequence. The completed database was expected be a useful tool for studying regions under selection, variation in multiple populations and understanding the underlying processes of mutation and recombination.[15]

Outline

The human genome consists of approximately 3 billion DNA base pairs and is estimated to carry around 20,000 protein coding genes. In designing the study the consortium needed to address several critical issues regarding the project metrics such as technology challenges, data quality standards and sequence coverage.[15]

Over the course of the next three years,[clarification needed] scientists at the Sanger Institute, BGI Shenzhen and the National Human Genome Research Institute’s Large-Scale Sequencing Network planned to sequence a minimum of 1,000 human genomes. Due to the large amount of sequence data that was required, recruiting additional participants was maintained.[14]

Almost 10 billion bases were to be sequenced per day over a period of the two year production phase, equating to more than two human genomes every 24 hours. The intended sequence dataset was to comprise 6 trillion DNA bases, 60-fold more sequence data than what has been published in DNA databases at the time.[14]

To determine the final design of the full project three pilot studies were to be carried out within the first year of the project. The first pilot intends to genotype 180 people of 3 major geographic groups at low coverage (2×). For the second pilot study, the genomes of two nuclear families (both parents and an adult child) are going to be sequenced with deep coverage (20× per genome). The third pilot study involves sequencing the coding regions (exons) of 1,000 genes in 1,000 people with deep coverage (20×).[14][15]

It was estimated that the project would likely cost more than $500 million if standard DNA sequencing technologies were used. Several newer technologies (e.g. Solexa, 454, SOLiD) were to be applied, lowering the expected costs to between $30 million and $50 million. The major support will be provided by the Wellcome Trust Sanger Institute in Hinxton, England; the Beijing Genomics Institute, Shenzhen (BGI Shenzhen), China; and the NHGRI, part of the National Institutes of Health (NIH).[14]

In keeping with Fort Lauderdale principles Archived 2013-12-28 at the Wayback Machine, all genome sequence data (including variant calls) is freely available as the project progresses and can be downloaded via ftp from the 1000 genomes project webpage.

Human genome samples

Locations of population samples of 1000 Genomes Project.[16] Each circle represents the number of sequences in the final release.

Based on the overall goals for the project, the samples will be chosen to provide power in populations where association studies for common diseases are being carried out. Furthermore, the samples do not need to have medical or phenotype information since the proposed catalogue will be a basic resource on human variation.[15]

For the pilot studies human genome samples from the HapMap collection will be sequenced. It will be useful to focus on samples that have additional data available (such as ENCODE sequence, genome-wide genotypes, fosmid-end sequence, structural variation assays, and gene expression) to be able to compare the results with those from other projects.[15]

Complying with extensive ethical procedures, the 1000 Genomes Project will then use samples from volunteer donors. The following populations will be included in the study: Yoruba in Ibadan (YRI), Nigeria; Japanese in Tokyo (JPT); Chinese in Beijing (CHB); Utah residents with ancestry from northern and western Europe (CEU); Luhya in Webuye, Kenya (LWK); Maasai in Kinyawa, Kenya (MKK); Toscani in Italy (TSI); Peruvians in Lima, Peru (PEL); Gujarati Indians in Houston (GIH); Chinese in metropolitan Denver (CHD); people of Mexican ancestry in Los Angeles (MXL); and people of African ancestry in the southwestern United States (ASW).[14]

ID Place Population Detail
ASW United States* African Ancestry in Southwestern USA Detail
ACB Barbados* African Caribbean in Barbados Detail
BEB Bangladesh Bengali in Bangladesh Detail
GBR United Kingdom British from England and Scotland Detail
CDX China Chinese Dai in Xishuangbanna, China Detail
CLM Colombia Colombian in Medellín, Colombia Detail
ESN Nigeria Esan in Nigeria Detail
FIN Finland Finnish in Finland Detail
GWD The Gambia Gambian in Western DivisionMandinka Detail
GIH United States* Gujarati Indians in Houston, Texas, United States Detail
CHB China Han Chinese in Beijing, China Detail
CHS China Han Chinese South, China Detail
IBS Spain Iberian populations in Spain Detail
ITU United Kingdom* Indian Telugu in the U.K. Detail
JPT Japan Japanese in Tokyo, Japan Detail
KHV Vietnam Kinh in Ho Chi Minh City, Vietnam Detail
LWK Kenya Luhya in Webuye, Kenya Detail
MSL Sierra Leone Mende in Sierra Leone Detail
MXL United States* Mexican Ancestry in Los Angeles, California, United States Detail
PEL Peru Peruvian in Lima, Peru Detail
PUR Puerto Rico Puerto Rican in Puerto Rico Detail
PJL Pakistan Punjabi in Lahore, Pakistan Detail
STU United Kingdom* Sri Lankan Tamil in the U.K. Detail
TSI Italy Toscani in Italia Detail
YRI Nigeria Yoruba in Ibadan, Nigeria Detail
CEU United States* Utah residents with Northern and Western European ancestry from the CEPH collection Detail

* Population that was collected in diaspora

Community meeting

Data generated by the 1000 Genomes Project is widely used by the genetics community, making the first 1000 Genomes Project one of the most cited papers in biology.[17] To support this user community, the project held a community analysis meeting in July 2012 that included talks highlighting key project discoveries, their impact on population genetics and human disease studies, and summaries of other large-scale sequencing studies.[18]

Project findings

Pilot phase

The pilot phase consisted of three projects:

  • low-coverage whole-genome sequencing of 179 individuals from 4 populations
  • high-coverage sequencing of 2 trios (mother-father-child)
  • exon-targeted sequencing of 697 individuals from 7 populations

It was found that on average, each person carries around 250–300 loss-of-function variants in annotated genes and 50-100 variants previously implicated in inherited disorders. Based on the two trios, it is estimated that the rate of de novo germline mutation is approximately 10−8 per base per generation.[1]

See also

References

  1. ^ a b Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, Gibbs RA, et al. (October 2010). "A map of human genome variation from population-scale sequencing". Nature. 467 (7319): 1061–73. Bibcode:2010Natur.467.1061T. doi:10.1038/nature09534. PMC 3042601. PMID 20981092.
  2. ^ a b Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, et al. (November 2012). "An integrated map of genetic variation from 1,092 human genomes". Nature. 491 (7422): 56–65. Bibcode:2012Natur.491...56T. doi:10.1038/nature11632. PMC 3498066. PMID 23128226.
  3. ^ Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, et al. (October 2015). "A global reference for human genetic variation". Nature. 526 (7571): 68–74. Bibcode:2015Natur.526...68T. doi:10.1038/nature15393. PMC 4750478. PMID 26432245.
  4. ^ Sudmant PH, Rausch T, Gardner EJ, Handsaker RE, Abyzov A, Huddleston J, et al. (October 2015). "An integrated map of structural variation in 2,504 human genomes". Nature. 526 (7571): 75–81. Bibcode:2015Natur.526...75.. doi:10.1038/nature15394. PMC 4617611. PMID 26432246.
  5. ^ "Variety of life". Nature News & Comment. 2015-09-30. Retrieved 2015-10-15.
  6. ^ "1000 Genomes Project | Scientific Computing and Data". Mount Sinai School of Medicine. 2020-07-07. Retrieved 2023-10-01.
  7. ^ Nielsen R (October 2010). "Genomics: In search of rare human variants". Nature. 467 (7319): 1050–1. Bibcode:2010Natur.467.1050N. doi:10.1038/4671050a. PMID 20981085.
  8. ^ JC Long, Human Genetic Variation: The mechanisms and results of microevolution, American Anthropological Association (2004)
  9. ^ Anzai T, Shiina T, Kimura N, Yanagiya K, Kohara S, Shigenari A, et al. (June 2003). "Comparative sequencing of human and chimpanzee MHC class I regions unveils insertions/deletions as the major path to genomic divergence". Proceedings of the National Academy of Sciences of the United States of America. 100 (13): 7708–13. Bibcode:2003PNAS..100.7708A. doi:10.1073/pnas.1230533100. PMC 164652. PMID 12799463.
  10. ^ Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, Andrews TD, et al. (November 2006). "Global variation in copy number in the human genome". Nature. 444 (7118): 444–54. Bibcode:2006Natur.444..444R. doi:10.1038/nature05329. PMC 2669898. PMID 17122850.
  11. ^ Barreiro LB, Laval G, Quach H, Patin E, Quintana-Murci L (March 2008). "Natural selection has driven population differentiation in modern humans". Nature Genetics. 40 (3): 340–5. doi:10.1038/ng.78. PMID 18246066. S2CID 205357396.
  12. ^ EE Harris et al., The molecular signature of selection underlying human adaptations, Yearbook of Physical Anthropology 49: 89-130 (2006)
  13. ^ Bamshad M, Wooding SP (February 2003). "Signatures of natural selection in the human genome". Nature Reviews. Genetics. 4 (2): 99–111. doi:10.1038/nrg999. PMID 12560807. S2CID 13722452.
  14. ^ a b c d e f G Spencer, International Consortium Announces the 1000 Genomes Project, EMBARGOED (2008) http://www.nih.gov/news/health/jan2008/nhgri-22.htm
  15. ^ a b c d e f Meeting Report: A Workshop to Plan a Deep Catalog of Human Genetic Variation, (2007) http://www.1000genomes.org/sites/1000genomes.org/files/docs/1000Genomes-MeetingReport.pdf
  16. ^ Oleksyk TK, Brukhin V, O'Brien SJ (2015). "The Genome Russia project: closing the largest remaining omission on the world Genome map". GigaScience. 4: 53. doi:10.1186/s13742-015-0095-0. PMC 4644275. PMID 26568821.
  17. ^ C. King (2012) The Hottest Research of 2011. Science Watch http://archive.sciencewatch.com/newsletter/2012/201203/hottest_research_2012/
  18. ^ 1000 Genomes Project Community Analysis Meeting http://1000gconference.sph.umich.edu/

Read other articles:

「ベートーヴェン、ベートーベン、ヴァン・ベートーヴェン」はこの項目へ転送されています。その他の用法については「ベートーヴェン (曖昧さ回避)」をご覧ください。 この記事のほとんどまたは全てが唯一の出典にのみ基づいています。他の出典の追加も行い、記事の正確性・中立性・信頼性の向上にご協力ください。出典検索?: ルートヴィヒ・ヴァン・ベート

For the Arrested Development episode, see Let 'Em Eat Cake (Arrested Development episode). Let 'Em Eat CakeOfficial LogoMusicGeorge GershwinLyricsIra GershwinBookGeorge S. KaufmanBasisSequel to Of Thee I SingProductions1933 Broadway 1994 BBC concert 2009 Opera North Let 'Em Eat Cake is a 1933 Broadway musical with music by George Gershwin, lyrics by Ira Gershwin, and book by George S. Kaufman and Morrie Ryskind. It is the sequel to the Pulitzer prize-winning Of Thee I Sing and had the same pr...

When cases are heard without going through lower courts This article is part of a series on theSupreme Courtof the United States The Court History Procedures Nomination and confirmation Judiciary Committee review Demographics Ideological leanings of justices Lists of decisions Supreme Court building Current membership Chief Justice John Roberts Associate justices Clarence Thomas Samuel Alito Sonia Sotomayor Elena Kagan Neil Gorsuch Brett Kavanaugh Amy Coney Barrett Ketanji Brown Jackson Retir...

Ця стаття є частиною Проєкту:Населені пункти України (рівень: невідомий) Портал «Україна»Мета проєкту — покращувати усі статті, присвячені населеним пунктам та адміністративно-територіальним одиницям України. Ви можете покращити цю статтю, відредагувавши її, а на стор�...

Álvaro de Mendaña de Neira Álvaro de Mendaña de Neira (ook wel Neyra) (1541 - 18 oktober 1595) was een Spaanse zeevaarder, en geldt als de Europese ontdekker van de Salomonseilanden. Hij is geboren in Zaragoza, en verhuisde in 1558 naar Lima (Peru). In 1567 vertrok hij met Pedro Sarmiento de Gamboa vanuit Callao op een expeditie naar de Grote Oceaan, op zoek naar het legendarische Terra Australis. Hij voer langs onder meer Nui en Lord Howe-eiland voor hij uiteindelijk in februari 1568 lan...

Artikel ini sebatang kara, artinya tidak ada artikel lain yang memiliki pranala balik ke halaman ini.Bantulah menambah pranala ke artikel ini dari artikel yang berhubungan atau coba peralatan pencari pranala.Tag ini diberikan pada Desember 2022. Re-bootAlbum studio karya Golden ChildDirilis18 November 2019 (2019-11-18)GenreK-popDurasi40:23BahasaKoreaLabelWoollim EntertainmentKronologi Golden Child Wish(2018) Re-boot(2019) Take A Leap(2020) Singel dalam album Re-boot WannabeDirilis: 1...

Blaine Ridge-Davis Informação pessoal Nascimento 7 de maio de 1999 (24 anos) Cidadania  Reino Unido Ocupação ciclista desportivo (en) e ciclista de pista (d) Informação equipa Desporto Ciclismo de pista Recorde Medalhas Ciclismo de pista feminino Evento 1º 2º 3º Campeonato Europeu 0 1 0 [edite no Wikidata] Blaine Ridge-Davis (7 de maio de 1999) é uma desportista britânica que compete no ciclismo na modalidade de pista. Ganhou uma medalha de prata no Cam...

French stylist (born 1937) Anne-Marie BerettaBorn24 September 1937BéziersNationalityFrench Anne-Marie Beretta (born 24 September 1937) is a French stylist. She is known particularly for her iconic 101801 coat designed for Max Mara. In 1986 she became a Chevalier des Arts et des Lettres. Life Beretta was born in Béziers in 1937. In 1957 she took the advice of Roger Bauer who was working at Jacques Griffe and decided to be a fashion designer. She found work with Antonio Castillo[1] wh...

Railway station in Sofia, Bulgaria Sofia Central StationЦентрална гара СофияRailway stationCentral Railway Station in SofiaGeneral informationLocation102 Knyaginya Maria Luiza Blvd.Sofia, BulgariaCoordinates42°42′44″N 23°19′16″E / 42.712115°N 23.321046°E / 42.712115; 23.321046Owned byNRICLine(s)Kalotina Zapad – Svilengrad GranitsaSofia – VarnaIliyantsi – Varna FeribotnaSofia – KulataSofia – GyueshevoPlatforms6Tracks13ConnectionsR...

This article is about the 2009 video game. For the second season of the Overlord anime series, see Overlord (season 2). 2009 video gameOverlord IINorth American coverDeveloper(s)Triumph StudiosPublisher(s)CodemastersDirector(s)Lennart SasDesigner(s)Lennart SasWriter(s)Rhianna PratchettComposer(s)Michiel van den BosSeriesOverlordPlatform(s)Linux, OS X, Microsoft Windows, Xbox 360, PlayStation 3ReleaseNA: 23 June 2009[1] EU: 26 June 2009[1] AU: 9 July 2009Genre(s)Action role-pla...

Mountain in Switzerland DammastockThe long ridge to the summit (right) above the Damma GlacierHighest pointElevation3,630 m (11,910 ft)Prominence1,466 m (4,810 ft)[1]Parent peakFinsteraarhornIsolation21.6 km (13.4 mi)[2]ListingCanton high point Alpine mountains above 3000 mCoordinates46°38′37.3″N 8°25′16″E / 46.643694°N 8.42111°E / 46.643694; 8.42111GeographyDammastockLocation in Switzerland LocationUri/Val...

  105 kg maschileLondra 2012 Informazioni generaliLuogoExCeL Exhibition Centre Periodo6 agosto Partecipanti17 da 14 nazioni Podio Navab Nassirshalal  Iran Bartłomiej Bonk  Polonia İvan Yefremov  Uzbekistan Edizione precedente e successiva Pechino 2008 Rio de Janeiro 2016 Voce principale: Sollevamento pesi ai Giochi della XXX Olimpiade. Sollevamento pesi aLondra 2012 Uomini Donne   56 kg     48 kg   62 kg 53 kg 69 kg 58 kg 77 kg 63 kg 85 kg 69 kg 9...

Мати Божа Милостива, «Домаґаличівська» — чудотворна римо-католицька ікона («образ») Богородиці 1598 року із зображенням Пресвятої Діви Марії в червоній сукні і зеленому плащі. Вона сидить на троні, огорнена хмарами. На правому її коліні стоїть маленький Ісус. Ікона нама...

This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Pacific Institute of Culinary Arts – news · newspapers · books · scholar · JSTOR (January 2021) (Learn how and when to remove this template message) Pacific Institute of Culinary ArtsTypePrivateEstablished1996Other studentsDiplomaLocationVancouver, British Colu...

Nigerian singer For album by Grant McLennan, see Fireboy. Fireboy DMLFireboy DML in 2019Background informationBirth nameAdedamola Oyinlola Adefolahan[1]Born (1996-02-05) 5 February 1996 (age 27)Abeokuta, Ogun State, NigeriaGenresAfrobeats, R&BOccupation(s)SingersongwriterYears active2012–presentLabelsYBNL Nation, Empire DistributionWebsitefireboydml.comMusical artist Adedamola Oyinlola Adefolahan (born 5 February 1996),[2] known professionally as Fireboy DML, is a N...

Aboriginal Australians of Western Australia Gija, also spelt Gidja and Kija,[1] alternatively known as the Lungga,[a] refers to Aboriginal Australians from the East Kimberley area of Western Australia, about 200 km south of Kununurra. In the late 19th century pastoralists were fiercely resisted by Gija people, many of whom now live around localities such as Halls Creek and Warmun (also known as Turkey Creek). Language Gija does not belong to the Pama-Nyungan language fami...

Joseba Arregui Aramburu Consejero de Cultura del Gobierno Vasco 16 de abril de 1984-2 de marzo de 1985Presidente José Antonio ArdanzaPredecesor Pedro Miguel Etxenike (Consejero de Educación y Cultura)Sucesor Luis María Bandrés (Consejero de Cultura y Turismo) Consejero de Cultura y Turismo del Gobierno Vasco 12 de marzo de 1987-4 de octubre de 1991Presidente José Antonio ArdanzaPredecesor Luis María Bandrés Consejero de Cultura del Gobierno Vasco 4 de octubre de 1991-4 de enero de 1995...

Macau ForumNot to be confused with Forum Macao, the multi-lateral economic cooperation body. Macau Forum (Chinese: 澳門綜藝館, Portuguese: Forum de Macau) is a venue connected to the adjacent Macao Polytechnic University Multisport Pavilion and Media Centre located at Avenida de Marciano Baptista, in Sé, Macau, China.[1] Macao Forum used to be the largest indoor venue in Macau before the completion of the Macao East Asian Games Dome. It comprises two pavilions: The main pavi...

2015 outbreak of avian influenza subtype H5N2 Influenza (flu) Types Avian A/H5N1 subtype Canine Equine Swine A/H1N1 subtype Vaccines 2009 pandemic Pandemrix Live attenuated Seasonal flu vaccine brands Treatment Amantadine Baloxavir marboxil Laninamivir Oseltamivir Peramivir Rimantadine Umifenovir Zanamivir Pandemics 1889-1890 Russian flu 1918 Spanish flu 1957-1958 Asian flu 1968 Hong Kong flu 1977 Russian flu 2009 swine flu Outbreaks 1976 swine flu 2006 H5N1 India 2007 Australian equine 2007 ...

2007 television film by Stuart Gillard This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Twitches Too – news · newspapers · books · scholar · JSTOR (July 2021) (Learn how and when to remove this template message) Twitches TooFilm posterWritten byDaniel BerendsenDirected byStuart GillardStarring Tia Mowry Tamer...