Gene family

Phylogenetic tree of the Mup gene family

A gene family is a set of several similar genes, formed by duplication of a single original gene, and generally with similar biochemical functions. One such family are the genes for human hemoglobin subunits; the ten genes are in two clusters on different chromosomes, called the α-globin and β-globin loci. These two gene clusters are thought to have arisen as a result of a precursor gene being duplicated approximately 500 million years ago.[1]

Genes are categorized into families based on shared nucleotide or protein sequences. Phylogenetic techniques can be used as a more rigorous test. The positions of exons within the coding sequence can be used to infer common ancestry. Knowing the sequence of the protein encoded by a gene can allow researchers to apply methods that find similarities among protein sequences that provide more information than similarities or differences among DNA sequences.

If the genes of a gene family encode proteins, the term protein family is often used in an analogous manner to gene family.

The expansion or contraction of gene families along a specific lineage can be due to chance, or can be the result of natural selection.[2] To distinguish between these two cases is often difficult in practice. Recent work uses a combination of statistical models and algorithmic techniques to detect gene families that are under the effect of natural selection.[3]

The HUGO Gene Nomenclature Committee (HGNC) creates nomenclature schemes using a "stem" (or "root") symbol for members of a gene family (by homology or function), with a hierarchical numbering system to distinguish the individual members.[4][5] For example, for the peroxiredoxin family, PRDX is the root symbol, and the family members are PRDX1, PRDX2, PRDX3, PRDX4, PRDX5, and PRDX6.

Basic structure

Gene phylogeny as lines within grey species phylogeny. Top: An ancestral gene duplication produces two paralogs (histone H1.1 and 1.2). A speciation event produces orthologs in the two daughter species (human and chimpanzee). Bottom: in a separate species (E. coli), a gene has a similar function (histone-like nucleoid-structuring protein) but has a separate evolutionary origin and so is an analog.

One level of genome organization is the grouping of genes into several gene families.[6][7] Gene families are groups of related genes that share a common ancestor. Members of gene families may be paralogs or orthologs. Gene paralogs are genes with similar sequences from within the same species while gene orthologs are genes with similar sequences in different species. Gene families are highly variable in size, sequence diversity, and arrangement. Depending on the diversity and functions of the genes within the family, families can be classified as multigene families or superfamilies.[6][8]

Multigene families typically consist of members with similar sequences and functions, though a high degree of divergence (at the sequence and/or functional level) does not lead to the removal of a gene from a gene family. Individual genes in the family may be arranged close together on the same chromosome or dispersed throughout the genome on different chromosomes. Due to the similarity of their sequences and their overlapping functions, individual genes in the family often share regulatory control elements.[6][8] In some instances, gene members have identical (or nearly identical) sequences. Such families allow for massive amounts of gene product to be expressed in a short time as needed. Other families allow for similar but specific products to be expressed in different cell types or at different stages of an organism's development.[6]

Superfamilies are much larger than single multigene families. Superfamilies contain up to hundreds of genes, including multiple multigene families as well as single, individual gene members. The large number of members allows superfamilies to be widely dispersed with some genes clustered and some spread far apart. The genes are diverse in sequence and function displaying various levels of expression and separate regulation controls.[6][8]

Some gene families also contain pseudogenes, sequences of DNA that closely resemble established gene sequences but are non-functional.[9] Different types of pseudogenes exist. Non-processed pseudogenes are genes that acquired mutations over time becoming non-functional. Processed pseudogenes are genes that have lost their function after being moved around the genome by retrotransposition.[8][9] Pseudogenes that have become isolated from the gene family they originated in, are referred to as orphans.[6]

Formation

Gene families arose from multiple duplications of an ancestral gene, followed by mutation and divergence.[6] Duplications can occur within a lineage (e.g., humans might have two copies of a gene that is found only once in chimpanzees) or they are the result of speciation. For example, a single gene in the ancestor of humans and chimpanzees now occurs in both species and can be thought of as having been 'duplicated' via speciation. As a result of duplication by speciation, a gene family might include 15 genes, one copy in each of 15 different species.

Duplication

In the formation of gene families, four levels of duplication exist: 1) exon duplication and shuffling, 2) entire gene duplication, 3) multigene family duplication, and 4) whole genome duplication. Exon duplication and shuffling gives rise to variation and new genes. Genes are then duplicated to form multigene families which duplicate to form superfamilies spanning multiple chromosomes. Whole genome duplication doubles the number of copies of every gene and gene family.[6] Whole genome duplication or polyploidization can be either autopolyploidization or alloploidization. Autopolyploidization is the duplication of the same genome and allopolyploidization is the duplication of two closely related genomes or hybridized genomes from different species.[8]

Duplication occurs primarily through uneven crossing over events in meiosis of germ cells. (1,2) When two chromosomes misalign, crossing over - the exchange of gene alleles - results in one chromosome expanding or increasing in gene number and the other contracting or decreasing in gene number. The expansion of a gene cluster is the duplication of genes that leads to larger gene families.[6][8]

Relocation

Gene members of a multigene family or multigene families within superfamilies exist on different chromosomes due to relocation of those genes after duplication of the ancestral gene. Transposable elements play a role in the movement of genes. Transposable elements are recognized by inverted repeats at their 5' and 3' ends. When two transposable elements are close enough in the same region on a chromosome, they can form a composite transposon. The protein transposase recognizes the outermost inverted repeats, cutting the DNA segment. Any genes between the two transposable elements are relocated as the composite transposon jumps to a new area of the genome.[6]

Reverse transcription is another method of gene movement. An mRNA transcript of a gene is reversed transcribed, or copied, back into DNA. This new DNA copy of the mRNA is integrated into another part of the genome, resulting in gene family members being dispersed.[8]

A special type of multigene family is implicated in the movement of gene families and gene family members. LINE (Long INterspersed Elements) and SINE (Short INterspersed Elements) families are highly repetitive DNA sequences spread all throughout the genome. The LINEs contain a sequence that encodes a reverse transcriptase protein. This protein aids in copying the RNA transcripts of LINEs and SINEs back into DNA, and integrates them into different areas of the genome. This self-perpetuates the growth of LINE and SINE families. Due to the highly repetitive nature of these elements, LINEs and SINEs when close together also trigger unequal crossing over events which result in single-gene duplications and the formation of gene families.[6][8]

Divergence

Non-synonymous mutations resulting in the substitution of amino acids, increase in duplicate gene copies. Duplication gives rise to multiple copies of the same gene, giving a level of redundancy where mutations are tolerated. With one functioning copy of the gene, other copies are able to acquire mutations without being extremely detrimental to the organisms. Mutations allow duplicate genes to acquire new or different functions.[8]

Concerted evolution

Some multigene families are extremely homogenous, with individual genes members sharing identical or almost identical sequences. The process by which gene families maintain high homogeneity is Concerted evolution. Concerted evolution occurs through repeated cycles of unequal crossing over events and repeated cycles of gene transfer and conversion. Unequal crossing over leads to the expansion and contraction of gene families. Gene families have an optimal size range that natural selection acts towards. Contraction deletes divergent gene copies and keeps gene families from becoming too large. Expansion replaces lost gene copies and prevents gene families from becoming too small. Repeat cycles of gene transfer and conversion increasingly make gene family members more similar.[6]

In the process of gene transfer, allelic gene conversion is biased. Mutant alleles spreading in a gene family towards homogeneity is the same process of an advantageous allele spreading in a population towards fixation. Gene conversion also aids in creating genetic variation in some cases.[10]

Evolution

Gene families, part of a hierarchy of information storage in a genome, play a large role in the evolution and diversity of multicellular organisms. Gene families are large units of information and genetic variability.[6] Over evolutionary time, gene families have expanded and contracted with genes within a family duplicating and diversifying into new genes, and genes being lost. An entire gene family may also be lost, or gained through de novo gene birth, by such extensive divergence such that a gene is considered part of a new family, or by horizontal gene transfer. When the number of genes per genome remains relatively constant, this implies that genes are gained and lost at relatively same rates. There are some patterns in which genes are more likely to be lost vs. which are more likely to duplicate and diversify into multiple copies.[11]

An adaptive expansion of a single gene into many initially identical copies occurs when natural selection would favour additional gene copies. This is the case when an environmental stressor acts on a species. Gene amplification is more common in bacteria and is a reversible process. Contraction of gene families commonly results from accumulation of loss of function mutations. A nonsense mutation which prematurely halts gene transcription becomes fixed in the population, leading to the loss of genes. This process occurs when changes in the environment render a gene redundant.[7]

Functional family

In addition to classification by evolution (structural gene family), the HGNC also makes "gene families" by function in their stem nomenclature.[12] As a result, a stem can also refer to genes that have the same function, often part of the same protein complex. For example, BRCA1 and BRCA2 are unrelated genes that are both named for their role in breast cancer and RPS2 and RPS3 are unrelated ribosomal proteins found in the same small subunit.

The HGNC also maintains a "gene group" (formerly "gene family") classification. A gene can be a member of multiple groups, and all groups form a hierarchy. As with the stem classification, both structural and functional groups exist.[4][5]

See also

References

  1. ^ Nussbaum, Robert L.; McInnes, Roderick R.; Willard, Huntington F. (2016). Thompson & Thompson Genetics in Medicine (8th ed.). Philadelphia, PA: Elsevier. p. 25. ISBN 978-1-4377-0696-3.
  2. ^ Hartl, D.L. and Clark A.G. 2007. Principles of population genetics. Chapter 7, page 372.
  3. ^ Demuth, Jeffery P.; Bie, Tijl De; Stajich, Jason E.; Cristianini, Nello; Hahn, Matthew W.; Borevitz, Justin (20 December 2006). "The Evolution of Mammalian Gene Families". PLOS ONE. 1 (1): e85. Bibcode:2006PLoSO...1...85D. doi:10.1371/journal.pone.0000085. PMC 1762380. PMID 17183716.
  4. ^ a b Daugherty, LC; Seal, RL; Wright, MW; Bruford, EA (Jul 5, 2012). "Gene family matters: expanding the HGNC resource". Human Genomics. 6 (1): 4. doi:10.1186/1479-7364-6-4. PMC 3437568. PMID 23245209.
  5. ^ a b HGNC. "Gene group help". Retrieved 2020-10-13.
  6. ^ a b c d e f g h i j k l m Hartwell, Leland H.; et al. (2011). Genetics : from genes to genomes (4th ed.). New York: McGraw-Hill. ISBN 978-0-07-352526-6.
  7. ^ a b Demuth, JP; Hahn, MW (January 2009). "The life and death of gene families". BioEssays. 31 (1): 29–39. doi:10.1002/bies.080085. PMID 19153999. S2CID 9528185.
  8. ^ a b c d e f g h i Ohta, Tomoka (2008). "Gene families: multigene families and superfamilies". eLS. doi:10.1038/npg.els.0005126. ISBN 978-0-470-01590-2.
  9. ^ a b Nussbaum, Robert L; et al. (2015). Genetics in Medicine (8 ed.). Philadelphia: Elsevier. ISBN 978-1-4377-0696-3.
  10. ^ Ohta, T (30 September 2010). "Gene conversion and evolution of gene families: an overview". Genes. 1 (3): 349–56. doi:10.3390/genes1030349. PMC 3966226. PMID 24710091.
  11. ^ James, Jennifer E; Nelson, Paul G; Masel, Joanna (4 April 2023). "Differential Retention of Pfam Domains Contributes to Long-term Evolutionary Trends". Molecular Biology and Evolution. 40 (4): msad073. doi:10.1093/molbev/msad073. PMC 10089649. PMID 36947137.
  12. ^ "What is a stem symbol?". HGNC FAQ.

Read other articles:

U.S. Army general Maria BarrettOfficial portrait, 2022AllegianceUnited StatesService/branchUnited States ArmyYears of service1988–presentRankLieutenant GeneralCommands heldUnited States Army Cyber CommandArmy Network Enterprise Technology Command160th Signal BrigadeBattles/warsGulf WarWar in AfghanistanAwardsArmy Distinguished Service MedalDefense Superior Service MedalLegion of Merit (3)Bronze Star MedalRelationsMajor General Paula Lodi (sister) Maria Lodi Barrett is a United States A...

 

Об экономическом термине см. Первородный грех (экономика). ХристианствоБиблия Ветхий Завет Новый Завет Евангелие Десять заповедей Нагорная проповедь Апокрифы Бог, Троица Бог Отец Иисус Христос Святой Дух История христианства Апостолы Хронология христианства Ран�...

 

Battle of the American Civil War Battle of ChustenahlahPart of the Trans-Mississippi Theater of theAmerican Civil WarDateDecember 26, 1861 (1861-12-26)LocationOsage County, OklahomaResult Confederate victoryBelligerents CreeksSeminoles CSA (Confederacy)Commanders and leaders Opothleyahola Douglas H. CooperJames M. McIntoshStand WatieStrength 1,700 men 1,380 menCasualties and losses 250 killed and wounded 180 people captured 9 killed40 wounded vteOperations in the Indian Territory Round M...

George C. ContantBornApril 10, 1864Mankato, Minnesota, USDied1930 (aged 65–66)OccupationReformed outlawSpouse(s)Therese Landgraff [1887-1898]Alice Brown Warbinton (1913-1928)RelativesBrother John Sontag George C. Contant, aka George Sontag (April 10, 1864 – 1930), was an outlaw of the American West known mostly for train robberies. Like his older brother, John Sontag, he was originally from Mankato, Minnesota. Background Contant was the younger of two sons of Jacob Contant and the former ...

 

Former state electoral district of New South Wales, Australia Uralla-Walcha was an electoral district of the Legislative Assembly in the Australian state of New South Wales, including the towns of Uralla and Walcha. It was originally created in 1894, when multi-member districts were abolished,[1] and the three member district of New England was largely divided between Uralla-Walcha, Armidale and Bingara.[2][3] The district was abolished in 1904 as a result of the 1903 ...

 

2023 film by Shantanu Bagchi Mission MajnuOfficial release posterDirected byShantanu BagchiWritten byAseem ArroraSumit BathejaParveez ShaikhStory byParveez Shaikh Aseem ArroraProduced byRonnie ScrewvalaAmar ButalaGarima MehtaStarringSidharth MalhotraRashmika MandannaParmeet SethiSharib HashmiKumud MishraRajit KapurCinematographyBijitesh DeEdited byNitin BaidSiddharth S. PandeMusic byScore:Ketan SodhaSongs:Tanishk BagchiRochak KohliRaghav SacharArkoProductioncompaniesRSVP MoviesGuilty By Assoc...

 本表是動態列表,或許永遠不會完結。歡迎您參考可靠來源來查漏補缺。 潛伏於中華民國國軍中的中共間諜列表收錄根據公開資料來源,曾潛伏於中華民國國軍、被中國共產黨聲稱或承認,或者遭中華民國政府調查審判,為中華人民共和國和中國人民解放軍進行間諜行為的人物。以下列表以現今可查知時間為準,正確的間諜活動或洩漏機密時間可能早於或晚於以下所歸�...

 

King of Etruria from 1801 to 1803 Louis IPortrait by Goya, 1800King of EtruriaReign21 March 1801 – 27 May 1803PredecessorFerdinand III as Grand Duke of TuscanySuccessorLouis IIBorn(1773-07-05)5 July 1773Piacenza, Duchy of ParmaDied27 May 1803(1803-05-27) (aged 29)Florence, Kingdom of EtruriaBurialEl EscorialConsortMaria Luisa of SpainIssueCharles II, Duke of ParmaMaria Luisa Carlota, Hereditary Princess of SaxonyNamesItalian: Ludovico Francesco Filiberto di Borbone-ParmaSpanish: Luis F...

 

Stamp of the Romanian People's Republic commemorating the Grivița strike of 1933, provoked by poor laboral conditions as a result of the Great Depression in Romania. The Great Depression (Romanian: Marea Criză Economică or, rarely, Marea Depresie) of 1929–1933, which affected the whole world, had several consequences in the Kingdom of Romania. Romania had been among the winner countries of World War I. It received several new territories (Bessarabia, Bukovina and Transylvania), with man...

Astronomical object emitting X-rays X-rays start at ~0.008 nm and extend across the electromagnetic spectrum to ~8 nm, over which Earth's atmosphere is opaque. Astrophysical X-ray sources are astronomical objects with physical properties which result in the emission of X-rays. Several types of astrophysical objects emit X-rays. They include galaxy clusters, black holes in active galactic nuclei (AGN), galactic objects such as supernova remnants, stars, and binary stars containing a white dwar...

 

هذه المقالة يتيمة إذ تصل إليها مقالات أخرى قليلة جدًا. فضلًا، ساعد بإضافة وصلة إليها في مقالات متعلقة بها. (مايو 2022) ألفا، بيتا ، الإصدار المرشح - إنتاج البرمجيات باللغة السويدية إدارة الإصدار (مكتبة البنية التحتية لتقانة المعلومات) (بالإنجليزية: Release management (ITIL))‏ تتمحور إدار...

 

Questa voce sull'argomento stagioni delle società calcistiche italiane è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Segui i suggerimenti del progetto di riferimento. Associazione Calcio IntraStagione 1933-1934Sport calcio Squadra Verbania Sportiva Prima Divisione6º posto nel girone C 1932-1933 1934-1935 Si invita a seguire il modello di voce Questa pagina raccoglie i dati riguardanti l'Associazione Calcio Intra nelle competizioni ufficiali d...

つのちょう 津野町 四国カルスト・天狗高原町役場位置 津野町旗 国 日本地方 四国地方都道府県 高知県郡 高岡郡市町村コード 39411-4法人番号 2000020394114 面積 197.85km2総人口 4,877人 [編集](推計人口、2024年9月1日)人口密度 24.6人/km2隣接自治体 須崎市、吾川郡仁淀川町、高岡郡中土佐町、佐川町、越知町、檮原町、四万十町愛媛県上浮穴郡久万高原町町の木 ヒノキ町�...

 

English composer For other people named Peter Philips, see Peter Philips (disambiguation). This article includes a list of references, related reading, or external links, but its sources remain unclear because it lacks inline citations. Please help improve this article by introducing more precise citations. (March 2018) (Learn how and when to remove this message) Peter PhilipsBorn1560Devonshire or London, Kingdom of EnglandDied1628Brussels, Spanish NetherlandsOccupation(s)Organist, virginals ...

 

1988 studio album by Bajaga i InstruktoriProdavnica tajniStudio album by Bajaga i InstruktoriReleased1988RecordedStudio M, Studio 1 Radio Novi Sad, Novi SadAutumn 1988Genre Pop rock folk rock LabelPGP-RTBProducerSaša HabićBajaga i Instruktori chronology Jahači magle(1986) Prodavnica tajni(1988) Neka svemir čuje nemir(1989) Professional ratingsReview scoresSourceRatingRitam[1]Muzika.hr (2010 reissue review)[2] Prodavnica tajni (trans. The Shop of Secrets) is the fou...

This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Château de Clagny – news · newspapers · books · scholar · JSTOR (May 2011) (Learn how and when to remove this message) 3D reconstruction: the Chateau seen from the entrance The Château de Clagny was a French country house that stood northeast of the Palace of...

 

Sono detti transuranici (dal latino « al di là dell'uranio ») gli elementi chimici con numero atomico maggiore di 92. Sono detti inoltre superpesanti alcuni elementi artificiali stabili con numero atomico superiore a 103, detti anche transattinoidi. Elementi transuranici Elemento Numero atomico nettunio 93 plutonio 94 americio 95 curio 96 berkelio 97 californio 98 einsteinio 99 fermio 100 mendelevio 101 nobelio 102 laurenzio 103 rutherfordio 104 dubnio 105 seaborgio 106 bohrio 10...

 

Railway track gauge Track gauge By transport mode Rapid transit Tram Miniature Scale model By size (list)   Minimum   Fifteen inch 381 mm (15 in)   Narrow   600 mm 600 mm (1 ft 11+5⁄8 in) Two foot 610 mm (2 ft) Two foot three inch 686 mm (2 ft 3 in)   750 mm 750 mm (2 ft 5+1⁄2 in) Bosnian gauge 760 mm (2 ft 5+15⁄16 in) Two foot six inch 762 mm (2 ft 6 in)   Swedish three foot 891 mm (2 ft 11+3⁄32 in) 900 mm 900 mm (2 ft 11+7⁄16 ...

Coat worn over street dress For other uses, see Overcoat (disambiguation). Overcoat (left) and topcoat (right) from The Gazette of Fashion, 1872 An overcoat is a type of long coat intended to be worn as the outermost garment, which usually extends below the knee. Overcoats are most commonly used in winter when warmth is more important. They are sometimes confused with or referred to as topcoats, which are shorter and end at or above the knees. Topcoats and overcoats together are known as oute...

 

Conservative ecologist political party in Germany You can help expand this article with text translated from the corresponding article in German. (August 2015) Click [show] for important translation instructions. View a machine-translated version of the German article. Machine translation, like DeepL or Google Translate, is a useful starting point for translations, but translators must revise errors as necessary and confirm that the translation is accurate, rather than simply copy-pastin...