UniGene

UniGene
Content
Descriptiontranscriptome
Contact
Research centerNCBI
Access
Websitehttps://www.ncbi.nlm.nih.gov/unigene

UniGene was a NCBI database of the transcriptome and thus, despite the name, not primarily a database for genes. Each entry is a set of transcripts that appear to stem from the same transcription locus (i.e. gene or expressed pseudogene). Information on protein similarities, gene expression, cDNA clones, and genomic location is included with each entry.

Descriptions of the UniGene transcript based and genome based build procedures are available.

A detailed description of UniGene database

The UniGene resource, developed at NCBI, clusters ESTs and other mRNA sequences, along with coding sequences (CDSs) annotated on genomic DNA, into subsets of related sequences. In most cases, each cluster is made up of sequences produced by a single gene, including alternatively spliced transcripts. However, some genes may be represented by more than one cluster. The clusters are organism specific and are currently available for human, mouse, rat, zebrafish, and cattle. They are built in several stages, using an automatic process based on special sequence comparison algorithms. First, the nucleotide sequences are searched for contaminants, such as mitochondrial, ribosomal, and vector sequence, repetitive elements, and low-complexity sequences. After a sequence is screened, it must contain at least 100 bases to be a candidate for entry into UniGene. mRNA and genomic DNA are clustered first into gene links. A second sequence comparison links ESTs to each other and to the gene links. At this stage, all clusters are ‘‘anchored,’’ and contain either a sequence with a polyadenylation site or two ESTs labeled as coming from the 3 end of a clone. Clone-based edges are added by linking the 5 and 3 ESTs that derive from the same clone. In some cases, this linking may merge clusters identified at a previous stage. Finally, unanchored ESTs and gene clusters of size 1 (which may represent rare transcripts) are compared with other UniGene clusters at lower stringency. The UniGene build is updated weekly, and the sequences that make up a cluster may change. Thus, it is not safe to refer to a UniGene cluster by its cluster identifier; instead, one should use the GenBank accession numbers of the sequences in the cluster.

As of July 2000, the human subset of UniGene contained 1.7 million sequences in 82,000 clusters; 98% of these clustered sequences were ESTs, and the remaining 2% were from mRNAs or CDSs annotated on genomic DNA. These human clusters could represent fragments of up to 82,000 unique human genes, implying that many human genes are now represented in a UniGene cluster. (This number is undoubtedly an overestimate of the number of genes in the human genome, as some genes may be represented by more than one cluster.) Only 1.4% of clusters totally lack ESTs, implying that most human genes are represented by at least one EST. Conversely, it appears that the majority of human genes have been identified only by ESTs; only 16% of clusters contain either an mRNA or a CDS annotated on a genomic DNA. Because fewer ESTs are available for mouse, rat, and zebrafish, the UniGene clusters are not as representative of the unique genes in the genome. Mouse UniGene contains 895,000 sequences in 88,000 clusters, and rat UniGene contains 170,000 sequences in 37,000 clusters.

A new UniGene resource, HomoloGene, includes curated and calculated orthologs and homologs for genes from human, mouse, rat, and zebrafish. Calculated orthologs and homologs are the result of nucleotide sequence comparisons between all UniGene clusters for each pair of organisms. Homologs are identified as the best match between a UniGene cluster in one organism and a cluster in a second organism. When two sequences in different organisms are best matches to one another (a reciprocal best match), the UniGene clusters corresponding to the pair of sequences are considered putative orthologs. A special symbol indicates that UniGene clusters in three or more organisms share a mutually consistent ortholog relationship. The calculated orthologs and homologs are considered putative, since they are based only on sequence comparisons. Curated orthologs are provided by the Mouse Genome Database (MGD) at the Jackson Laboratory and the Zebrafish Information Database (ZFIN) at the University of Oregon and can also be obtained from the scientific literature. Queries to UniGene are entered into a text box on any of the UniGene pages. Query terms can be, for example, the UniGene identifier, a gene name, a text term that is found somewhere in the UniGene record, or the accession number of an EST or gene sequence in the cluster. For example, the cluster entitled ‘‘A disintegrin and metalloprotease domain 10’’ that contains the sequence for human ADAM10 can be retrieved by entering ADAM10, disintegrin, AF009615 (the GenBank accession number of ADAM10), or H69859 (the GenBank accession number of an EST in the cluster). To query a specific part of the UniGene record, use the @ symbol. For example, @gene(symbol) looks for genes with the name of the symbol enclosed in the parentheses, @chr(num) searches for entries that map to chromosome num, @lib(id) returns entries in a cDNA library identified by id, and @pid(id) se- lects entries associated with a GenBank protein identifier id.

The query results page contains a list of all UniGene clusters that match the query. Each cluster is identified by an identifier, a description, and a gene symbol, if available. Cluster identifiers are prefixed with Hs for Homo sapiens, Rn for Rattus norvegicus, Mm for Mus musculus, or Dn for Danio rerio. The descriptions of UniGene clusters are taken from LocusLink, if available, or from the title of a sequence in the cluster. The UniGene report page for each cluster links to data from other NCBI resources (Fig. 12.5). At the top of the page are links to LocusLink, which provides descriptive information about genetic loci (Pruitt et al., 2000), OMIM, a catalog of human genes and genetic disorders, and HomoloGene. Next are listed similarities between the translations of DNA sequences in the cluster and protein sequences from model organisms, including human, mouse, rat, fruit fly, and worm. The subsequent section describes relevant mapping information. It is followed by ‘‘expression information,’’ which lists the tissues from which the ESTs in the cluster have been created, along with links to the SAGE database. Sequences making up the cluster are listed next, along with a link to download these sequences.

It is important to note that clusters that contain ESTs only (i.e., no mRNAs or annotated CDSs) will be missing some of these fields, such as LocusLink, OMIM, and mRNA/Gene links. UniGene titles for such clusters, such as ‘‘EST, weakly similar to ORF2 contains a reverse transcriptase domain [H. sapiens],’’ are derived from the title of a characterized protein with which the translated EST sequence aligns. The cluster title might be as simple as ‘‘EST’’ if the ESTs share no significant similarity with characterized proteins.[1]

Retirement of UniGene

On February 1, 2019, the NCBI announced that it was retiring the UniGene database because "reference genomes are available for most organisms with a sizable research community. Consequently, the usage of and need for UniGene has dropped significantly."[2] Access to the UniGene builds will remain available through FTP.

  • NCBI Gene database NCBI database cataloging individual genes
  • HomoloGene NCBI database which stores groups of homologous genes from different organisms

See also

References

  1. ^ Andreas D. Baxevanis and B. F. Francis Ouellette|BIOINFORMATICS A Practical Guide to the Analysis of Genes and Proteins(2001 2nd edition)||JOHN WILEY & SONS, INC.|ISBN 0-471-38391-0|ISBN 978-0-471-38391-8 |
  2. ^ "NCBI to Retire the UniGene". February 2019. Retrieved 12 February 2019.

Read other articles:

Nan Sandar Hla HtunNan Sandar Hla Htun pada 2019 dalam acara Vivo.Nama asalနန်းစန္ဒာလှထွန်းLahirNan Sandar Hla Htun22 Juni 1993 (umur 30)Mongpawn, Shan State, MyanmarPendidikanUniversitas TaunggyiPekerjaanPemeran, peraga busana, mantan ratu kecantikanTahun aktif2012–kiniTinggi165 cm (5 ft 5 in) Nan Sandar Hla Htun (bahasa Burma: နန်းစန္ဒာလှထွန်း; juga disebut Nan Sandar Hla Tun atau မနန�...

 

Ath-Thufail bin al-Haritsradhiyallahu anhuNama asalالطفيل بن الحارثLahir586Meninggal653KebangsaanSuku QuraisyKabilah Bani MuththalibAnakAmirOrang tuaAl-Harits bin al-Muththalib (ayah)Sukhailah binti Khuza'i (ibu) Ath-Thufail bin al-Harits (Arab: الطفيل بن الحارثcode: ar is deprecated , lahir tahun 38 sebelum hijrah (586) - wafat 32 H (653, usia 67)) adalah sahabat Nabi Muhammad dari kaum muhajirin.[1] Ath-Thufail bersama dua saudara kandungnya, Ubaida...

 

La bibliografia (termine mutuato dal greco βιβλιογραφία[1], composto di βιβλίον, biblìon, libro, e γράφω, gràpho, io scrivo, che però aveva il significato di (tra)scrizione di libri, diverso da quello moderno di libro sui libri[2]) enumerativa (o sistematica) si può intendere: l'elenco di libri, saggi, riviste, articoli su un particolare argomento o su uno specifico autore; l'elenco di pubblicazioni usate e citate nella stesura specialmente di un sa...

صندوق النقد الدولي   الاختصار (بالإسبانية: FMI)‏،  و(بالأستورية: FMI)‏،  و(بالأراغونية: FMI)‏،  و(بالكتالونية: FMI)‏،  و(بالإيطالية: FMI)‏،  و(بالرومانية: FMI)‏،  و(بالرومانشية: FMI)‏،  و(بالبرتغالية: FMI)‏،  و(بالإسبرانتو: IMF)‏،  و(بالألمانية: IMA)‏،  و(بالأفريق...

 

Komando Resor Militer 181/Praja Vira TamaLambang Korem 181/PVTDibentuk17 Mei 1963Negara IndonesiaCabangTNI Angkatan DaratTipe unitKorem Tipe APeranSatuan TeritorialBagian dariKodam XVIII/KSRMarkasSorong, Papua Barat DayaPelindungTentara Nasional IndonesiaMotoPraja Vira TamaBaret H I J A U MaskotKakatua RajaUlang tahun17 MeiPertempuranOperasi TrikoraOperasi NemangkawiTokohKomandanBrigadir Jenderal TNI Totok Sutriono, S.Sos.M.MKepala StafKolonel Inf. Christian Pieter Sipahelut Ko...

 

Pensil warna Pensil warna adalah media seni yang dibuat dari inti berpigmen kecil yang terbungkus dalam cangkang silinder kayu seperti halnya pensil. Namun, berbeda dengan pensil grafit dan pensil arang, inti pensil warna berbahan dasar lilin atau minyak dan mengandung berbagai proporsi pigmen, zat tambahan, dan bahan pengikat.[1] Pensil warna yang larut dalam air (pensil cat air), pensil pastel, serta inti berwarna untuk pensil mekanik juga tersedia di pasaran. Pensil warna dibuat da...

Ecuadorian television network Television channel RTSCountryEcuadorHeadquartersGuayaquilProgrammingPicture format1080i HDTVOwnershipOwnerAlbavisión (Telecuatro Guayaquil C.A.)LinksWebsitewww.rts.com.ecAvailabilityTerrestrialDigital VHFChannel 4.1/4.2 (Guayaquil) RedTeleSistema (RTS), is a private television station in Ecuador. The channel is owned by Albavisión. The channel is the oldest television station to operate in Ecuador since its inception, HCJB-TV signed on in Quito in 1959 but shut...

 

Bagian dari seri artikel mengenaiSejarah Jepang PeriodePaleolitiksebelum 14.000 SMJōmon14.000–300 SMYayoi300 SM – 250 MKofun250–538Asuka538–710Nara710–794Heian794–1185Kamakura1185–1333Restorasi Kemmu1333–1336Muromachi (Ashikaga) Nanboku-chōSengoku 1336–1573Azuchi–Momoyama Perdagangan dengan Nanban 1568–1603Edo (Tokugawa) SakokuPersetujuan KanagawaBakumatsu 1603–1868Meiji Perang BoshinRestorasiPerang Sino-Jepang PertamaPemberontakan BoxerPerang Rusia-Jepang 1868–191...

 

Local museum in Hertford, Hertfordshire, England View of Hertford Museum. Hertford Museum is a local museum in Hertford, the county town of Hertfordshire, England.[1] The museum first opened in 1903 and is located in a 17th-century town house with a Jacobean-style knot garden.[1] The galleries on the ground floor present the early history of the museum. Objects include exotic animals, fossils, and Japanese armour. The first floor presents the town and people of Hertford. The c...

Badan Pendidikan dan Pelatihan Keuangan Kementerian Keuangan Republik IndonesiaGambaran umumDibentuk1974Bidang tugasmelaksanakan pendidikan dan pelatihan di bidang keuangan negaraPegawai1316 orang[1]Susunan organisasiKepalaAndin HadiyantoSitus webhttps://bppk.kemenkeu.go.id Badan Pendidikan dan Pelatihan Keuangan (BPPK) adalah unit Eselon 1 yang bertanggungjawab dalam pengembangan SDM pengelola keuangan dan kekayaan negara melalui penyelenggaraan pendidikan dan pelatihan. Untuk m...

 

Pour les articles homonymes, voir ADD. Aéroport d'Addis-Abeba Boleአዲስ አበባ ቦሌ ዓለም አቀፍ አውሮፕላን ማረፊያ Vue du terminal principal Localisation Pays Éthiopie Ville Addis-Abeba Coordonnées 8° 58′ 40″ nord, 38° 47′ 58″ est Altitude 2 334 m (7 656 ft) Informations aéronautiques Code IATA ADD Code OACI HAAB Type d'aéroport Civil Pistes Direction Longueur Surface 07/25 3 800 m (12 467 ft) Asph...

 

2000 single by Ricky Martin Not to be confused with She Bangs the Drums. She BangsSingle by Ricky Martinfrom the album Sound Loaded B-side Por Arriba, Por Abajo Amor ReleasedSeptember 22, 2000 (2000-09-22)Studio Sony Music, New York City Hit Factory Criteria & Gentleman's Club, Miami WallyWorld & Capitol, Hollywood Aireborne, Indianapolis Quad Recordings, Nashville GenreLatin popdance-popsalsaLength 4:42 (album version) 4:06 (radio edit) LabelColumbiaSongwriter(s) Desmo...

2020年夏季奥林匹克运动会波兰代表團波兰国旗IOC編碼POLNOC波蘭奧林匹克委員會網站olimpijski.pl(英文)(波兰文)2020年夏季奥林匹克运动会(東京)2021年7月23日至8月8日(受2019冠状病毒病疫情影响推迟,但仍保留原定名称)運動員206參賽項目24个大项旗手开幕式:帕维尔·科热尼奥夫斯基(游泳)和马娅·沃什乔夫斯卡(自行车)[1]闭幕式:卡罗利娜·纳亚(皮划艇)&#...

 

Bangladeshi public service broadcaster Bangladesh Sangbad Sangstha (BSS)Native nameবাংলাদেশ সংবাদ সংস্থা (বাসস)Company typeNational News AgencyIndustryNews agencyFoundedJanuary 1972HeadquartersDhaka, BangladeshKey peopleAbul Kalam Azad (MD and CEO)Websitebssnews.net Bangladesh Sangbad Sangstha (BSS) is the news agency of Bangladesh. BSS was established on 1 January, 1972 by the Government of Bangladesh soon after the Liberation War.[1] Abu...

 

Cereseto komune di Italia Tempat Negara berdaulatItaliaDaerah di ItaliaPiemonteProvinsi di ItaliaProvinsi Alessandria NegaraItalia Ibu kotaCereseto PendudukTotal389  (2023 )GeografiLuas wilayah10,44 km² [convert: unit tak dikenal]Ketinggian280 m Berbatasan denganMoncalvo Ozzano Monferrato Pontestura Ponzano Monferrato Sala Monferrato Serralunga di Crea Treville Ottiglio SejarahHari liburpatronal festival Informasi tambahanKode pos15020 Zona waktuUTC+1 UTC+2 Kode telepon0142 ID ISTA...

Features from accelerated segment test (FAST) is a corner detection method, which could be used to extract feature points and later used to track and map objects in many computer vision tasks. The FAST corner detector was originally developed by Edward Rosten and Tom Drummond, and was published in 2006.[1] The most promising advantage of the FAST corner detector is its computational efficiency. Referring to its name, it is indeed faster than many other well-known feature extraction me...

 

Azmi BisharaLahir22 Juli 1956 (umur 67)Tempat lahirNazaret, IsraelKnesset14, 15, 16, 17 Azmi Bishara (dengarkanⓘ bahasa Arab: عزمي بشارة dengarkanⓘ, Ibrani: עזמי בשארה dengarkanⓘ, lahir 22 Juli 1956 di Nazaret, Israel), mantan anggota Knesset (parlemen Israel), adalah seorang intelektual, akademisi, politikus, dan penulis asal Palestina.[1] Pada tahun 2007, Bishara mengungsi dari Israel dan mengundurkan diri dari Knesset setelah ditanyai oleh polis...

 

Ethnic group of South America Chaco (tribe) redirects here. For the people of Chaco Canyon, New Mexico, see Chaco Culture National Historical Park § Ancestral Puebloans. Ethnic group Gran Chaco peopleArea of the Gran ChacoTotal population300,000 (est. 2010)Regions with significant populationsArgentina, Brazil, Bolivia, ParaguayLanguagesSee textReligiontraditional tribal religion, Catholicism, Protestantism, atheism The indigenous Gran Chaco people consist of approximately thirty-five tr...

Santa Barbara International Film Festival Premio a Premios a la excelencia en logros cinematográficos.Ubicación Santa BárbaraEstados UnidosHistoriaPrimera entrega 1990Sitio web oficial[editar datos en Wikidata] El Festival Internacional de Cine de Santa Bárbara (en inglés Santa Bárbara International Film Festival o, abreviadamente, SBIFF) es una organización no lucrativa dedicada a exhibir cine independiente estadounidense e internacional, para premiar a los cineastas indepen...

 

Native America tribe in southwest Oklahoma Ethnic group Apache Tribe of OklahomaPlains ApacheNá'ishą[1]Vanessa Jennings, a Plains Apache/Kiowa/Gila River Pima artist and traditionalistTotal population2,263Regions with significant populationsUnited States (Oklahoma)LanguagesEnglish, formerly Plains Apache languageReligionIndigenous religion, Native American Church, ChristianityRelated ethnic groupsfellow Apache, Navajo, and Tsuutʼina[1] The Plains Apache are a small Southern...