Sequence logo

A sequence logo showing the most conserved bases around the initiation codon from all human mRNAs (Kozak consensus sequence). Note that the initiation codon is not drawn to scale, otherwise the letters AUG would each have a height of 2 bits.

In bioinformatics, a sequence logo is a graphical representation of the sequence conservation of nucleotides (in a strand of DNA/RNA) or amino acids (in protein sequences).[1] A sequence logo is created from a collection of aligned sequences and depicts the consensus sequence and diversity of the sequences. Sequence logos are frequently used to depict sequence characteristics such as protein-binding sites in DNA or functional units in proteins.

Overview

A sequence logo consists of a stack of letters at each position. The relative sizes of the letters indicate their frequency in the sequences. The total height of the letters depicts the information content of the position, in bits.

Logo creation

To create sequence logos, related DNA, RNA or protein sequences, or DNA sequences that have common conserved binding sites, are aligned so that the most conserved parts create good alignments. A sequence logo can then be created from the conserved multiple sequence alignment. The sequence logo will show how well residues are conserved at each position: the higher the number of residues, the higher the letters will be, because the better the conservation is at that position. Different residues at the same position are scaled according to their frequency. The height of the entire stack of residues is the information measured in bits. Sequence logos can be used to represent conserved DNA binding sites, where transcription factors bind.

The information content (y-axis) of position is given by:[2]

for amino acids,
for nucleic acids,

where is the uncertainty (sometimes called the Shannon entropy) of position

Here, is the relative frequency of base or amino acid at position , and is the small-sample correction for an alignment of letters.[2][3] The height of letter in column is given by

The approximation for the small-sample correction, , is given by:

where is 4 for nucleotides, 20 for amino acids, and is the number of sequences in the alignment.

A consensus logo is a simplified variation of a sequence logo that can be embedded in text format. Like a sequence logo, a consensus logo is created from a collection of aligned protein or DNA/RNA sequences and conveys information about the conservation of each position of a sequence motif or sequence alignment[1][4] . However, a consensus logo displays only conservation information, and not explicitly the frequency information of each nucleotide or amino acid at each position. Instead of a stack made of several characters, denoting the relative frequency of each character, the consensus logo depicts the degree of conservation of each position using the height of the consensus character at that position.

A sequence logo for the LexA-binding motif of several Gram-positive species.
A consensus logo for the LexA-binding motif of several Gram-positive species.

Advantages and drawbacks

The main, and obvious, advantage of consensus logos over sequence logos is their ability to be embedded as text in any Rich Text Format supporting editor/viewer and, therefore, in scientific manuscripts. As described above, the consensus logo is a cross between sequence logos and consensus sequences. As a result, compared to a sequence logo, the consensus logo omits information (the relative contribution of each character to the conservation of that position in the motif/alignment). Hence, a sequence logo should be used preferentially whenever possible. That being said, the need to include graphic figures in order to display sequence logos has perpetuated the use of consensus sequences in scientific manuscripts, even though they fail to convey information on both conservation and frequency.[5] Consensus logos represent therefore an improvement over consensus sequences whenever motif/alignment information has to be constrained to text.

Extensions

Hidden Markov models (HMMs) not only consider the information content of aligned positions in an alignment, but also of insertions and deletions. In an HMM sequence logo used by Pfam, three rows are added to indicate the frequencies of occupancy (presence) and insertion, as well as the expected insertion length.[6]

A sequence logo for TALE-likes. Note the reduced occupancy (blue) at the position one and the occasional insertion at position 19 (red).

See also

References

  1. ^ a b Schneider TD; Stephens RM (1990). "Sequence Logos: A New Way to Display Consensus Sequences". Nucleic Acids Res. 18 (20): 6097–6100. doi:10.1093/nar/18.20.6097. PMC 332411. PMID 2172928.
  2. ^ a b Schneider TD; Stormo GD (1986). "Information content of binding sites on nucleotide sequences" (PDF). Journal of Molecular Biology. 188 (3): 415–431. doi:10.1016/0022-2836(86)90165-8. PMID 3525846.
  3. ^ Basharin GP (1959). "On a statistical estimate for the entropy of a sequence of independent random variables". Theory of Probability and Its Applications. 4 (3): 333–336. doi:10.1137/1104033.
  4. ^ Anzaldi LJ; Muñoz-Fernández D; Erill I. (2012). "BioWord: a sequence manipulation suite for Microsoft Word". BMC Bioinformatics. 13 (124): 124. doi:10.1186/1471-2105-13-124. PMC 3546851. PMID 22676326.
  5. ^ Schneider TD (2002). "Consensus Sequence Zen". Appl Bioinform. 1 (3): 111–119. PMC 1852464. PMID 15130839.
  6. ^ Wheeler, Travis J; Clements, Jody; Finn, Robert D (13 January 2014). "Skylign: a tool for creating informative, interactive logos representing sequence alignments and profile hidden Markov models". BMC Bioinformatics. 15 (1): 7. doi:10.1186/1471-2105-15-7. PMC 3893531. PMID 24410852.

Read other articles:

Arthur Ernest PercivalArthur Ernest Percival semasa menjadi Komandan Umum Gerakan Tanah Melayu Desember 1941PengabdianBritania RayaLama dinas1914 – 1946PangkatLetnan JenderalKomandanGeneral Officer Commanding Malaya BritaniaPerang/pertempuranPerang Dunia I Pertempuran Somme Serangan Musim Semi Perang Saudara Rusia Perang Anglo-Irlandia Perang Dunia II Pertempuran Malaya Pertempuran Singapura PenghargaanKompanyon Ordo BathOrdo Pengabdian TerhormatOrdo Imperium BritaniaMilitary CrossCroi...

 

 

Artikel ini tidak memiliki referensi atau sumber tepercaya sehingga isinya tidak bisa dipastikan. Tolong bantu perbaiki artikel ini dengan menambahkan referensi yang layak. Tulisan tanpa sumber dapat dipertanyakan dan dihapus sewaktu-waktu.Cari sumber: Kurun – berita · surat kabar · buku · cendekiawan · JSTOR Kurun dalam penggunaan umum adalah suatu periode waktu sembarang yang ditentukan oleh manusia. Para ahli geologi menggunakan kurun sebagai subdiv...

 

 

Kim Tae-heePotret Kim Tae-hee, 2022Lahir29 Maret 1980 (umur 44)Busan, Korea SelatanAlmamaterSeoul National University (B.A. Fashion Design)PekerjaanAktrisTahun aktif2001 (2001)–sekarangAgenBS Company (Korea)[1] Sweet Power (Jepang)Suami/istriRain ​(m. 2017)​Anak2KerabatLee Wan (adik laki-laki)[2]Nama KoreaHangul김태희 Hanja金泰希 Alih AksaraGim Tae-huiMcCune–ReischauerKim T'ae-hŭi Tanda tangan Kim Tae-hee (Hangul:...

You can help expand this article with text translated from the corresponding article in Turkish. (January 2022) Click [show] for important translation instructions. View a machine-translated version of the Turkish article. Machine translation, like DeepL or Google Translate, is a useful starting point for translations, but translators must revise errors as necessary and confirm that the translation is accurate, rather than simply copy-pasting machine-translated text into the English Wiki...

 

 

Credit Company in Japan You can help expand this article with text translated from the corresponding article in Japanese. (November 2023) Click [show] for important translation instructions. Machine translation, like DeepL or Google Translate, is a useful starting point for translations, but translators must revise errors as necessary and confirm that the translation is accurate, rather than simply copy-pasting machine-translated text into the English Wikipedia. Consider adding a topic t...

 

 

This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages) Some of this article's listed sources may not be reliable. Please help improve this article by looking for better, more reliable sources. Unreliable citations may be challenged and removed. (December 2021) (Learn how and when to remove this message) The neutrality of this article is disputed. Relevant discussion may be found on the talk page...

土库曼斯坦总统土库曼斯坦国徽土库曼斯坦总统旗現任谢尔达尔·别尔德穆哈梅多夫自2022年3月19日官邸阿什哈巴德总统府(Oguzkhan Presidential Palace)機關所在地阿什哈巴德任命者直接选举任期7年,可连选连任首任萨帕尔穆拉特·尼亚佐夫设立1991年10月27日 土库曼斯坦土库曼斯坦政府与政治 国家政府 土库曼斯坦宪法 国旗 国徽 国歌 立法機關(英语:National Council of Turkmenistan) ...

 

 

Railroad station in Bryan, Ohio, US Bryan, OHBryan station in January 2019General informationLocationPaige and Lynn StreetBryan, OhioUnited StatesCoordinates41°28′49″N 84°33′06″W / 41.4803°N 84.5517°W / 41.4803; -84.5517Owned byAmtrak, City of Bryan, Norfolk Southern RailwayPlatforms1 side platformTracks3ConstructionParkingYesAccessibleYesOther informationStation codeAmtrak: BYNHistoryOpened1980PassengersFY 20224,262[1] (Amtrak) Servic...

 

 

Військово-музичне управління Збройних сил України Тип військове формуванняЗасновано 1992Країна  Україна Емблема управління Військово-музичне управління Збройних сил України — структурний підрозділ Генерального штабу Збройних сил України призначений для планува...

فرانك أتشيمبونغ (بالإنجليزية: Frank Acheampong)‏  معلومات شخصية الاسم الكامل فرانك أوبوكو أتشيمبونغ الميلاد 16 أكتوبر 1993 (العمر 30 سنة)غانا الطول 1.69 م (5 قدم 6 1⁄2 بوصة) مركز اللعب مهاجم الجنسية غانا  معلومات النادي النادي الحالي شنجن الرقم 7 مسيرة الشباب سنوات فريق 20...

 

 

2007 single by Billy JoelAll My LifeSingle by Billy JoelB-sideYou're My HomeReleasedFebruary 14, 2007 (2007-02-14)RecordedDecember 29, 2006StudioLegacy StudiosGenreTraditional popLength5:19LabelSonySongwriter(s)Billy JoelProducer(s)Phil RamoneBilly Joel singles chronology Hey Girl (1997) All My Life (2007) Christmas in Fallujah (2007) All My Life is a song by Billy Joel, his first new song of original material with lyrics he had written since 1993's River of Dreams. The song, p...

 

 

This is a list of provincial parks in Southwestern Ontario. These provincial parks are maintained by Ontario Parks. For a list of other provincial parks in Ontario, see the List of provincial parks in Ontario. Bruce County Name Established Commons category Picture Coordinates Black Creek Provincial Park 1989 44°58′24″N 81°21′49″W / 44.973333333333°N 81.363611111111°W / 44.973333333333; -81.363611111111 Cabot Head Provincial Nature Reserve 1985 45°12′35�...

Voce principale: Novara Calcio. Questa voce sull'argomento stagioni delle società calcistiche italiane è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Segui i suggerimenti del progetto di riferimento. Associazione Calcio NovaraStagione 1934-1935Sport calcio Squadra Novara Allenatore Árpád Weisz Presidente Guido Beldì Serie B2º posto nel girone A. Maggiori presenzeCampionato: Bercellino (29) Miglior marcatoreCampionato: Romano (30) StadioStad...

 

 

Peta menunjukkan lokasi Sallapadan. Sallapadan adalah munisipalitas yang terletak di provinsi Abra, Filipina. Pada tahun 2011, munisipalitas ini memiliki populasi sebesar 6.584 jiwa atau 1.290 rumah tangga.[1] Pembagian wilayah Sallapadan terbagi menjadi 9 barangay, yaitu: Barangay Penduduk (2007) Bazar 590 Bilabila 535 Gangal (Pob.) 909 Maguyepyep 1,251 Naguilian 721 Saccaang 716 Sallapadan Barrio 303 Subusob 824 Ud-udiao 521 Referensi ^ Local Governance Performance Management System...

 

 

  关于續作,請見「超級槍彈辯駁2 再會了絕望學園」。   关于外傳,請見「絕對絕望少女 槍彈辯駁Another Episode」。 弹丸论破 希望學園與絕望高中生 ダンガンロンパ 希望の学園と絶望の高校生 Danganronpa: Trigger Happy Havoc 假名 だんがんろんぱ きぼうのがくえんとぜつぼうのこうこうせい 罗马字 Dangan-Ronpa: Kibō no Gakuen to Zetsubō no Kōkōsei 類型 高速推理動作遊戲&...

State of North Dakota Staat van de Verenigde Staten (Details) Coördinaten 47°30'NB, 100°30'WL Algemeen Oppervlakte 183.272 km² (2,4% water) Inwoners 683.932 (3,83 inw./km²) Hoofdstad Bismarck Politiek Gouverneur Doug Burgum (R)(sinds 2016) Overig Tijdzone −6 / −7 Toegetreden 2 november 1889 Bijnaam Peace Garden State Website nd.gov Detailkaart Portaal    Verenigde Staten North Dakota (Nederlands: Noord-Dakota) is een van de staten van de Verenigde Staten. De standaard...

 

 

German-American architect (1883–1969) Walter GropiusPortrait by Louis Held, c. 1919BornWalter Adolph Georg Gropius(1883-05-18)18 May 1883Berlin, Kingdom of Prussia, German EmpireDied5 July 1969(1969-07-05) (aged 86)Boston, Massachusetts, U.S.OccupationArchitectSpouses Alma Mahler ​ ​(m. 1915; div. 1920)​ Ise Gropius ​(m. 1923)​ Children2, including ManonAwards AIA Gold Medal (1959) Albert Medal (196...

 

 

French polymath (1588–1648) The ReverendMarin MersenneOMBorn(1588-09-08)8 September 1588Oizé, Kingdom of FranceDied1 September 1648(1648-09-01) (aged 59)Paris, Kingdom of FranceOther namesMarinus MersennusKnown forMersenne primesMersenne's conjectureMersenne's lawsAcousticsScientific careerFieldsMathematics, physics Marin Mersenne, OM (also known as Marinus Mersennus or le Père Mersenne; French: [maʁɛ̃ mɛʁsɛn]; 8 September 1588 – 1 September 1648) was a Fren...

Holy Roman Emperor from 1619 to 1637 Ferdinand IIPortrait of the Archduke, c. 1614Holy Roman Emperor (more...) Reign28 August 1619 – 15 February 1637Coronation9 September 1619 Frankfurt CathedralPredecessorMatthiasSuccessorFerdinand IIIBorn9 July 1578 (NS: (1578-07-19)19 July 1578)Graz, Duchy of Styria, Holy Roman EmpireDied15 February 1637(1637-02-15) (aged 58)Vienna, Archduchy of Austria, Holy Roman EmpireBurial Mausoleum in Graz (body) Augustinian Church (heart) Spouses Maria ...

 

 

  هذه المقالة عن جزيرة بافن. لمعانٍ أخرى، طالع خليج بافن. بافن (جزيرة)   أصل التسمية وليام بافين  تاريخ الاكتشاف 1576  معلومات جغرافية   المنطقة أرخبيل القطب الشمالي الكندي  الإحداثيات 69°N 72°W / 69°N 72°W / 69; -72   [1] [2] الأرخبيل أرخبيل القطب ال...