It is any nucleotide sequence within a gene that is removed by RNA splicing to get the final RNA product of a gene.[1][2] The term intron refers to both the DNA sequence within a gene, and the corresponding sequence in RNA transcripts.[3]
Sequences of coding DNA which are joined together in the final RNA after RNA splicing are exons. They code for amino acids in the final polypeptide.
Introns are in the genes of most organisms and many viruses. They can be in a wide range of genes, including those that generate proteins, ribosomal RNA (rRNA), and transfer RNA (tRNA). RNA splicing takes place after transcription and before translation.
Introns: parts of a gene which are discarded: non-working bits.
Exons: parts of a gene which are expressed: bits of a gene which code for amino-acid sequences in a protein.
There are many unanswered questions about introns. It is unclear whether introns serve some specific function, or whether they are selfish DNA which reproduces itself as a parasite.[5]
Recent studies of entire eukaryoticgenomes have now shown that the lengths and density (introns/gene) of introns varies considerably between related species. There are four or five different kinds of intron. Some introns represent mobile genetic elements (transposons).
Alternative splicing of introns within a gene allows a variety of protein isoforms from a single gene. Thus multiple related proteins can be generated from a single gene and a single precursor mRNA transcript. The control of alternative RNA splicing is performed by complex network of signalling molecules. In humans, ~95% of genes with more than one exon are alternatively spliced.[6]
References
↑Alberts, Bruce (2008). Molecular biology of the cell. New York: Garland Science. ISBN978-0-8153-4105-5.
↑Stryer, Lubert; Berg, Jeremy Mark; Tymoczko, John L. (2007). Biochemistry. San Francisco: W.H. Freeman. ISBN978-0-7167-6766-4.{{cite book}}: CS1 maint: multiple names: authors list (link)
↑Pan, Q; Shai O, Lee LJ, Frey BJ, Blencowe BJ 2008. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nature Genetics40 (12): 1413–1415. doi:10.1038/ng.259. PMID 18978789.