The different types of splicing mutations in genes. Mutations within the splicing regions of genes can lead to a defective transcript and protein. Depending on where exactly the mutation occurs and which "cryptic" splice site near the original site is chosen for splicing, the specific defect in the transcript and protein will vary. Frequently, splicing mutations will lead to exon skipping, intron inclusion, exon extension/truncation, and premature termination in the resulting transcript. The various defects in the transcript will in turn result in different kinds of disruption in the amino acid sequence of the protein.
The Shapiro—Senapathy algorithm (S&S) is a computational method for identifying splice sites in eukaryotic genes. The algorithm employs a Position Weight Matrix (PWM) scoring formula to predict donor and acceptor splice sites in any given gene. This methodology has been used to discover splice sites and disease-causing splice site mutations in the human genome, and has become a standard tool in clinical genomics.
The S&S algorithm has been cited in thousands of clinical studies, according to Google Scholar. It has also formed the basis of widely used software, including Human Splicing Finder,[1] SROOGLE,[2] and Alamut,[3] which identify splice sites and splice site mutations that cause disease. The algorithm has uncovered splicing mutations in diseases ranging from cancers to inherited disorders, and predicted the deleterious effects of these mutations including exon skipping, intron retention, and cryptic splice site activation.
The algorithm
A splice site defines the boundary between a coding exon and a non-coding intron in eukaryotic genes. The S&S algorithm employs a sliding window, corresponding to the length of the splice site motif, to scan a gene sequence and detect potential splice sites. For each sliding window, the algorithm calculates a score by comparing the nucleotide sequence to a Position Weight Matrix (PWM) derived from known splice sites. This formula generates a percentile score, indicating the likelihood that a given sequence functions as a donor or acceptor splice site.
The majority of disease-causing mutations in the human genome are located in splice sites. Clinical genomics studies analyze the splice site scores generated by the S&S algorithm to predict the consequences of splice site mutations including exon skipping and intron retention. The algorithm’s sensitivity to single-nucleotide changes allows it to determine mutations that may impact RNA splicing and contribute to disease.
In addition to identifying real splice sites, the S&S algorithm has been used to discover cryptic splice sites — alternative splice sites activated by mutations — which may disrupt normal splicing. The algorithm detects mutations that lead to the activation of cryptic splice sites, which may be located proximal to real splice sites or deep within non-coding introns. It has thus been used to determine the causes of numerous diseases that are due to cryptic splicing.
Specific mutations in different splice sites in various genes causing breast cancer (e.g., BRCA1, PALB2), ovarian cancer (e.g., SLC9A3R1, COL7A1, HSD17B7), colon cancer (e.g., APC, MLH1, DPYD), colorectal cancer (e.g., COL3A1, APC, HLA-A), skin cancer (e.g., COL17A1, XPA, POLH), and Fanconi anemia (e.g., FANC, FANA) have been uncovered. The mutations in the donor and acceptor splice sites in different genes causing a variety of cancers that have been identified by S&S are shown in Table 1.
Creates a novel 5’ splice site that results in a 4 nucleotide deletion of the 3’ end of exon 16[44]
Table 1. Mutations in the donor and acceptor splice sites in different genes
Discovery of genes causing inherited disorders using S&S
Specific mutations in different splice sites in various genes that cause inherited disorders, including, for example, Type 1 diabetes (e.g., PTPN22, TCF1 (HCF-1A)), hypertension (e.g., LDL, LDLR, LPL), Marfan syndrome (e.g., FBN1, TGFBR2, FBN2), cardiac diseases (e.g., COL1A2, MYBPC3, ACTC1), eye disorders (e.g., EVC, VSX1) have been uncovered. A few example mutations in the donor and acceptor splice sites in different genes causing a variety of inherited disorders identified using S&S are shown in Table 2.
Xeroderma pigmentosum, an autosomal recessive disorder is caused by faulty proteins formed due to new preferred splice donor site identified using S&S algorithm and resulted in defective nucleotide excision repair.[27]
Type I Bartter syndrome (BS) is caused by mutations in the gene SLC12A1. S&S algorithm helped in disclosing the presence of two novel heterozygous mutations c.724 + 4A > G in intron 5 and c.2095delG in intron 16 leading to complete exon 5 skipping.[28]
Mutations in the MYH gene, which is responsible for removing the oxidatively damaged DNA lesion are cancer-susceptible in the individuals. The IVS1+5C plays a causative role in the activation of a cryptic splice donor site and the alternative splicing in intron 1, S&S algorithm shows, guanine (G) at the position of IVS+5 is well conserved (at the frequency of 84%) among primates. This also supported the fact that the G/C SNP in the conserved splice junction of the MYH gene causes the alternative splicing of intron 1 of the β type transcript.[29]
Splice site scores were calculated according to S&S to find EBV infection in X-linked lymphoproliferative disease.[57] Identification of Familial tumoral calcinosis (FTC) is an autosomal recessive disorder characterized by ectopic calcifications and elevated serum phosphate levels and it is because of aberrant splicing.[58]
Application of S&S in hospitals for clinical practice and research
The Shapiro–Senapathy (S&S) algorithm has played a significant role in advancing the diagnosis and treatment of human diseases through its application in modern clinical genomics. With the widespread adoption of next-generation sequencing (NGS) technologies, the S&S algorithm is now routinely integrated into clinical practice by geneticists and diagnostic laboratories. It is implemented in various computational tools such as Human Splicing Finder (HSF),[1] Splice Site Finder (SSF),[59] and Alamut Visual,[3] which assist in interpreting the functional impact of genetic variants on RNA splicing.
The algorithm is particularly useful in identifying pathogenic splice site mutations in cases where the clinical presentation is unclear or where conventional diagnostic methods have failed to identify a causative gene. Its utility has been demonstrated across diverse patient cohorts, including individuals from different ethnic backgrounds with various cancers and inherited genetic disorders. The following are selected examples illustrating its application in clinical research.
Cancers
Cancer type
Publication title
Year
Ethnicity
Number of patients
1
Hereditary Breast Cancer
Uncovering the clinical relevance of unclassified variants in DNA repair genes: a focus on BRCA negative Tunisian cancer families[60]
2024
Tunisian
67 Patients
2
Basel Cell Carcinoma
PTCH1 gene variants rs357564, rs2236405, rs2297086 and rs41313327, mRNA and tissue expression in basal cell carcinoma patients from Western Mexico[61]
2024
Western Mexico
250 Patients
290 Control
3
Non-Small Cell Lung Cancer
Associations between telomere attrition, genetic variants in telomere maintenance genes, and non-small cell lung cancer risk in the Jammu and Kashmir population of North India[62]
2023
India
162 Patients 561 Controls
4
Prostate Cancer
Somatic and germline aberrations in homologous recombination repair genes in Chinese prostate cancer patients[63]
2023
Chinese
721 Patients
5
Colorectal Cancer
Lynch-like syndrome is as frequent as Lynch syndrome in early-onset nonfamilial nonpolyposis colorectal cancer[64]
2023
Argentina
102 patients
6
Colorectal Cancer
Germline Variants of CYBA and TRPM4 Predispose to Familial Colorectal Cancer[65]
2022
Poland
15 Families
7
Hereditary Ovarian Cancer
The Genetic and Molecular Analyses of RAD51C and RAD51D Identifies Rare Variants Implicated in Hereditary Ovarian Cancer from a Genetically Unique Population[66]
2022
French Canadians
17 Families
53 Patients
8
Thymic Carcinoma
Mutation profile and immunoscore signature in thymic carcinomas: An exploratory study and review of the literature[67]
2021
Italian
15 Patients
9
Neurofibromatosis Type 1
Simultaneous Detection of NF1, SPRED1, LZTR1, and NF2 Gene Mutations by Targeted NGS in an Italian Cohort of Suspected NF1 Patients[68]
2020
Italian
250 Patients
10
Neuroendocrine Pancreatic Tumor
Identification of new candidate genes and signalling pathways associated with the development of neuroendocrine pancreatic tumours based on next generation sequencing data[69]
2020
Caucasian
24 Patients
11
Oral Cancer
Polymorphic variants of drug-metabolizing enzymes alter the risk and survival of oral cancer patients[70]
2020
Indian
909 Controls
539 Patients
12
Endometrial Cancer
Targeted sequencing of genes associated with the mismatch repair pathway in patients with endometrial cancer[71]
2020
Australia
199 patients
13
Breast cancer
The germline mutational landscape of BRCA1 and BRCA2 in Brazil[72]
2018
Brazil
649 Patients
14
Hereditary non-polyposis colorectal cancer
Prevalence and characteristics of hereditary non-polyposis colorectal cancer (HNPCC) syndrome in immigrant Asian colorectal cancer patients[10]
2017
Asian Immigrant
143 Patients
15
Renal cell cancer
Genetic screening of the FLCN gene identify six novel variants and a Danish founder mutation[73]
2016
Danish
143 individuals
16
Nevoid basal cell carcinoma syndrome
Nevoid basal cell carcinoma syndrome caused by splicing mutations in the PTCH1 gene[7]
2016
Japanese
10 Patients
17
Prostate cancer
Identification of Two Novel HOXB13 Germline Mutations in Portuguese Prostate Cancer Patients[74]
2015
Portuguese
462 Patients, 132 Controls
18
Colorectal adenomatous polyposis
Identification of Novel Causative Genes for Colorectal Adenomatous Polyposis
2015
German
181 Patients,531 Controls
Inherited disorders
Disease name
Publication title
Year
Ethnicity
Number of patients
1
Congenital Myopathy
Exome sequencing in undiagnosed congenital myopathy reveals new genes and refines genes–phenotypes correlations[75]
2024
Multiple Population
310 Families (429 patients)
2
Neurodevelopmental Delay and Neurodevelopmental Comorbidities
Phenotypic and genetic analysis of children with unexplained neurodevelopmental delay and neurodevelopmental comorbidities in a Chinese cohort using trio-based whole-exome sequencing[76]
2024
Chinese
163 Patients
3
Congenital Cataracts
Evaluation of Genetic Testing in a Cohort of Diverse Pediatric Patients in the United States with Congenital Cataracts[77]
2023
Chicago, USA
52 Patients
4
X-linked hypophosphatemia
A genetic study of a Brazilian cohort of patients with X-linked hypophosphatemia reveals no correlation between genotype and phenotype[78]
2023
Brazil
41 Patients
5
Hereditary Cerebellar Ataxia
Molecular Characterization of Portuguese Patients with Hereditary Cerebellar Ataxia[79]
2022
Portuguese
19 Families (30 Individual)
6
Stargardt disease
ABCA4 c.859-25A>G, a Frequent Palestinian Founder Mutation Affecting the Intron 7 Branchpoint, Is Associated With Early-Onset Stargardt Disease[80]
2022
Palestinian
175 patients
7
Hearing Impairment & Retinal Dystrophy
Unraveling the genetic complexities of combined retinal dystrophy and hearing impairment[81]
2021
Mexican & Iranian
59 patients
8
Angelman syndrome
New genes involved in Angelman syndrome-like: Expanding the genetic spectrum[82]
2021
Spain
14 patients
9
Acute intermittent porphyria
Molecular Analysis of 55 Spanish Patients with Acute Intermittent Porphyria[83]
2020
Spanish
55 patients
10
Hearing loss
Novel Loss-of-Function Variants in CDC14A are Associated with Recessive Sensorineural Hearing Loss in Iranian and Pakistani Patients[84]
2020
Iranian & Pakistani
2 Families
11
Non-syndromic hearing loss
GJB2 and GJB6 Genetic Variant Curation in an Argentinean Non-Syndromic Hearing-Impaired Cohort[85]
2020
Argentinean
600 patients
12
Inherited retinal diseases
Molecular genetic analysis using targeted NGS analysis of 677 individuals with retinal dystrophy[86]
Genetic Evidence Supporting the Role of the Calcium Channel, CACNA1S, in Tooth Cusp and Root Patterning[87]
2018
Thai families
11 Patients,18 Controls
14
Unclear speech developmental delay
Progressive SCAR14 with unclear speech, developmental delay, tremor, and behavioral problems caused by a homozygous deletion of the SPTBN2 pleckstrin homology domain[88]
Genetics of Age-related Macular Degeneration and Stargardt disease in South African populations[93]
2015
African Populations
32 Patients
S&S - Algorithm for identifying splice sites, exons and split genes
The Shapiro–Senapathy algorithm (SSA) was developed to identify splice sites in uncharacterized genomic sequences, with early applications in the Human Genome Project.[94][95] The method introduced a Position Weight Matrix (PWM)-based approach to analyze splicing sequences across eukaryotic organisms, marking the first computational framework to systematically define splice sites using probabilistic scoring.
Key innovations of the algorithm included:
Exon Detection – Exons were defined as sequences bounded by acceptor and donor splice sites with S&S scores above a threshold, requiring an open reading frame (ORF) for validation.
Gene Prediction – The method enabled the identification of complete genes by assembling predicted exons, forming a basis for later gene-finding tools.
Mutation Analysis – The algorithm distinguishes deleterious splice-site mutations (which disrupt protein function by lowering S&S scores) from neutral variations. This capability allowed researchers to study disease-linked cryptic splice sites in humans, animals, and plants.
SSA's PWM-based framework influenced subsequent computational methods, including machine learning and neural network approaches, for splice-site prediction and alternative splicing research. It remains a foundational tool in genomics and disease studies.
Discovering the mechanisms of aberrant splicing in diseases
The Shapiro–Senapathy algorithm has been used to determine the various aberrant splicing mechanisms in genes due to deleterious mutations in the splice sites, which cause numerous diseases. Deleterious splice site mutations impair the normal splicing of the gene transcripts, and thereby make the encoded protein defective. A mutant splice site can become “weak” compared to the original site, due to which the mutated splice junction becomes unrecognizable by the spliceosomal machinery. This can lead to the skipping of the exon in the splicing reaction, resulting in the loss of that exon in the spliced mRNA (exon-skipping). On the other hand, a partial or complete intron could be included in the mRNA due to a splice site mutation that makes it unrecognizable (intron inclusion). A partial exon-skipping or intron inclusion can lead to premature termination of the protein from the mRNA, which will become defective leading to diseases. The S&S has thus paved the way to determine the mechanisms by which a deleterious mutation could lead to a defective protein, resulting in different diseases depending on which gene is affected.
lead to exon skipping, intron inclusion, or the use of a cryptic splice site, resulting in either a truncated protein or a protein lacking a small region of the coding sequence[98]
An example of splicing aberration (exon skipping) caused by a mutation in the donor splice site in the exon 8 of MLH1 gene that led to colorectal cancer is given below. This example shows that a mutation in a splice site within a gene can lead to a profound effect in the sequence and structure of the mRNA, and the sequence, structure and function of the encoded protein, leading to disease.
Exon Skipping caused by a donor mutation in the gene MLH1 leading to colorectal cancer. The generation of a mRNA from a split gene involves the transcription of the gene into the primary RNA transcript, and the precise removal of the introns and the joining of the exons from the primary RNA transcript. A deleterious mutation within the splicing signals (donor or acceptor splice sites) can affect the recognition of the correct splice junction and lead to an aberration in the joining of the authentic exons. Depending on if the mutation occurs within the donor or the acceptor site, and the particular base that is mutated within the splice sequence, the aberration could lead to the skipping of a complete or partial exon, or the inclusion of a partial intron or a cryptic exon in the mRNA produced by the splicing process. Either of these situations will usually lead to a premature stop codon in the mRNA and result in a completely defective protein. The S&S algorithm aids in determining which splice site and exon in a gene are mutated, and the S&S score of the mutated splice site aids in determining the type of splicing aberration and the resulting mRNA structure and sequence. The example gene MLH1 affected in colorectal cancer is shown in the figure. It was found using the S&S algorithm that a mutation in the donor splice site in exon 8 led to the skipping of the exon 8. The mRNA thus lacks the sequence corresponding to exon 8 (sequence positions are shown in the figure). This causes a frame shift in the mRNA coding sequence at amino acid position 226, leading to premature protein truncation at amino acid position 233. This mutated protein is completely defective, which has led to colorectal cancer in the patient.
S&S in cryptic splice sites research and medical applications
The proper identification of splice sites has to be highly precise as the consensus splice sequences are very short and there are many other sequences similar to the authentic splice sites within gene sequences, which are known as cryptic, non-canonical, or pseudo splice sites. When an authentic or real splice site is mutated, any cryptic splice sites present close to the original real splice site could be erroneously used as authentic site, resulting in an aberrant mRNA. The erroneous mRNA may include a partial sequence from the neighboring intron or lose a partial exon, which may result in a premature stop codon. The result may be a truncated protein that would have lost its function completely.
Shapiro–Senapathy algorithm can identify the cryptic splice sites, in addition to the authentic splice sites. Cryptic sites can often be stronger than the authentic sites, with a higher S&S score. However, due to the lack of an accompanying complementary donor or acceptor site, this cryptic site will not be active or used in a splicing reaction. When a neighboring real site is mutated to become weaker than the cryptic site, then the cryptic site may be used instead of the real site, resulting in a cryptic exon and an aberrant transcript.
Numerous diseases have been caused by cryptic splice site mutations or usage of cryptic splice sites due to the mutations in authentic splice sites.[100][101][102][103][104]
The mRNA splicing plays a fundamental role in gene functional regulation. Very recently, it has been shown that A to G conversions at splice sites can lead to mRNA mis-splicing in Arabidopsis.[110] The splicing and exon–intron junction prediction coincided with the GT/AG rule (S&S) in the Molecular characterization and evolution of carnivorous sundew (Drosera rotundifolia L.) class V b-1,3-glucanase.[111] Unspliced (LSDH) and spliced (SSDH) transcripts of NAD+ dependent sorbitol dehydroge nase (NADSDH) of strawberry (Fragaria ananassa Duch., cv. Nyoho) were investigated for phytohormonal treatments.[112]
Ambra1 is a positive regulator of autophagy, a lysosome-mediated degradative process involved both in physiological and pathological conditions. Nowadays, this function of Ambra1 has been characterized only in mammals and zebrafish.[106] Diminution of rbm24a or rbm24b gene products by morpholino knockdown resulted in significant disruption of somite formation in mouse and zebrafish.[107] Dr.Senapathy algorithm used extensively to study intron-exon organization of fut8 genes. The intron-exon boundaries of Sf9 fut8 were in agreement with the consensus sequence for the splicing donor and acceptor sites concluded using S&S.[108]
^ abHoudayer, Claude (2011), "In Silico Prediction of Splice-Affecting Nucleotide Variants", In Silico Tools for Gene Discovery, Methods in Molecular Biology, vol. 760, Humana Press, pp. 269–281, doi:10.1007/978-1-61779-176-5_17, ISBN9781617791758, PMID21780003
^Damiola, Francesca; Schultz, Inès; Barjhoux, Laure; Sornin, Valérie; Dondon, Marie-Gabrielle; Eon-Marchais, Séverine; Marcou, Morgane; Caron, Olivier; Gauthier-Villars, Marion (2015-11-12). "Mutation analysis of PALB2 gene in French breast cancer families". Breast Cancer Research and Treatment. 154 (3): 463–471. doi:10.1007/s10549-015-3625-7. ISSN0167-6806. PMID26564480. S2CID12852074.
^ abDudley, Beth; Brand, Randall E.; Thull, Darcy; Bahary, Nathan; Nikiforova, Marina N.; Pai, Reetesh K. (August 2015). "Germline MLH1 Mutations Are Frequently Identified in Lynch Syndrome Patients With Colorectal and Endometrial Carcinoma Demonstrating Isolated Loss of PMS2 Immunohistochemical Expression". The American Journal of Surgical Pathology. 39 (8): 1114–1120. doi:10.1097/pas.0000000000000425. ISSN0147-5185. PMID25871621. S2CID26069072.
^Mensenkamp, Arjen R.; Vogelaar, Ingrid P.; van Zelst–Stams, Wendy A.G.; Goossens, Monique; Ouchene, Hicham; Hendriks–Cornelissen, Sandra J.B.; Kwint, Michael P.; Hoogerbrugge, Nicoline; Nagtegaal, Iris D. (March 2014). "Somatic Mutations in MLH1 and MSH2 Are a Frequent Cause of Mismatch-Repair Deficiency in Lynch Syndrome-Like Tumors". Gastroenterology. 146 (3): 643–646.e8. doi:10.1053/j.gastro.2013.12.002. ISSN0016-5085. PMID24333619.
^ abvan der Post, Rachel S.; Vogelaar, Ingrid P.; Manders, Peggy; van der Kolk, Lizet E.; Cats, Annemieke; van Hest, Liselotte P.; Sijmons, Rolf; Aalfs, Cora M.; Ausems, Margreet G.E.M. (October 2015). "Accuracy of Hereditary Diffuse Gastric Cancer Testing Criteria and Outcomes in Patients With a Germline Mutation in CDH1". Gastroenterology. 149 (4): 897–906.e19. doi:10.1053/j.gastro.2015.06.003. ISSN0016-5085. PMID26072394.
^Castiglia, Daniele; Pagani, Elena; Alvino, Ester; Vernole, Patrizia; Marra, Giancarlo; Cannavò, Elda; Jiricny, Josef; Zambruno, Giovanna; D'Atri, Stefania (June 2003). "Biallelic somatic inactivation of the mismatch repair gene MLH1 in a primary skin melanoma". Genes, Chromosomes and Cancer. 37 (2): 165–175. doi:10.1002/gcc.10193. ISSN1045-2257. PMID12696065. S2CID1228058.
^ abSidwell, R.U.; Sandison, A.; Wing, J.; Fawcett, H.D.; Seet, J-E.; Fisher, C.; Nardo, T.; Stefanini, M.; Lehmann, A.R. (July 2006). "A novel mutation in the XPA gene associated with unusually mild clinical features in a patient who developed a spindle cell melanoma". British Journal of Dermatology. 155 (1): 81–88. doi:10.1111/j.1365-2133.2006.07272.x. ISSN0007-0963. PMID16792756. S2CID42003864.
^ abNozu, Kandai; Iijima, Kazumoto; Kawai, Kazuo; Nozu, Yoshimi; Nishida, Atsushi; Takeshima, Yasuhiro; Fu, Xue Jun; Hashimura, Yuya; Kaito, Hiroshi (10 July 2009). "In vivo and in vitro splicing assay of SLC12A1 in an antenatal salt-losing tubulopathy patient with an intronic mutation". Human Genetics. 126 (4): 533–538. doi:10.1007/s00439-009-0697-7. ISSN0340-6717. PMID19513753. S2CID20181541.
^Becker, A. J.; Löbach, M.; Klein, H.; Normann, S.; Nöthen, M. M.; von Deimling, A.; Mizuguchi, M.; Elger, C. E.; Schramm, J. (March 2001). "Mutational analysis of TSC1 and TSC2 genes in gangliogliomas". Neuropathology and Applied Neurobiology. 27 (2): 105–114. doi:10.1046/j.0305-1846.2001.00302.x. ISSN0305-1846. PMID11437991. S2CID9696988.
^Schick, Volker; Majores, Michael; Engels, Gudrun; Spitoni, Sylvia; Koch, Arend; Elger, Christian E.; Simon, Matthias; Knobbe, Christiane; Blümcke, Ingmar (2006-09-30). "Activation of Akt independent of PTEN and CTMP tumor-suppressor gene mutations in epilepsy-associated Taylor-type focal cortical dysplasias". Acta Neuropathologica. 112 (6): 715–725. doi:10.1007/s00401-006-0128-y. ISSN0001-6322. PMID17013611. S2CID35008161.
^Muller, Danièle; Mazoyer, Sylvie; Stoppa-Lyonnet, Dominique; Sinilnikova, Olga M.; Andrieu, Nadine; Fricker, Jean-Pierre; Bignon, Yves-Jean; Longy, Michel; Lasset, Christine (2015-12-01). "Mutation analysis of PALB2 gene in French breast cancer families". Breast Cancer Research and Treatment. 154 (3): 463–471. doi:10.1007/s10549-015-3625-7. ISSN1573-7217. PMID26564480. S2CID12852074.
^Masunaga, Takuji; Ogawa, Junki; Akiyama, Masashi; Nishikawa, Takeji; Shimizu, Hiroshi; Ishiko, Akira (2017). "Compound heterozygosity for novel splice site mutations of ITGA6 in lethal junctional epidermolysis bullosa with pyloric atresia". The Journal of Dermatology. 44 (2): 160–166. doi:10.1111/1346-8138.13575. ISSN1346-8138. PMID27607025. S2CID3934121.
^Hansen, Thomas vO; Nielsen, Finn C.; Gerdes, Anne-Marie; Ousager, Lilian B.; Jensen, Uffe B.; Skytte, Anne-Bine; Albrechtsen, Anders; Rossing, Maria (February 2017). "Genetic screening of the FLCN gene identify six novel variants and a Danish founder mutation". Journal of Human Genetics. 62 (2): 151–157. doi:10.1038/jhg.2016.118. ISSN1435-232X. PMID27734835. S2CID24558301.
^Jääskeläinen, Pertti; Kuusisto, Johanna; Miettinen, Raija; Kärkkäinen, Päivi; Kärkkäinen, Satu; Heikkinen, Sami; Peltola, Paula; Pihlajamäki, Jussi; Vauhkonen, Ilkka (4 November 2002). "Mutations in the cardiac myosin-binding protein C gene are the predominant cause of familial hypertrophic cardiomyopathy in eastern Finland". Journal of Molecular Medicine. 80 (7): 412–422. doi:10.1007/s00109-002-0323-9. ISSN0946-2716. PMID12110947. S2CID7089974.
^Attanasio, M; Lapini, I; Evangelisti, L; Lucarini, L; Giusti, B; Porciani, MC; Fattori, R; Anichini, C; Abbate, R (2008-04-23). "FBN1 mutation screening of patients with Marfan syndrome and related disorders: detection of 46 novel FBN1 mutations". Clinical Genetics. 74 (1): 39–46. doi:10.1111/j.1399-0004.2008.01007.x. ISSN0009-9163. PMID18435798. S2CID205406696.
^Kit, Oleg I.; Trifanov, Vladimir S.; Petrusenko, Nataliya A.; Gvaldin, Dmitry Y.; Kutilin, Denis S.; Timoshkina, Nataliya N. (June 2020). "Identification of new candidate genes and signalling pathways associated with the development of neuroendocrine pancreatic tumours based on next generation sequencing data". Molecular Biology Reports. 47 (6): 4233–4243. doi:10.1007/s11033-020-05534-z. PMID32451928.
^Rossing, Maria; Albrechtsen, Anders; Skytte, Anne-Bine; Jensen, Uffe B; Ousager, Lilian B; Gerdes, Anne-Marie; Nielsen, Finn C; Hansen, Thomas vO (2016-10-13). "Genetic screening of the FLCN gene identify six novel variants and a Danish founder mutation". Journal of Human Genetics. 62 (2): 151–157. doi:10.1038/jhg.2016.118. ISSN1434-5161. PMID27734835. S2CID24558301.
^Yıldız Bölükbaşı, Esra; Afzal, Muhammad; Mumtaz, Sara; Ahmad, Nafees; Malik, Sajid; Tolun, Aslıhan (2017-06-21). "Progressive SCAR14 with unclear speech, developmental delay, tremor, and behavioral problems caused by a homozygous deletion of the SPTBN2 pleckstrin homology domain". American Journal of Medical Genetics Part A. 173 (9): 2494–2499. doi:10.1002/ajmg.a.38332. ISSN1552-4825. PMID28636205. S2CID5586800.
^Abdelkreem, Elsayed; Akella, Radha Rama Devi; Dave, Usha; Sane, Sudhir; Otsuka, Hiroki; Sasai, Hideo; Aoyama, Yuka; Nakama, Mina; Ohnishi, Hidenori (2016-12-08), "Clinical and Mutational Characterizations of Ten Indian Patients with Beta-Ketothiolase Deficiency", JIMD Reports, 35, Springer Berlin Heidelberg: 59–65, doi:10.1007/8904_2016_26, ISBN9783662558324, PMC5585108, PMID27928777
^Davoodi-Semiromi, Abdoreza; Lanyon, George W.; Davidson, Rosemary; Connor, Michael J. (2000-11-06). "Aberrant RNA splicing in the hMSH2 gene: Molecular identification of three aberrant RNA in Scottish patients with colorectal cancer in the West of Scotland". American Journal of Medical Genetics. 95 (1): 49–52. doi:10.1002/1096-8628(20001106)95:1<49::aid-ajmg10>3.0.co;2-p. ISSN1096-8628. PMID11074494.
^van den Hurk, José A. J. M.; van de Pol, Dorien J. R.; Wissinger, Bernd; van Driel, Marc A.; Hoefsloot, Lies H.; de Wijs, Ilse J.; van den Born, L. Ingeborgh; Heckenlively, John R.; Brunner, Han G. (2003-06-25). "Novel types of mutation in the choroideremia (CHM) gene: a full-length L1 insertion and an intronic mutation activating a cryptic exon". Human Genetics. 113 (3): 268–275. doi:10.1007/s00439-003-0970-0. ISSN0340-6717. PMID12827496. S2CID23750723.
^Infante, Joana B.; Alvelos, Maria I.; Bastos, Margarida; Carrilho, Francisco; Lemos, Manuel C. (January 2016). "Complete androgen insensitivity syndrome caused by a novel splice donor site mutation and activation of a cryptic splice donor site in the androgen receptor gene". The Journal of Steroid Biochemistry and Molecular Biology. 155 (Pt A): 63–66. doi:10.1016/j.jsbmb.2015.09.042. ISSN0960-0760. PMID26435450. S2CID33393364.
^Niba, E.; Nishuda, A.; Tran, V.; Vu, D.; Matsumoto, M.; Awano, H.; Lee, T.; Takeshima, Y.; Nishio, H. (June 2016). "Cryptic splice site activation by a splice donor site mutation of dystrophin intron 64 is determined by intronic splicing regulatory elements". Neuromuscular Disorders. 26: S96. doi:10.1016/j.nmd.2016.06.042. ISSN0960-8966. S2CID54267534.
^Qadah, Talal; Finlayson, Jill; Joly, Philippe; Ghassemifar, Reza (2013-11-25). "Molecular and Cellular Analysis of a NovelHBA2Mutation (HBA2: c.94A>G) Shows Activation of a Cryptic Splice Site and Generation of a Premature Termination Codon". Hemoglobin. 38 (1): 13–18. doi:10.3109/03630269.2013.858639. ISSN0363-0269. PMID24274170. S2CID28120011.
^ abGasparini, Fabio; Skobo, Tatjana; Benato, Francesca; Gioacchini, Giorgia; Voskoboynik, Ayelet; Carnevali, Oliana; Manni, Lucia; Valle, Luisa Dalla (2016-02-01). "Characterization of Ambra1 in asexual cycle of a non-vertebrate chordate, the colonial tunicate Botryllus schlosseri, and phylogenetic analysis of the protein group in Bilateria". Molecular Phylogenetics and Evolution. 95: 46–57. Bibcode:2016MolPE..95...46G. doi:10.1016/j.ympev.2015.11.001. ISSN1055-7903. PMID26611831.
^Feng, Jiayue; Li, Jing; Liu, Hong; Gao, Qinghua; Duan, Ke; Zou, Zhirong (2012-10-03). "Isolation and Characterization of a Calcium-Dependent Protein Kinase Gene, FvCDPK1, Responsive to Abiotic Stress in Woodland Strawberry (Fragaria vesca)". Plant Molecular Biology Reporter. 31 (2): 443–456. doi:10.1007/s11105-012-0513-8. ISSN0735-9640. S2CID14378361.
^Philip, Anna; Syamaladevi, Divya P.; Chakravarthi, M.; Gopinath, K.; Subramonian, N. (2013-03-19). "5′ Regulatory region of ubiquitin 2 gene from Porteresia coarctata makes efficient promoters for transgene expression in monocots and dicots". Plant Cell Reports. 32 (8): 1199–1210. Bibcode:2013PCelR..32.1199P. doi:10.1007/s00299-013-1416-3. ISSN0721-7714. PMID23508257. S2CID12170634.