Minor allele frequency

Minor allele frequency (MAF) is the frequency at which the second most common allele occurs in a given population. They play a surprising role in heritability since MAF variants which occur only once, known as "singletons", drive an enormous amount of selection.[1]

Single nucleotide polymorphisms (SNPs) with a minor allele frequency of 0.05 (5%) or greater were targeted by the HapMap project.[2]

MAF is widely used in population genetics studies because it provides information to differentiate between common and rare variants in the population. As an example, a 2015 study sequenced the whole genomes of 2,120 Sardinian individuals. The authors classified the variants found in the study in three classes according to their MAF. It was observed that rare variants (MAF < 0.05) appeared more frequently in coding regions than common variants (MAF > 0.05) in this population.[3]

Interpreting MAF data

1. Introduce the reference of a SNP of interest, as an example: rs429358, in a database (dbSNP or other).

2. Find MAF/MinorAlleleCount link. MAF/MinorAlleleCount: C=0.1506/754 (1000 Genomes, where number of genomes sampled = N = 2504);[4] where C is the minor allele for that particular locus; 0.1506 is the frequency of the C allele (MAF), i.e. 15% within the 1000 Genomes database; and 754 is the number of times this SNP has been observed in the population of the study. To find the number, note that , where is to account for diploidy.

See also

References

  1. ^ Hernandez, Ryan D.; Uricchio, Lawrence H.; Hartman, Kevin; Ye, Chun; Dahl, Andrew; Zaitlen, Noah (September 2019). "Ultrarare variants drive substantial cis heritability of human gene expression". Nature Genetics. 51 (9): 1349–1355. doi:10.1038/s41588-019-0487-7. ISSN 1546-1718. PMC 6730564. PMID 31477931.
  2. ^ The International HapMap Consortium (2005). "A haplotype map of the human genome". Nature. 437 (7063): 1299–1320. Bibcode:2005Natur.437.1299T. doi:10.1038/nature04226. PMC 1880871. PMID 16255080.
  3. ^ Sidore, C., y colaboradores (2015). "Genome sequencing elucidates Sardinian genetic architecture and augments association analyses for lipid and blood inflammatory markers". Nature Genetics. 47 (11): 1272–1281. doi:10.1038/ng.3368. PMC 4627508. PMID 26366554.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  4. ^ National Center for Biotechnology Information: New SNP Attributes