BioSamples

BioSamples
Content
DescriptionA database containing aggregated information pertaining to reference samples and samples stored in the European Bioinformatics Institute assay databases.
Data types
captured
Biological sample metadata
OrganismsAll
Contact
Research centerEuropean Bioinformatics Institute.
AuthorsMikhail Gostev
Primary citationGostev & al. (2012)[1]
Release date2011
Access
Data formatXML, RDF
WebsiteEBI page, NCBI page
Download URLEBI FTP
Web service URLREST
Sparql endpointBioSD Sparql
Tools
WebSample display, advanced search by samples and groups, sorting by columns, links to assay database record
Miscellaneous
LicenseUnrestricted
VersioningYes
Data release
frequency
Daily
Curation policyYes (manual)
Bookmarkable
entities
Yes - samples and sample groups

BioSamples (BioSD) is a database at European Bioinformatics Institute for the information about the biological samples used in sequencing.[1]

It stores submitter-supplied metadata about the biological materials from which data stored in the National Center for Biotechnology Information’s (NCBI) primary data archives are derived. NCBI’s archives hosts data pertaining to diverse types of samples from many species, and as such the BioSample database is similarly diverse. Examples of a BioSample include a primary tissue biopsy, an individual organism or an environmental isolate.

The BioSamples database captures sample metadata in a structured way by encouraging use of controlled sample attribute field name vocabularies. This metadata is key in giving the sample data context, allowing it to be more fully understood, reused, and enables aggregation of disparate data sets.

Sample metadata is linked to relevant experimental data across many archival databases relieving submitter burden by enabling one-time submission of sample description. They then can reference that sample, when necessary, when making data deposits to other archives.

BioSample records are indexed and searchable, supporting cross-database queries by sample description.

History

The BioSamples database was launched in 2011 to help aggregate and standardise sample metadata. Historically, each archive had created its own convention for sample metadata collection. These usually were limited in their standardisation and had no method to indicate when the a sample was used across multiple data sets. In addition to this, there is a growing awareness amongst the research community that sample metadata is vital for understanding the underlying data. Further, chances for re-use, aggregation and integration of data are increased with improved metadata. The database was initially populated with existing descriptions extracted from SRA, EST, GSS and dbGaP.[2] As of May 2013, the database hosts almost 2 million BioSample records encompassing 18,000 species.[3]

Content

The BioSamples database has doubled in size since January 2012 when 1 million samples were described in the BioSamples database, as of October 2013 2,846,137 samples are available as 80,232 groups. [4] The rapid growth is predominantly due to new data sources, and increased volume of data from existing sources. New data sources include 22,288 samples from The Cancer Genome Atlas, and 920,441 samples from the Catalogue of Somatic Mutation in Cancer (COSMIC). [5]

Attributes define the material under investigation using structured name: value pairs, for example:

tissue: liver
collection date: 31-Jan-2013

After specifying the sample type, the user is presented with a list of required and optional attribute fields to fill in, as well as the opportunity to supply any number of custom descriptive attributes. The BioSample database is extendible in that new types and attributes can be added as new standards develop. In addition to BioSample type and attributes, each BioSample record also contains:

IDs An identifier block that lists not only the BioSample accession assigned to that record, but also any other external sample identifier, such as that issued by the source database or repository.
Organism The organism name and taxonomy identifier. The full taxonomic tree is displayed and searchable.
Title BioSample title. A title is auto-generated if one is not supplied by the submitter.
Description [optional] A free text field in which to store non-structured information about the sample.
Links [optional] URL to link to relevant information on external sites.
Owner Submitter information, including name and affiliation where available.
Dates Information about when the record was submitted, released, and last updated.
Access Statement about whether the record is fully public or controlled access

The full list and definitions of BioSample types and attributes is available for preview and download.[6]

Data Access

There are a number of ways in which the database can be accessed. The initial release of BioSD to the public only provided access to the database through a web interface. This web interface was subsequently updated in November 2012 and then again in March 2013 following the EBI site-wide re-launch. In February 2013, a public Application Programming Interface (API) was released using a Representational state transfer (REST) system. In October 2013, as a part of the EBI's new RDF platform a SPARQL endpoint was released, providing access to the data in the RDF format. Additionally, the database can be downloaded through EBI's FTP service.[7]

Web Interface

The web interface allows users to access the BioSD database through a web browser. It provides functionality for both searching by sample groups and by samples themselves. The search features incremental search to assist users by providing them with possible search terms as they type. Advanced search is provided and allows users to search by applying the binary terms, AND, OR and NOT, to their search terms. Additionally, a wildcard character can be used to match any combination of characters including no characters. A question mark character can also be used to match any single character.[8] Examples of these can be seen in the following table:

Search query Example results
mo*se "mouse", "moose", "mose", "mofoobarse"
mo?se "mouse", "moose", "motse"

The web interface also allows users to select search results and view further details of that search result. The detailed view provides further information and makes available a link to the assay database(s) from which the data was sourced. Ordering by columns is also provided.

Application Programming Interface

The API provides a suitable method for retrieving data in a programmatic way. It uses a RESTful system that allows users to query URI endpoints and receive XML as results. The API has URI endpoints for a number of different types of requests. These requests can be used to, find specific samples, find specific groups, search for groups, search for samples and to search for samples within a group.[9]


SPARQL Endpoint

The SPARQL endpoint allows users to search the database in a more comprehensive way than the standard web interface whilst still being usable from a web browser.[10] Through this interface, far more complex queries can be made to further enable users in their searches. However, there is an increased learning curve with this method of accessing the data. The SPARQL endpoint returns results in the RDF format which was initially designed with metadata in mind and is thus suited to the needs of BioSD.[11]

Development

The development team forms a part of Helen Parkinson's team at EMBL-EBI and contains software engineers and web developers who are assisted with domain specific knowledge by ontologists and bioinformaticians.

The primary programming language used on the project is the Java programming language. To aid the development of the project, the development teams uses the integrated development environment, IntelliJ IDEA which is provided by JetBrains. Other tools used in the project include Bamboo for continuous integration and the management of software releases. Additionally, YourKit is a Java profiler which helps optimise and eliminate bugs in the BioSD project.[12]

The project is developed as an open-source project with all source code being freely available on GitHub.[13]

Funding

Currently the primary funding for the BioSD database development and maintenance is provided by the European Molecular Biology Laboratory (EMBL) core budget which is in turn funded by its 20 member countries.[1] There has also been additional contributions from the European Commission in the form of a number of grants.[14] Further funding has come from the Human Induced Pluripotent Stem Cells Initiative provided by the Wellcome Trust and the Medical Research Council and from the EBiSC Innovative Medicines Initiative.[15]

See also

References

  1. ^ a b c Gostev, Mikhail; Faulconbridge Adam; Brandizi Marco; Fernandez-Banet Julio; Sarkans Ugis; Brazma Alvis; Parkinson Helen (Jan 2012). "The BioSample Database (BioSD) at the European Bioinformatics Institute". Nucleic Acids Res. 40 (1). England: D64-70. doi:10.1093/nar/gkr937. PMC 3245134. PMID 22096232.
  2. ^ "About biosharing database of Genotypes and Phenotypes (dbGaP)" (HTML). Retrieved 11 September 2014.
  3. ^ Barrett, Tanya (14 November 2013). "The NCBI Handbook [Internet] 2nd edition". Retrieved 11 September 2014.
  4. ^ Faulconbridge, Adam; Tony Burdett; Marco Brandizi; Mikhail Gostev; Rui Pereira; Drashtti Vasant; Ugis Sarkans; Alvis Brazma; Helen Parkinson (20 November 2013). "Updates to BioSamples database at European Bioinformatics Institute". Nucleic Acids Research. 42 (Database issue). England: D50-2. doi:10.1093/nar/gkt1081. PMC 3965081. PMID 24265224.
  5. ^ Shepherd, R; Beare D; Bamford S; Cole CG; Ward S; Bindal N; Gunasekaran P; Jia M; Kok CY; et al. (23 May 2011). "Data mining using the Catalogue of Somatic Mutations in Cancer BioMart". Database (Oxford). 2011. England: bar018. doi:10.1093/database/bar018. PMC 3263736. PMID 21609966.
  6. ^ "BioSample Template Generator". EMBL-EBI (HTML). Retrieved 11 September 2014.
  7. ^ "BioSamples News". EMBL-EBI (HTML). Archived from the original on 10 September 2014. Retrieved 11 September 2014.
  8. ^ "How to search BioSamples Database". EMBL-EBI (HTML). Archived from the original on 11 September 2014. Retrieved 11 September 2014.
  9. ^ "BioSamples API Overview". EMBL-EBI (HTML). Retrieved 29 September 2018.
  10. ^ "BioSamples Database SPARQL Endpoint". EMBL-EBI (HTML). Retrieved 11 September 2014.
  11. ^ "Biosamples Database RDF". EMBL-EBI (HTML). Retrieved 11 September 2014.
  12. ^ "About BioSamples". EMBL-EBI (HTML). Retrieved 10 September 2014.
  13. ^ "EBI BioSamples Database GitHub Project". GitHub (HTML). Retrieved 10 September 2014.
  14. ^ Faulconbridge, A.; Burdett, T.; Brandizi, M.; Gostev, M.; Pereira, R.; Vasant, D.; Sarkans, U.; Brazma, A.; Parkinson, H. (2013). "Updates to BioSamples database at European Bioinformatics Institute". Nucleic Acids Research. 42 (D1): D50 – D52. doi:10.1093/nar/gkt1081. ISSN 0305-1048. PMC 3965081. PMID 24265224.
  15. ^ "BioSamples: Quick tour". EMBL-EBI (HTML). Archived from the original on 10 September 2014. Retrieved 10 September 2014.

Read other articles:

British business magnate (born 1950) Not to be confused with Richard Bronson or Richard Brandon. SirRichard BransonBranson in 2015BornRichard Charles Nicholas Branson (1950-07-18) 18 July 1950 (age 73)London, EnglandOccupationsEntrepreneurauthorYears active1966–presentKnown forFounder of the Virgin GroupSpouses Kristen Tomassi ​ ​(m. 1972; div. 1979)​ Joan Templeman ​(m. 1989)​ Children3 (1 deceased)...

 

 

Hindu temple in Bangladesh Kal Bhairab Temple God Kal Bhairab Part of a series onHinduism Hindus History Timeline Origins Hindu synthesis (500/200 BCE-300 CE) History Indus Valley Civilisation Historical Vedic religion Dravidian folk religion Śramaṇa Tribal religions in India Traditions Major traditions Shaivism Shaktism Smartism Vaishnavism List Deities Trimurti Brahma Vishnu Shiva Tridevi Saraswati Lakshmi Parvati Other major Devas / Devis Vedic: Agni Ashvins Chandra Indra Praja...

 

 

Youth detention center in Pennsylvania Glen Mills Schools logo The Glen Mills Schools was a youth detention center for juvenile delinquents located near Glen Mills in Thornbury Township, Delaware County, Pennsylvania, United States,[1] for boys between 12 and 21 years of age. The school was founded in 1826[2] and was the oldest surviving school of its type in the United States until all residents were ordered removed on March 25, 2019, by the Pennsylvania Department of Hu...

Wouter Bos Deputi Perdana Menteri BelandaMasa jabatan22 Februari 2007 – 23 Februari 2010Menjabat dengan André RouvoetPerdana MenteriJan Peter Balkenende PendahuluGerrit ZalmPenggantiAndré RouvoetMenteri KeuanganMasa jabatan22 Februari 2007 – 23 Februari 2010Perdana MenteriJan Peter Balkenende PendahuluGerrit ZalmPenggantiJan Kees de JagerSekretaris Negara untuk KeuanganMasa jabatan24 Maret 2000 – 22 Juli 2002Perdana MenteriWim Kok PendahuluWillem VermeendPen...

 

 

Mercedes-Benz Kelas-BInformasiProdusenMercedes-BenzMasa produksi2005-sekarangBodi & rangkaKelasMPV subkompak eksekutifBentuk kerangkaHatchback 5-pintuTata letakMesin depan, penggerak roda depanMobil terkaitMercedes-Benz Kelas-AMercedes-Benz Kelas-CLA Mercedes-Benz Kelas-B merupakan sebuah MPV kompak yang diciptakan produsen otomotif Jerman, Mercedes-Benz, sejak tahun 2005. Di Indonesia, mobil ini dijual dalam satu varian yaitu B200. Generasi Pertama (W245; 2005) Generasi Pertama (W24...

 

 

Greek businessman and philanthropist Ioannis PapafisPortrait of Ioannis PapafisBorn1792Thessaloniki, Ottoman GreeceDied1886 (aged 93–94)MaltaNationalityGreekKnown fornational benefactor of Greece Ioannis Papafis or Giovanni di Niccolò Pappaffy (Greek: Ιωάννης Παπάφης; 1792 – 1886) was a Greek businessman and philanthropist, prominent for helping in the funding of the Greek War of Independence and in financing crucial sectors of independent Greece after its suc...

Alternate uniforms in Major League Baseball Not to be confused with City Connection or CityConnect WIFI. City Connect is a brand name for a line of alternate uniforms made by Nike, Inc. for Major League Baseball (MLB) teams. The uniforms feature different color schemes, typefaces, and graphic elements compared with the teams' typical home and away uniforms. The uniforms are designed to reflect the cultural aspects of each team's home city.[1] Of MLB's 30 teams, 20 have a City Connect ...

 

 

Thomas Lemar Thomas Lemar bermain untuk timnas prancis pada pergelaran piala dunia 2018Informasi pribadiNama lengkap Thomas LemarTanggal lahir 12 November 1995 (umur 28)Tempat lahir Baie-Mahault, GuadeloupeTinggi 172 m (564 ft 4 in)Posisi bermain GelandangInformasi klubKlub saat ini Atletico MadridNomor 11Karier senior*Tahun Tim Tampil (Gol)2011 –2015 Caen II 55 (4)2013–2015 Caen 32 (1)2015–2018 Monaco 90 (16)2018– Atlético Madrid 86 (5)Tim nasional2016 – Pranc...

 

 

Cet article est une ébauche concernant l’archéologie et l’Algérie. Vous pouvez partager vos connaissances en l’améliorant (comment ?) selon les recommandations des projets correspondants. Bir el-Ater Noms Nom arabe بئر العاتر Nom amazigh ⴱⵉⵔ ⵍⵄⴰⵜⴻⵔ Administration Pays Algérie Wilaya Tébessa Daïra Bir el-Ater Code postal 12001 Code ONS 1202 Démographie Population 77 727 hab. (2008[1]) Densité 51 hab./km2 Géographie Coordonnées 34...

X-Men vs. Street FightervideogiocoScreenshot di un combattimento: M. Bison vs MagnetoPiattaformaArcade, PlayStation, Sega Saturn, Windows Data di pubblicazione 1996 (arcade) 1997 (Saturn) 1998 (PlayStation) 2001 (Windows)[1] GenerePicchiaduro a incontri TemaX-Men OrigineGiappone SviluppoCapcom PubblicazioneCapcom, Virgin Interactive (PSX Europa), i-Dream Soft (PC) Modalità di giocoGiocatore singolo, multigiocatore locale (2) SupportoCD-ROM Requisiti di s...

 

 

Ця стаття потребує додаткових посилань на джерела для поліпшення її перевірності. Будь ласка, допоможіть удосконалити цю статтю, додавши посилання на надійні (авторитетні) джерела. Зверніться на сторінку обговорення за поясненнями та допоможіть виправити недоліки. Мат...

 

 

Species of fish Siganus punctatus Conservation status Least Concern  (IUCN 3.1)[1] Scientific classification Domain: Eukaryota Kingdom: Animalia Phylum: Chordata Class: Actinopterygii Order: Perciformes Family: Siganidae Genus: Siganus Species: S. punctatus Binomial name Siganus punctatus(Schneider & Forster, 1801) Synonyms[2] Amphacanthus punctatus Schneider & Forster, 1801 Teuthis punctata (Schneider & Forster, 1801) Teuthis punctatus (Schneider & F...

Major river in central United States For other uses, see Missouri (disambiguation). Missouri RiverPekitanoui,[1] Big Muddy,[2] Mighty Mo, Wide Missouri, Kícpaarukstiʾ,[3] Mnišoše[4][5]The Missouri River in MontanaMap of the Missouri River and its tributaries inNorth AmericaEtymologyThe Missouri tribe, whose name in turn meant people with wooden canoes[1]Native nameMnišóše (Lakota)[4][5]LocationCountryUnited StatesStat...

 

 

يفتقر محتوى هذه المقالة إلى الاستشهاد بمصادر. فضلاً، ساهم في تطوير هذه المقالة من خلال إضافة مصادر موثوق بها. أي معلومات غير موثقة يمكن التشكيك بها وإزالتها. (فبراير 2016) جمال الدين الأفغاني النوع تاريخي تأليف احمد رائف إخراج جلال غنيم بطولة محمود ياسين,إبراهيم الصلال,أشرف ع...

 

 

Ne doit pas être confondu avec Ézéchias. Pour les articles homonymes, voir Ézéchiel (homonymie). ÉzéchielLe prophète Ézéchiel, par Michel-Ange (1510)dans la chapelle Sixtine.FonctionProphèteJudaïsmeChristianismeIslamBiographieNaissance 622 av. J.-C.JérusalemDécès 571 av. J.-C.BabyloneNationalité Israélite de la tribu de LéviActivité Troisième des quatre grands prophètesPériode d'activité VIe siècle av. J.-C.Autres informationsÉtape de canonisation SaintFête 10 avril...

Language spoken in Uttar Pradesh, india Kannaujiकन्नौजीNative toIndiaRegionKannaujNative speakers9.5 million (2001)[1]Language familyIndo-European Indo-IranianIndo-AryanCentral ZoneWestern HindiKannaujiWriting systemDevanagariLanguage codesISO 639-3bjjGlottologkana1281Area depicting Kannauji speaking region in Uttar Pradesh, India. Kannauji is an Indo-Aryan language spoken in the Kannauj region of the Indian state of Uttar Pradesh. Kannauji is closely related t...

 

 

Paraguayans of African descent This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Afro-Paraguayans – news · newspapers · books · scholar · JSTOR (May 2020) (Learn how and when to remove this message) Ethnic group Afro-ParaguayansAfro-Mestizo Paraguayan working in Emboscada.Total populationabout 8 thousand[1...

 

 

American journalist (died 2019) For other uses, see Raúl Ruiz. Ruíz in 1983 Raul Ruíz (11 July 1940 – 13 June 2019[1][2]) was an American journalist, professor, and political activist for Chicano civil rights during the Chicano movement and for the Peace movement of the 1960s and '70s. Biography Ruiz was born in El Paso, Texas but moved to Los Angeles in his teen years. He attended California State University, Los Angeles (Cal State LA) where he earned both a bachelor's d...

Lukisan Departure of the Israelites (Keberangkatan orang Israel), oleh David Roberts, 1829. Peristiwa Keluar dari Mesir (Ibrani: יציאת מצרים, translit. Yeẓi’at Miẓrayim, har. 'Keluar dari Mesir'; bahasa Yunani: ἔξοδος, translit. eksodos, har. 'Jalan ke Luar') adalah sebuah kisah pembentuk bangsa Israel.[1] Kisahnya menceritakan perbudakan yang dialami oleh orang-orang Israel dan kaburnya mereka kemudian dari Mesir, tur...

 

 

American philosopher Avital RonellBorn (1952-04-15) 15 April 1952 (age 72)Prague, Czechoslovakia (now Czech Republic)Alma materRutgers Preparatory SchoolMiddlebury CollegePrinceton UniversityEra20th-/21st-century philosophyRegionWestern philosophySchoolContinental philosophy, critical theory, deconstruction, existentialism, hermeneutics, post-structuralismDoctoral advisorStanley CorngoldMain interestsAddiction,[1] deficiency,[2] dictation,[3] disappearance of...