Share to: share facebook share twitter share wa share telegram print page

Self-supervised learning

Self-supervised learning (SSL) is a paradigm in machine learning where a model is trained on a task using the data itself to generate supervisory signals, rather than relying on external labels provided by humans. In the context of neural networks, self-supervised learning aims to leverage inherent structures or relationships within the input data to create meaningful training signals. SSL tasks are designed so that solving it requires capturing essential features or relationships in the data. The input data is typically augmented or transformed in a way that creates pairs of related samples. One sample serves as the input, and the other is used to formulate the supervisory signal. This augmentation can involve introducing noise, cropping, rotation, or other transformations. Self-supervised learning more closely imitates the way humans learn to classify objects.[1]

The typical SSL method is based on an artificial neural network or other model such as a decision list.[2] The model learns in two steps. First, the task is solved based on an auxiliary or pretext classification task using pseudo-labels which help to initialize the model parameters.[3][4] Second, the actual task is performed with supervised or unsupervised learning.[5][6][7] Other auxiliary tasks involve pattern completion from masked input patterns (silent pauses in speech or image portions masked in black).

Self-supervised learning has produced promising results in recent years and has found practical application in audio processing and is being used by Facebook and others for speech recognition.[8]

Types

Autoassociative self-supervised learning

Autoassociative self-supervised learning is a specific category of self-supervised learning where a neural network is trained to reproduce or reconstruct its own input data.[9] In other words, the model is tasked with learning a representation of the data that captures its essential features or structure, allowing it to regenerate the original input.

The term "autoassociative" comes from the fact that the model is essentially associating the input data with itself. This is often achieved using autoencoders, which are a type of neural network architecture used for representation learning. Autoencoders consist of an encoder network that maps the input data to a lower-dimensional representation (latent space), and a decoder network that reconstructs the input data from this representation.

The training process involves presenting the model with input data and requiring it to reconstruct the same data as closely as possible. The loss function used during training typically penalizes the difference between the original input and the reconstructed output. By minimizing this reconstruction error, the autoencoder learns a meaningful representation of the data in its latent space.

Contrastive self-supervised learning

For a binary classification task, training data can be divided into positive examples and negative examples. Positive examples are those that match the target. For example, if you're learning to identify birds, the positive training data are those pictures that contain birds. Negative examples are those that do not.[10] Contrastive self-supervised learning uses both positive and negative examples. Contrastive learning's loss function minimizes the distance between positive sample pairs while maximizing the distance between negative sample pairs.[10]

Non-contrastive self-supervised learning

Non-contrastive self-supervised learning (NCSSL) uses only positive examples. Counterintuitively, NCSSL converges on a useful local minimum rather than reaching a trivial solution, with zero loss. For the example of binary classification, it would trivially learn to classify each example as positive. Effective NCSSL requires an extra predictor on the online side that does not back-propagate on the target side.[10]

Comparison with other forms of machine learning

SSL belongs to supervised learning methods insofar as the goal is to generate a classified output from the input. At the same time, however, it does not require the explicit use of labeled input-output pairs. Instead, correlations, metadata embedded in the data, or domain knowledge present in the input are implicitly and autonomously extracted from the data. These supervisory signals, generated from the data, can then be used for training.[1]

SSL is similar to unsupervised learning in that it does not require labels in the sample data. Unlike unsupervised learning, however, learning is not done using inherent data structures.

Semi-supervised learning combines supervised and unsupervised learning, requiring only a small portion of the learning data be labeled.[4]

In transfer learning a model designed for one task is reused on a different task.[11]

Training an autoencoder intrinsically constitutes a self-supervised process, because the output pattern needs to become an optimal reconstruction of the input pattern itself. However, in current jargon, the term 'self-supervised' has become associated with classification tasks that are based on a pretext-task training setup. This involves the (human) design of such pretext task(s), unlike the case of fully self-contained autoencoder training.[9]

In reinforcement learning, self-supervising learning from a combination of losses can create abstract representations where only the most important information about the state are kept in a compressed way.[12]

Examples

Self-supervised learning is particularly suitable for speech recognition. For example, Facebook developed wav2vec, a self-supervised algorithm, to perform speech recognition using two deep convolutional neural networks that build on each other.[8]

Google's Bidirectional Encoder Representations from Transformers (BERT) model is used to better understand the context of search queries.[13]

OpenAI's GPT-3 is an autoregressive language model that can be used in language processing. It can be used to translate texts or answer questions, among other things.[14]

Bootstrap Your Own Latent (BYOL) is a NCSSL that produced excellent results on ImageNet and on transfer and semi-supervised benchmarks.[15]

The Yarowsky algorithm is an example of self-supervised learning in natural language processing. From a small number of labeled examples, it learns to predict which word sense of a polysemous word is being used at a given point in text.

DirectPred is a NCSSL that directly sets the predictor weights instead of learning it via gradient update.[10]

Self-GenomeNet is an example of self-supervised learning in genomics.[16]

References

  1. ^ a b Bouchard, Louis (25 November 2020). "What is Self-Supervised Learning? | Will machines ever be able to learn like humans?". Medium. Retrieved 9 June 2021.
  2. ^ Yarowsky, David (1995). "Unsupervised Word Sense Disambiguation Rivaling Supervised Methods". Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics. Cambridge, MA: Association for Computational Linguistics: 189–196. doi:10.3115/981658.981684. Retrieved 1 November 2022.
  3. ^ Doersch, Carl; Zisserman, Andrew (October 2017). "Multi-task Self-Supervised Visual Learning". 2017 IEEE International Conference on Computer Vision (ICCV). IEEE. pp. 2070–2079. arXiv:1708.07860. doi:10.1109/iccv.2017.226. ISBN 978-1-5386-1032-9. S2CID 473729.
  4. ^ a b Beyer, Lucas; Zhai, Xiaohua; Oliver, Avital; Kolesnikov, Alexander (October 2019). "S4L: Self-Supervised Semi-Supervised Learning". 2019 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE. pp. 1476–1485. arXiv:1905.03670. doi:10.1109/iccv.2019.00156. ISBN 978-1-7281-4803-8. S2CID 167209887.
  5. ^ Doersch, Carl; Gupta, Abhinav; Efros, Alexei A. (December 2015). "Unsupervised Visual Representation Learning by Context Prediction". 2015 IEEE International Conference on Computer Vision (ICCV). IEEE. pp. 1422–1430. arXiv:1505.05192. doi:10.1109/iccv.2015.167. ISBN 978-1-4673-8391-2. S2CID 9062671.
  6. ^ Zheng, Xin; Wang, Yong; Wang, Guoyou; Liu, Jianguo (April 2018). "Fast and robust segmentation of white blood cell images by self-supervised learning". Micron. 107: 55–71. doi:10.1016/j.micron.2018.01.010. ISSN 0968-4328. PMID 29425969. S2CID 3796689.
  7. ^ Gidaris, Spyros; Bursuc, Andrei; Komodakis, Nikos; Perez, Patrick Perez; Cord, Matthieu (October 2019). "Boosting Few-Shot Visual Learning with Self-Supervision". 2019 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE. pp. 8058–8067. arXiv:1906.05186. doi:10.1109/iccv.2019.00815. ISBN 978-1-7281-4803-8. S2CID 186206588.
  8. ^ a b "Wav2vec: State-of-the-art speech recognition through self-supervision". ai.facebook.com. Retrieved 9 June 2021.
  9. ^ a b Kramer, Mark A. (1991). "Nonlinear principal component analysis using autoassociative neural networks" (PDF). AIChE Journal. 37 (2): 233–243. Bibcode:1991AIChE..37..233K. doi:10.1002/aic.690370209.
  10. ^ a b c d "Demystifying a key self-supervised learning technique: Non-contrastive learning". ai.facebook.com. Retrieved 5 October 2021.
  11. ^ Littwin, Etai; Wolf, Lior (June 2016). "The Multiverse Loss for Robust Transfer Learning". 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. pp. 3957–3966. arXiv:1511.09033. doi:10.1109/cvpr.2016.429. ISBN 978-1-4673-8851-1. S2CID 6517610.
  12. ^ Francois-Lavet, Vincent; Bengio, Yoshua; Precup, Doina; Pineau, Joelle (2019). "Combined Reinforcement Learning via Abstract Representations". Proceedings of the AAAI Conference on Artificial Intelligence. arXiv:1809.04506.
  13. ^ "Open Sourcing BERT: State-of-the-Art Pre-training for Natural Language Processing". Google AI Blog. 2 November 2018. Retrieved 9 June 2021.
  14. ^ Wilcox, Ethan; Qian, Peng; Futrell, Richard; Kohita, Ryosuke; Levy, Roger; Ballesteros, Miguel (2020). "Structural Supervision Improves Few-Shot Learning and Syntactic Generalization in Neural Language Models". Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Stroudsburg, PA, USA: Association for Computational Linguistics. pp. 4640–4652. arXiv:2010.05725. doi:10.18653/v1/2020.emnlp-main.375. S2CID 222291675.
  15. ^ Grill, Jean-Bastien; Strub, Florian; Altché, Florent; Tallec, Corentin; Richemond, Pierre H.; Buchatskaya, Elena; Doersch, Carl; Pires, Bernardo Avila; Guo, Zhaohan Daniel; Azar, Mohammad Gheshlaghi; Piot, Bilal (10 September 2020). "Bootstrap your own latent: A new approach to self-supervised Learning". arXiv:2006.07733 [cs.LG].
  16. ^ Gündüz, Hüseyin Anil; Binder, Martin; To, Xiao-Yin; Mreches, René; Bischl, Bernd; McHardy, Alice C.; Münch, Philipp C.; Rezaei, Mina (11 September 2023). "A self-supervised deep learning method for data-efficient training in genomics". Communications Biology. 6 (1): 928. doi:10.1038/s42003-023-05310-2. ISSN 2399-3642. PMC 10495322. PMID 37696966.

Further reading

  • Balestriero, Randall; Ibrahim, Mark; Sobal, Vlad; Morcos, Ari; Shekhar, Shashank; Goldstein, Tom; Bordes, Florian; Bardes, Adrien; Mialon, Gregoire; Tian, Yuandong; Schwarzschild, Avi; Wilson, Andrew Gordon; Geiping, Jonas; Garrido, Quentin; Fernandez, Pierre (24 April 2023). "A Cookbook of Self-Supervised Learning". arXiv:2304.12210 [cs.LG].

External links

Read other articles:

Freighter sunk off the shore of Isle Royale in Lake Superior For other ships with the same name, see SS William H. Gratwick. History Canada NameGlenlyon OperatorMidland Shipbuilding Company BuilderF. W. Wheeler LaunchedFebruary 4, 1893 Acquired1914 Out of serviceNovember 1, 1924 FateSunk off the shore of Isle Royale in Lake Superior General characteristics TypeFreighter Tonnage3,800 DWT Length328 ft (100 m) Beam42.5 ft (13.0 m) Depth20.5 ft (6.2 m) Installed po…

جزء من سلسلة مقالات حولالإسلام العقيدة الإيمان توحيد الله الإيمان بالملائكة الإيمان بالكتب السماوية الإيمان بالرسل والأنبياء الإيمان باليوم الآخر الإيمان بالقضاء والقدر أركان الإسلام شهادة أن لا إله إلا الله وأن محمد رسول الله إقامة الصلاة إيتاء الزكاة صوم رمضان الحج مصا…

Questa voce sull'argomento cestisti brasiliani è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Segui i suggerimenti del progetto di riferimento. Luisão Nazionalità  Brasile Altezza 206 cm Pallacanestro Ruolo Centro Termine carriera 2007 Carriera Squadre di club 2006-2007 Joinville Nazionale 1987 Brasile U-191991-1995 Brasile Palmarès  Campionati sudamericani Argento Venezuela 1991 Bronzo Uruguay 1995  Giochi panamericani Bronzo Mar …

يفتقر محتوى هذه المقالة إلى الاستشهاد بمصادر. فضلاً، ساهم في تطوير هذه المقالة من خلال إضافة مصادر موثوق بها. أي معلومات غير موثقة يمكن التشكيك بها وإزالتها. (ديسمبر 2018) دائرة ذراع الميزان ذراع الميزانذراع الميزان الإدارة ولاية ولاية تيزي وزو مركز الدائرة ذراع الميزان البلد

Untuk sejarah alfabet, lihat Sejarah alfabet Yunani. Untuk sejarah non-bahasa, lihat Sejarah Yunani. Ada beberapa teori tentang asal usul bahasa Yunani. Satu teori menyatakan bahwa bahasa ini berasal dari migrasi penutur Proto-Helenik ke Semenanjung Yunani kira-kira pada tahun 3000 SM hingga 1700 SM. Teori lain menyatakan bahwa migrasi ke Yunani terjadi pada tahap Pra-Proto-Helenik (PIE akhir), dan perubahan bunyi dari bahasa Proto-Helenik menjadi bahasa Yunani terawal terjadi kemudian di Semena…

هذه المقالة يتيمة إذ تصل إليها مقالات أخرى قليلة جدًا. فضلًا، ساعد بإضافة وصلة إليها في مقالات متعلقة بها. (يوليو 2019) لولا كيويتو معلومات شخصية اسم الولادة (بالإسبانية: María Dolores Velázquez Rivas)‏  الميلاد 2 مارس 1897[1][2]  أزكابوتزالكو  الوفاة 24 يناير 1978 (80 سنة)   مدينة م…

Muhammad FadliS.E. Wakil Wali Kota Pagar Alam ke-4Masa jabatan18 September 2018 – 7 Desember 2022PresidenJoko WidodoGubernurAlex Noerdin Herman DeruWali KotaAlpian MaskoniPendahuluNovirzah Informasi pribadiLahir(1985-11-17)17 November 1985Kota Pagar Alam, Sumatera SelatanMeninggal7 Desember 2022(2022-12-07) (umur 37)Kota Palembang, Sumatera SelatanKebangsaanIndonesiaPartai politik  NasDemSuami/istriRahma Munto Via NingrumAnak2Alma materUniversitas Teknologi Surabaya…

القضاء على المثقفين المعلومات البلد ألمانيا النازية  الموقع بولندا  التاريخ سبتمبر 1939  الأسلحة Automatic weapons الخسائر الوفيات 100000   المنفذ شرطة الأمنإدارة التحقيقات الجنائيةغيستابو  تعديل مصدري - تعديل   القضاء على المثقفين Intelligenzaktion ( تُلفظ بالألمانية: [ɪntɛliˈɡ…

Pour l’article homonyme, voir Le Journal de Québec (1842-1889). Cet article est une ébauche concernant la presse écrite et Québec. Vous pouvez partager vos connaissances en l’améliorant (comment ?) selon les recommandations des projets correspondants. Le Journal de Québec Pays Canada Langue Français Périodicité Quotidien Format Tabloïd Genre Généraliste Diffusion 98 165 ex. (2004) Date de fondation 1967 Ville d’édition Québec Propriétaire Pierre Karl Péladeau via Québ…

The Hinckel Brewery in Albany was built in 1880 and is now an apartment complex. New York State, one of the fifty states of the United States of America, is home to more than 320 beer breweries,[1] as well as numerous brewpubs and bars. Throughout the last decade, the consumption of craft beer has grown to be a part of the state's culture. The following is a partial list of breweries located in the state. The list includes not only breweries of beer but also of sake, such as Brooklyn Kur…

Predator 2SutradaraStephen HopkinsProduser Joel Silver Lawrence Gordon (producer) John Davis (American producer) Ditulis olehJim ThomasJohn ThomasPemeranDanny GloverGary BuseyMaria Conchita AlonsoRuben BladesBill PaxtonCalvin LockhartKevin Peter HallSinematograferPeter LevyDistributor20th Century FoxTanggal rilisNovember 21, 1990Durasi108 min.Negara Amerika Serikat Bahasa Inggris Anggaran$35,000,000Pendapatankotor$57,169,413 (worldwide) [1]PrekuelPredatorSekuelAlien vs. PredatorIMDbInfor…

جزء من سلسلة حول دولة الكويت تاريخ تاريخ الكويت استقلال الكويت معركة ذات السلاسل معركة الجهراء معركة الرقة معركة الصريف معركة هدية الغزو العراقي للكويت السياسة السياسة آل صباح مجلس الأمة الدستور الكويتي الأمير الأنتخابات الحكومة رئيس وزراء الأحزاب السياسية الاقتصاد الاق

1950 film by Lesley Selander Short GrassDirected byLesley SelanderWritten byThomas W. BlackburnProduced byScott R. DunlapStarringRod Cameron, Cathy Downs and Johnny Mack BrownCinematographyHarry NeumannEdited byOtho LoveringMusic byEdward J. KayProductioncompanyScott R. Dunlap ProductionsDistributed byAllied Artists PicturesRelease dateDecember 24, 1950Running time82 minutesCountryUnited StatesLanguageEnglish Short Grass is a 1950 American Western film directed by Lesley Selander and starring Ro…

American spice company Diaspora is a spice company that trades in spices sourced to small farmers in South East Asia. The company was founded in 2017 by Indian-American Sana Javeri Kadri, and is based in California's Bay Area.[1] Kadri was born in Mumbai but lived in the United States,[1] attending Pomona College and working on the Pomona College Organic Farm.[2] She left for India in 2016 after she became disappointed in the quality of the turmeric for sale in the US. He…

Facultad de la Comunicación e Imagen Sigla FCEIFundación 1953 (Escuela de Periodismo)2003 (ICEI)2022 (FCEI)LocalizaciónDirección Av. Ignacio Carrera Pinto #1045, Ñuñoa Santiago, ChileCampus Juan Gómez MillasAdministraciónRectora Rosa DevésDecana Loreto Rebolledo GonzálezVicedecana María Eugenia Domínguez SaúlAfiliaciones Universidad de ChileSitio web www.icei.uchile.cl[editar datos en Wikidata] La Facultad de la Comunicación e Imagen de la Universidad de Chile (FCEI) (has…

Galeri Nasional IndonesiaGaleri Nasional Indonesia (bahasa Inggris: National Gallery of Indonesia) adalah sebuah lembaga budaya negara atau sebagai museum seni rupa modern dan kontemporer. Di sini terdapat gedung yang berfungsi antara lain sebagai tempat pameran, dan perhelatan seni rupa Indonesia dan mancanegara.[1] Gedung ini merupakan institusi milik pemerintah di bawah Menteri Pendidikan dan Kebudayaan.[2] Fungsi Galeri Nasional Indonesia adalah melaksanakan pengkajian, pengu…

Wedstrijden in de eredivisie 2011/12 kan verwijzen naar: Eredivisie 2011/12 (mannenvoetbal)/Wedstrijden Eredivisie 2011/12 (vrouwenvoetbal)/Wedstrijden Bekijk alle artikelen waarvan de titel begint met Wedstrijden in de eredivisie 2011/12 of met Wedstrijden in de eredivisie 2011/12 in de titel. Dit is een doorverwijspagina, bedoeld om de verschillen in betekenis of gebruik van Wedstrijden in de eredivisie 2011/12 inzichtelijk te maken. Op deze pagina staat een uitleg van…

2004 film King of ThievesDirected byIvan FílaWritten byIvan FílaProduced byHelga BährRudolf BiermannIvan FílaStarringLazar RistovskiJakov KultiasovKatharina ThalbachJulia KhanverdievaOktay ÖzdemirPaulus MankerDistributed byPicture This! EntertainmentRelease date 19 February 2004 (2004-02-19) Running time101 minutesCountryGermanyLanguageGerman King of Thieves is a 2004 German movie directed by Ivan Fíla, starring Lazar Ristovski, Jakov Kultiasov, Katharina Thalbach, Paulus Ma…

Marrocos nos Jogos Olímpicos de Inverno de 1988 Comitê Olímpico Nacional Código do COI MAR Nome Comité National Olympique Marocain«site oficial» (em francês)  Jogos Olímpicos de Inverno de 1988 Sede Calgary, Canadá Competidores 3 em 1 esporte Medalhas Pos.n/d 0 0 0 0 Participações nos Jogos Olímpicos Verão 1960 • 1964 • 1968 • 1972 • 1976 • 1980 • 1984 • 1988 • 1992 • 1996 • 2000 • 2004 • 2008 • 2012 • 2016 • 2020 Inverno 1968 • 1972…

BetrokaPhân loại khoa họcGiới (regnum)AnimaliaNgành (phylum)ArthropodaLớp (class)InsectaBộ (ordo)LepidopteraLiên họ (superfamilia)GelechioideaHọ (familia)EthmiidaeChi (genus)Betroka (bướm đêm) Betroka (bướm đêm) là một chi bướm đêm thuộc họ Ethmiidae. Hình ảnh Chú thích Tham khảo Dữ liệu liên quan tới Betroka (bướm đêm) tại Wikispecies Tư liệu liên quan tới Betroka tại Wikimedia Commons Bài viết về Bộ Cánh vẩy này v…

Kembali kehalaman sebelumnya

Lokasi Pengunjung: 3.16.130.155