Sparse dictionary learning (also known as sparse coding or SDL) is a representation learning method which aims to find a sparse representation of the input data in the form of a linear combination of basic elements as well as those basic elements themselves. These elements are called atoms, and they compose a dictionary. Atoms in the dictionary are not required to be orthogonal, and they may be an over-complete spanning set. This problem setup also allows the dimensionality of the signals being represented to be higher than any one of the signals being observed. These two properties lead to having seemingly redundant atoms that allow multiple representations of the same signal, but also provide an improvement in sparsity and flexibility of the representation.
One of the most important applications of sparse dictionary learning is in the field of compressed sensing or signal recovery. In compressed sensing, a high-dimensional signal can be recovered with only a few linear measurements, provided that the signal is sparse or near-sparse. Since not all signals satisfy this condition, it is crucial to find a sparse representation of that signal such as the wavelet transform or the directional gradient of a rasterized matrix. Once a matrix or a high-dimensional vector is transferred to a sparse space, different recovery algorithms like basis pursuit, CoSaMP,[1] or fast non-iterative algorithms[2] can be used to recover the signal.
One of the key principles of dictionary learning is that the dictionary has to be inferred from the input data. The emergence of sparse dictionary learning methods was stimulated by the fact that in signal processing, one typically wants to represent the input data using a minimal amount of components. Before this approach, the general practice was to use predefined dictionaries such as Fourier or wavelet transforms. However, in certain cases, a dictionary that is trained to fit the input data can significantly improve the sparsity, which has applications in data decomposition, compression, and analysis, and has been used in the fields of image denoising and classification, and video and audio processing. Sparsity and overcomplete dictionaries have immense applications in image compression, image fusion, and inpainting.
Problem statement
Given the input dataset we wish to find a dictionary and a representation such that both is minimized and the representations are sparse enough. This can be formulated as the following optimization problem:
, where ,
is required to constrain so that its atoms would not reach arbitrarily high values allowing for arbitrarily low (but non-zero) values of . controls the trade off between the sparsity and the minimization error.
The minimization problem above is not convex because of the ℓ0-"norm" and solving this problem is NP-hard.[3] In some cases L1-norm is known to ensure sparsity[4] and so the above becomes a convex optimization problem with respect to each of the variables and when the other one is fixed, but it is not jointly convex in .
Properties of the dictionary
The dictionary defined above can be "undercomplete" if or "overcomplete" in case with the latter being a typical assumption for a sparse dictionary learning problem. The case of a complete dictionary does not provide any improvement from a representational point of view and thus isn't considered.
Undercomplete dictionaries represent the setup in which the actual input data lies in a lower-dimensional space. This case is strongly related to dimensionality reduction and techniques like principal component analysis which require atoms to be orthogonal. The choice of these subspaces is crucial for efficient dimensionality reduction, but it is not trivial. And dimensionality reduction based on dictionary representation can be extended to address specific tasks such as data analysis or classification. However, their main downside is limiting the choice of atoms.
Overcomplete dictionaries, however, do not require the atoms to be orthogonal (they will never have a basis anyway) thus allowing for more flexible dictionaries and richer data representations.
An overcomplete dictionary which allows for sparse representation of signal can be a famous transform matrix (wavelets transform, fourier transform) or it can be formulated so that its elements are changed in such a way that it sparsely represents the given signal in a best way. Learned dictionaries are capable of giving sparser solutions as compared to predefined transform matrices.
Algorithms
As the optimization problem described above can be solved as a convex problem with respect to either dictionary or sparse coding while the other one of the two is fixed, most of the algorithms are based on the idea of iteratively updating one and then the other.
The problem of finding an optimal sparse coding with a given dictionary is known as sparse approximation (or sometimes just sparse coding problem). A number of algorithms have been developed to solve it (such as matching pursuit and LASSO) and are incorporated in the algorithms described below.
Method of optimal directions (MOD)
The method of optimal directions (or MOD) was one of the first methods introduced to tackle the sparse dictionary learning problem.[5] The core idea of it is to solve the minimization problem subject to the limited number of non-zero components of the representation vector:
Here, denotes the Frobenius norm. MOD alternates between getting the sparse coding using a method such as matching pursuit and updating the dictionary by computing the analytical solution of the problem given by where is a Moore-Penrose pseudoinverse. After this update is renormalized to fit the constraints and the new sparse coding is obtained again. The process is repeated until convergence (or until a sufficiently small residue).
MOD has proved to be a very efficient method for low-dimensional input data requiring just a few iterations to converge. However, due to the high complexity of the matrix-inversion operation, computing the pseudoinverse in high-dimensional cases is in many cases intractable. This shortcoming has inspired the development of other dictionary learning methods.
K-SVD is an algorithm that performs SVD at its core to update the atoms of the dictionary one by one and basically is a generalization of K-means. It enforces that each element of the input data is encoded by a linear combination of not more than elements in a way identical to the MOD approach:
This algorithm's essence is to first fix the dictionary, find the best possible under the above constraint (using Orthogonal Matching Pursuit) and then iteratively update the atoms of dictionary in the following manner:
The next steps of the algorithm include rank-1 approximation of the residual matrix , updating and enforcing the sparsity of after the update. This algorithm is considered to be standard for dictionary learning and is used in a variety of applications. However, it shares weaknesses with MOD being efficient only for signals with relatively low dimensionality and having the possibility for being stuck at local minima.
One can also apply a widespread stochastic gradient descent method with iterative projection to solve this problem.[6] The idea of this method is to update the dictionary using the first order stochastic gradient and project it on the constraint set . The step that occurs at i-th iteration is described by this expression:
, where is a random subset of and is a gradient step.
Lagrange dual method
An algorithm based on solving a dual Lagrangian problem provides an efficient way to solve for the dictionary having no complications induced by the sparsity function.[7] Consider the following Lagrangian:
, where is a constraint on the norm of the atoms and are the so-called dual variables forming the diagonal matrix .
We can then provide an analytical expression for the Lagrange dual after minimization over :
.
After applying one of the optimization methods to the value of the dual (such as Newton's method or conjugate gradient) we get the value of :
Solving this problem is less computational hard because the amount of dual variables is a lot of times much less than the amount of variables in the primal problem.
Parametric training methods are aimed to incorporate the best of both worlds — the realm of analytically constructed dictionaries and the learned ones.[9] This allows to construct more powerful generalized dictionaries that can potentially be applied to the cases of arbitrary-sized signals. Notable approaches include:
Translation-invariant dictionaries.[10] These dictionaries are composed by the translations of the atoms originating from the dictionary constructed for a finite-size signal patch. This allows the resulting dictionary to provide a representation for the arbitrary-sized signal.
Multiscale dictionaries.[11] This method focuses on constructing a dictionary that is composed of differently scaled dictionaries to improve sparsity.
Sparse dictionaries.[12] This method focuses on not only providing a sparse representation but also constructing a sparse dictionary which is enforced by the expression where is some pre-defined analytical dictionary with desirable properties such as fast computation and is a sparse matrix. Such formulation allows to directly combine the fast implementation of analytical dictionaries with the flexibility of sparse approaches.
Many common approaches to sparse dictionary learning rely on the fact that the whole input data (or at least a large enough training dataset) is available for the algorithm. However, this might not be the case in the real-world scenario as the size of the input data might be too big to fit it into memory. The other case where this assumption can not be made is when the input data comes in a form of a stream. Such cases lie in the field of study of online learning which essentially suggests iteratively updating the model upon the new data points becoming available.
A dictionary can be learned in an online manner the following way:[13]
This method allows us to gradually update the dictionary as new data becomes available for sparse representation learning and helps drastically reduce the amount of memory needed to store the dataset (which often has a huge size).
Applications
The dictionary learning framework, namely the linear decomposition of an input signal using a few basis elements learned from data itself, has led to state-of-art[citation needed] results in various image and video processing tasks. This technique can be applied to classification problems in a way that if we have built specific dictionaries for each class, the input signal can be classified by finding the dictionary corresponding to the sparsest representation.
It also has properties that are useful for signal denoising since usually one can learn a dictionary to represent the meaningful part of the input signal in a sparse way but the noise in the input will have a much less sparse representation.[14]
Sparse dictionary learning has been successfully applied to various image, video and audio processing tasks as well as to texture synthesis[15] and unsupervised clustering.[16] In evaluations with the Bag-of-Words model,[17][18] sparse coding was found empirically to outperform other coding approaches on the object category recognition tasks.
Dictionary learning is used to analyse medical signals in detail. Such medical signals include those from electroencephalography (EEG), electrocardiography (ECG), magnetic resonance imaging (MRI), functional MRI (fMRI), continuous glucose monitors [19] and ultrasound computer tomography (USCT), where different assumptions are used to analyze each signal.
^Needell, D.; Tropp, J.A. (2009). "CoSaMP: Iterative signal recovery from incomplete and inaccurate samples". Applied and Computational Harmonic Analysis. 26 (3): 301–321. arXiv:0803.2392. doi:10.1016/j.acha.2008.07.002.
^Donoho, David L. (2006-06-01). "For most large underdetermined systems of linear equations the minimal 𝓁1-norm solution is also the sparsest solution". Communications on Pure and Applied Mathematics. 59 (6): 797–829. doi:10.1002/cpa.20132. ISSN1097-0312. S2CID8510060.
^Engan, K.; Aase, S.O.; Hakon Husoy, J. (1999-01-01). "Method of optimal directions for frame design". 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258). Vol. 5. pp. 2443–2446 vol.5. doi:10.1109/ICASSP.1999.760624. ISBN978-0-7803-5041-0. S2CID33097614.
Gereja ScientologyBangunan Scientology di Los Angeles, CaliforniaTanggal pendirian1954Kantor pusatGold BaseLokasiRiverside County, CaliforniaKetua Pusat Teknologi KeagamaanDavid MiscavigeSitus webwww.scientology.org Gereja Scientology adalah organisasi aliran kepercayaan baru yang muncul pada tahun 1954. Organisasi ini didirikan oleh Laffayette Ronald Hubbard. Aliran kepercayaan ini memiliki banyak pengikut dari kalangan selebritas Hollywood, seperti John Travolta, Jason Lee, Tom Cruise (kelu...
Alsco Uniforms 300SirkuitLas Vegas Motor SpeedwayLas Vegas, NevadaSponsorAlsco[1]Lomba pertama1997Jarak tempuh300 mil (482,803 km)Lap200Tahap 1: 45Tahap 2: 45Tahap akhir: 110Nama sebelumnyaLas Vegas 300 (1997) Sam’s Town Las Vegas 300 (1998) Sam's Town 300 (1999-2013) Boyd Gaming 300 (2014-2020) Alsco Uniforms 300 adalah balapan mobil NASCAR Xfinity Series yang diadakan setiap tahun di Las Vegas Motor Speedway di Las Vegas, Nevada. Lomba ini pertama kali terjadi pada tahun 1997...
Manufacturer of business jets LearjetFounded1962[1]FounderBill LearDefunct2021FateAcquired by Bombardier Aerospace in 1990, production ceased in 2021HeadquartersWichita, KansasParentBombardier Aviation (1990–2021)SubsidiariesBrantly Helicopter Corporation (1966–1969) Learjet was a manufacturer of business jets for civilian and military use based in Wichita, Kansas, United States. Founded in the late 1950s by William Powell Lear as Swiss American Aviation Corporation, it became a s...
Artikel ini sebatang kara, artinya tidak ada artikel lain yang memiliki pranala balik ke halaman ini.Bantulah menambah pranala ke artikel ini dari artikel yang berhubungan atau coba peralatan pencari pranala.Tag ini diberikan pada Februari 2023. Tympanocryptis tolleyi TaksonomiKerajaanAnimaliaFilumChordataKelasReptiliaOrdoSquamataFamiliAgamidaeGenusTympanocryptisSpesiesTympanocryptis tolleyi DistribusiEndemikAustralia lbs Tympanocryptis tolleyi adalah sebuah spesies agama yang ditemukan di Au...
Jean LaurentJuan Laurent, dari La Ilustración Nacional (1887)Lahir(1816-07-23)23 Juli 1816GarchizyMeninggal24 November 1886(1886-11-24) (umur 70)MadridTempat pemakamanCementerio de la AlmudenaNama lainJuan Laurent MinierPekerjaanFotografer Madrid, Plaza de toros (1874) Torre Nueva Zaragoza, (skt. 1875). Menara ini dihancurkan pada 1892–1893. Bangunan ini adalah menara jam, dibangun dari batu bata dengan gaya Mudéjar pada awal abad keenam belas Jean Laurent atau, dalam bahasa Sp...
Musical based on the novel of the same name The NotebookBroadway Promotional PosterMusicIngrid MichaelsonLyricsIngrid MichaelsonBookBekah BrunstetterBasisThe Notebook by Nicholas SparksProductions2022 Chicago 2024 Broadway The Notebook is a musical with music and lyrics by Ingrid Michaelson and a book by Bekah Brunstetter. It is based on the 1996 novel of the same name, written by Nicholas Sparks. The musical opened on Broadway on March 14, 2024 at the Gerald Schoenfeld Theatre.[1] Su...
Private liberal arts college in Williamstown, Massachusetts This article is about the college in Williamstown, Massachusetts. For other uses, see Williams College (disambiguation). Williams CollegeMottoE liberalitate E. Williams, armigeri (Latin)Motto in EnglishThrough the Generosity of E. Williams, EsquireTypePrivate liberal arts collegeEstablished1793; 231 years ago (1793)AccreditationNECHEAcademic affiliationsAICUMAnnapolis GroupCLACCOFHEOberlin GroupSpace-grantEndow...
Chu Lai Air Base Part of Chu Lai Base AreaChu Lai, Quảng Nam Province in VietnamChu Lai Air BaseCoordinates15°27′18″N 108°41′24″E / 15.455°N 108.69°E / 15.455; 108.69 (Chu Lai Base Area)Site informationOperatorUnited States Marine Corps (USMC)United States Army (US ARMY)Site historyBuilt1965 (1965)In use1965-1972 (1972)Battles/warsVietnam WarOperation StarliteAirfield informationElevation1,759 feet (536 m) AMSL Runw...
American short television program This article is about the American version. For the British version, see As the Bell Rings (British TV series). For the Australian version, see As the Bell Rings (Australian TV series). As the Bell RingsCreated byChris ThompsonStarring Demi Lovato Tony Oller Carlson Young Gabriella Rodriguez Collin Cole Seth Ginsberg Lindsey Black Country of originUnited StatesOriginal languageEnglishNo. of seasons2No. of episodes36 (list of episodes)ProductionCamera setupMul...
Japanese professional wrestler This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages) This biography of a living person needs additional citations for verification. Please help by adding reliable sources. Contentious material about living persons that is unsourced or poorly sourced must be removed immediately from the article and its talk page, especially if potentially libelous.Find sources:...
County in Ireland This article is about County Wexford in Ireland. For the county in Michigan, see Wexford County, Michigan. County in Leinster, IrelandCounty Wexford Contae Loch GarmanCounty Coat of armsNickname: The Model CountyMotto(s): Exemplar Hiberniae (Latin)An example to IrelandSampla na hÉireannLocation in IrelandCoordinates: 52°30′N 6°45′W / 52.5°N 6.75°W / 52.5; -6.75CountryIrelandProvinceLeinsterRegionSouthernEstablished1210 ...
D. Affandi Karier militerDinas/cabang TNI Angkatan DaratMasa dinas1945-1966Pangkat Brigadir Jenderal TNINRP11860SatuanInfanteriSunting kotak info • L • B Brigadir Jenderal TNI (Purn.) D. Affandi adalah seorang perwira tinggi angkatan darat dan pernah menjabat Komandan Pusat Pendidikan Infanteri. Karier Pada tahun 1949, ketika masa-masa terakhir dari Revolusi Nasional Indonesia, Affandi ditunjuk sebagai Kepala Bagian Pertahanan pada salah satu Sub Territorium Commando (STC) ...
Artikel ini perlu diwikifikasi agar memenuhi standar kualitas Wikipedia. Anda dapat memberikan bantuan berupa penambahan pranala dalam, atau dengan merapikan tata letak dari artikel ini. Untuk keterangan lebih lanjut, klik [tampil] di bagian kanan. Mengganti markah HTML dengan markah wiki bila dimungkinkan. Tambahkan pranala wiki. Bila dirasa perlu, buatlah pautan ke artikel wiki lainnya dengan cara menambahkan [[ dan ]] pada kata yang bersangkutan (lihat WP:LINK untuk keterangan lebih lanjut...
Home video game console by Nintendo Not to be confused with WIIU. Wii UA Wii U (right) and Wii U GamePadCodenameProject Café[1]DeveloperNintendo IRD, NTDManufacturerNintendo, Foxconn, Mitsumi[2]TypeHome video game consoleGenerationEighthRelease dateNA: November 18, 2012PAL: November 30, 2012JP: December 8, 2012Introductory priceUS$299/¥26,250 (Basic Set) (Deluxe/Premium Set (later bundles))US$349/¥31,500 (Deluxe/Premium Set)DiscontinuedWW: January 31, 2017Units soldWorldwid...
Hinduism by country Africa Algeria Angola Benin Botswana Burkina Faso Burundi Cameroon Cape Verde Central African Republic Chad Comoros Democratic Republic of the Congo Republic of the Congo Djibouti Egypt Equatorial Guinea Eritrea Eswatini Ethiopia Gabon Gambia Ghana Guinea Guinea-Bissau Ivory Coast Kenya Lesotho Liberia Libya Madagascar Malawi Mali Mauritania Mauritius Morocco Western Sahara Mozambique Namibia Niger Nigeria Rwanda São Tomé and Príncipe Senegal Seychelles Sierra Leone So...
Armenian amateur boxer Hovhannes BachkovBachkov at the 2016 OlympicsBorn (1992-12-02) 2 December 1992 (age 31)Gyumri, ArmeniaStatisticsWeight classLightweight, Light welterweightWeight(s)64 kg (141 lb)Height173 cm (5 ft 8 in)[1] Medal record Men's amateur boxing Representing Armenia Olympic Games 2020 Tokyo Lightweight World Championships 2017 Hamburg Light welterweight 2019 Yekaterinburg Light welterweight 2021 Belgrade Light welterweight 2023 Tash...
Questa voce sull'argomento centri abitati del Baden-Württemberg è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Korbcomune Korb – Veduta LocalizzazioneStato Germania Land Baden-Württemberg DistrettoStoccarda CircondarioRems-Murr TerritorioCoordinate48°50′30″N 9°21′40″E48°50′30″N, 9°21′40″E (Korb) Altitudine296 m s.l.m. Superficie8,45 km² Abitanti10 385[1] (2005-12-31) Densità1 228,9...