In functional analysis, a reproducing kernel Hilbert space (RKHS) is a Hilbert space of functions in which point evaluation is a continuous linear functional. Specifically, a Hilbert space of functions from a set (to or ) is an RKHS if, for each , there exists a function such that for all ,
The function is called the reproducing kernel, and it reproduces the value of at via the inner product.
An immediate consequence of this property is that convergence in norm implies uniform convergence on any subset of on which is bounded. However, the converse does not necessarily hold. Often the set carries a topology, and depends continuously on , in which case: convergence in norm implies uniform convergence on compact subsets of .
It is not entirely straightforward to construct natural examples of a Hilbert space which are not an RKHS in a non-trivial fashion.[1] Some examples, however, have been found.[2][3]
While, formally, L2 spaces are defined as Hilbert spaces of equivalence classes of functions, this definition can trivially be extended to a Hilbert space of functions by chosing a (total) function as a representative for each equivalence class. However, no choice of representatives can make this space an RKHS ( would need to be the non-existent Dirac delta function). However, there are RKHSs in which the norm is an L2-norm, such as the space of band-limited functions (see the example below).
An RKHS is associated with a kernel that reproduces every function in the space in the sense that for every in the set on which the functions are defined, "evaluation at " can be performed by taking an inner product with a function determined by the kernel. Such a reproducing kernel exists if and only if every evaluation functional is continuous.
These spaces have wide applications, including complex analysis, harmonic analysis, and quantum mechanics. Reproducing kernel Hilbert spaces are particularly important in the field of statistical learning theory because of the celebrated representer theorem which states that every function in an RKHS that minimises an empirical risk functional can be written as a linear combination of the kernel function evaluated at the training points. This is a practically useful result as it effectively simplifies the empirical risk minimization problem from an infinite dimensional to a finite dimensional optimization problem.
For ease of understanding, we provide the framework for real-valued Hilbert spaces. The theory can be easily extended to spaces of complex-valued functions and hence include the many important examples of reproducing kernel Hilbert spaces that are spaces of analytic functions.[5]
Definition
Let be an arbitrary set and a Hilbert space of real-valued functions on , equipped with pointwise addition and pointwise scalar multiplication. The evaluation functional over the Hilbert space of functions is a linear functional that evaluates each function at a point ,
We say that H is a reproducing kernel Hilbert space if, for all in , is continuous at every in or, equivalently, if is a bounded operator on , i.e. there exists some such that
1
Although is assumed for all , it might still be the case that .
While property (1) is the weakest condition that ensures both the existence of an inner product and the evaluation of every function in at every point in the domain, it does not lend itself to easy application in practice. A more intuitive definition of the RKHS can be obtained by observing that this property guarantees that the evaluation functional can be represented by taking the inner product of with a function in . This function is the so-called reproducing kernel[citation needed] for the Hilbert space from which the RKHS takes its name. More formally, the Riesz representation theorem implies that for all in there exists a unique element of with the reproducing property,
2
Since is itself a function defined on with values in the field (or in the case of complex Hilbert spaces) and as is in we have that
where is the element in associated to .
This allows us to define the reproducing kernel of as a function (or in the complex case) by
From this definition it is easy to see that (or in the complex case) is both symmetric (resp. conjugate symmetric) and positive definite, i.e.
for every [6] The Moore–Aronszajn theorem (see below) is a sort of converse to this: if a function satisfies these conditions then there is a Hilbert space of functions on for which it is a reproducing kernel.
Examples
The simplest example of a reproducing kernel Hilbert space is the space where is a set and is the counting measure on . For , the reproducing kernel is the indicator function of the one point set .
where is the set of square integrable functions, and is the Fourier transform of . As the inner product, we use
Since this is a closed subspace of , it is a Hilbert space. Moreover, the elements of are smooth functions on that tend to zero at infinity, essentially by the Riemann-Lebesgue lemma. In fact, the elements of are the restrictions to of entire holomorphic functions, by the Paley–Wiener theorem.
Thus we obtain the reproducing property of the kernel.
in this case is the "bandlimited version" of the Dirac delta function, and that converges to in the weak sense as the cutoff frequency tends to infinity.
Moore–Aronszajn theorem
We have seen how a reproducing kernel Hilbert space defines a reproducing kernel function that is both symmetric and positive definite. The Moore–Aronszajn theorem goes in the other direction; it states that every symmetric, positive definite kernel defines a unique reproducing kernel Hilbert space. The theorem first appeared in Aronszajn's Theory of Reproducing Kernels, although he attributes it to E. H. Moore.
Theorem. Suppose K is a symmetric, positive definite kernel on a set X. Then there is a unique Hilbert space of functions on X for which K is a reproducing kernel.
Proof. For all x in X, define Kx = K(x, ⋅ ). Let H0 be the linear span of {Kx : x ∈ X}. Define an inner product on H0 by
which implies .
The symmetry of this inner product follows from the symmetry of K and the non-degeneracy follows from the fact that K is positive definite.
Let H be the completion of H0 with respect to this inner product. Then H consists of functions of the form
To prove uniqueness, let G be another Hilbert space of functions for which K is a reproducing kernel. For every x and y in X, (2) implies that
By linearity, on the span of . Then because G is complete and contains H0 and hence contains its completion.
Now we need to prove that every element of G is in H. Let be an element of G. Since H is a closed subspace of G, we can write where and . Now if then, since K is a reproducing kernel of G and H:
where we have used the fact that belongs to H so that its inner product with in G is zero.
This shows that in G and concludes the proof.
Integral operators and Mercer's theorem
We may characterize a symmetric positive definite kernel via the integral operator using Mercer's theorem and obtain an additional view of the RKHS. Let be a compact space equipped with a strictly positive finite Borel measure and a continuous, symmetric, and positive definite function. Define the integral operator as
where is the space of square integrable functions with respect to .
Mercer's theorem states that the spectral decomposition of the integral operator of yields a series representation of in terms of the eigenvalues and eigenfunctions of . This then implies that is a reproducing kernel so that the corresponding RKHS can be defined in terms of these eigenvalues and eigenfunctions. We provide the details below.
Under these assumptions is a compact, continuous, self-adjoint, and positive operator. The spectral theorem for self-adjoint operators implies that there is an at most countable decreasing sequence such that and
, where the form an orthonormal basis of . By the positivity of for all One can also show that maps continuously into the space of continuous functions and therefore we may choose continuous functions as the eigenvectors, that is, for all Then by Mercer's theorem may be written in terms of the eigenvalues and continuous eigenfunctions as
for all such that
This above series representation is referred to as a Mercer kernel or Mercer representation of .
Furthermore, it can be shown that the RKHS of is given by
where the inner product of given by
This representation of the RKHS has application in probability and statistics, for example to the Karhunen-Loève representation for stochastic processes and kernel PCA.
Feature maps
A feature map is a map , where is a Hilbert space which we will call the feature space. The first sections presented the connection between bounded/continuous evaluation functions, positive definite functions, and integral operators and in this section we provide another representation of the RKHS in terms of feature maps.
Every feature map defines a kernel via
3
Clearly is symmetric and positive definiteness follows from the properties of inner product in . Conversely, every positive definite function and corresponding reproducing kernel Hilbert space has infinitely many associated feature maps such that (3) holds.
For example, we can trivially take and for all . Then (3) is satisfied by the reproducing property. Another classical example of a feature map relates to the previous section regarding integral operators by taking and .
This connection between kernels and feature maps provides us with a new way to understand positive definite functions and hence reproducing kernels as inner products in . Moreover, every feature map can naturally define a RKHS by means of the definition of a positive definite function.
Lastly, feature maps allow us to construct function spaces that reveal another perspective on the RKHS. Consider the linear space
We can define a norm on by
It can be shown that is a RKHS with kernel defined by . This representation implies that the elements of the RKHS are inner products of elements in the feature space and can accordingly be seen as hyperplanes. This view of the RKHS is related to the kernel trick in machine learning.[7]
Properties
Useful properties of RKHSs:
Let be a sequence of sets and be a collection of corresponding positive definite functions on It then follows that
is a kernel on
Let then the restriction of to is also a reproducing kernel.
Consider a normalized kernel such that for all . Define a pseudo-metric on X as
This inequality allows us to view as a measure of similarity between inputs. If are similar then will be closer to 1 while if are dissimilar then will be closer to 0.
We also provide examples of Bergman kernels. Let X be finite and let H consist of all complex-valued functions on X. Then an element of H can be represented as an array of complex numbers. If the usual inner product is used, then Kx is the function whose value is 1 at x and 0 everywhere else, and can be thought of as an identity matrix since
Lastly, the space of band limited functions in with bandwidth is a RKHS with reproducing kernel
Extension to vector-valued functions
In this section we extend the definition of the RKHS to spaces of vector-valued functions as this extension is particularly important in multi-task learning and manifold regularization. The main difference is that the reproducing kernel is a symmetric function that is now a positive semi-definite matrix for every in . More formally, we define a vector-valued RKHS (vvRKHS) as a Hilbert space of functions such that for all and
and
This second property parallels the reproducing property for the scalar-valued case. This definition can also be connected to integral operators, bounded evaluation functions, and feature maps as we saw for the scalar-valued RKHS. We can equivalently define the vvRKHS as a vector-valued Hilbert space with a bounded evaluation functional and show that this implies the existence of a unique reproducing kernel by the Riesz Representation theorem. Mercer's theorem can also be extended to address the vector-valued setting and we can therefore obtain a feature map view of the vvRKHS. Lastly, it can also be shown that the closure of the span of coincides with , another property similar to the scalar-valued case.
We can gain intuition for the vvRKHS by taking a component-wise perspective on these spaces. In particular, we find that every vvRKHS is isometrically isomorphic to a scalar-valued RKHS on a particular input space. Let . Consider the space and the corresponding reproducing kernel
4
As noted above, the RKHS associated to this reproducing kernel is given by the closure of the span of where
for every set of pairs .
The connection to the scalar-valued RKHS can then be made by the fact that every matrix-valued kernel can be identified with a kernel of the form of (4) via
Moreover, every kernel with the form of (4) defines a matrix-valued kernel with the above expression. Now letting the map be defined as
where is the component of the canonical basis for , one can show that is bijective and an isometry between and .
While this view of the vvRKHS can be useful in multi-task learning, this isometry does not reduce the study of the vector-valued case to that of the scalar-valued case. In fact, this isometry procedure can make both the scalar-valued kernel and the input space too difficult to work with in practice as properties of the original kernels are often lost.[11][12][13]
An important class of matrix-valued reproducing kernels are separable kernels which can factorized as the product of a scalar valued kernel and a -dimensional symmetric positive semi-definite matrix. In light of our previous discussion these kernels are of the form
for all in and in . As the scalar-valued kernel encodes dependencies between the inputs, we can observe that the matrix-valued kernel encodes dependencies among both the inputs and the outputs.
We lastly remark that the above theory can be further extended to spaces of functions with values in function spaces but obtaining kernels for these spaces is a more difficult task.[14]
Connection between RKHSs and the ReLU function
The ReLU function is commonly defined as and is a mainstay in the architecture of neural networks where it is used as an activation function. One can construct a ReLU-like nonlinear function using the theory of reproducing kernel Hilbert spaces. Below, we derive this construction and show how it implies the representation power of neural networks with ReLU activations.
We will work with the Hilbert space of absolutely continuous functions with and square integrable (i.e. ) derivative. It has the inner product
To construct the reproducing kernel it suffices to consider a dense subspace, so let and .
The Fundamental Theorem of Calculus then gives
where
and i.e.
This implies reproduces .
Moreover the minimum function on has the following representations with the ReLu function:
Using this formulation, we can apply the representer theorem to the RKHS, letting one prove the optimality of using ReLU activations in neural network settings.[citation needed]
^Alpay, D., and T. M. Mills. "A family of Hilbert spaces which are not reproducing kernel Hilbert spaces." J. Anal. Appl. 1.2 (2003): 107–111.
^ Z. Pasternak-Winiarski, "On weights which admit reproducing kernel of Bergman type", International Journal of Mathematics and Mathematical Sciences, vol. 15, Issue 1, 1992.
^ T. Ł. Żynda, "On weights which admit reproducing kernel of Szegő type", Journal of Contemporary Mathematical Analysis (Armenian Academy of Sciences), 55, 2020.
De Vito, Ernest, Umanita, Veronica, and Villa, Silvia. "An extension of Mercer theorem to vector-valued measurable kernels," arXiv:1110.4017, June 2013.
Steinwart, Ingo; Scovel, Clint (2012). "Mercer's theorem on general domains: On the interaction between measures, kernels, and RKHSs". Constr. Approx. 35 (3): 363–417. doi:10.1007/s00365-012-9153-3. MR2914365. S2CID253885172.
Rosasco, Lorenzo and Poggio, Thomas. "A Regularization Tour of Machine Learning – MIT 9.520 Lecture Notes" Manuscript, Dec. 2014.
Belgia padaOlimpiadeKode IOCBELKONKomite Olimpiade dan Interfederal BelgiaSitus webwww.olympic.be (dalam bahasa Belanda)Medali 42 55 59 Total 156 Penampilan Musim Panas1900190419081912192019241928193219361948195219561960196419681972197619801984198819921996200020042008201220162020Penampilan Musim Dingin192419281932193619481952195619601964196819721976198019841988199219941998200220062010201420182022Penampilan terkait lainnyaPermainan Interkala 1906 Belgia berkompetisi dalam sebagi...
Taekwondo padaPekan Olahraga Nasional XIX Poomsae Putra Putri Perorangan Perorangan Beregu Beregu Kyorugi Putra Putri 54 kg 46 kg 58 kg 49 kg 63 kg 53 kg 68 kg 57 kg 74 kg 62 kg 80 kg 67 kg 87 kg 73 kg +87 kg +73 kg Taekwondo 73 kg putri pada Pekan Olahraga Nasional XIX dilaksanakan pada tanggal 28 september 2016 di Gymnasium FPOK, Universitas Pendidikan Indonesia, Kota Bandung, Jawa Barat.[1] Jadwal Seluruh w...
Owen Garvan Garvan bermain untuk Crystal Palace pada tahun 2012Informasi pribadiNama lengkap Owen William Garvan[1]Tanggal lahir 29 Januari 1988 (umur 36)Tempat lahir Dublin, Republik IrlandiaTinggi 1,83 m (6 ft 0 in)Posisi bermain GelandangInformasi klubKlub saat ini Colchester UnitedNomor 22Karier junior1999–2004 Home Farm2004–2005 Ipswich TownKarier senior*Tahun Tim Tampil (Gol)2005–2010 Ipswich Town 163 (13)2010–2015 Crystal Palace 77 (10)2014 → Mill...
Sergio Rico Rico con la maglia del Siviglia nel 2017 Nazionalità Spagna Altezza 194 cm Peso 90 kg Calcio Ruolo Portiere Squadra Paris Saint-Germain Carriera Giovanili 2006-2011 Siviglia Squadre di club1 2011-2014 Siviglia Atlético36 (-60)2014-2018 Siviglia114 (-150)2018-2019→ Fulham29 (-56)2019-2022 Paris Saint-Germain13 (-14)2022→ Maiorca14 (-29)2022- Paris Saint-Germain0 (0) Nazionale 2016 Spagna1 (-1) 1 I due numeri indicano le presen...
Treatment for life-threatening cardiac arrhythmias Not to be confused with infibulation or Defibrator. DefibrillationView of defibrillator electrode position and placementMeSHD047548[edit on Wikidata] Defibrillation is a treatment for life-threatening cardiac arrhythmias, specifically ventricular fibrillation (V-Fib) and non-perfusing ventricular tachycardia (V-Tach).[1][2] A defibrillator delivers a dose of electric current (often called a counter-shock) to the heart. Alt...
Questa voce sull'argomento filosofi britannici è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Segui i suggerimenti del progetto di riferimento. Francis Hutcheson Francis Hutcheson (Drumalig, 8 agosto 1694 – Glasgow, 8 agosto 1746) è stato un filosofo scozzese. La sua importanza è legata alla sua attività di filosofo morale all'Università di Glasgow. È considerato l'iniziatore di quel filone del pensiero filosofico scozzese che avrà esponenti ...
Artikel ini adalah bagian dari seriPolitik dan ketatanegaraanIndonesia Pemerintahan pusat Hukum Pancasila(ideologi nasional) Undang-Undang Dasar Negara Republik Indonesia Tahun 1945 Hukum Perpajakan Ketetapan MPR Undang-undang Perppu Peraturan pemerintah Peraturan presiden Peraturan daerah Provinsi Kabupaten/kota Legislatif Majelis Permusyawaratan Rakyat Ketua: Bambang Soesatyo (Golkar) Dewan Perwakilan Rakyat Ketua: Puan Maharani (PDI-P) Dewan Perwakilan Daerah Ketua: La Nyalla Mattalitti (J...
Napoleon's secret agent This article includes a list of references, related reading, or external links, but its sources remain unclear because it lacks inline citations. Please help improve this article by introducing more precise citations. (September 2022) (Learn how and when to remove this message) You can help expand this article with text translated from the corresponding article in German. (June 2021) Click [show] for important translation instructions. View a machine-translated ve...
Mountain range in Somaliland This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Ogo Mountains – news · newspapers · books · scholar · JSTOR (December 2018) (Learn how and when to remove this message) Ogo MountainsGalgodon Highlands[1]Buuraha Oogo جَبَل أوغوCal Madow, a subrange of the Ogo Mount...
18th and 19th-century Canadian businessman John MolsonBorn(1763-12-28)28 December 1763Moulton, Lincolnshire, EnglandDied11 January 1836(1836-01-11) (aged 72)Boucherville, Lower CanadaResting placeMount Royal CemeteryNationalityBritish-CanadianOccupationBrewer John Molson (28 December 1763 – 11 January 1836) was an English-born brewer and entrepreneur in colonial Quebec, which during his lifetime became Lower Canada. In addition to founding Molson Brewery, he built the first steamship a...
Pour les articles homonymes, voir TV5. Ne doit pas être confondu avec TV5 Monde. Tivi5 MondeLogo actuel de TiVi5 MondeCaractéristiquesCréation 30 janvier 2012Propriétaire TV5 MondeLangue FrançaisPays FranceStatut Thématique internationale publiqueChaîne sœur TV5 MondeSite web Site de Tivi5MondeDiffusionSatellite OuiAire États-Unis Afriquemodifier - modifier le code - modifier Wikidata TiVi5 Monde[1] (stylisé TiVi5MONDE) initialement baptisée TiVi5 est une chaîne de té...
St. Anton im Montafon Localidad Escudo St. Anton im MontafonLocalización de St. Anton im Montafon en Vorarlberg Coordenadas 47°07′00″N 9°52′00″E / 47.116666666667, 9.8666666666667Entidad Localidad • País Austria • Estado Vorarlberg • Distrito BludenzSuperficie • Total 3,42 km² Altitud • Media 651 m s. n. m.Población (2018) • Total 715 hab. • Densidad 209,06 hab./km²Huso ho...
Rock formation in the Grand Canyon, Arizona Cardenas BasaltCardenas Lava(s)Stratigraphic range: Mesoproterozoic, 1,104 Ma Pha. Proterozoic Archean Had. ↓ black-Cardenas Basalt cliffs on the Colorado River. The squarish cliff is the down-dropped Tanner Graben, of Cardenas Basalt.TypeGeological formationUnit ofUnkar GroupUnderliesNankoweap FormationOverliesDox FormationThickness300 m (980 ft) approximate maximumLithologyPrimarybasaltOtherhyaloclastite, sandstone, and lapilliteLo...
You can help expand this article with text translated from the corresponding article in German. (October 2021) Click [show] for important translation instructions. Machine translation, like DeepL or Google Translate, is a useful starting point for translations, but translators must revise errors as necessary and confirm that the translation is accurate, rather than simply copy-pasting machine-translated text into the English Wikipedia. Consider adding a topic to this template: there are ...
1993 2002 Élections législatives de 1997 en Lot-et-Garonne 3 sièges de députés à l'Assemblée nationale 25 mai et 1er juin 1997 Corps électoral et résultats Inscrits 221 220 Votants au 1er tour 160 866 72,72 % 0,5 Votes exprimés au 1er tour 151 256 Votants au 2d tour 169 514 76,64 % Votes exprimés au 2d tour 157 074 Gauche plurielle Liste Parti socialisteParti communiste françaisLes VertsMouvement des citoyensParti radical...
Questa voce sull'argomento calciatori kirghisi è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Segui i suggerimenti del progetto di riferimento. Bakhtiyar DuyshobekovNazionalità Kirghizistan Altezza182 cm Calcio RuoloCentrocampista Squadra Sheikh Russel CarrieraSquadre di club1 2015 Abdiš-Ata Kant? (?)2016 Krumkačy Minsk0 (0)2016 Eğirdirspor17 (5)2017 Abdysh-Ata Kant? (?)2017-2018 Dordoi Biškek? (?)2018 Kelantan...
Un turbine, 1872, GAM di Milano Fulvia Bisi (Milano, 24 dicembre 1818 – Milano, 15 luglio 1911) è stata una pittrice italiana. Indice 1 Biografia 2 Bibliografia 3 Altri progetti 4 Collegamenti esterni Biografia Figlia del celebre pittore paesaggista Giuseppe Bisi e di Ernesta Legnani, allieva dell'incisore Giuseppe Longhi, svolge il suo apprendistato presso il padre del quale nella sua prima produzione ripropone modi e soggetti. Nel 1842 esordisce all'Esposizione di Belle Arti dell'Accadem...
British aristocrat and writer (1798–1864) For other people named Mary Fox, see Mary Fox. Lady Mary FoxLithograph of Lady Mary by Richard James Lane, published in March 1836Born(1798-12-19)19 December 1798Bushy House, Teddington, EnglandDied13 July 1864(1864-07-13) (aged 65)BuriedKensal Green CemeteryNoble familyFitzClarenceSpouse(s) Charles Richard Fox (m. 1824)FatherWilliam IVMotherDorothea JordanOccupationWriter Tomb of Lady Mary and Charles Richard Fox...