Kantorovich in 1948 proposed calculating the smallest eigenvalue of a symmetric matrix by steepest descent using a direction of a scaled gradient of a Rayleigh quotient in a scalar product, with the step size computed by minimizing the Rayleigh quotient in the linear span of the vectors and , i.e. in a locally optimal manner. Samokish[1] proposed applying a preconditioner to the residual vector to generate the preconditioned direction and derived asymptotic, as approaches the eigenvector, convergence rate bounds. D'yakonov suggested[2] spectrally equivalent preconditioning and derived non-asymptotic convergence rate bounds. Block locally optimal multi-step steepest descent for eigenvalue problems was described in.[3] Local minimization of the Rayleigh quotient on the subspace spanned by the current approximation, the current residual and the previous approximation, as well as its block version, appeared in.[4] The preconditioned version was analyzed in [5] and.[6]
The costs per iteration and the memory use are competitive with those of the Lanczos method, computing a single extreme eigenpair of a symmetric matrix.
Linear convergence is theoretically guaranteed and practically observed.
Accelerated convergence due to direct preconditioning, in contrast to the Lanczos method, including variable and non-symmetric as well as fixed and positive definite preconditioning.
which results in finding largest (or smallest) eigenpairs of
The direction of the steepest ascent, which is the gradient, of the generalized Rayleigh quotient is positively proportional to the vector
called the eigenvector residual. If a preconditioner is available, it is applied to the residual and gives the vector
called the preconditioned residual. Without preconditioning, we set
and so . An iterative method
or, in short,
is known as preconditioned steepest ascent (or descent), where the scalar
is called the step size. The optimal step size can be determined by maximizing the Rayleigh quotient, i.e.,
(or in case of minimizing),
in which case the method is called locally optimal.
Three-term recurrence
To dramatically accelerate the convergence of the locally optimal preconditioned steepest ascent (or descent), one extra vector can be added to the two-term recurrence relation to make it three-term:
(use in case of minimizing). The maximization/minimization of the Rayleigh quotient in a 3-dimensional subspace can be performed numerically by the Rayleigh–Ritz method. Adding more vectors, see, e.g., Richardson extrapolation, does not result in significant acceleration[8] but increases computation costs, so is not generally recommended.
Numerical stability improvements
As the iterations converge, the vectors and become nearly linearly dependent, resulting in a precision loss and making the Rayleigh–Ritz method numerically unstable in the presence of round-off errors. The loss of precision may be avoided by substituting the vector with a vector , that may be further away from , in the basis of the three-dimensional subspace , while keeping the subspace unchanged and avoiding orthogonalization or any other extra operations.[8] Furthermore, orthogonalizing the basis of the three-dimensional subspace may be needed for ill-conditioned eigenvalue problems to improve stability and attainable accuracy.
Krylov subspace analogs
This is a single-vector version of the LOBPCG method—one of possible generalization of the preconditionedconjugate gradient linear solvers to the case of symmetric eigenvalue problems.[8] Even in the trivial case and the resulting approximation with will be different from that obtained by the Lanczos algorithm, although both approximations will belong to the same Krylov subspace.
Practical use scenarios
Extreme simplicity and high efficiency of the single-vector version of LOBPCG make it attractive for eigenvalue-related applications under severe hardware limitations, ranging from spectral clustering based real-time anomaly detection via graph partitioning on embedded ASIC or FPGA to modelling physical phenomena of record computing complexity on exascale TOP500 supercomputers.
Block version
Summary
Subsequent eigenpairs can be computed one-by-one via single-vector LOBPCG supplemented with an orthogonal deflation or simultaneously as a block. In the former approach, imprecisions in already computed approximate eigenvectors additively affect the accuracy of the subsequently computed eigenvectors, thus increasing the error with every new computation. Iterating several approximate eigenvectors together in a block in a locally optimal fashion in the block version of the LOBPCG.[8] allows fast, accurate, and robust computation of eigenvectors, including those corresponding to nearly-multiple eigenvalues where the single-vector LOBPCG suffers from slow convergence. The block size can be tuned to balance numerical stability vs. convergence speed vs. computer costs of orthogonalizations and the Rayleigh-Ritz method on every iteration.
Core design
The block approach in LOBPCG replaces single-vectors and with block-vectors, i.e. matrices and , where, e.g., every column of approximates one of the eigenvectors. All columns are iterated simultaneously, and the next matrix of approximate eigenvectors is determined by the Rayleigh–Ritz method on the subspace spanned by all columns of matrices and . Each column of is computed simply as the preconditioned residual for every column of The matrix is determined such that the subspaces spanned by the columns of and of are the same.
Numerical stability vs. efficiency
The outcome of the Rayleigh–Ritz method is determined by the subspace spanned by all columns of matrices and , where a basis of the subspace can theoretically be arbitrary. However, in inexact computer arithmetic the Rayleigh–Ritz method becomes numerically unstable if some of the basis vectors are approximately linearly dependent. Numerical instabilities typically occur, e.g., if some of the eigenvectors in the iterative block already reach attainable accuracy for a given computer precision and are especially prominent in low precision, e.g., single precision.
The art of multiple different implementation of LOBPCG is to ensure numerical stability of the Rayleigh–Ritz method at minimal computing costs by choosing a good basis of the subspace. The arguably most stable approach of making the basis vectors orthogonal, e.g., by the Gram–Schmidt process, is also the most computational expensive. For example, LOBPCG implementations,[9][10] utilize unstable but efficient Cholesky decomposition of the normal matrix, which is performed only on individual matrices and , rather than on the whole subspace. The constantly increasing amount of computer memory allows typical block sizes nowadays in the range, where the percentage of compute time spend on orthogonalizations and the Rayleigh-Ritz method starts dominating.
Locking of previously converged eigenvectors
Block methods for eigenvalue problems that iterate subspaces commonly have some of the iterative eigenvectors converged faster than others that motivates locking the already converged eigenvectors, i.e., removing them from the iterative loop, in order to eliminate unnecessary computations and improve numerical stability. A simple removal of an eigenvector may likely result in forming its duplicate in still iterating vectors. The fact that the eigenvectors of symmetric eigenvalue problems are pair-wise orthogonal suggest keeping all iterative vectors orthogonal to the locked vectors.
Locking can be implemented differently maintaining numerical accuracy and stability while minimizing the compute costs. For example, LOBPCG implementations,[9][10] follow,[8][11] separating hard locking, i.e. a deflation by restriction, where the locked eigenvectors serve as a code input and do not change, from soft locking, where the locked vectors do not participate in the typically most expensive iterative step of computing the residuals, however, fully participate in the Rayleigh—Ritz method and thus are allowed to be changed by the Rayleigh—Ritz method.
Modifications, LOBPCG II
LOBPCG includes all columns of matrices and into the Rayleigh–Ritz method resulting in an up to -by- eigenvalue problem needed to solve and up to dot products to compute at every iteration, where denotes the block size — the number of columns. For large block sizes this starts dominating compute and I/O costs and limiting parallelization, where multiple compute devices are running simultaneously.
The original LOBPCG paper[8] describes a modification, called LOBPCG II, to address such a problem running the single-vector version of the LOBPCG method for each desired eigenpair with the Rayleigh-Ritz procedure solving of 3-by-3 projected eigenvalue problems. The global Rayleigh-Ritz procedure for all eigenpairs is on every iteration but only on the columns of the matrix , thus reducing the number of the necessary dot products to from and the size of the global projected eigenvalue problem to -by- from -by- on every iteration.
Reference [12] goes further applying the LOBPCG algorithm to each approximate eigenvector separately, i.e., running the unblocked version of the LOBPCG method for each desired eigenpair for a fixed number of iterations. The Rayleigh-Ritz procedures in these runs only need to solve a set of 3 × 3 projected eigenvalue problems. The global Rayleigh-Ritz procedure for all desired eigenpairs is only applied periodically at the end of a fixed number of unblocked LOBPCG iterations.
Such modifications may be less robust compared to the original LOBPCG. Individually running branches of the single-vector LOBPCG may not follow continuous iterative paths flipping instead and creating duplicated approximations to the same eigenvector. The single-vector LOBPCG may be unsuitable for clustered eigenvalues, but separate small-block LOBPCG runs require determining their block sizes automatically during the process of iterations since the number of the clusters of eigenvalues and their sizes may be unknown a priori.
Convergence theory and practice
LOBPCG by construction is guaranteed[8] to minimize the Rayleigh quotient not slower than the block steepest gradient descent, which has a comprehensive convergence theory. Every eigenvector is a stationary point of the Rayleigh quotient, where the gradient vanishes. Thus, the gradient descent may slow down in a vicinity of any eigenvector, however, it is guaranteed to either converge to the eigenvector with a linear convergence rate or, if this eigenvector is a saddle point, the iterative Rayleigh quotient is more likely to drop down below the corresponding eigenvalue and start converging linearly to the next eigenvalue below. The worst value of the linear convergence rate has been determined[8] and depends on the relative gap between the eigenvalue and the rest of the matrix spectrum and the quality of the preconditioner, if present.
For a general matrix, there is evidently no way to predict the eigenvectors and thus generate the initial approximations that always work well. The iterative solution by LOBPCG may be sensitive to the initial eigenvectors approximations, e.g., taking longer to converge slowing down as passing intermediate eigenpairs. Moreover, in theory, one cannot guarantee convergence necessarily to the smallest eigenpair, although the probability of the miss is zero. A good quality randomGaussian function with the zero mean is commonly the default in LOBPCG to generate the initial approximations. To fix the initial approximations, one can select a fixed seed for the random number generator.
LOBPCG can be trivially adapted for computing several largest singular values and the corresponding singular vectors (partial SVD), e.g., for iterative computation of PCA, for a data matrix D with zero mean, without explicitly computing the covariance matrix DTD, i.e. in matrix-free fashion. The main calculation is evaluation of a function of the product DT(D X) of the covariance matrix DTD and the block-vector X that iteratively approximates the desired singular vectors. PCA needs the largest eigenvalues of the covariance matrix, while LOBPCG is typically implemented to calculate the smallest ones. A simple work-around is to negate the function, substituting -DT(D X) for DT(D X) and thus reversing the order of the eigenvalues, since LOBPCG does not care if the matrix of the eigenvalue problem is positive definite or not.[9]
LOBPCG for PCA and SVD is implemented in SciPy since revision 1.4.0[13]
LOBPCG is one of core eigenvalue solvers in PYFEMax and high performance multiphysics finite element software Netgen/NGSolve. LOBPCG from hypre is incorporated into open source lightweight scalable C++ library for finite element methods MFEM, which is used in many projects, including BLAST, XBraid, VisIt, xSDK, the FASTMath institute in SciDAC, and the co-design Center for Efficient Exascale Discretizations (CEED) in the Exascale computing Project.
^Knyazev, Andrew V. (1987). "Convergence rate estimates for iterative methods for mesh symmetric eigenvalue problem". Soviet Journal of Numerical Analysis and Mathematical Modelling. 2 (5): 371–396. doi:10.1515/rnam.1987.2.5.371. S2CID121473545.
^Knyazev, A. V. (1991). "A Preconditioned Conjugate Gradient Method for Eigenvalue Problems and its Implementation in a Subspace". In Albrecht, J.; Collatz, L.; Hagedorn, P.; Velte, W. (eds.). Numerical Treatment of Eigenvalue Problems Vol. 5. International Series of Numerical Mathematics. Vol. 96. pp. 143–154. doi:10.1007/978-3-0348-6332-2_11. ISBN978-3-0348-6334-6.
^Knyazev, Andrew (2017). "Recent implementations, applications, and extensions of the Locally Optimal Block Preconditioned Conjugate Gradient method (LOBPCG)". arXiv:1708.08354 [cs.NA].
^Knyazev, A. V.; Argentati, M. E.; Lashuk, I.; Ovtchinnikov, E. E. (2007). "Block Locally Optimal Preconditioned Eigenvalue Xolvers (BLOPEX) in Hypre and PETSc". SIAM Journal on Scientific Computing. 29 (5): 2224. arXiv:0705.2626. Bibcode:2007arXiv0705.2626K. doi:10.1137/060661624. S2CID266.
^Yamada, S.; Imamura, T.; Machida, M. (2005). 16.447 TFlops and 159-Billion-dimensional Exact-diagonalization for Trapped Fermion-Hubbard Model on the Earth Simulator. Proc. ACM/IEEE Conference on Supercomputing (SC'05). p. 44. doi:10.1109/SC.2005.1. ISBN1-59593-061-2.
^Yamada, S.; Imamura, T.; Kano, T.; Machida, M. (2006). Gordon Bell finalists I—High-performance computing for exact numerical approaches to quantum many-body problems on the earth simulator. Proc. ACM/IEEE conference on Supercomputing (SC '06). p. 47. doi:10.1145/1188455.1188504. ISBN0769527000.
^Yamada, S.; Imamura, T.; Machida, M. (2018). High Performance LOBPCG Method for Solving Multiple Eigenvalues of Hubbard Model: Efficiency of Communication Avoiding Neumann Expansion Preconditioner. Asian Conference on Supercomputing Frontiers. Yokota R., Wu W. (eds) Supercomputing Frontiers. SCFA 2018. Lecture Notes in Computer Science, vol 10776. Springer, Cham. pp. 243–256. doi:10.1007/978-3-319-69953-0_14.
^Yamada, S.; Imamura, T.; Machida, M. (2022). High performance parallel LOBPCG method for large Hamiltonian derived from Hubbard model on multi-GPU systems. SupercomputingAsia (SCA).
^Yang, C.; Meza, J. C.; Lee, B.; Wang, L.-W. (2009). "KSSOLV - a MATLAB toolbox for solving the Kohn-Sham equations". ACM Trans. Math. Softw. 36 (2): 1–35. doi:10.1145/1499096.1499099. S2CID624897.
^Fathurrahman, Fadjar; Agusta, Mohammad Kemal; Saputro, Adhitya Gandaryus; Dipojono, Hermawan Kresno (2020). "PWDFT.jl: A Julia package for electronic structure calculation using density functional theory and plane wave basis". Computer Physics Communications. 256: 107372. Bibcode:2020CoPhC.25607372F. doi:10.1016/j.cpc.2020.107372. S2CID219517717.
^Knyazev, A.; Malyshev, A. (2015). Accelerated graph-based spectral polynomial filters. 2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP), Boston, MA. pp. 1–6. arXiv:1509.02468. doi:10.1109/MLSP.2015.7324315.
Томас ҐустафсонСвен Томас Ґустафсон Загальна інформаціяНаціональність шведГромадянство ШвеціяНародження 28 грудня 1959(1959-12-28) (63 роки)Катрінегольм, СедерманландЗріст 175 смВага 70 кгСпортКраїна ШвеціяВид спорту ковзанярський спортДисципліна ковзанярський спорт на дов
Lazare Carnot Lazare Nicolas Marguerite Carnot (* 13. Mai 1753 in Nolay, Burgund, heute Département Côte-d’Or; † 2. August 1823 in Magdeburg) war ein französischer Offizier, Mathematiker und Politiker. Inhaltsverzeichnis 1 Biografie 2 Bedeutung 3 Ehrungen 4 Schriften 5 Siehe auch 6 Literatur 7 Weblinks 8 Einzelnachweise Biografie Carnot wurde in eine bürgerliche Familie geboren. Er hatte 17 Geschwister, von denen ein Bruder zum Procureur général am königlichen Hof aufstieg.[1...
American physician (1807–1881) For the American politician from Maryland, see Albert D. Mackey. Albert G. MackeyAlbert Mackey about 1870Born(1807-03-13)March 13, 1807Charleston, South Carolina, U.S.DiedJune 21, 1881(1881-06-21) (aged 74)Fortress Monroe, Virginia, U.S.NationalityAmericanOccupationPhysicianKnown forPioneering Masonic author and encyclopedian Part of a series onFreemasonry Overview Grand Lodge Masonic lodge Masonic lodge officers Grand Master Prince Hall Freemasonry ...
Daily newspaper published in Omaha, Nebraska This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Omaha World-Herald – news · newspapers · books · scholar · JSTOR (June 2023) (Learn how and when to remove this template message) This article is about the newspaper. For the book publisher, see World Publishing Comp...
ريتشارد ويتشغه (بالهولندية: Richard Witschge) معلومات شخصية الميلاد 20 سبتمبر 1969 (العمر 54 سنة)أمستردام الطول 1.83 م (6 قدم 0 بوصة) مركز اللعب لاعب وسط الجنسية هولندي أخوة وأخوات روب ويتشجه مسيرة الشباب سنوات فريق أياكس أمستردام أياكس أمستردام المسيرة الاحترافية1 سنوات ف...
Lihat pula: Terminal Kenjeran dan Shelter Bulak Terminal Kedung CowekTerminal Penumpang Tipe BPapan Nama Terminal Kedung CowekNama lainTerminal Tambak WediLokasiJalan Tambak Wedi Nomor 2, Kelurahan Kedung Cowek, Kecamatan Bulak, Kota Surabaya, Provinsi Jawa Timur, Kodepos 60125Kawasan Utara Surabaya IndonesiaKoordinat7°12′47″S 112°46′45″E / 7.213027°S 112.779302°E / -7.213027; 112.779302Koordinat: 7°12′47″S 112°46′45″E / 7.213...
نورث لوغان الإحداثيات 41°46′28″N 111°48′57″W / 41.774444444444°N 111.81583333333°W / 41.774444444444; -111.81583333333 تاريخ التأسيس 1890 تقسيم إداري البلد الولايات المتحدة[1] التقسيم الأعلى مقاطعة كاش، يوتا خصائص جغرافية المساحة 18.493175 كيلومتر مربع18.048892 كيلومتر مربع...
2013 Belgian filmMarinaDirected byStijn ConinxWritten byRik D'HietProduced byPeter BouckaertCinematographyLou BerghmansEdited byPhilippe RavoetMusic byMichelino BiscegliaProductioncompanyEyeworksRelease dates 23 August 2013 (2013-08-23) (Montréal World Film Festival) 6 November 2013 (2013-11-06) (Belgium) CountryBelgiumLanguagesDutch, ItalianBox office$4,941,893[1] Marina is a biographical film released in 2013 and directed by Stijn Coninx. The f...
Roman empress and wife of the Roman emperor Antoninus Pius Faustina the ElderAugustaBust of Faustina Major in the Altes Museum (Berlin)Roman empressTenure138 – 140Bornc. 100Diedlate October 140 (aged 40)near Rome, ItalyBurialMausoleum of HadrianSpouseAntoninus PiusIssue Marcus Aurelius Fulvius Antoninus (died before 138) Marcus Galerius Aurelius Antoninus (died before 138) Aurelia Fadilla (died in 135) Annia Galeria Faustina Minor or Faustina the Younger NamesAnnia Galeria FaustinaRegnal na...
Hypothesis in neuroscience proposed by Karl Friston The free energy principle is a theoretical framework suggesting that the brain reduces surprise or uncertainty by making predictions based on internal models and updating them using sensory input. It highlights the brain's objective of aligning its internal model with the external world to enhance prediction accuracy. This principle integrates Bayesian inference with active inference, where actions are guided by predictions and sensory feedb...
Rugby teamOlympicFull nameOlympic Club Rugby Football ClubUnionUSA RugbyNickname(s)Winged 'O' ruggersFounded1908; 115 years ago (1908)League(s)Pacific Rugby Premiership Team kit The Olympic Club RFC is a Pacific Rugby Premiership team based in San Francisco, California, and were the 2019 Division II USA national champions.[1] Known as the OC, or the Winged 'O' ruggers, the rugby team was formed in 1908 under the wing of the San Francisco Olympic Club, an athletic clu...
Doctor y farmacéutico. Das Buch des Lebens por Marsilius Ficinus, Florencia 1508. Anuncio de una farmacia en Roma, Italia La historia de la farmacia es el estudio de la evolución histórica de la farmacia y la farmacéutica. Por lo tanto, los historiadores de la farmacia se ocupan principalmente de la historia de los medicamentos, de la farmacoterapia, de las farmacias y boticas y de la industria farmacéutica, así como de las personas que influyeron en el campo. La historia de la farmacia...
Scenario voor de energietransitie in Duitsland De energietransitie in Duitsland, ook wel de Energiewende genoemd, is het plan om in Duitsland over te schakelen op betaalbare hernieuwbare energie. Hierbij worden ook alle kerncentrales uiterlijk tegen 2022 buiten dienst gesteld. Hiermee wil men de uitstoot van CO2 verminderen met 80 tot 95% tegen 2050, in vergelijking met 1990. Men streeft ernaar om tegen 2050 zestig procent van de energie uit hernieuwbare bronnen te halen. Duitsland investeert...
Mathematical function between groups that preserves multiplication structure This article includes a list of references, related reading, or external links, but its sources remain unclear because it lacks inline citations. Please help to improve this article by introducing more precise citations. (March 2022) (Learn how and when to remove this template message) Depiction of a group homomorphism (h) from G (left) to H (right). The oval inside H is the image of h. N is the kernel of h and aN is...
Esta página cita fontes, mas que não cobrem todo o conteúdo. Ajude a inserir referências. Conteúdo não verificável pode ser removido.—Encontre fontes: ABW • CAPES • Google (N • L • A) (Outubro de 2020) Ninja Slayer ニンジャスレイヤーNinja Sureiyā Ninja SlayerLogótipo da série. Informações gerais Gêneros Ação, Drama, Cyberpunk Ninja Slayer Light novel Escrita por Bradley Bond, Philip Ninj@ Morzez Ilustrad...
American college basketball season 1949–50 Idaho Vandals men's basketballConferencePacific Coast ConferenceRecord15–17 (7–9 PCC)Head coachCharles Finley (3rd season)Home arenaMemorial GymnasiumSeasons← 1948–491950–51 → 1949–50 Pacific Coast Conference men's basketball standings vte Conf Overall Team W L PCT W L PCT North Washington State 11 – 5 .688 19 – 13 .594 Washington 8 – 8 .500...