LINPACK benchmarks

LINPACK benchmarks
Original author(s)Jack Dongarra, Jim Bunch, Cleve Moler, and Gilbert Stewart
Initial release1979 (1979)
Websitenetlib.org/benchmark/hpl/

The LINPACK Benchmarks are a measure of a system's floating-point computing power. Introduced by Jack Dongarra, they measure how fast a computer solves a dense n by n system of linear equations Ax = b, which is a common task in engineering.

The latest version of these benchmarks is used to build the TOP500 list, ranking the world's most powerful supercomputers.[1]

The aim is to approximate how fast a computer will perform when solving real problems. It is a simplification, since no single computational task can reflect the overall performance of a computer system. Nevertheless, the LINPACK benchmark performance can provide a good correction over the peak performance provided by the manufacturer. The peak performance is the maximal theoretical performance a computer can achieve, calculated as the machine's frequency, in cycles per second, times the number of operations per cycle it can perform. The actual performance will always be lower than the peak performance.[2] The performance of a computer is a complex issue that depends on many interconnected variables. The performance measured by the LINPACK benchmark consists of the number of 64-bit floating-point operations, generally additions and multiplications, a computer can perform per second, also known as FLOPS. However, a computer's performance when running actual applications is likely to be far behind the maximal performance it achieves running the appropriate LINPACK benchmark.[3]

The name of these benchmarks comes from the LINPACK package, a collection of algebra Fortran subroutines widely used in the 1980s, and initially tightly linked to the LINPACK benchmark. The LINPACK package has since been replaced by other libraries.

History

The LINPACK benchmark report appeared first in 1979 as an appendix to the LINPACK user's manual.[4]

LINPACK was designed to help users estimate the time required by their systems to solve a problem using the LINPACK package, by extrapolating the performance results obtained by 23 different computers solving a matrix problem of size 100.

This matrix size was chosen due to memory and CPU limitations at that time:

  • 10,000 floating-point entries from -1 to 1 are randomly generated to fill in a general, dense matrix,
  • then, LU decomposition with partial pivoting is used for the timing.

Over the years, additional versions with different problem sizes, like matrices of order 300 and 1000, and constraints were released, allowing new optimization opportunities as hardware architectures started to implement matrix-vector and matrix-matrix operations.[5]

Parallel processing was also introduced in the LINPACK Parallel benchmark in the late 1980s.[2]

In 1991, the LINPACK was modified for[6] solving problems of arbitrary size, enabling high performance computers (HPC) to get near to their asymptotic performance.

Two years later this benchmark was used for measuring the performance of the first TOP500 list.

The benchmarks

LINPACK 100

LINPACK 100 is very similar to the original benchmark published in 1979 along with the LINPACK users' manual. The solution is obtained by Gaussian elimination with partial pivoting, with 2/3n3 + 2n2 floating-point operations where n is 100, the order of the dense matrix A that defines the problem. Its small size and the lack of software flexibility doesn't allow most modern computers to reach their performance limits. However, it can still be useful to predict performances in numerically intensive user written code using compiler optimization.[2]

LINPACK 1000

LINPACK 1000 can provide a performance nearer to the machine's limit because in addition to offering a bigger problem size, a matrix of order 1000, changes in the algorithm are possible. The only constraints are that the relative accuracy can't be reduced and the number of operations will always be considered to be 2/3 n3 + 2 n2, with n = 1000.[2]

HPLinpack

The previous benchmarks are not suitable for testing parallel computers,[7] and the so-called Linpack's Highly Parallel Computing benchmark, or HPLinpack benchmark, was introduced. In HPLinpack the size n of the problem can be made as large as it is needed to optimize the performance results of the machine. Once again, 2/3n3 + 2n2 will be taken as the operation count, with independence of the algorithm used. Use of the Strassen algorithm is not allowed because it distorts the real execution rate.[8] The accuracy must be such that the following expression is satisfied:

,

where

  • is the machine's precision, and
  • n is the size of the problem,[9]
  • is a matrix norm (largest row sum [10]) and
  • corresponds to the big-O notation.

For each computer system, the following quantities are reported:[2]

  • Rmax: the performance in GFLOPS for the largest problem run on a machine.
  • Nmax: the size of the largest problem run on a machine.
  • N1/2: the size where half the Rmax execution rate is achieved.
  • Rpeak: the theoretical peak performance GFLOPS for the machine.

These results are used to compile the TOP500 list twice a year, with the world's most powerful computers.[1] TOP500 measures these in double-precision floating-point format (FP64). The ratio of Rmax to Rpeak is called parallel efficiency or HPL efficiency.[11] It is typically lower the more nodes a system has due to communication overhead. For example, a 1990s Cray Y-MP achieves about 90% HPL efficiency,[12] while Frontier achieves about 70% in 2023.[13]

LINPACK benchmark implementations

The previous section describes the ground rules for the benchmarks. The actual implementation of the program can diverge, with some examples being available in Fortran,[14] C [15] or Java.[16]

HPL

HPL is a portable implementation of HPLinpack that was written in C, originally as a guideline, but that is now widely used to provide data for the TOP500 list, though other technologies and packages can be used. HPL generates a linear system of equations of order n and solves it using LU decomposition with partial row pivoting. It requires installed implementations of MPI and either BLAS or VSIPL to run.[17]

Coarsely, the algorithm has the following characteristics:[18][19]

  • cyclic data distribution in 2D blocks
  • LU factorization using the right-looking variant with various depths of look-ahead
  • recursive panel factorization
  • six different panel broadcasting variants
  • bandwidth reducing swap-broadcast algorithm
  • backward substitution with look-ahead of depth 1

Criticism

The LINPACK benchmark is said to have succeeded because of the scalability[20] of HPLinpack, the fact that it generates a single number, making the results easily comparable and the extensive historical data base it has associated.[21] However, soon after its release, the LINPACK benchmark was criticized for providing performance levels "generally unobtainable by all but a very few programmers who tediously optimize their code for that machine and that machine alone",[22] because it only tests the resolution of dense linear systems, which are not representative of all the operations usually performed in scientific computing.[23] Jack Dongarra, the main driving force behind the LINPACK benchmarks, said that, while they only emphasize "peak" CPU speed and number of CPUs, not enough stress is given to local bandwidth and the network.[24]

Thom Dunning, Jr., director of the National Center for Supercomputing Applications, had this to say about the LINPACK benchmark: "The Linpack benchmark is one of those interesting phenomena -- almost anyone who knows about it will deride its utility. They understand its limitations but it has mindshare because it's the one number we've all bought into over the years."[25]

According to Dongarra, "the organizers of the Top500 are actively looking to expand the scope of the benchmark reporting" because "it is important to include more performance characteristic and signatures for a given system".[26] One of the possibilities that is being considered to extend the benchmark for the TOP500 is the HPC Challenge Benchmark Suite.[27] With the advent of petascale computers, traversed edges per second have started to emerge as a complementary metric to FLOPS measured by LINPACK. Another such metric is the HPCG benchmark, proposed by Dongarra.[28]

The running time issue

According to Jack Dongarra, the running time required to obtain good performance results with HPLinpack is expected to increase. At a conference held in 2010, he said he expects running times of 2.5 days in "a few years".[29]

See also

References

  1. ^ a b "The Linpack Benchmark, TOP500 Supercomputing Sites". Retrieved 2015-02-10.
  2. ^ a b c d e Dongarra, Jack J.; Luszczek, Piotr; Petitet, Antoine (2003), "The LINPACK Benchmark: past, present and future" (PDF), Concurrency and Computation: Practice and Experience, 15 (9), John Wiley & Sons, Ltd.: 803–820, doi:10.1002/cpe.728, S2CID 1900724
  3. ^ Jack Dongarra interview by Sander Olson, archived from the original on 2016-03-04, retrieved 2012-01-13
  4. ^ Dongarra, J.J.; Moler, C.B.; Bunch, J.R.; Stewart, G.W. (1979), LINPACK: users' guide, SIAM, ISBN 9780898711721
  5. ^ Dongarra, Jack (1988), "The LINPACK Benchmark: An explanation", Supercomputing (PDF), Lecture Notes in Computer Science, vol. 297, Springer Berlin/Heidelberg, pp. 456–474, doi:10.1007/3-540-18991-2_27, ISBN 978-3-540-18991-6
  6. ^ High Performance Linpack Benchmark (PDF), retrieved 2015-02-10
  7. ^ Bailey, D.H.; Barszcz, E.; Barton, J.T.; Browning, D.S.; Carter, R.L.; Dagum, L.; Fatoohi, R.A.; Frederickson, P.O.; Lasinski, T.A.; Schreiber, R.S.; Simon, H.D.; Venkatakrishnan, V.; Weeratunga, S.K. (1991). "The NAS parallel benchmarks---summary and preliminary results". Proceedings of the 1991 ACM/IEEE conference on Supercomputing - Supercomputing '91. Supercomputing '91. Proceedings of the 1991 ACM/IEEE Conference. pp. 158–165. doi:10.1145/125826.125925. ISBN 0897914597. S2CID 18046345.{{cite book}}: CS1 maint: location (link) CS1 maint: location missing publisher (link)
  8. ^ "LINPACK FAQ - Can I use Strassen's Method when doing the matrix multiples in the HPL benchmark or for the Top500 run?". Retrieved 2015-02-10.
  9. ^ "LINPACK FAQ - To what accuracy must be the solution conform?". Retrieved 2015-02-10.
  10. ^ "LINPACK dlange". Retrieved 2024-03-09.
  11. ^ "High-Performance Linpack (HPL) benchmarking - Theoretical Peak Performance Rpeak". Retrieved 2023-11-15.
  12. ^ "Y-MP C916/12256". TOP500. 1995-12-01. Retrieved 2023-11-15.
  13. ^ "Frontier - HPE Cray EX235a, AMD Optimized 3rd Generation EPYC 64C 2GHz, AMD Instinct MI250X, Slingshot-11". TOP500. 2023-11-12. Retrieved 2023-11-15.
  14. ^ "Linpack benchmark program in Fortran". Retrieved 2015-02-10.
  15. ^ "Linpack benchmark program in C". Retrieved 2015-02-10.
  16. ^ "Linpack benchmark program in Java". Retrieved 2015-02-10.
  17. ^ "HPL - A Portable Implementation of the High-Performance Linpack Benchmark for Distributed-Memory Computers". Retrieved 2015-02-10.
  18. ^ "HPL algorithm".
  19. ^ "HPL overview". Retrieved 2015-02-10.
  20. ^ "AN INTERVIEW WITH SUPERCOMPUTING LEGEND JACK DONGARRA". 2002-05-24.
  21. ^ Haigh, Thomas (2004). "An interview with Jack J. Dongarra" (PDF). LINPACK is a benchmark that people often cite because there's such a historical data base of information there, because it's fairly easy to run, it's fairly easy to understand, and it captures in some sense the best and worst of programming.
  22. ^ Hammond, Steven (1995), Beyond Machoflops: Getting MPPs Into the Production Environment
  23. ^ Gahvari, Hormozd; Hoemmen, Mark; Demmel, James; Yelick, Katherine (2006), "Benchmarking Sparse Matrix-Vector Multiply in Five Minutes", SPEC Benchmark Workshop (PDF)
  24. ^ Dongarra, Jack J. (2007), "The HPC Challenge Benchmark: A Candidate for Replacing Linpack in the Top500?", SPEC Benchmark Workshop (PDF)
  25. ^ Christopher Mims (2010-11-08). "Why China's New Supercomputer Is Only Technically the World's Fastest". Retrieved 2011-09-22.
  26. ^ Meuer, Martin (2002-05-24). "AN INTERVIEW WITH SUPERCOMPUTING LEGEND JACK DONGARRA". Retrieved 2022-12-01.
  27. ^ Luszczek, Piotr; Dongarra, Jack J.; Koester, David; Rabenseifner, Rolf; Lucas, Bob; Kepner, Jeremy; Mccalpin, John; Bailey, David; Takahashi, Daisuke (2005), Introduction to the HPC Challenge Benchmark Suite (PDF)
  28. ^ Hemsoth, Nicole (June 26, 2014). "New HPC Benchmark Delivers Promising Results". HPCWire. Retrieved 2022-12-01.
  29. ^ Dongarra, Jack J. (2010). LINPACK Benchmark with Time Limits on Multicore & GPU Based Accelerators (PDF). International Supercomputing Conference.

Read other articles:

Artikel ini sebatang kara, artinya tidak ada artikel lain yang memiliki pranala balik ke halaman ini.Bantulah menambah pranala ke artikel ini dari artikel yang berhubungan atau coba peralatan pencari pranala.Tag ini diberikan pada Februari 2023. Iwana adalah nama Jepang. Tokoh-tokoh dengan nama Jepang ini antara lain: Pemain sepak bola Jepang Iwana Kobayashi Takuya Iwanami Halaman-halaman lainnya Semua halaman dengan Iwana Semua halaman dengan judul yang mengandung Iwana Halaman disambig...

 

D'Academy AsiaMusim 1Penayangan16 November 2015 – 29 Desember 2015Juri Hetty Koes Endang Hendro Saky DJ Daffy Hans Anuar Zul 2BY Pak Ngah Mayuni OmarPembawa acaraRamzi Rina Nose Irfan Hakim Andhika PratamaSaluranIndosiarLokasi finalStudio 5 IndosiarPemenangDanang PradanaAsalIndonesiaLagu kemenanganBidadari JiwaGenreDangdutJuara dua Lesti Kejora (1st Runner-Up) Shiha Zikir (2nd Runner-Up)Kronologi  2015 ► D'Academy Asia (Musim 1) adalah sebuah ajang kompetisi menyanyi dangdut musim pe...

 

This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: List of Miss World runners-up and finalists – news · newspapers · books · scholar · JSTOR (November 2021) (Learn how and when to remove this template message) For the Miss World titleholders, see List of Miss World titleholders.This article provides the names ...

Kolonisasi dan imperialisme pada akhir Perang Dunia II (1945) Koloni atau tanah jajahan adalah tempat yang dikuasai oleh negara lain. Negara metropolitan adalah negara yang memiliki koloni. Negara yang memiliki banyak koloni sering disebut sebagai Kerajaan. Penjajah adalah orang yang datang ke daerah asing untuk membuat koloni di sana dan kemudian mengontrol daerah itu. Kolonis adalah orang yang tinggal di koloni. Sekarang tidak lagi ada banyak negara yang de jure bersifat kolonial karena ber...

 

Hormone-producing glands of a body Endocrine systemMain glands of the human endocrine systemDetailsIdentifiersLatinsystema endocrinumMeSHD004703FMA9668Anatomical terminology[edit on Wikidata] The endocrine system[1] is a messenger system in an organism comprising feedback loops of hormones that are released by internal glands directly into the circulatory system and that target and regulate distant organs. In vertebrates, the hypothalamus is the neural control center for all endoc...

 

Below a weight considered healthy For the financial securities rating, see Underweight (stock market). Medical conditionUnderweightThe underweight range according to the body mass index (BMI) is the white area on the chart.SpecialtyEndocrinology Part of a series onHuman body weight General concepts Obesity (Epidemiology) Overweight Underweight Body shape Weight gain Weight loss Gestational weight gain Diet (nutrition) Weight management Overnutrition Childhood obesity (Epidemiology) Medical co...

Indian state government owned dairy cooperative society This article is about the cooperative. For other uses, see Amul (disambiguation). Anand Milk Union LimitedTrade nameAmul CoopCompany typeCooperativeIndustryDairyFounded14 December 1946; 77 years ago (1946-12-14)FounderTribhuvandas PatelHeadquartersAnand, Gujarat, IndiaArea servedWorldwideKey peopleJayen Mehta (managing director)ProductsMilk productsRevenue ₹52,000 crore (US$6.5 billion)[1] (2022)Owne...

 

1942 French comic book by Horst Rosenthal Mickey au Camp de Gurs(Mickey Mouse in the Gurs Internment Camp)Cover of Mickey au Camp de GursCreatorHorst RosenthalDate1942Main charactersMickey MousePage count15 pagesOriginal publicationPublished inMickey à Gurs: Les Carnets de dessin de Horst Rosenthal(Mickey in Gurs: The comic books of Horst Rosenthal)Date of publication2014LanguageFrenchISBN978-27021-438-5-8 Mickey au Camp de Gurs (Mickey Mouse in the Gurs Internment Camp)[1]...

 

  اتحاد مدني الاتحاد المدني (ويُعرف أيضًا بالشراكة المدنية) هو ترتيب معترف به قانونيًا مشابه للزواج، أُنشئ في المقام الأول وسيلةً لمنح الاعتراف القانوني للأزواج من نفس الجنس. يمنح الاتحاد المدني بعض حقوق الزواج أو كلها ما عدا التسمية ذاتها. أنشئت الاتحادات المدنية تح�...

Este artigo ou se(c)ção trata de um evento desportivo recente ou em curso. A informação apresentada pode mudar com frequência. Não adicione especulações, nem texto sem referência a fontes confiáveis.Editado pela última vez em 27 de maio de 2024. Brasileirão Série B 2024 – Betnacional Brasileirão 2024 - Série B Dados Participantes 20 Organização CBF Período 19 de abril – 26 de novembro Gol(o)s 145 Partidas 62 Média 2,34 gol(o)s por partida Melhor marcador 4 gols: Gustavo...

 

County in Ireland County in Leinster, IrelandCounty Wicklow Contae Chill MhantáinCounty Coat of armsNickname: The Garden of IrelandMotto(s): Meanma Saor (Irish)Free SpiritsCountryIrelandProvinceLeinsterRegionEastern and MidlandEstablished1606[1]County townWicklowLargest settlementBrayGovernment • Local authorityWicklow County Council • Dáil constituencyWicklow • EP constituencySouthArea[2] • Total2,027 km2 (...

 

Pour les articles homonymes, voir Landström. Cet article est une ébauche concernant un athlète finlandais. Vous pouvez partager vos connaissances en l’améliorant (comment ?) selon les recommandations des projets correspondants. Eeles Landström Eeles Landström en 1958. Informations Disciplines Saut à la perche Nationalité Finlandais Naissance 3 janvier 1932 Viiala Décès 29 juin 2022 (à 90 ans) Helsinki Taille 1,85 m Masse 81 kg Club RU-38 Michigan Wolverines Palmarès M...

Village in Wisconsin, United StatesStoddard, WisconsinVillageLocation of Stoddard in Vernon County, Wisconsin.Coordinates: 43°39′46″N 91°13′11″W / 43.66278°N 91.21972°W / 43.66278; -91.21972Country United StatesState WisconsinCountyVernonArea[1] • Total0.71 sq mi (1.84 km2) • Land0.64 sq mi (1.65 km2) • Water0.07 sq mi (0.19 km2)Elevation[2]659 f...

 

Hanušovce nad Topľoucittà Hanušovce nad Topľou – Veduta LocalizzazioneStato Slovacchia Regione Prešov DistrettoVranov nad Topľou TerritorioCoordinate49°01′39″N 21°30′13″E49°01′39″N, 21°30′13″E (Hanušovce nad Topľou) Altitudine197[1] m s.l.m. Superficie14,37 km² Abitanti3 792[2] (31-12-2009) Densità263,88 ab./km² Altre informazioniCod. postale094 31 Prefisso057 Fuso orarioUTC+1 TargaVT CartografiaHanušovce n...

 

Fell in the Lake District, Cumbria, England Grey KnottsGrey Knotts seen from above the Stonethwaite valleyHighest pointElevation697 m (2,287 ft)Prominencec. 15 mParent peakBrandrethListingWainwright, NuttallCoordinates54°30′11″N 3°12′29″W / 54.503°N 3.208°W / 54.503; -3.208GeographyGrey KnottsLocation in the Lake District National ParkShow map of the Lake DistrictGrey KnottsLocation in Allerdale, CumbriaShow map of the former Allerdale Borou...

Den här artikeln behöver källhänvisningar för att kunna verifieras. (2015-01) Åtgärda genom att lägga till pålitliga källor (gärna som fotnoter). Uppgifter utan källhänvisning kan ifrågasättas och tas bort utan att det behöver diskuteras på diskussionssidan. KékesKékes är Ungerns högsta berg och ligger i kedjan Mátra. Toppen ligger 1 014 meter över havet. Namnet Kékes kommer från bergets blåaktiga färg. På ungerska betyder kék blå medan kékes betyder blåa...

 

This article is about the US area. For the German area, see Steigerwald Nature Park. Steigerwald Lake National Wildlife RefugeIUCN category IV (habitat/species management area)LocationClark County, Washington, United StatesNearest cityWashougal, WashingtonCoordinates45°34′02″N 122°18′14″W / 45.5673413°N 122.3039798°W / 45.5673413; -122.3039798[1]Area1,049 acres (4.25 km2; 425 ha)Established1987Governing bodyU.S. Fish and Wildlife...

 

Azerbaijani politician and diplomat Hasan HasanovHəsən HəsənovHasanov in 2010Minister of Foreign AffairsIn office2 September 1993 – 16 February 1998PresidentHeydar AliyevPreceded byTofig GasimovSucceeded byTofig ZulfugarovPrime Minister of AzerbaijanIn office5 February 1991 – 4 April 1992PresidentAyaz MutallibovYagub Mammadov (acting)Preceded byPosition established(Himself as Chairman of the Council of Ministers of the Azerbaijan SSR)Succeeded byFiruz Mustafayev (act...

Railway station in New Zealand PahiatuaNew Zealand Government Railways (NZGR)Regional railGeneral informationLocationPahiatua – Mangahao RoadTararuaNew ZealandCoordinates40°26′42.12″S 175°48′54.42″E / 40.4450333°S 175.8151167°E / -40.4450333; 175.8151167Elevation116 metres (381 ft)Owned byKiwiRailOperated byPahiatua Railcar Society (lessee)[1]Line(s)Wairarapa LineDistance154.62 kilometres (96.08 mi) from WellingtonPlatformsSingle sideTra...

 

Cette liste est une ébauche concernant l’histoire, le catholicisme et le Trentin-Haut-Adige. Vous pouvez partager vos connaissances en l’améliorant (comment ?) selon les recommandations des projets correspondants. Blason de l'archidiocèse de Trente. Ceci est une liste des évêques et archevêques de l'archidiocèse de Trente. En 1027, l'empereur confie le comté de Trente, ainsi que les comtés de Botzen et de Vintschgau, à l'évêque de Trente et érige la principauté ainsi f...