Amdahl's law

Amdahl's Law demonstrates the theoretical maximum speedup of an overall system and the concept of diminishing returns. If exactly 50% of the work can be parallelized, the best possible speedup is 2 times. If 95% of the work can be parallelized, the best possible speedup is 20 times. According to the law, even with an infinite number of processors, the speedup is constrained by the unparallelizable portion.

In computer architecture, Amdahl's law (or Amdahl's argument[1]) is a formula that shows how much faster a task can be completed when you add more resources to the system.

The law can be stated as:

"the overall performance improvement gained by optimizing a single part of a system is limited by the fraction of time that the improved part is actually used".[2]

It is named after computer scientist Gene Amdahl, and was presented at the American Federation of Information Processing Societies (AFIPS) Spring Joint Computer Conference in 1967.

Amdahl's law is often used in parallel computing to predict the theoretical speedup when using multiple processors.

Definition

In the context of Amdahl's law, speedup can be defined as: [3]

Amdahl's law can be formulated in the following way: [4]

where

  • represents the total speedup of a program
  • represents time spent on the code where parallelism is used
  • represents the extent of the improvement

The is frequently much lower than one might expect. For instance, if a programmer enhances a part of the code that represents 10% of the total execution time (i.e. of 0.10) and achieves a of 10,000, then becomes 1.11 which means only 11% improvement in total speedup of the program. So, despite a massive improvement in one section, the overall benefit is quite small. In another example, if the programmer optimizes a section that accounts for 99% of the execution time (i.e. of 0.99) with a speedup factor of 100 (i.e. of 100), the only reaches 50. This indicates that half of the potential performance gain is lost due to the remaining 1% of execution time that was not improved. [4]

Implications

Followings are implications of Amdahl's law: [5][6]

  • Balance efforts between improving both parallelizable and non-parallelizable parts of a tasks to get the best overall performance improvement.
  • Diminishing Returns: Adding more processors gives diminishing returns. Beyond a certain point, adding more processors doesn't significantly increase speedup.
  • Limited Speedup: Even with many processors, there's a limit to how much faster a task can be completed due to parts of the task that cannot be parallelized.

Limitations

Followings are limitations of Amdahl's law: [7][3][8]

  • Assumption of Fixed Workload: Amdahl's Law assumes that the workload remains constant. It doesn't account for dynamic or increasing workloads, which can impact the effectiveness of parallel processing.
  • Overhead Ignored: Amdahl's Law neglects overheads associated with concurrency, including coordination, synchronization, inter-process communication, and concurrency control.
  • Neglecting extrinsic factors: Amdahl's Law addresses computational parallelism, neglecting extrinsic factors such as data persistence, I/O operations, and memory access overheads, and assumes idealized conditions.
  • Scalability Issues: While it highlights the limits of parallel speedup, it doesn't address practical scalability issues, such as the cost and complexity of adding more processors.
  • Non-Parallelizable Work: Amdahl's Law emphasizes the non-parallelizable portion of the task as a bottleneck but doesn’t provide solutions for reducing or optimizing this portion.
  • Assumes Homogeneous Processors: It assumes that all processors are identical and contribute equally to speedup, which may not be the case in heterogeneous computing environments.

Amdahl's law applies only to the cases where the problem size is fixed. In practice, as more computing resources become available, they tend to get used on larger problems (larger datasets), and the time spent in the parallelizable part often grows much faster than the inherently serial work. In this case, Gustafson's law gives a less pessimistic and more realistic assessment of the parallel performance.[9]

Universal Scalability Law (USL), developed by Neil J. Gunther, extends the Amdahl's law and accounts for the additional overhead due to inter-process communication. USL quantifies scalability based on parameters such as contention and coherency. [10]

Derivation

A task executed by a system whose resources are improved compared to an initial similar system can be split up into two parts:

  • a part that does not benefit from the improvement of the resources of the system;
  • a part that benefits from the improvement of the resources of the system.

An example is a computer program that processes files. A part of that program may scan the directory of the disk and create a list of files internally in memory. After that, another part of the program passes each file to a separate thread for processing. The part that scans the directory and creates the file list cannot be sped up on a parallel computer, but the part that processes the files can.

The execution time of the whole task before the improvement of the resources of the system is denoted as . It includes the execution time of the part that would not benefit from the improvement of the resources and the execution time of the one that would benefit from it. The fraction of the execution time of the task that would benefit from the improvement of the resources is denoted by . The one concerning the part that would not benefit from it is therefore . Then:

It is the execution of the part that benefits from the improvement of the resources that is accelerated by the factor after the improvement of the resources. Consequently, the execution time of the part that does not benefit from it remains the same, while the part that benefits from it becomes:

The theoretical execution time of the whole task after the improvement of the resources is then:

Amdahl's law gives the theoretical speedup in latency of the execution of the whole task at fixed workload , which yields

Parallel programs

If 30% of the execution time may be the subject of a speedup, p will be 0.3; if the improvement makes the affected part twice as fast, s will be 2. Amdahl's law states that the overall speedup of applying the improvement will be:

For example, assume that we are given a serial task which is split into four consecutive parts, whose percentages of execution time are p1 = 0.11, p2 = 0.18, p3 = 0.23, and p4 = 0.48 respectively. Then we are told that the 1st part is not sped up, so s1 = 1, while the 2nd part is sped up 5 times, so s2 = 5, the 3rd part is sped up 20 times, so s3 = 20, and the 4th part is sped up 1.6 times, so s4 = 1.6. By using Amdahl's law, the overall speedup is

Notice how the 5 times and 20 times speedup on the 2nd and 3rd parts respectively don't have much effect on the overall speedup when the 4th part (48% of the execution time) is accelerated by only 1.6 times.

Serial programs

Assume that a task has two independent parts, A and B. Part B takes roughly 25% of the time of the whole computation. By working very hard, one may be able to make this part 5 times faster, but this reduces the time of the whole computation only slightly. In contrast, one may need to perform less work to make part A perform twice as fast. This will make the computation much faster than by optimizing part B, even though part B's speedup is greater in terms of the ratio, (5 times versus 2 times).

For example, with a serial program in two parts A and B for which TA = 3 s and TB = 1 s,

  • if part B is made to run 5 times faster, that is s = 5 and p = TB/(TA + TB) = 0.25, then
  • if part A is made to run 2 times faster, that is s = 2 and p = TA/(TA + TB) = 0.75, then

Therefore, making part A to run 2 times faster is better than making part B to run 5 times faster. The percentage improvement in speed can be calculated as

  • Improving part A by a factor of 2 will increase overall program speed by a factor of 1.60, which makes it 37.5% faster than the original computation.
  • However, improving part B by a factor of 5, which presumably requires more effort, will achieve an overall speedup factor of 1.25 only, which makes it 20% faster.

Optimizing the sequential part of parallel programs

If the non-parallelizable part is optimized by a factor of , then

It follows from Amdahl's law that the speedup due to parallelism is given by

When , we have , meaning that the speedup is measured with respect to the execution time after the non-parallelizable part is optimized.

When ,

If , and , then:

Transforming sequential parts of parallel programs into parallelizable

Next, we consider the case wherein the non-parallelizable part is reduced by a factor of , and the parallelizable part is correspondingly increased. Then

It follows from Amdahl's law that the speedup due to parallelism is given by

Relation to the law of diminishing returns

Amdahl's law is often conflated with the law of diminishing returns, whereas only a special case of applying Amdahl's law demonstrates law of diminishing returns. If one picks optimally (in terms of the achieved speedup) what is to be improved, then one will see monotonically decreasing improvements as one improves. If, however, one picks non-optimally, after improving a sub-optimal component and moving on to improve a more optimal component, one can see an increase in the return. Note that it is often rational to improve a system in an order that is "non-optimal" in this sense, given that some improvements are more difficult or require larger development time than others.

Amdahl's law does represent the law of diminishing returns if one is considering what sort of return one gets by adding more processors to a machine, if one is running a fixed-size computation that will use all available processors to their capacity. Each new processor added to the system will add less usable power than the previous one. Each time one doubles the number of processors the speedup ratio will diminish, as the total throughput heads toward the limit of 1/(1 − p).

This analysis neglects other potential bottlenecks such as memory bandwidth and I/O bandwidth. If these resources do not scale with the number of processors, then merely adding processors provides even lower returns.

An implication of Amdahl's law is that to speed up real applications which have both serial and parallel portions, heterogeneous computing techniques are required.[11] There are novel speedup and energy consumption models based on a more general representation of heterogeneity, referred to as the normal form heterogeneity, that support a wide range of heterogeneous many-core architectures. These modelling methods aim to predict system power efficiency and performance ranges, and facilitates research and development at the hardware and system software levels.[12][13]

See also

References

  1. ^ Rodgers, David P. (June 1985). "Improvements in multiprocessor system design". ACM SIGARCH Computer Architecture News. 13 (3). New York, NY, USA: ACM: 225–231 [p. 226]. doi:10.1145/327070.327215. ISBN 0-8186-0634-7. ISSN 0163-5964. S2CID 7083878.
  2. ^ Reddy, Martin (2011). API Design for C++. Burlington, Massachusetts: Morgan Kaufmann Publishers. p. 210. doi:10.1016/C2010-0-65832-9. ISBN 978-0-12-385003-4. LCCN 2010039601. OCLC 666246330.
  3. ^ a b Computer Architecture: A Quantitative Approach. Morgan Kaufmann. ISBN 978-8178672663.
  4. ^ a b Bakos, Jason D. (2016-01-01), Bakos, Jason D. (ed.), "Chapter 2 - Multicore and data-level optimization: OpenMP and SIMD", Embedded Systems, Boston: Morgan Kaufmann, pp. 49–103, doi:10.1016/b978-0-12-800342-8.00002-x, ISBN 978-0-12-800342-8, retrieved 2024-11-18
  5. ^ The Art of Multiprocessor Programming, Revised Reprint. Morgan Kaufmann. ISBN 9780123973375.
  6. ^ Programming Many-Core Chips. ISBN 9781441997395.
  7. ^ Amdahl, Gene M. (1967-04-18). "Validity of the single processor approach to achieving large scale computing capabilities". Proceedings of the April 18-20, 1967, spring joint computer conference. AFIPS '67 (Spring). New York, NY, USA: Association for Computing Machinery: 483–485. doi:10.1145/1465482.1465560. ISBN 978-1-4503-7895-6.
  8. ^ Parallel Computer Architecture A Hardware/Software Approach. Elsevier Science. ISBN 9781558603431.
  9. ^ McCool, Michael; Reinders, James; Robison, Arch (2013). Structured Parallel Programming: Patterns for Efficient Computation. Elsevier. p. 61. ISBN 978-0-12-415993-8.
  10. ^ Gunther, Neil (2007). Guerrilla Capacity Planning: A Tactical Approach to Planning for Highly Scalable Applications and Services. ISBN 978-3540261384.
  11. ^ Hill, Mark D.; Marty, Michael R. (2008). "Amdahl's Law in the Multicore Era". Computer. 41 (7): 33–38. CiteSeerX 10.1.1.221.8635. doi:10.1109/MC.2008.209.
  12. ^ Rafiev, Ashur; Al-Hayanni, Mohammed A. N.; Xia, Fei; Shafik, Rishad; Romanovsky, Alexander; Yakovlev, Alex (2018-07-01). "Speedup and Power Scaling Models for Heterogeneous Many-Core Systems". IEEE Transactions on Multi-Scale Computing Systems. 4 (3): 436–449. doi:10.1109/TMSCS.2018.2791531. ISSN 2332-7766. S2CID 52287374.
  13. ^ Al-hayanni, Mohammed A. Noaman; Xia, Fei; Rafiev, Ashur; Romanovsky, Alexander; Shafik, Rishad; Yakovlev, Alex (July 2020). "Amdahl's law in the context of heterogeneous many-core systems – a survey". IET Computers & Digital Techniques. 14 (4): 133–148. doi:10.1049/iet-cdt.2018.5220. ISSN 1751-8601. S2CID 214415079.

Further reading

Read other articles:

Часть серии статей о Холокосте Идеология и политика Расовая гигиена · Расовый антисемитизм · Нацистская расовая политика · Нюрнбергские расовые законы Шоа Лагеря смерти Белжец · Дахау · Майданек · Малый Тростенец · Маутхаузен ·&...

 

 

Saint-Gaudens double eagle ($20)Amerika SerikatNilai20 Dolar AmerikaMassa33.431 gDiameter34.1 mm (1.342 in)TepiTulisan E PLURIBUS UNUMKomposisi90% emas, 10% tembagaEmas.96750 ons troyTahun pencetakan1907–1933Tanda cetakD, S, Tanda mint terletak di bagian belakang, tepat di bawah tanggal. Philadelphia Mint, koin spesimen tanpa tanda mint.Depan DesainDewi Liberty membawa obor di tangan kanannya, dan ranting zaitun di tangan kirinya dengan latar belakang matahari terbi...

 

 

العلاقات التشادية التوفالية تشاد توفالو   تشاد   توفالو تعديل مصدري - تعديل   العلاقات التشادية التوفالية هي العلاقات الثنائية التي تجمع بين تشاد وتوفالو.[1][2][3][4][5] مقارنة بين البلدين هذه مقارنة عامة ومرجعية للدولتين: وجه المقارنة تشاد توف...

Jam Kiamat pada tahun 2020, digambarkan 100 detik menuju tengah malam Jam Kiamat (Inggris: Doomsday Clock) adalah jam simbolis yang mewakili kemungkinan risiko bencana global buatan manusia. Simbol ini dikelola sejak tahun 1947 oleh Bulletin of the Atomic Scientists di University of Chicago, Amerika Serikat. Semakin dekat mereka mengatur jam hingga tengah malam, semakin dekat mereka percaya dunia mengalami bencana global. Pada awalnya, Jam Kiamat, yang menggantung pada dinding kantor Bull...

 

 

Belief in certain typical characteristics for a grouping of people A 19th-century British children's book informs its readers that the Dutch are a very industrious race, and that Chinese children are very obedient to their parents. An ethnic stereotype or racial stereotype involves part of a system of beliefs about typical characteristics of members of a given ethnic group, their status, societal and cultural norms. A national stereotype does the same for a given nationality. The stereotypin...

 

 

Scaled Composites, LLCJenisDivisiIndustriPerusahaan dirgantaraDidirikanMojave, California (1982)PendiriBurt RutanKantorpusatMojave, CaliforniaTokohkunciCory Bird, presidenKevin Mickey, presiden emeritusProdukDesain, peralatan, dan pembuatan pesawat, spesialisasi di desain, analisis, dan fabrikasi struktur kompositPendapatan$20-30 jutaKaryawanlebih dari 200 orangIndukNorthrop GrummanSitus webwww.scaled.com Scaled Composites (terkadang hanya disebut sebagai Scaled) adalah sebuah perusahaan...

Voce principale: Associazione Calcio Legnano. A.C. LegnanoStagione 1967-1968Sport calcio Squadra Legnano Allenatore Carlo Facchini Presidente Augusto Terreni Serie C8º posto nel girone A Maggiori presenzeCampionato: Piacentini e Tomy (36) Miglior marcatoreCampionato: Mario Tomy (12) StadioComunale 1966-1967 1968-1969 Si invita a seguire il modello di voce Questa pagina raccoglie le informazioni riguardanti l'Associazione Calcio Legnano nelle competizioni ufficiali della stagione 19...

 

 

NANO HAXAGONAL SATELIT INASAT-1 Bagan satelit pertama Indonesia, INASAT-1 Launch Date TBD Launcher PSLV (reference) Orbit Altitude 800 km Sun-Synchronous (Inclination:98,57) Dimension 352 x 340 (Hexagonal) Mass -22 kg Power Maximum 26,3W Altitude Control Active Magnetorquering (Magnetorque) Frequency Band Uplink/Downlink: 436Mhz Main Computer RABIT 3000, 29,4Mhz Payloads: Inasat-1 Protocol System Experiment (IPSE) - Power System Modul Experiment (PSME) - On Board TCC Experiment (OB...

 

 

Therapy?I Therapy in concerto nel 2016 Paese d'origine Irlanda del Nord GenereRock alternativoAlternative metal Periodo di attività musicale1989 – in attività EtichettaWiiija, A&M, Ark 21, Spitfire, Demolition Records Album pubblicati15 Sito ufficiale Modifica dati su Wikidata · Manuale I Therapy? sono un gruppo alternative metal nordirlandese. Sono stati formati nel 1989 dal chitarrista/cantante Andy Cairns di Ballyclare e dal batterista Fyfe Ewing di La...

Pour les articles homonymes, voir Merci. Ordre de Notre-Dame-de-la-Merci Ordre de droit pontifical Approbation pontificale 17 janvier 1235par Grégoire IX Institut militaire, clérical (1317), mendiant (1690) Type ordre rédempteur et apostolique Règle Règle de saint Augustin But à l'origine : rachat des captifs (traite orientale) ;aujourd'hui : missions, travail social, visite des malades, des prisonniers Structure et histoire Fondation 10 août 1218Barcelone Fondateur Pie...

 

 

Andreas Pereira Pereira al Manchester Utd nel 2021 Nazionalità  Belgio Brasile (dal 2015) Altezza 177 cm Peso 66 kg Calcio Ruolo Centrocampista Squadra  Fulham CarrieraGiovanili 2003-2005 KVSK Utd2005-2011 PSV2011-2014 Manchester UtdSquadre di club1 2014-2016 Manchester Utd5 (0)2016-2017→  Granada35 (5)2017-2018→  Valencia23 (1)2018-2020 Manchester Utd40 (2)2020-2021→  Lazio26 (1)2021-2022→  Flamengo31 (7)[1]2022-&#...

 

 

此條目需要补充更多来源。 (2021年7月4日)请协助補充多方面可靠来源以改善这篇条目,无法查证的内容可能會因為异议提出而被移除。致使用者:请搜索一下条目的标题(来源搜索:美国众议院 — 网页、新闻、书籍、学术、图像),以检查网络上是否存在该主题的更多可靠来源(判定指引)。 美國眾議院 United States House of Representatives第118届美国国会众议院徽章 众议院旗...

UK Parliamentary by-election 1874 County Dublin by-election ← 1874 18 March 1874 1880 →   Candidate Thomas Edward Taylor Charles Stewart Parnell Party Irish Conservative Home Rule Popular vote 2,183 1,235 Percentage 63.9% 36.1% Location of County Dublin constituency within Ireland MP before election Thomas Edward Taylor Irish Conservative Subsequent MP Thomas Edward Taylor Irish Conservative A by-election was held on 18 March 1874 in County Dublin due to the in...

 

 

Ассеманієве Євангеліє Дата створення / заснування 10 століття Місце розташування Ватиканська бібліотека Мова твору або назви староцерковнослов’янська мова З матеріалу пергамент У збірках Ватиканська бібліотека[1] Інвентарний номер Vat. Slav. 3 Абетка глаголиця Повн�...

 

 

Weingart StadiumLocation1301 Avenida Cesar Chavez Monterey Park CA 91754 (on campus of East Los Angeles College)OwnerEast Los Angeles CollegeCapacity22,355ConstructionOpened1951Construction cost$3.1 millionTenantsEast Los Angeles College (football, men's and women's soccer[1]) (CCCAA) (1951–present)CSULA Diablos (1958–1961, 1970–1971)Los Angeles Aztecs (NASL) (1974)Olympic Field Hockey (IOC) (1984)Los Angeles Salsa (APSL) (1993–1994)East LA Classic (CIF) Weingart Stadium (form...

1957 film Victor and VictoriaGerman film posterGermanViktor und Viktoria Directed byKarl AntonWritten byReinhold Schünzel (1933 screenplay)Curt J. BraunProduced byWaldemar FrankStarringJohanna von KoczianGeorg ThomallaJohannes HeestersCinematographyWilly WintersteinEdited byAnnemarie RokossMusic byHeino GazeProductioncompanyCentral-Europa FilmDistributed byPrisma FilmRelease date 16 April 1957 (1957-04-16) Running time107 minutesCountryWest GermanyLanguageGerman Victor and Vic...

 

 

تراقيا الشرقية Doğu Trakya موقع إقليم تراقيا الشرقية (لون أزرق) الإحداثيات 41°09′13″N 27°22′00″E / 41.15361111°N 27.36666667°E / 41.15361111; 27.36666667   تقسيم إداري  البلد تركيا (1922–)  التقسيم الأعلى منطقة مرمرة  أهم المدن إسطنبول خصائص جغرافية  المساحة 23٬764 كم2 (9٬175 ميل2) ع�...

 

 

Pour les articles homonymes, voir Poitrenaud. Clément Poitrenaud Clément Poitrenaud en 2014. Fiche d'identité Naissance 20 mai 1982 (42 ans)[1]Castres (France) Taille 1,88 m (6′ 2″) Poste Arrière, centre Carrière en junior PériodeÉquipe  1989-2001 Stade toulousain Carrière en senior PériodeÉquipeM (Pts)a 2001-20162017 Stade toulousainSharks 356 (343)[2],[3] 4 (0)[3] Carrière en équipe nationale PériodeÉquipeM (Pts)b 2001-2012 France 47 (35) Carrière d'e...

アンドラーシュ2世II András ハンガリー国王 アンドラーシュ2世の肖像画(14世紀)在位 1205年5月7日 - 1235年4月23日戴冠式 1205年5月29日(セーケシュフェヘールヴァール)出生 1177年死去 1235年9月21日埋葬 ハンガリー王国、イグリシュ修道院配偶者 ゲルトルート・フォン・アンデクス  ヨランド・ド・クルトネー  ベアトリーチェ・デステ子女 後述家名 アールパード�...

 

 

The following highways are numbered 571: Canada Highway 571 Cuba Jarahueca–El Mamey Road (4-571) Ireland R571 road (Ireland) United Kingdom A571 road (Great Britain) B571 road United States LA 571 MD 571 (former) MS 571 Route 571 PR-571 FM 571 Preceded by570 Lists of highways571 Succeeded by572 vteList of highways numbered ...0–9 0 1 1A 1B 1D 1X 2 2A 2N 3 3A 3B 3C 3E 3G 4 4A 5 5A 5B 6 6A 6N 7 7A 7B 7C 8 9 9A 9B 9E 9W 10–16 10 10A 10N 11 11A 11B 11C 12 12A ...