Pipeline (computing)

In computing, a pipeline, also known as a data pipeline, is a set of data processing elements connected in series, where the output of one element is the input of the next one. The elements of a pipeline are often executed in parallel or in time-sliced fashion. Some amount of buffer storage is often inserted between elements.

Concept and motivation

Pipelining is a commonly used concept in everyday life. For example, in the assembly line of a car factory, each specific task—such as installing the engine, installing the hood, and installing the wheels—is often done by a separate work station. The stations carry out their tasks in parallel, each on a different car. Once a car has had one task performed, it moves to the next station. Variations in the time needed to complete the tasks can be accommodated by "buffering" (holding one or more cars in a space between the stations) and/or by "stalling" (temporarily halting the upstream stations), until the next station becomes available.

Suppose that assembling one car requires three tasks that take 20, 10, and 15 minutes, respectively. Then, if all three tasks were performed by a single station, the factory would output one car every 45 minutes. By using a pipeline of three stations, the factory would output the first car in 45 minutes, and then a new one every 20 minutes.

As this example shows, pipelining does not decrease the latency, that is, the total time for one item to go through the whole system. It does however increase the system's throughput, that is, the rate at which new items are processed after the first one.

In computing

In computing, a pipeline or data pipeline[1] is a set of data processing elements connected in series, where the output of one element is the input of the next one. The elements of a pipeline are often executed in parallel or in time-sliced fashion. Some amount of buffer storage is often inserted between elements.

Computer-related pipelines include:

Design considerations

Balancing the stages

Since the throughput of a pipeline cannot be better than that of its slowest element, the designer should try to divide the work and resources among the stages so that they all take the same time to complete their tasks. In the car assembly example above, if the three tasks took 15 minutes each, instead of 20, 10, and 15 minutes, the latency would still be 45 minutes, but a new car would then be finished every 15 minutes, instead of 20.

Buffering

Under ideal circumstances, if all processing elements are synchronized and take the same amount of time to process, then each item can be received by each element just as it is released by the previous one, in a single clock cycle. That way, the items will flow through the pipeline at a constant speed, like waves in a water channel. In such "wave pipelines",[2] no synchronization or buffering is needed between the stages, besides the storage needed for the data items.

More generally, buffering between the pipeline stages is necessary when the processing times are irregular, or when items may be created or destroyed along the pipeline. For example, in a graphics pipeline that processes triangles to be rendered on the screen, an element that checks the visibility of each triangle may discard the triangle if it is invisible, or may output two or more triangular pieces of the element if they are partly hidden. Buffering is also needed to accommodate irregularities in the rates at which the application feeds items to the first stage and consumes the output of the last one.

The buffer between two stages may be simply a hardware register with suitable synchronization and signalling logic between the two stages. When a stage A stores a data item in the register, it sends a "data available" signal to the next stage B. Once B has used that data, it responds with a "data received" signal to A. Stage A halts, waiting for this signal, before storing the next data item into the register. Stage B halts, waiting for the "data available" signal, if it is ready to process the next item but stage A has not provided it yet.

If the processing times of an element are variable, the whole pipeline may often have to stop, waiting for that element and all the previous ones to consume the items in their input buffers. The frequency of such pipeline stalls can be reduced by providing space for more than one item in the input buffer of that stage. Such a multiple-item buffer is usually implemented as a first-in, first-out queue. The upstream stage may still have to be halted when the queue gets full, but the frequency of those events will decrease as more buffer slots are provided. Queueing theory can tell the number of buffer slots needed, depending on the variability of the processing times and on the desired performance.

Nonlinear pipelines

If some stage takes (or may take) much longer than the others, and cannot be sped up, the designer can provide two or more processing elements to carry out that task in parallel, with a single input buffer and a single output buffer. As each element finishes processing its current data item, it delivers it to the common output buffer, and takes the next data item from the common input buffer. This concept of "non-linear" or "dynamic" pipeline is exemplified by shops or banks that have two or more cashiers serving clients from a single waiting queue.

Dependencies between items

In some applications, the processing of an item Y by a stage A may depend on the results or effect of processing a previous item X by some later stage B of the pipeline. In that case, stage A cannot correctly process item Y until item X has cleared stage B.

This situation occurs very often in instruction pipelines. For example, suppose that Y is an arithmetic instruction that reads the contents of a register that was supposed to have been modified by an earlier instruction X. Let A be the stage that fetches the instruction operands, and B be the stage that writes the result to the specified register. If stage A tries to process instruction Y before instruction X reaches stage B, the register may still contain the old value, and the effect of Y would be incorrect.

In order to handle such conflicts correctly, the pipeline must be provided with extra circuitry or logic that detects them and takes the appropriate action. Strategies for doing so include:

  • Stalling: Every affected stage, such as A, is halted until the dependency is resolved—that is, until the required information is available and/or the required state has been achieved.
  • Reordering items: Instead of stalling, stage A may put item Y aside and look for any subsequent item Z in its input stream that does not have any dependencies pending with any earlier item. In instruction pipelines, this technique is called out-of-order execution.
  • Guess and backtrack: One important example of item-to-item dependency is the handling of a conditional branch instruction X by an instruction pipeline. The first stage A of the pipeline, that fetches the next instruction Y to be executed, cannot perform its task until X has fetched its operand and determined whether the branch is to be taken or not. That may take many clock cycles, since the operand of X may in turn depend on previous instructions that fetch data from main memory.
Rather than halt while waiting for X to be finished, stage A may guess whether the branch will be taken or not, and fetch the next instruction Y based on that guess. If the guess later turns out to be incorrect (hopefully rarely), the system would have to backtrack and resume with the correct choice. Namely, all the changes that were made to the machine's state by stage A and subsequent stages based on that guess would have to be undone, the instructions following X already in the pipeline would have to be flushed, and stage A would have to restart with the correct instruction pointer. This branch prediction strategy is a special case of speculative execution.

Typical software implementations

To be effectively implemented, data pipelines need a CPU scheduling strategy to dispatch work to the available CPU cores, and the usage of data structures on which the pipeline stages will operate on. For example, UNIX derivatives may pipeline commands connecting various processes' standard IO, using the pipes implemented by the operating system. Some operating systems[example needed] may provide UNIX-like syntax to string several program runs in a pipeline, but implement the latter as simple serial execution, rather than true pipelining—namely, by waiting for each program to finish before starting the next one.[citation needed]

Lower level approaches may rely on the threads provided by the operating system to schedule work on the stages: both thread pool-based implementations or on a one-thread-per-stage are viable, and exist.[3]

Other strategies relying on cooperative multitasking exist, that do not need multiple threads of execution and hence additional CPU cores, such as using a round-robin scheduler with a coroutine-based framework. In this context, each stage may be instantiated with its own coroutine, yielding control back to the scheduler after finishing its round task. This approach may need careful control over the process' stages to avoid them abuse their time slice.

Costs and drawbacks

A pipelined system typically requires more resources (circuit elements, processing units, computer memory, etc.) than one that executes one batch at a time, because its stages cannot share those resources, and because buffering and additional synchronization logic may be needed between the elements.

Moreover, the transfer of items between separate processing elements may increase the latency, especially for long pipelines.

The additional complexity cost of pipelining may be considerable if there are dependencies between the processing of different items, especially if a guess-and-backtrack strategy is used to handle them. Indeed, the cost of implementing that strategy for complex instruction sets has motivated some radical proposals to simplify computer architecture, such as RISC and VLIW. Compilers also have been burdened with the task of rearranging the machine instructions so as to improve the performance of instruction pipelines.

New technologies

It's true that in recent years the demands on applications and their underlying hardware have been significant. For example, building pipelines with single node applications that trawl through the data row by row is no longer feasible with the volume and variety of big data. However, with the advent of data analytics engines such as Hadoop, or more recently Apache Spark, it's been possible to distribute large datasets across multiple processing nodes, allowing applications to reach heights of efficiency several hundred times greater than was thought possible before. The effect of this today is that even a mid-level PC using distributed processing in this fashion can handle the building and running of big data pipelines.[4]

See also

References

  1. ^ Data Pipeline Development Archived 2018-05-24 at the Wayback Machine Published by Dativa, retrieved 24 May, 2018
  2. ^ O. Hauck; Sorin A. Huss; M. Garg (1999). "Two-phase asynchronous wave-pipelines and their application to a 2D-DCT". Proceedings. Fifth International Symposium on Advanced Research in Asynchronous Circuits and Systems. pp. 219–228. doi:10.1109/ASYNC.1999.761536. ISBN 0-7695-0031-5. S2CID 206515615.
  3. ^ "MTDP". GitHub. September 2022.
  4. ^ What is a Data Pipeline? Published by Data Pipelines, retrieved 11 March 2021

Bibliography

Read other articles:

Часть серии статей о Холокосте Идеология и политика Расовая гигиена · Расовый антисемитизм · Нацистская расовая политика · Нюрнбергские расовые законы Шоа Лагеря смерти Белжец · Дахау · Майданек · Малый Тростенец · Маутхаузен ·&...

 

artikel ini perlu dirapikan agar memenuhi standar Wikipedia. Tidak ada alasan yang diberikan. Silakan kembangkan artikel ini semampu Anda. Merapikan artikel dapat dilakukan dengan wikifikasi atau membagi artikel ke paragraf-paragraf. Jika sudah dirapikan, silakan hapus templat ini. (Pelajari cara dan kapan saatnya untuk menghapus pesan templat ini) 'Tak' FujimotoNama asalTakashi FujimotoLahir12 Juli 1939 (umur 84)San Diego, California, Amerika SerikatTempat tinggalSanta Fe, New Mexi...

 

Untuk kegunaan lain, lihat Kasino (disambiguasi). CasinoPoster film CasinoSutradaraMartin ScorseseProduserBarbara De FinaDitulis olehNicholas Pileggi & Martin Scorsese (skenario) Nicholas Pileggi (buku)PemeranRobert De NiroJoe PesciSharon StoneFrank VincentDon RicklesPasquale CajanoJames WoodsAlan KingNaratorRobert De NiroFrank VincentJoe PesciSinematograferRobert RichardsonPenyuntingThelma SchoonmakerDistributorUniversal PicturesTanggal rilis22 November 1995Durasi178 menitNegara Ame...

العلاقات الجزائرية اليابانية   الجزائر   اليابان تعديل مصدري - تعديل   العلاقات الجزائرية اليابانية (بالإنجليزية: Algeria–Japan relations)‏ ، هي العلاقات الدبلوماسية الرسمية بين اليابان والجزائر ، وتقع السفارة الجزائرية في قلب العاصمة اليابانية طوكيو ، وكما تقع مقر ال...

 

Koin Apodakos Apodakos adalah seorang raja Karakene,[1][2] sebuah kerajaan yang diduga vasal dari Kekaisaran Partia. Apodakos dikenal dari koin perak dan perunggunya, hanya beberapa di antaranya yang diberi tanggal.[3] Koin-koin tersebut berasal dari tahun 110-109 hingga 104-103 SM. Pada 124 SM, Hispaosines, raja pertama Karakene meninggal. Setelah kematiannya, jandanya Thalassia mencoba menempatkan putra mereka ke atas takhta. Namun, peristiwa seputar suksesi diketahu...

 

American musical Mack & MabelOriginal Broadway RecordingMusicJerry HermanLyricsJerry HermanBookMichael StewartProductions1974 Broadway 1995 West End 2006 West End revival Mack & Mabel is a musical with a book by Michael Stewart and music and lyrics by Jerry Herman. The plot involves the tumultuous romantic relationship between Hollywood director Mack Sennett and Mabel Normand (transformed from an artist's model to a waitress from Flatbush, Brooklyn for the musical), who became one of ...

John Everett Millais' Kemenangan Ya Tuhan! (1871) menggambarkan Musa mengangkat tangannya selama Pertempuran dari Rafidim, dibantu oleh Hur (kiri) dan Harun. Hur (Ibrani: חור) adalah seorang sahabat Musa dan Harun sebagaimana dicatat dalam Alkitab Ibrani atau Perjanjian Lama di Alkitab Kristen. Dia adalah anggota dari Suku Yehuda. Identitasnya masih belum jelas dalam Taurat itu sendiri, tetapi diuraikan dalam komentari kerabian. Orang lain bernama Hur juga disebutkan dalam Alkitab. Hur...

 

Pepper LunchペッパーランチIndustriRestoranDidirikan1994KantorpusatTokyo, JepangTokohkunciKunio Ichinose (pendiri)ProdukBistik, ayam, minuman berkarbonasi dan lain-lainSitus webpepperlunch.com Restoran Pepper Lunch di Shunde, China Pepper Lunch (ペッパーランチcode: ja is deprecated , Peppā-ranchi) adalah waralaba restoran bistik cepat saji terkenal di wilayah Tokyo. Terdapat lebih dari 200 restoran di Jepang, Hong Kong, Korea Selatan, China, Taiwan, Malaysia, Australia, Indonesi...

 

ХристианствоБиблия Ветхий Завет Новый Завет Евангелие Десять заповедей Нагорная проповедь Апокрифы Бог, Троица Бог Отец Иисус Христос Святой Дух История христианства Апостолы Хронология христианства Раннее христианство Гностическое христианство Вселенские соборы Н...

Данио-рерио Научная классификация Домен:ЭукариотыЦарство:ЖивотныеПодцарство:ЭуметазоиБез ранга:Двусторонне-симметричныеБез ранга:ВторичноротыеТип:ХордовыеПодтип:ПозвоночныеИнфратип:ЧелюстноротыеГруппа:Костные рыбыКласс:Лучепёрые рыбыПодкласс:Новопёрые рыбыИн�...

 

هذه المقالة تحتاج للمزيد من الوصلات للمقالات الأخرى للمساعدة في ترابط مقالات الموسوعة. فضلًا ساعد في تحسين هذه المقالة بإضافة وصلات إلى المقالات المتعلقة بها الموجودة في النص الحالي. (يونيو 2023) يفتقر محتوى هذه المقالة إلى الاستشهاد بمصادر. فضلاً، ساهم في تطوير هذه المقالة ...

 

本條目存在以下問題,請協助改善本條目或在討論頁針對議題發表看法。 此條目需要編修,以確保文法、用詞、语气、格式、標點等使用恰当。 (2013年8月6日)請按照校對指引,幫助编辑這個條目。(幫助、討論) 此條目剧情、虛構用語或人物介紹过长过细,需清理无关故事主轴的细节、用語和角色介紹。 (2020年10月6日)劇情、用語和人物介紹都只是用於了解故事主軸,輔助�...

  لمعانٍ أخرى، طالع مخباط (توضيح). مخباط (سلاح)   النوع سلاح كليل  [لغات أخرى]‏  تعديل مصدري - تعديل   رجلان يتقاتلان بمخباطين مُسنَّنين طويلي المقبض. المِخباط[1] هو سلاح يتكون من رأسِ ضربٍ متصل مع مقبض بحبل مرن أو شريطة أو سلسلة. كانت الميزة التكتيكية ال�...

 

English marine painter This article's lead section may be too short to adequately summarize the key points. Please consider expanding the lead to provide an accessible overview of all important aspects of the article. (September 2020) Portrait of John Wilson Carmichael by Thomas Ellerby, 1839 (South Shields Museum & Art Gallery). James John Wilson Carmichael (9 June 1800 – 1868), also known as John Carmichael was a British marine painter. Life Carmichael was born at the Ouseburn, in New...

 

Intelligence agency of Mongolia General Intelligence Agency of MongoliaТагнуулын ерөнхий газарThe Ulaanbaatar headquarters of the GIAAgency overviewFormed1922Preceding agencyInternal Security AgencyJurisdictionGovernment of MongoliaHeadquartersUlaanbaatarAgency executivePeljee Odonbaatar, GIA DirectorWebsiteGeneral Intelligence Agency The General Intelligence Agency of Mongolia (GIA; Mongolian language: Тагнуулын ерөнхий газар) is the intelligence age...

Malay-language system of styles, titles and honorifics This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages) This article includes a list of general references, but it lacks sufficient corresponding inline citations. Please help to improve this article by introducing more precise citations. (February 2009) (Learn how and when to remove this message) This article needs additional citations for v...

 

Kiandra as built, in her original livery History NameKiandra, Kirrule and Kubu OperatorSydney Ferries LimitedSydney Harbour Transport Board Port of registrySydney BuilderMorrison & Sinclair Ltd, at Balmain CostKirrule: £16,415, Kiandra: £17,087, and Kubu: £17,138.[2] LaunchedKirrule 1910, Kiandra 1911, Kubu 1912 Out of serviceKirrule and Kiandra 1951, Kubu 1959 IdentificationKubu: O/N 131525[1] General characteristics Tonnage258 tonnes Length42.7 m Beam9.2 m Propulsion...

 

Conference on computer graphics For the Special Interest Group, see ACM SIGGRAPH. SIGGRAPH ConferenceStatusActiveGenreComputer graphics conferenceFrequencyAnnualCountryUS, CanadaAsia: Australia, China, India, Japan, Singapore, South Korea, ThailandInaugurated1974; 50 years ago (1974)Most recent2024 (Denver)Next event2025 (Vancouver)Organized byACM SIGGRAPHWebsitesiggraph.org An exhibitor at SIGGRAPH 2023 demoing a full-body motion capture suit. Hydraulophone, presented at SI...

British businessman (born 1951) This article's tone or style may not reflect the encyclopedic tone used on Wikipedia. See Wikipedia's guide to writing better articles for suggestions. (February 2010) (Learn how and when to remove this message) Victor ChandlerBorn (1951-04-18) 18 April 1951 (age 73)[1]EducationHighgate School Millfield SchoolOccupationBookmakerTitleFormer chairman, Victor Chandler InternationalRelativesWilliam Chandler (grandfather)Websitewww.betvictor.com Victor ...

 

Si ce bandeau n'est plus pertinent, retirez-le. Cliquez ici pour en savoir plus. Cet article présente des problèmes à corriger. Vous pouvez aider à l'améliorer ou bien discuter des problèmes sur sa page de discussion. Il ne cite pas suffisamment ses sources. Vous pouvez indiquer les passages à sourcer avec {{référence nécessaire}} ou {{Référence souhaitée}}, et inclure les références utiles en les liant aux notes de bas de page. (Marqué depuis mars 2024) Son texte doit être w...