Audio-to-video synchronization

Audio-to-video synchronization (AV synchronization, also known as lip sync, or by the lack of it: lip-sync error, lip flap) refers to the relative timing of audio (sound) and video (image) parts during creation, post-production (mixing), transmission, reception and play-back processing. AV synchronization can be an issue in television, videoconferencing, or film.

In industry terminology, the lip-sync error is expressed as the amount of time the audio departs from perfect synchronization with the video where a positive time number indicates the audio leads the video and a negative number indicates the audio lags the video.[1] This terminology and standardization of the numeric lip-sync error is utilized in the professional broadcast industry as evidenced by the various professional papers,[2] standards such as ITU-R BT.1359-1, and other references below.

Digital or analog audio video streams or video files usually contain some sort of synchronization mechanism, either in the form of interleaved video and audio data or by explicit relative timestamping of data.

Sources of error

There are different ways in which the AV-sync can get incorrectly synchronized.

During creation AV-sync errors happen because of internal AV-sync error due to different signal processing delays between image and sound in video camera and microphone. The AV-sync delay is normally fixed. External AV-sync errors can occur if a microphone is placed far away from the sound source, the audio will be out of sync because the speed of sound is much lower than the speed of light. If the sound source is 340 meters from the microphone, then the sound arrives approximately 1 second later than the light. The AV-sync delay increases with distance. During mixing of video clips normally either the audio or video needs to be delayed so they are synchronized. The AV-sync delay is static but can vary with the individual clip. Video editing effects can delay video causing it to lag the audio.

Transmission (broadcasting), reception and playback that can get introduce AV-sync errors. A video camera with built-in microphones or line-in may not delay sound and video paths by the same amount. Solid-state video cameras (e.g. charge-coupled device (CCD) and CMOS image sensors) can delay the video signal by one or more frames. Audio and video signal processing circuitry exists with significant (and potentially non-constant) delays in television systems. Particular video signal processing circuitry that is widely used and contributes significant video delays include frame synchronizers, digital video effects processors, video noise reduction, format converters and compression systems.

Processing circuits format conversion and deinterlace processing in video monitors can add one or more frames of video delay. A video monitor with built-in speakers or line-out may not delay sound and video paths equally. Some video monitors contain internal user-adjustable audio delays to aid in correction of errors.

Some transmission protocols like RTP require an out-of-band method for synchronizing media streams. In some RTP systems, each media stream has its own timestamp using an independent clock rate and per-stream randomized starting value. A RTCP Sender Report (SR) may be needed for each stream in order to synchronize streams.[3]

Effect of no explicit AV-sync timing

When a digital or analog AV system stream does not have a synchronization method or mechanism, the stream may become out of sync. In film movies these timing errors are most commonly caused by worn films skipping over the movie projector sprockets because the film has torn sprocket holes. Errors can also be caused by the projectionist misthreading the film in the projector.

Synchronization errors have become a significant problem in the digital television industry because of the use of large amounts of video signal processing in television production, television broadcasting and pixelated television displays such as LCD, DLP and plasma displays. Pixelated displays utilize complex video signal processing to convert the resolution of the incoming video signal to the native resolution of the pixelated display, for example converting standard definition video to be displayed on a high definition display. Synchronization problems are commonly caused when significant amounts of video processing is performed on the video part of the television program. Typical sources of significant video delays in the television field include video synchronizers and video compression encoders and decoders. Particularly troublesome encoders and decoders are used in MPEG compression systems utilized for broadcasting digital television and storing television programs on consumer and professional recording and playback devices.

In broadcast television, it is not unusual for lip-sync error to vary by over 100 ms (several video frames) from time to time. AV-sync is commonly corrected and maintained with an audio synchronizer. Television industry standards organizations have established acceptable amounts of audio and video timing error and suggested practices related to maintaining acceptable timing.[4][1] The EBU Recommendation R37 "The relative timing of the sound and vision components of a television signal" states that end-to-end audio/video sync should be within +40 ms and -60 ms (audio before/after video, respectively) and that each stage should be within +5 ms and -15 ms.[5]

Viewer experience of incorrectly synchronized AV-sync

The result typically leaves a filmed or televised character's mouth movements mismatching spoken dialog, hence the term lip flap or lip-sync error. The resulting audio-video sync error can be annoying to the viewer and may even cause the viewer to not enjoy the program, decrease the effectiveness of the program or lead to a negative perception of the speaker on the part of the viewer.[6] The potential loss of effectiveness is of particular concern for product commercials and political candidates. Television industry standards organizations, such as the Advanced Television Systems Committee, have become involved in setting standards for audio-video sync errors.[4]

Because of these annoyances, AV-sync error is a concern to the television programming industry, including television stations, networks, advertisers and program production companies. Unfortunately, the advent of high-definition flat-panel display technologies (LCD, DLP and plasma), which can delay video more than audio, has moved the problem into the viewer's home and beyond the control of the television programming industry alone. Consumer product companies now offer audio-delay adjustments to compensate for video-delay changes in TVs, soundbars and A/V receivers,[7] and several companies manufacture dedicated digital audio delays made exclusively for lip-sync error correction.

Recommendations

For television applications, the Advanced Television Systems Committee recommends that audio should lead video by no more than 15 ms and audio should lag video by no more than 45 ms.[4] However, the ITU performed strictly controlled tests with expert viewers and found that the threshold for detectability is 45 ms lead to 125 ms lag.[1] For film, acceptable lip sync is considered to be no more than 22 milliseconds in either direction.[5][8]

The Consumer Electronics Association has published a set of recommendations for how digital television receivers should implement A/V sync.[9]

SMPTE ST2064

SMPTE standard ST2064, published in 2015,[10] provides technology to reduce or eliminate lip-sync errors in digital television. The standard utilizes audio and video fingerprints taken from a television program. The fingerprints can be recovered and used to correct the accumulated lip-sync error. When fingerprints have been generated for a TV program, and the required technology is incorporated, the viewer's television set has the ability to continuously measure and correct lip-sync errors.[11][12]

Timestamps

Presentation time stamps (PTS) are embedded in MPEG transport streams to precisely signal when each audio and video segment is to be presented and avoid AV-sync errors. However, these timestamps are often added after the video undergoes frame synchronization, format conversion and preprocessing, and thus the lip sync errors created by these operations will not be corrected by the addition and use of timestamps.[13][14][15][16]

The Real-time Transport Protocol clocks media using origination timestamps on an arbitrary timeline. A real-time clock such as one delivered by the Network Time Protocol or Precision Time Protocol and described in the Session Description Protocol[17] associated with the media may be used to synchronize media. A server may then be used for synchronization between multiple receivers.[18]

See also

References

  1. ^ a b c "ITU-R BT.1359-1, Relative Timing of Sound and Vision for Broadcasting" (PDF). ITU. 1998. Retrieved 30 May 2015.
  2. ^ Patrick Waddell; Graham Jones; Adam Goldberg. "Audio/Video Standards and Solutions A Status Report" (PDF). ATSC. Archived from the original (PDF) on 17 February 2016. Retrieved 4 April 2012.
  3. ^ RFC 3550
  4. ^ a b c IS-191: Relative Timing of Sound and Vision for Broadcast Operations, ATSC, 2003-06-26, archived from the original on 2012-03-21
  5. ^ a b "The relative timing of the sound and vision components of a television signal" (PDF).
  6. ^ Byron Reeves; David Voelker (October 1993). "Effects of Audio-Video Asynchrony on Viewer's Memory, Evaluation of Content and Detection Ability" (PDF). Archived from the original (PDF) on 2 October 2008. Retrieved 2008-10-19.
  7. ^ "Lip-sync error: Causes, solutions". Retrieved 2024-06-13.
  8. ^ Sara Kudrle; et al. (July 2011). "Fingerprinting for Solving A/V Synchronization Issues within Broadcast Environments". Motion Imaging Journal. SMPTE. Appropriate A/V sync limits have been established and the range that is considered acceptable for film is +/- 22 ms. The range for video, according to the ATSC, is up to 15 ms lead time and about 45 ms lag time
  9. ^ Consumer Electronics Association. "CEA-CEB20 R-2013: A/V Synchronization Processing Recommended Practice". Archived from the original on 2015-05-30.
  10. ^ ST 2064:2015 - SMPTE Standard - Audio to Video Synchronization Measurement, SMPTE, 2015
  11. ^ SMPTE Standards Update: The Lip-Sync Challenge, SMPTE, 10 December 2013, archived from the original on 2021-12-15
  12. ^ SMPTE Standards Update: The Lip-Sync Challenge (PDF), SMPTE, 10 December 2013, archived from the original (PDF) on 2016-08-26, retrieved 2016-06-09
  13. ^ "MPEG-2 Systems FAQ: 19. Where are the PTSs and DTSs inserted?". Archived from the original on 2008-07-26. Retrieved 2007-12-27.
  14. ^ Arpi (7 May 2003). "MPlayer-G2-dev: mpeg container's timing (PTS values)".
  15. ^ "birds-eye.net: DTS - Decode Time Stamp".
  16. ^ "SVCD2DVD: Author and burn DVDs: AVI to DVD, DivX to DVD, Xvid to DVD, MPEG to DVD, SVCD to DVD, VCD to DVD, PAL to NTSC conversion, HDTV2DVD, HDTV to DVD, BLURAY". www.svcd2dvd.com.
  17. ^ A. Williams; K. Gross; et al. (June 2014). RTP Clock Source Signalling. Internet Engineering Task Force. doi:10.17487/RFC7273. RFC 7273. Proposed Standard.
  18. ^ R. van Brandenburg; et al. (June 2014). Inter-Destination Media Synchronization (IDMS) Using the RTP Control Protocol (RTCP). Internet Engineering Task Force. doi:10.17487/RFC7272. RFC 7272. Proposed Standard.

Further reading

Read other articles:

Мерія Харкова Міський голова Харкова — головна посадова особа Харкова, що представляє його інтереси, обирається на 5 років та здійснює свої повноваження на постійній основі. Очолює виконавчий комітет міської ради, головує на засіданнях міської ради. На цей час посаду ...

 

Samdech Akeak Moha Thomak PothisalChea SimGCRS NMChea Sim pada 2012 Presiden SenatMasa jabatan25 Maret 1999 – 8 Juni 2015Penguasa monarkiNorodom SihanoukNorodom SihamoniPerdana MenteriHun SenWakil PresidenSay ChhumTep Ngorn PendahuluDidirikan kembali (Jabatan terakhir kali dipegang oleh Peter Khoy Saukam)PenggantiSay ChhumPresiden Partai Rakyat KambojaMasa jabatan17 Oktober 1991 – 8 Juni 2015WakilHun Sen PendahuluHeng Samrin sebagai Sekretaris Jenderal Partai Revolusione...

 

Town in Virginia, United StatesAshland, VirginiaTownAshland Town HallNickname: The Center of the Universe[1]Location in Hanover County and the state of VirginiaCoordinates: 37°45′34″N 77°28′38″W / 37.75944°N 77.47722°W / 37.75944; -77.47722CountryUnited StatesStateVirginiaCountyHanoverFounded1858Government • TypeCouncil-Manager • MayorJames R. Foley • Town ManagerJoshua FarrarArea[2] • To...

I Cuntrera-Caruana è stata una Famiglia di Cosa Nostra che ha ottenuto una posizione chiave nel traffico di stupefacenti e nel riciclaggio di denaro sporco tra gli anni ottanta e novanta. La stampa italiana del periodo li ribattezzò come i Rothschild della Mafia o i banchieri di Cosa Nostra. Indice 1 Storia 2 Elementi di spicco 3 Note 4 Bibliografia Storia Nel secondo dopoguerra i Cuntrera e i Caruana svolgevano le occupazioni di campieri e gabellotti nelle tenute del barone Agnello nei pre...

 

Questa voce o sezione sull'argomento informatica non cita le fonti necessarie o quelle presenti sono insufficienti. Puoi migliorare questa voce aggiungendo citazioni da fonti attendibili secondo le linee guida sull'uso delle fonti. Segui i suggerimenti del progetto di riferimento. Interno di un centro di elaborazione dati con armadi e relativi rack. Un centro elaborazione dati (CED) o in inglese data center è una funzione all'interno di un'organizzazione (impresa, ente pubblico, associ...

 

طاحونة مائية طلا أبادمعلومات عامةنوع المبنى طاحونة مائيةالمكان قوجد[1][2] المنطقة الإدارية مقاطعة كاشمر[2] البلد  إيرانالصفة التُّراثيَّةتصنيف تراثي المعالم الوطنية الإيرانية[1][2] (2005 – ) التفاصيل التقنيةمواد البناء طابوق الطين التصميم والإنشاءالنم...

Astronomical diagram graphing two colour indices A color–color diagram is a means of comparing the colors of an astronomical object at different wavelengths. Astronomers typically observe at narrow bands around certain wavelengths, and objects observed will have different brightnesses in each band. The difference in brightness between two bands is referred to as color. On color–color diagrams, the color defined by two wavelength bands is plotted on the horizontal axis, and the color defin...

 

Durga PujaNama resmidurg pujaNama lainAkalbodhan, Vijaya Dashami, Dashain, and DussehraDirayakan olehOrang HinduJenisHinduMulainavratriTanggalScript error: The function "getRawValue" does not exist.Tahun 2024date missing (please add)Terkait denganDussehra Durgapuja - The Festival of Bengalies Durga Puja (diucapkan [ˈd̪ʊɾga 'puja], bahasa Bengali: দুর্গাপূজা, bahasa Assam: দুৰ্গা পূজা, bahasa Oriya: ଦୁର୍ଗା ପ�...

 

U.S. presidential administration from 2017 to 2021 For a chronological guide, see Timeline of the Donald Trump presidency. This article may be too long to read and navigate comfortably. Consider splitting content into sub-articles, condensing it, or adding subheadings. Please discuss this issue on the article's talk page. (April 2024)Presidency of Donald TrumpJanuary 20, 2017 – January 20, 2021CabinetSee listPartyRepublicanElection2016SeatWhite House← Barack ObamaJoe Bid...

2006 California lieutenant gubernatorial election ← 2002 November 7, 2006 2010 →   Nominee John Garamendi Tom McClintock Party Democratic Republican Popular vote 4,189,584 3,845,858 Percentage 49.12% 45.09% County resultsGaramendi:      40–50%      50–60%      60–70%      70–80%McClintock:      40–50%     &#...

 

PT Bakrie Indo Infrastructure TbkIndustriInfrastrukturDidirikan2008 (2008) di Jakarta, IndonesiaKantorpusatJakarta, IndonesiaTokohkunciAD Erlangga, Direktur Chandra Devi Muharam, Head Of Legal & Admin Krisnaraga Syarfuan, Direktur Bambang Banyudoyo, Direktur Bakrie Oil & Gas Infrastructure Ali Herman, Direktur Utama Bakrie Power.IndukBakrie & BrothersAnakusahaPT Bakrie Power PT Bakrie Oil & Gas Infrastructure PT Bakrie Toll Indonesia PT Bakrie Port Indonesia PT Bakrie Tel...

 

King of France from 1547 to 1559 Henry II1559 portraitKing of France (more...) Reign31 March 1547 – 10 July 1559Coronation25 July 1547PredecessorFrancis ISuccessorFrancis IIDuke of BrittanyReign10 August 1536 – 31 March 1547PredecessorFrancis IIISuccessorPosition abolished (Brittany absorbed into the crown lands of France)BornHenry, Duke of Orléans31 March 1519Château de Saint-Germain-en-LayeDied10 July 1559 (aged 40)Hôtel des TournellesBurial13 August 1559Saint Denis BasilicaSpouse Ca...

Historical society and museum in West Chester, Pennsylvania Chester County History CenterEstablished1893Location225 North High StreetWest Chester, PennsylvaniaTypeHistorical/MuseumDirectorConor HeppWebsitemycchc.org Horticultural HallU.S. Historic districtContributing property Built1848[1]ArchitectThomas U. WalterPart ofWest Chester Downtown Historic District (ID85001447[2])Designated CPJuly 2, 1985 Chester County History Center (CCHC), formerly the Chester County Histori...

 

vteLists of United Kingdom locations Aa-Ak Al Am-Ar As-Az Bab-Bal Bam-Bap Bar Bas-Baz Bea-Bem Ben-Bez Bi Bla-Blac Blad-Bly Boa-Bot Bou-Boz Bra Bre-Bri Bro-Bron Broo-Brt Bru-Bun Bur-Bz Ca-Cap Car-Cd Ce-Chap Char-Che Chi-Ck Cl-Cn Co-Col Com-Cor Cos-Cou Cov-Coy Cra Cre-Croc Croe-Cros Crot-Croz Cru-Cu Cw-Cz Da-Dam Dan-Ddu De-Dee Deo-Dn Do-Dor Dos-Doz Dr Ds-Dz Ea-Eass East A-D East E-L East M-Y Eat-Ee Ef-El Em-Ez Fa-Fe Ff-Fn Fo Fr-Fz Gab-Gan Gao-Gar Gas-Gaz Ge-Gl Gm-Gq Gr-Gred Gree-Gz Ha-Ham Han-...

 

Latvian trainer aircraft VEF I-17 VEF I-17 at the VEF factory, 1940 Role Trainer aircraftType of aircraft National origin Latvia Manufacturer VEF Designer Kārlis Irbītis First flight 1940 Introduction 1940 Primary users Latvian Air ForceLuftwaffe Number built 6 VEF I-17 was a Latvian trainer aircraft (intended also as a fighter) designed in 1939 by Kārlis Irbītis. The I-17 was test flown in early 1940 and almost immediately accepted by Latvian Air Force. It was produced by the VEF fa...

Військово-музичне управління Збройних сил України Тип військове формуванняЗасновано 1992Країна  Україна Емблема управління Військово-музичне управління Збройних сил України — структурний підрозділ Генерального штабу Збройних сил України призначений для планува...

 

Sports event in Sacramento, California International athletics championship event1981 USA Outdoor Track and Field ChampionshipsDatesJune 20–21Host citySacramento, California, United StatesVenueHughes Stadium Sacramento City College← 1980 1982 → The 1981 USA Outdoor Track and Field Championships took place between June 20–21 at Hughes Stadium on the campus of Sacramento City College in Sacramento, California. The 20K racewalk was held May 3 in Kenosha, Wisconsin. The decathlon ...

 

Norwegian biologist Hanna Resvoll-Holmsen Hanna Marie Resvoll-Holmsen (née Resvoll) (11 September 1873 in Vågå, Oppland – 13 March 1943 in Oslo) was a Norwegian botanist – a female pioneer in Norwegian natural history education and nature conservation together with her sister, Thekla Resvoll. Life Hanna Resvoll-Holmsen suffered much from illness in her childhood and school attendance after her 12th year was sporadic. She took a high school exam in 1902, at which time she had also an un...

37th BSFC Awards December 11, 2016 Best Film: La La Land The 37th Boston Society of Film Critics Awards, honoring the best in filmmaking in 2016, were given on December 11, 2016.[1] Winners Damien Chazelle, Best Director winner Casey Affleck, Best Actor winner Isabelle Huppert, Best Actress winner Mahershala Ali, Best Supporting Actor winner Best Film: La La Land Best Director: Damien Chazelle – La La Land Runner-up: Kenneth Lonergan – Manchester by the Sea Best Actor: Casey Affl...

 

Events at the2005 World ChampionshipsTrack events100 mmenwomen200 mmenwomen400 mmenwomen800 mmenwomen1500 mmenwomen5000 mmenwomen10,000 mmenwomen100 m hurdleswomen110 m hurdlesmen400 m hurdlesmenwomen3000 msteeplechasemenwomen4 × 100 m relaymenwomen4 × 400 m relaymenwomenRoad eventsMarathonmenwomen20 km walkmenwomen50 km walkmenField eventsHigh jumpmenwomenPole vaultmenwomenLong jumpmenwomenTriple jumpmenwomenShot putmenwomenDiscus throwmenwomenHammer throwmenwomenJavelin throwmenwomenComb...