Computer facial animation

Computer facial animation is primarily an area of computer graphics that encapsulates methods and techniques for generating and animating images or models of a character face. The character can be a human, a humanoid, an animal, a legendary creature or character, etc. Due to its subject and output type, it is also related to many other scientific and artistic fields from psychology to traditional animation. The importance of human faces in verbal and non-verbal communication and advances in computer graphics hardware and software have caused considerable scientific, technological, and artistic interests in computer facial animation.

Although development of computer graphics methods for facial animation started in the early-1970s, major achievements in this field are more recent and happened since the late 1980s.

The body of work around computer facial animation can be divided into two main areas: techniques to generate animation data, and methods to apply such data to a character. Techniques such as motion capture and keyframing belong to the first group, while morph targets animation (more commonly known as blendshape animation) and skeletal animation belong to the second. Facial animation has become well-known and popular through animated feature films and computer games but its applications include many more areas such as communication, education, scientific simulation, and agent-based systems (for example online customer service representatives). With the recent advancements in computational power in personal and mobile devices, facial animation has transitioned from appearing in pre-rendered content to being created at runtime.

History

Human facial expression has been the subject of scientific investigation for more than one hundred years. Study of facial movements and expressions started from a biological point of view. After some older investigations, for example by John Bulwer in the late 1640s, Charles Darwin's book The Expression of the Emotions in Men and Animals can be considered a major departure for modern research in behavioural biology.

Computer based facial expression modelling and animation is not a new endeavour. The earliest work with computer based facial representation was done in the early-1970s. The first three-dimensional facial animation was created by Parke in 1972. In 1973, Gillenson developed an interactive system to assemble and edit line drawn facial images. in 1974, Parke developed a parameterized three-dimensional facial model.

One of the most important attempts to describe facial movements was Facial Action Coding System (FACS). Originally developed by Carl-Herman Hjortsjö[1] in the 1960s and updated by Ekman and Friesen in 1978, FACS defines 46 basic facial Action Units (AUs). A major group of these Action Units represent primitive movements of facial muscles in actions such as raising brows, winking, and talking. Eight AU's are for rigid three-dimensional head movements, (i.e. turning and tilting left and right and going up, down, forward and backward). FACS has been successfully used for describing desired movements of synthetic faces and also in tracking facial activities.

The early-1980s saw the development of the first physically based muscle-controlled face model by Platt and the development of techniques for facial caricatures by Brennan. In 1985, the animated short film Tony de Peltrie was a landmark for facial animation. This marked the first time computer facial expression and speech animation were a fundamental part of telling the story.

The late-1980s saw the development of a new muscle-based model by Waters, the development of an abstract muscle action model by Magnenat-Thalmann and colleagues, and approaches to automatic speech synchronization by Lewis and Hill. The 1990s have seen increasing activity in the development of facial animation techniques and the use of computer facial animation as a key storytelling component as illustrated in animated films such as Toy Story (1995), Antz (1998), Shrek, and Monsters, Inc. (both 2001), and computer games such as Sims. Casper (1995), a milestone in this decade, was the first movie in which a lead actor was produced exclusively using digital facial animation.

The sophistication of the films increased after 2000. In The Matrix Reloaded and The Matrix Revolutions, dense optical flow from several high-definition cameras was used to capture realistic facial movement at every point on the face. Polar Express (film) used a large Vicon system to capture upward of 150 points. Although these systems are automated, a large amount of manual clean-up effort is still needed to make the data usable. Another milestone in facial animation was reached by The Lord of the Rings, where a character specific shape base system was developed. Mark Sagar pioneered the use of FACS in entertainment facial animation, and FACS based systems developed by Sagar were used on Monster House, King Kong, and other films.

Techniques

Generating facial animation data

The generation of facial animation data can be approached in different ways: 1.) marker-based motion capture on points or marks on the face of a performer, 2.) markerless motion capture techniques using different type of cameras, 3.) audio-driven techniques, and 4.) keyframe animation.

  • Motion capture uses cameras placed around a subject. The subject is generally fitted either with reflectors (passive motion capture) or sources (active motion capture) that precisely determine the subject's position in space. The data recorded by the cameras is then digitized and converted into a three-dimensional computer model of the subject. Until recently, the size of the detectors/sources used by motion capture systems made the technology inappropriate for facial capture. However, miniaturization and other advancements have made motion capture a viable tool for computer facial animation. Facial motion capture was used extensively in Polar Express by Imageworks where hundreds of motion points were captured. This film was very accomplished and while it attempted to recreate realism, it was criticized for having fallen in the 'uncanny valley', the realm where animation realism is sufficient for human recognition and to convey the emotional message but where the characters fail to be perceived as realistic. The main difficulties of motion capture are the quality of the data which may include vibration as well as the retargeting of the geometry of the points.
  • Markerless motion capture aims at simplifying the motion capture process by avoiding encumbering the performer with markers. Several techniques came out recently leveraging different sensors, among which standard video cameras, Kinect and depth sensors or other structured-light based devices. Systems based on structured light may achieve real-time performance without the use of any markers using a high speed structured light scanner. The system is based on a robust offline face tracking stage which trains the system with different facial expressions. The matched sequences are used to build a person-specific linear face model that is subsequently used for online face tracking and expression transfer.
  • Audio-driven techniques are particularly well fitted for speech animation. Speech is usually treated in a different way to the animation of facial expressions, this is because simple keyframe-based approaches to animation typically provide a poor approximation to real speech dynamics. Often visemes are used to represent the key poses in observed speech (i.e. the position of the lips, jaw and tongue when producing a particular phoneme), however there is a great deal of variation in the realisation of visemes during the production of natural speech. The source of this variation is termed coarticulation which is the influence of surrounding visemes upon the current viseme (i.e. the effect of context). To account for coarticulation current systems either explicitly take into account context when blending viseme keyframes [2] or use longer units such as diphone, triphone, syllable or even word and sentence-length units. One of the most common approaches to speech animation is the use of dominance functions introduced by Cohen and Massaro. Each dominance function represents the influence over time that a viseme has on a speech utterance. Typically the influence will be greatest at the center of the viseme and will degrade with distance from the viseme center. Dominance functions are blended together to generate a speech trajectory in much the same way that spline basis functions are blended together to generate a curve. The shape of each dominance function will be different according to both which viseme it represents and what aspect of the face is being controlled (e.g. lip width, jaw rotation etc.). This approach to computer-generated speech animation can be seen in the Baldi talking head. Other models of speech use basis units which include context (e.g. diphones, triphones etc.) instead of visemes. As the basis units already incorporate the variation of each viseme according to context and to some degree the dynamics of each viseme, no model of coarticulation is required. Speech is simply generated by selecting appropriate units from a database and blending the units together. This is similar to concatenative techniques in audio speech synthesis. The disadvantage to these models is that a large amount of captured data is required to produce natural results, and whilst longer units produce more natural results the size of database required expands with the average length of each unit. Finally, some models directly generate speech animations from audio. These systems typically use hidden Markov models or neural nets to transform audio parameters into a stream of control parameters for a facial model. The advantage of this method is the capability of voice context handling, the natural rhythm, tempo, emotional and dynamics handling without complex approximation algorithms. The training database is not needed to be labeled since there are no phonemes or visemes needed; the only needed data is the voice and the animation parameters.
  • Keyframe animation is the least automated of the processes to create animation data although it delivers the maximum amount of control over the animation. It is often used in combination with other techniques to deliver the final polish to the animation. The keyframe data can be made of scalar values defining the morph targets coefficients or rotation and translation values of the bones in models with a bone based rig. Often to speed up the keyframe animation process a control rig is used by the animation. The control rig represents a higher level of abstraction that can act on multiple morph targets coefficients or bones at the same time. For example, a "smile" control can act simultaneously on the mouth shape curving up and the eyes squinting.

Applying facial animation to a character

The main techniques used to apply facial animation to a character are: 1.) morph targets animation, 2.) bone driven animation, 3.) texture-based animation (2D or 3D), and 4.) physiological models.

  • Morph targets (also called "blendshapes") based systems offer a fast playback as well as a high degree of fidelity of expressions. The technique involves modeling portions of the face mesh to approximate expressions and visemes and then blending the different sub meshes, known as morph targets or blendshapes. Perhaps the most accomplished character using this technique was Gollum, from The Lord of the Rings. Drawbacks of this technique are that they involve intensive manual labor and are specific to each character. Recently, new concepts in 3D modeling have started to emerge. Recently, a new technology departing from the traditional techniques starts to emerge, such as Curve Controlled Modeling[3] that emphasizes the modeling of the movement of a 3D object instead of the traditional modeling of the static shape.
  • Bone driven animation is very broadly used in games. The bones setup can vary between few bones to close to a hundred to allow all subtle facial expressions. The main advantages of bone driven animation is that the same animation can be used for different characters as long as the morphology of their faces is similar, and secondly they do not require loading in memory all the Morph targets data. Bone driven animation is most widely supported by 3D game engines. Bone driven animation can be used for both 2D and 3D animation. For example, it is possible to rig and animate using bones a 2D character using Adobe Flash.
Screenshot from "Kara" animated short by Quantic Dream
  • Texture-based animation uses pixel color to create the animation on the character face. 2D facial animation is commonly based upon the transformation of images, including both images from still photography and sequences of video. Image morphing is a technique which allows in-between transitional images to be generated between a pair of target still images or between frames from sequences of video. These morphing techniques usually consist of a combination of a geometric deformation technique, which aligns the target images, and a cross-fade which creates the smooth transition in the image texture. An early example of image morphing can be seen in Michael Jackson's video for "Black Or White". In 3D animation texture based animation can be achieved by animating the texture itself or the UV mapping. In the latter case a texture map of all the facial expression is created and the UV map animation is used to transition from one expression to the next.
  • Physiological models, such as skeletal muscle systems and physically based head models, form another approach in modeling the head and face.[4] Here, the physical and anatomical characteristics of bones, tissues, and skin are simulated to provide a realistic appearance (e.g. spring-like elasticity). Such methods can be very powerful for creating realism but the complexity of facial structures make them computationally expensive, and difficult to create. Considering the effectiveness of parameterized models for communicative purposes (as explained in the next section), it may be argued that physically based models are not a very efficient choice in many applications. This does not deny the advantages of physically based models and the fact that they can even be used within the context of parameterized models to provide local details when needed.

Face animation languages

Many face animation languages are used to describe the content of facial animation. They can be input to a compatible "player" software which then creates the requested actions. Face animation languages are closely related to other multimedia presentation languages such as SMIL and VRML. Due to the popularity and effectiveness of XML as a data representation mechanism, most face animation languages are XML-based. For instance, this is a sample from Virtual Human Markup Language (VHML):

 <vhml>
   <person disposition="angry">
     First I speak with an angry voice and look very angry,
     <surprised intensity="50">
       but suddenly I change to look more surprised.
     </surprised>
   </person>
 </vhml>

More advanced languages allow decision-making, event handling, and parallel and sequential actions. The Face Modeling Language (FML) is an XML-based language for describing face animation.[5] FML supports MPEG-4 Face Animation Parameters (FAPS), decision-making and dynamic event handling, and typical programming constructs such as loops. It is part of the iFACE system.[5] The following is an example from FML:

 <fml>
   <act>
     <par>
 	<hdmv type="yaw" value="15" begin="0" end="2000" />
 	<expr type="joy" value="-60" begin="0" end="2000" />
     </par>
     <excl event_name="kbd" event_value="" repeat="kbd;F3_up" >
 	<hdmv type="yaw" value="40" begin="0" end="2000" event_value="F1_up" />
 	<hdmv type="yaw" value="-40" begin="0" end="2000" event_value="F2_up" />
     </excl>
   </act>
 </fml>

See also

References

  1. ^ Hjortsjö, CH (1969). Man's face and mimic language Archived 2022-08-06 at the Wayback Machine.
  2. ^ Learning Audio-Driven Viseme Dynamics for 3D Face Animation
  3. ^ Ding, H.; Hong, Y. (2003). "NURBS curve controlled modeling for facial animation". Computers and Graphics. 27 (3): 373–385. doi:10.1016/S0097-8493(03)00033-5.
  4. ^ Lucero, J.C.; Munhall, K.G. (1999). "A model of facial biomechanics for speech production". Journal of the Acoustical Society of America. 106 (5): 2834–2842. Bibcode:1999ASAJ..106.2834L. doi:10.1121/1.428108. PMID 10573899.
  5. ^ a b "iFACE". Carleton University. 6 June 2007. Archived from the original on 6 June 2007. Retrieved 16 June 2019.

Further reading

Read other articles:

Questa voce o sezione sull'argomento Svizzera non cita le fonti necessarie o quelle presenti sono insufficienti. Puoi migliorare questa voce aggiungendo citazioni da fonti attendibili secondo le linee guida sull'uso delle fonti. Segui i suggerimenti del progetto di riferimento. A differenza degli Stati di tipo centralista, nella Confederazione Svizzera i cantoni sono completamente liberi di decidere la propria suddivisione amministrativa. Pertanto esistono diverse strutture e terminolog...

 

Маршрутизатор, як приклад вбудованих систем. Відмічені складові позначають: мікропроцесор (4), RAM (6), та флеш-пам'ять (7). Вбудо́вана, вбудо́вувана систе́ма (англ. embedded system) — спеціалізована комп'ютерна система або обчислювальний пристрій, призначений для виконання обмеже...

 

Soviet Army CupMost recent season or competition:1989–90SportFootballFounded1945First season1946Ceased1990No. of teamsVariousCountryBulgariaLastchampion(s)CSKA SofiaMost titlesLevski SofiaPromotion toUEFA Cup Winners' Cup (1962–1982)RelatedcompetitionsBulgarian Cup The Soviet Army Cup (Bulgarian: Купа на Съветската армия, romanized: Kupa na Savetskata armiya) was an annual knockout football cup tournament held in Bulgaria between 1946 and 1990. Until 1982, it was...

هويزو 惠州市  خريطة الموقع تاريخ التأسيس 1988  تقسيم إداري البلد  الصين[1][2] التقسيم الأعلى غوانغدونغ  خصائص جغرافية إحداثيات 23°06′41″N 114°24′55″E / 23.11147°N 114.41523°E / 23.11147; 114.41523   [3] المساحة 10922 كم² الارتفاع 15 السكان التعداد السكاني 4598402 نسمة (إحص...

 

Peter Ofori-Quaye Nazionalità  Ghana Altezza 183 cm Peso 72 kg Calcio Ruolo Attaccante Termine carriera 2012 Carriera Giovanili 1991-1995 Ajax Cape Town Squadre di club1 1995-1997 Kalamata34 (8)1997-2003 Olympiakos80 (24)2004-2005 Liberty Prof.29 (21)2005-2007 OFI Creta45 (9)2007-2008 Ironi K. Shmona36 (35)2008-2010 AEL Limassol11 (3)2010-2011 Hearts of Oak? (?)2011-2012 Bechem Utd? (?) Nazionale 1997-1999 Ghana U-206 (6)1998-2007 Ghana18 (2...

 

Artikel ini sebatang kara, artinya tidak ada artikel lain yang memiliki pranala balik ke halaman ini.Bantulah menambah pranala ke artikel ini dari artikel yang berhubungan atau coba peralatan pencari pranala.Tag ini diberikan pada Januari 2023. Beberapa atau seluruh referensi dari artikel ini mungkin tidak dapat dipercaya kebenarannya. Bantulah dengan memberikan referensi yang lebih baik atau dengan memeriksa apakah referensi telah memenuhi syarat sebagai referensi tepercaya. Referensi yang t...

此條目可参照英語維基百科相應條目来扩充。 (2021年5月6日)若您熟悉来源语言和主题,请协助参考外语维基百科扩充条目。请勿直接提交机械翻译,也不要翻译不可靠、低品质内容。依版权协议,译文需在编辑摘要注明来源,或于讨论页顶部标记{{Translated page}}标签。 约翰斯顿环礁Kalama Atoll 美國本土外小島嶼 Johnston Atoll 旗幟颂歌:《星條旗》The Star-Spangled Banner約翰斯頓環礁�...

 

هنودمعلومات عامةنسبة التسمية الهند التعداد الكليالتعداد قرابة 1.21 مليار[1][2]تعداد الهند عام 2011ق. 1.32 مليار[3]تقديرات عام 2017ق. 30.8 مليون[4]مناطق الوجود المميزةبلد الأصل الهند البلد الهند  الهند نيبال 4,000,000[5] الولايات المتحدة 3,982,398[6] الإمار...

 

Troupes de fusiliers motorisésМотострелковые войска Emblème des forces terrestres russes. Création 1992 Pays Russie Branche Forces terrestres russes et forces côtières russes Type Arme (corps militaire) Rôle Infanterie mécanisée Composée de 10 divisions, 23 brigades et 11 régiments Ancienne dénomination Troupes de fusiliers motorisés soviétiques Guerres guerre civile du Tadjikistan (1992-1997) ; première (1994-1996) et seconde guerres de Tché...

Census-designated place in New York, United StatesPearl RiverCensus-designated placeEdward Salyer House on South Middletown Road in Pearl River.Location in Rockland County and the state of New YorkPearl RiverLocation within the state of New YorkCoordinates: 41°3′32.8″N 74°1′12.9″W / 41.059111°N 74.020250°W / 41.059111; -74.020250CountryUnited StatesStateNew YorkCountyRocklandArea[1] • Total7.19 sq mi (18.63 km2) •&...

 

Vertebra lumbarPosisi tulang belakang bawah pada manusia (ditunjukkan pada merah). Terdiri dari lima tulang, dari atas ke bawah, mereka ialah L1, L2, L3, L4, dan L5.Vertebra lumbar biasa.RincianPengidentifikasiBahasa Latinvertebrae lumbalesMeSHD008159TA98A02.2.04.001TA21068FMA9921Daftar istilah anatomi tulang[sunting di Wikidata]Pada anatomi manusia, tulang punggung bawah atau tulang punggung lumbar adalah serangkaian lima vertebra diantara tulang rusuk dan pelvis. Mereka merupakan segmen...

 

Italian cyclist Michele BartoliBartoli at the 2009 Tour of FlandersPersonal informationFull nameMichele BartoliNicknameIl Leoncino delle Fiandre (in Italian)(The Little Lion of Flanders) (in English)Born (1970-05-27) 27 May 1970 (age 53)Pisa, ItalyHeight1.79 m (5 ft 10 in)Weight65 kg (143 lb; 10 st 3 lb)Team informationCurrent teamRetiredDisciplineRoadRoleRiderRider typeClassics specialistProfessional teams1992–1995Mercatone Uno–Medeg...

Concert hall in Perth, Western Australia Perth Concert HallGeneral informationTypeConcert hallArchitectural styleBrutalistTown or cityPerth, Western AustraliaCountryAustraliaCoordinates31°57′29″S 115°51′50″E / 31.958°S 115.86385°E / -31.958; 115.86385 (Perth Concert Hall)Current tenantsWest Australian Symphony OrchestraVarious other events and performancesConstruction started1971; 53 years ago (1971)Opened26 January 1973CostA$3,201...

 

هذه المقالة يتيمة إذ تصل إليها مقالات أخرى قليلة جدًا. فضلًا، ساعد بإضافة وصلة إليها في مقالات متعلقة بها. (يونيو 2018) جوزيبي فارفانيللي معلومات شخصية الميلاد 13 فبراير 1915   بِروجَة  تاريخ الوفاة سنة 1979   الجنسية إيطاليا (18 يونيو 1946–1979) مملكة إيطاليا (13 فبراير 1915–18 يوني...

 

Pour les articles homonymes, voir Clwyd. Clwyd Carte du comté dans le découpage de 2010. Administration Pays Royaume-Uni Nation Pays de Galles Comté Denbigh (partiel) Flintshire (partiel) Merioneth Chef-lieu Mold Type Comté préservé (depuis 1996) Zone de gouvernement local (1974-1996) Assemblée Mandat Conseil de comté de la Clwyd Active de 1974 à 1996 Lord-lieutenant Mandat Henry George Fetherstonhaugh Depuis 2013 Haut-shérif Mandat Karen Ann Farrell-Thornley 2024-2025 Démographie...

French cyclist Axel DomontDomont in 2015Personal informationFull nameAxel DomontBorn (1990-08-07) 7 August 1990 (age 34)Valence, FranceHeight1.79 m (5 ft 10 in)Weight65 kg (143 lb; 10.2 st)Team informationCurrent teamRetiredDisciplineRoadRoleRiderRider typeClimberAmateur teams2009UC Aubenas2010–2012Chambéry CF2011Ag2r–La Mondiale (stagiaire) Professional team2013–2020Ag2r–La Mondiale[1] Axel Domont (born 7 August 1990) is a Fr...

 

2007 Philippine television drama series This article is about the 2007 TV series. For the 1988 Filipino song, see Sana Maulit Muli (song). For the 1995 film, see Sana Maulit Muli (film). Sana Maulit MuliTitle cardAlso known asChancesGenreDramaRomanceCreated byABS-CBN StudiosWritten byPhilip KingDirected byJerome Chavez Pobocan Claudio Tots Sanchez-Mariscal IVStarringKim ChiuGerald AndersonErich GonzalesJake CuencaTheme music composerGary ValencianoAngeli ValencianoEnding themeSana Maulit Muli...

 

東高円寺駅 蚕糸の森公園方面1番出入口(2019年8月30日撮影) ひがしこうえんじ Higashi-koenji ◄M 03 新高円寺 (0.9 km) (1.0 km) 新中野 M 05► 所在地 東京都杉並区和田三丁目55-42北緯35度41分52.8秒 東経139度39分28.2秒 / 北緯35.698000度 東経139.657833度 / 35.698000; 139.657833 (東高円寺駅)座標: 北緯35度41分52.8秒 東経139度39分28.2秒 / 北緯35.69800...

« Évian » redirige ici. Pour les autres significations, voir Évian (homonymie). Évian-les-Bains De haut en bas, de gauche à droite : Évian-les-Bains vue depuis le Léman ; Le Palais Lumière et l'hôtel de ville d'Évian ; Le Casino d'Évian ; La Buvette Cachat ; L'Hôtel Royal d'Évian ; La promenade du port ; Le port d'Évian Blason Logo Administration Pays France Région Auvergne-Rhône-Alpes Département Haute-Savoie Arrondissement Thon...

 

الخطوط الجوية الموزمبيقية   إياتاTM  إيكاوLAM  رمز النداءMOZAMBIQUE تاريخ الإنشاء 26 أغسطس 1936  الجنسية موزمبيق  موقع ويب الموقع الرسمي  تعديل مصدري - تعديل   الخطوط الجوية الموزمبيقية أو لام[1] (بالبرتغالية: LAM Linhas Aéreas de Moçambique‏) هي الناقل الوطني لموزمبيق.[2&...