In Unicode, a script is a collection of letters and other written signs used to represent textual information in one or more writing systems.[1] Some scripts support one and only one writing system and language, for example, Armenian. Other scripts support many different writing systems; for example, the Latin script supports English, French, German, Italian, Vietnamese, Latin itself, and several other languages. Some languages make use of multiple alternate writing systems and thus also use several scripts; for example, in Turkish, the Arabic script was used before the 20th century but transitioned to Latin in the early part of the 20th century. More or less complementary to scripts are symbols and Unicode control characters.
The unified diacritical characters and unified punctuation characters frequently have the "common" or "inherited" script property. However, the individual scripts often have their own punctuation and diacritics, so that many scripts include not only letters but also diacritic and other marks, punctuation, numerals and even their own idiosyncratic symbols and space characters.
Unicode 16.0 defines 168 separate scripts, including 99 modern scripts and 69 ancient or historic scripts.[2][3] More scripts are in the process for encoding or have been tentatively allocated for encoding in roadmaps.[4]
Definition and classification
When multiple languages make use of the same script, there are frequently some differences, particularly in diacritics and other marks. For example, Swedish and English both use the Latin script. However, Swedish includes the character å (sometimes called a Swedish O), while English has no such character. Nor does English make use of the diacritic combining ring above for any character. In general, the languages sharing the same scripts share many of the same characters. Despite these peripheral differences in the Swedish and English writing systems, they are said to use the same Latin script. Thus, the Unicode abstraction of scripts is a basic organizing technique. The differences among different alphabets or writing systems remain and are supported through Unicode’s flexible scripts, combining marks and collation algorithms.
Script versus writing system
Writing system is sometimes treated as a synonym for "script". However, it also can be used as the specific concrete writing system supported by a script. For example, the Vietnamese writing system is supported by the Latin script. A writing system may also cover more than one script; for example, the Japanese writing system makes use of the Han, Hiragana and Katakana scripts.
Most writing systems can be broadly divided into several categories: logographic, syllabic, alphabetic (or segmental), abugida, abjad and featural; however, all features of any of these may be found in any given writing system in varying proportions, often making it difficult to purely categorize a system. The term complex system is sometimes used to describe those where the admixture makes classification problematic.
Unicode supports all of these types of writing systems through its numerous scripts. Unicode also adds further properties to characters to help differentiate the various characters and the ways they behave within Unicode text-processing algorithms.
Special script property values
In addition to explicit or specific script properties, Unicode uses three special values:[5]
Common
Unicode can assign a character in the UCS to a single script only. However, many characters—those that are not part of a formal natural-language writing system or are unified across many writing systems—may be used in more than one script (for example, currency signs, symbols, numerals and punctuation marks). In these cases Unicode defines them as belonging to the "common" script (ISO 15924 code "Zyyy").
Inherited
Many diacritics and non-spacing combining characters may be applied to characters from more than one script. In these cases Unicode assigns them to the "inherited" script (ISO 15924 code Zinh), which means that they have the same script class as the base character with which they combine, and so in different contexts they may be treated as belonging to different scripts. For example, U+0308 ̈COMBINING DIAERESIS may combine either with U+0065eLATIN SMALL LETTER E to create a Latin ë or with U+0435еCYRILLIC SMALL LETTER IE for the Cyrillicё. In the former case, it inherits the Latin script of the base character, whereas in the latter case, it inherits the Cyrillic script of the base character.
Unknown
The value of "unknown" script (ISO 15924 code Zzzz) is given to unassigned, private-use, noncharacter, and surrogate code points.
Character categories within scripts
Unicode provides a general category property for each character. So in addition to belonging to a script every character also has a general category. Typically scripts include letter characters including: uppercase letters, lowercase letter and modifier letters. Some characters are considered titlecase letters for a few precomposed ligatures such as Dz (U+01F2). Such titlecase ligatures are all in the Latin and Greek scripts and are all compatibility characters, and therefore Unicode discourages their use by authors. It is unlikely that new titlecase letters will be added in the future.
Most writing systems do not differentiate between uppercase and lowercase letters. For those scripts all letters are categorized as "other letter" or "modifier letter". Ideographs such as Unihan ideographs are also categorized as "other letters". A few scripts do differentiate between uppercase and lowercase however: Latin, Cyrillic, Greek, Armenian, Georgian, and Deseret. Even for these scripts there are some letters that are neither uppercase nor lowercase.
Scripts can also contain any other general category character such as marks (diacritic and otherwise), numbers (numerals), punctuation, separators (word separators such as spaces), symbols and non-graphical format characters. These are included in a particular script when they are unique to that script. Other such characters are generally unified and included in the punctuation or diacritic blocks. However, the bulk of characters in any script (other than the common and inherited scripts) are letters.
List of encoded scripts
As of version 16.0[update], Unicode defines 168 scripts (called "Alias" or "Property value alias") based on the ISO 15924 list. In addition, Unicode assigns the name "Common" to ISO 15924's Zyyy code for undetermined scripts, "Inherited" to ISO 15924's Zinh code for inherited scripts, and "Unknown" to ISO 15924's Zzzz code for uncoded scripts. There are script codes defined by ISO 15924 but are not used in Unicode, including Zsym (Symbols) and Zmth (Mathematical notation).
Unicode uses the "Property Value Alias" (Alias) as the script-name. These Alias names are part of Unicode and are published informatively next to ISO 15924. An alias script name may be used in a character name: Palm, Palmyrene → U+10860𐡠PALMYRENE LETTER ALEPH.
The project Missing Scripts—with contributors from the Mainz University of Applied Sciences, the L’Atelier national de recherche typographique (ANRT) in Nancy, and the University of California, Berkeley—has compiled a list of 131 scripts that have not yet been encoded in The Unicode Standard, out of a total of 294 recognized scripts according to the current state of research.[6]
Heliocopris tyrannus Klasifikasi ilmiah Kerajaan: Animalia Filum: Arthropoda Kelas: Insecta Ordo: Coleoptera Famili: Scarabaeidae Genus: Heliocopris Spesies: Heliocopris tyrannus Heliocopris tyrannus adalah spesies kumbang yang berasal dari genus Heliocopris dan famili Scarabaeidae. Kumbang ini juga merupakan bagian dari ordo Coleoptera, kelas Insecta, filum Arthropoda, dan kingdom Animalia. Kumbang ini memiliki antena yang terdiri dari plat yang disebut lamela. Referensi Bisby F.A., Roskov ...
Artikel bertopik Perang Dunia II ini adalah sebuah rintisan. Anda dapat membantu Wikipedia dengan mengembangkannya.lbs Sisa fondasi barak Felsennest Felsennest (Rocky Eyrie) adalah salah satu markas Führer yang berada di Bad Münstereifel, Jerman yang dibangun pada tahun 1940an. Bangunan Markas Felsennest jauh lebih kecil dari pada markas resmi lain yang dimiliki oleh Adolf Hitler. Bunker pada markas ini hanya memiliki empat bilik kamar. Hitler tinggal di markas ini pada musim gugur tahun 19...
Former association football club in Scotland Football clubCampsie HiberniansNickname(s)the HibsFounded1889Dissolved1891GroundBrechin's FarmHon. SecretaryH. MurrayMatch SecretaryDaniel Gallacher Home colours Campsie Hibernians Football Club was a Scottish association football club based in the village of Lennoxtown, Stirlingshire. History 1890–91 Stirlingshire Cup 1st Round, Campsie Hibernians 18–0 Dunipace, Kirkintilloch Herald, 19 November 1890 The club was founded in 1889, only managing...
Pertempuran BuchaBagian dari serangan Kyiv dalam Invasi Rusia ke Ukraina 2022Tanggal27 Februari – 12 Maret 2022 (Tahap pertama)29–31 Maret 2022 (Tahap kedua)LokasiBucha, Oblast Kyiv, UkrainaHasil Tahap pertama: Pasukan Rusia menduduki kota pada 12 Maret 2022.[1][2] Tahap kedua: Penarikan mundur pasukan Rusia pada 31 Maret 2022[3] dan kota dikuasai kembali oleh pasukan Ukraina.[4][5]Pihak terlibat Rusia UkrainaTokoh dan pemimpin Gennady B...
Dominasi Tiongkok kedua di VietnamBắc thuộc lần thứ hai (北屬吝次二)43–544Map of the Liang dynasty in 502StatusDistrik dinasti Han Timur-Wu Timur-dinasti Jin-LiangIbu kotaJiaozhi (Vietnam: Giao Chỉ)Bahasa yang umum digunakanTionghoa LamaPemerintahanMonarkiKaisar • 43-57 Kaisar Guangwu dari Han (pertama)• 229-252 Sun Quan dari Wu Timur• 266-290 Kaisar Wu dari Jin• 420-422 Kaisar Wu dari Liu Song• 479-482 Kaisar Gao dari Qi Selatan�...
A Haunting in VenicePoster resmiSutradaraKenneth BranaghProduser Kenneth Branagh Judy Hofflund Ridley Scott Simon Kinberg SkenarioMichael GreenBerdasarkanHallowe'en Partyoleh Agatha ChristiePemeran Kyle Allen Kenneth Branagh Camille Cottin Jamie Dornan Tina Fey Jude Hill Ali Khan Emma Laird Kelly Reilly Riccardo Scamarcio Michelle Yeoh Penata musikHildur GuðnadóttirSinematograferHaris ZambarloukosPenyuntingLucy DonaldsonPerusahaanproduksi Kinberg Genre The Mark Gordon Company Scott Fr...
Magnetic tape-based videocassette format for HD video HDCAMHDCAM small videotapeMedia typeMagnetic tapeUsageVideo production Sony HDW-F900 CineAlta HDCAM camcorder HDCAM is a high-definition video digital recording videocassette version of Digital Betacam introduced in 1997 that uses an 8-bit discrete cosine transform (DCT) compressed 3:1:1 recording, in 1080i-compatible down-sampled resolution of 1440×1080, and adding 24p and 23.976 progressive segmented frame (PsF) modes to later models. T...
Six Degrees - Sei gradi di separazioneTitolo originaleSix Degrees PaeseStati Uniti d'America Anno2006-2007 Formatoserie TV Generedrammatico Stagioni1 Episodi13 Durata40 min (episodio) Lingua originaleinglese CreditiIdeatoreRaven Metzner e Stuart Zicherman Interpreti e personaggi Jay Hernandez: Carlos Green Erika Christensen: Mae Anderson Hope Davis: Laura Morgan Campbell Scott: Steven Caseman Bridget Moynahan: Whitney Crane Dorian Missick: Damian Henry Doppiatori e personaggi Francesco Vendit...
Artikel ini perlu diwikifikasi agar memenuhi standar kualitas Wikipedia. Anda dapat memberikan bantuan berupa penambahan pranala dalam, atau dengan merapikan tata letak dari artikel ini. Untuk keterangan lebih lanjut, klik [tampil] di bagian kanan. Mengganti markah HTML dengan markah wiki bila dimungkinkan. Tambahkan pranala wiki. Bila dirasa perlu, buatlah pautan ke artikel wiki lainnya dengan cara menambahkan [[ dan ]] pada kata yang bersangkutan (lihat WP:LINK untuk keterangan lebih lanjut...
Dutch businessman (1905–2005) Frits PhilipsPortrait of Frits Philips, 1971BornFrederik Jacques Philips(1905-04-16)16 April 1905Eindhoven, NetherlandsDied5 December 2005(2005-12-05) (aged 100)Eindhoven, NetherlandsNationalityDutchAlma materTechnische Hogeschool DelftKnown forChairman of the BOD of PhilipsSpouse Sylvia van Lennep (m. 1929; died 1992)Children7Parent(s)Anton PhilipsAnna de Jongh Frederik Jacques Frits Philips (1...
Infantry Tank Mk III ValentineMk. III Valentine in esposizione al museo di Kubinka, RussiaDescrizioneTipoCarro armato per fanteria Equipaggio3 ProgettistaVickers-Armstrongs CostruttoreVickers-Armstrongs, Metropolitan-Cammell Carriage and Wagon, Birmingham Railway Carriage and Wagon, Canadian Pacific Railway Data impostazione1938 Data primo collaudomaggio 1940 Data entrata in servizioluglio 1940 Utilizzatore principale Regno Unito Altri utilizzatori Unione Sovietica; Nuova ...
Concattedrale di Santa Maria AssuntaFacciataStato Italia RegioneToscana LocalitàCortona Coordinate43°16′35.4″N 11°59′03.5″E43°16′35.4″N, 11°59′03.5″E Religionecattolica di rito romano TitolareMaria Diocesi Arezzo-Cortona-Sansepolcro Stile architettonicoromanico, rinascimentale Inizio costruzioneXI secolo CompletamentoXV secolo Modifica dati su Wikidata · Manuale La concattedrale di Santa Maria Assunta è il principale luogo di culto cattolico di Cortona, in pro...
FLiBe (2LiF-BeF2) cair. Garam cair adalah garam yang berbentuk padat pada suhu dan tekanan standar tetapi berubah fase menjadi cair karena kenaikan suhu. Suatu garam yang normalnya berbentuk cair meskipun pada suhu dan tekanan standar biasanya disebut cairan ionik suhu ruang, meskipun secara teknis garam cair adalah bagian dari cairan ionik. Penggunaan Garam cair memiliki beragam kegunaan. Campuran garam klorida cair umumnya digunakan sebagai penangas pada perlakuan panas beragam logam paduan...
Mexican regional banking group This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: BanRegio – news · newspapers · books · scholar · JSTOR (November 2017) (Learn how and when to remove this message)This article relies excessively on references to primary sources. Please improve this article by adding secondary or...
American novelist and poet (born 1942) Erica JongJong in 1977BornErica Mann (1942-03-26) March 26, 1942 (age 82)New York City, U.S.OccupationAuthorteacherAlma materBarnard College (BA)Columbia University (MA)Period1973–presentGenrePrimarily fiction and poetryNotable worksFear of Flying, Shylock's Daughter, Seducing the DemonSpouse Michael Werthman (m. 1963, divorced) Allan Jong (m. 1966, divorced...
Bài này không có nguồn tham khảo nào. Mời bạn giúp cải thiện bài bằng cách bổ sung các nguồn tham khảo đáng tin cậy. Các nội dung không có nguồn có thể bị nghi ngờ và xóa bỏ. Nếu bài được dịch từ Wikipedia ngôn ngữ khác thì bạn có thể chép nguồn tham khảo bên đó sang đây. Phó Chủ tịch Hội đồng Quốc phòng và An ninh Việt NamQuốc huy Việt NamĐương nhiệmPhạm Minh Chínhtừ 5 tháng ...
March 18–22, 1958, nor'easterCategory 3 Major (RSI/NOAA: 7.14)Surface weather analysis of the storm The March 18–22, 1958 nor'easter was an unusual late-season and violent winter storm that impacted the Mid-Atlantic and New England regions of the United States. Its snowfall extended from North Carolina through Maine.[1] Analysis and impacts On June 15, 1958, Raymond A. Green, a meteorologist at the extended forecast section of the United States Weather Bure...