Unicode Script

In Unicode, a script is a collection of letters and other written signs used to represent textual information in one or more writing systems.

Unicode Script Unicode Script Unicode Script
Unicode Script Unicode Script
Unicode Script Unicode Script Unicode Script ழ்
Unicode Script ع‎‎ ש‎‎ Д A‎

Some scripts support one and only one writing system and language, for example, Armenian. Other scripts support many different writing systems; for example, the Latin script supports English, French, German, Italian, Vietnamese, Latin itself, and several other languages. Some languages make use of multiple alternate writing systems and thus also use several scripts; for example, in Turkish, the Arabic script was used before the 20th century but transitioned to Latin in the early part of the 20th century. More or less complementary to scripts are symbols and Unicode control characters.

The unified diacritical characters and unified punctuation characters frequently have the "common" or "inherited" script property. However, the individual scripts often have their own punctuation and diacritics, so that many scripts include not only letters but also diacritic and other marks, punctuation, numerals and even their own idiosyncratic symbols and space characters.

Unicode 15.1 defines 161 separate scripts, including 94 modern scripts and 67 ancient or historic scripts. More scripts are in the process for encoding or have been tentatively allocated for encoding in roadmaps.

Definition and classification

When multiple languages make use of the same script, there are frequently some differences, particularly in diacritics and other marks. For example, Swedish and English both use the Latin script. However, Swedish includes the character å (sometimes called a Swedish O), while English has no such character. Nor does English make use of the diacritic combining ring above for any character. In general, the languages sharing the same scripts share many of the same characters. Despite these peripheral differences in the Swedish and English writing systems, they are said to use the same Latin script. Thus, the Unicode abstraction of scripts is a basic organizing technique. The differences among different alphabets or writing systems remain and are supported through Unicode’s flexible scripts, combining marks and collation algorithms.

Script versus writing system

Writing system is sometimes treated as a synonym for "script". However, it also can be used as the specific concrete writing system supported by a script. For example, the Vietnamese writing system is supported by the Latin script. A writing system may also cover more than one script; for example, the Japanese writing system makes use of the Han, Hiragana and Katakana scripts.

Most writing systems can be broadly divided into several categories: logographic, syllabic, alphabetic (or segmental), abugida, abjad and featural; however, all features of any of these may be found in any given writing system in varying proportions, often making it difficult to purely categorize a system. The term complex system is sometimes used to describe those where the admixture makes classification problematic.

Unicode supports all of these types of writing systems through its numerous scripts. Unicode also adds further properties to characters to help differentiate the various characters and the ways they behave within Unicode text-processing algorithms.

Special script property values

In addition to explicit or specific script properties, Unicode uses three special values:

    Common
    Unicode can assign a character in the UCS to a single script only. However, many characters—those that are not part of a formal natural-language writing system or are unified across many writing systems—may be used in more than one script (for example, currency signs, symbols, numerals and punctuation marks). In these cases Unicode defines them as belonging to the "common" script (ISO 15924 code "Zyyy").
    Inherited
    Many diacritics and non-spacing combining characters may be applied to characters from more than one script. In these cases Unicode assigns them to the "inherited" script (ISO 15924 code Zinh), which means that they have the same script class as the base character with which they combine, and so in different contexts they may be treated as belonging to different scripts. For example, U+0308  ̈  COMBINING DIAERESIS may combine either with U+0065 e LATIN SMALL LETTER E to create a Latin ë or with U+0435 е CYRILLIC SMALL LETTER IE for the Cyrillic ё. In the former case, it inherits the Latin script of the base character, whereas in the latter case, it inherits the Cyrillic script of the base character.
    Unknown
    The value of "unknown" script (ISO 15924 code Zzzz) is given to unassigned, private-use, noncharacter, and surrogate code points.

Character categories within scripts

Unicode provides a general category property for each character. So in addition to belonging to a script every character also has a general category. Typically scripts include letter characters including: uppercase letters, lowercase letter and modifier letters. Some characters are considered titlecase letters for a few precomposed ligatures such as Dz (U+01F2). Such titlecase ligatures are all in the Latin and Greek scripts and are all compatibility characters, and therefore Unicode discourages their use by authors. It is unlikely that new titlecase letters will be added in the future.

Most writing systems do not differentiate between uppercase and lowercase letters. For those scripts all letters are categorized as "other letter" or "modifier letter". Ideographs such as Unihan ideographs are also categorized as "other letters". A few scripts do differentiate between uppercase and lowercase however: Latin, Cyrillic, Greek, Armenian, Georgian, and Deseret. Even for these scripts there are some letters that are neither uppercase nor lowercase.

Scripts can also contain any other general category character such as marks (diacritic and otherwise), numbers (numerals), punctuation, separators (word separators such as spaces), symbols and non-graphical format characters. These are included in a particular script when they are unique to that script. Other such characters are generally unified and included in the punctuation or diacritic blocks. However, the bulk of characters in any script (other than the common and inherited scripts) are letters.

List of scripts in Unicode

Unicode defines over a hundred script names (called "Alias" or "Property value alias"), based on the ISO 15924 list. Unicode uses the "Common" script name for ISO 15924's Zyyy (code for undetermined script), "Inherited" for ISO 15924's Zinh (code for inherited script), and "Unknown" for ISO 15924's Zzzz (code for uncoded script). Not used are, among others, the ISO 15924 script codes: Zsym (Symbols) and Zmth (Mathematical notation). These are considered not to be scripts in Unicode sense.

ISO 15924 Script in Unicode[e]
Code ISO number ISO formal name Directionality Unicode Alias[f] Version Characters Notes Description
Adlm 166 Adlam right-to-left script Unicode Script  Adlam 9.0 88 Ch 19.9
Afak 439 Afaka varies ZZ— Not in Unicode, proposal is explored
Aghb 239 Caucasian Albanian left-to-right Unicode Script  Caucasian Albanian 7.0 53 Ancient/historic Ch 8.11
Ahom 338 Ahom, Tai Ahom left-to-right Unicode Script  Ahom 8.0 65 Ancient/historic Ch 15.16
Arab 160 Arabic right-to-left script Unicode Script  Arabic 1.0 1,368 Ch 9.2
Aran 161 Arabic (Nastaliq variant) mixed ZZ— Typographic variant of Arabic (see § Arab)
Armi 124 Imperial Aramaic right-to-left script Unicode Script  Imperial Aramaic 5.2 31 Ancient/historic Ch 10.4
Armn 230 Armenian left-to-right Unicode Script  Armenian 1.0 96 Ch 7.6
Avst 134 Avestan right-to-left script Unicode Script  Avestan 5.2 61 Ancient/historic Ch 10.7
Bali 360 Balinese left-to-right Unicode Script  Balinese 5.0 124 Ch 17.3
Bamu 435 Bamum left-to-right Unicode Script  Bamum 5.2 657 Ch 19.6
Bass 259 Bassa Vah left-to-right Unicode Script  Bassa Vah 7.0 36 Ancient/historic Ch 19.7
Batk 365 Batak left-to-right Unicode Script  Batak 6.0 56 Ch 17.6
Beng 325 Bengali (Bangla) left-to-right Unicode Script  Bengali 1.0 96 Ch 12.2
Bhks 334 Bhaiksuki left-to-right Unicode Script  Bhaiksuki 9.0 97 Ancient/historic Ch 14.3
Blis 550 Blissymbols varies ZZ— Not in Unicode, proposal is explored
Bopo 285 Bopomofo left-to-right, right-to-left script Unicode Script  Bopomofo 1.0 77 Ch 18.3
Brah 300 Brahmi left-to-right Unicode Script  Brahmi 6.0 115 Ancient/historic Ch 14.1
Brai 570 Braille left-to-right Unicode Script  Braille 3.0 256 Ch 21.1
Bugi 367 Buginese left-to-right Unicode Script  Buginese 4.1 30 Ch 17.2
Buhd 372 Buhid left-to-right Unicode Script  Buhid 3.2 20 Ch 17.1
Cakm 349 Chakma left-to-right Unicode Script  Chakma 6.1 71 Ch 13.11
Cans 440 Unified Canadian Aboriginal Syllabics left-to-right Unicode Script  Canadian Aboriginal 3.0 726 Ch 20.2
Cari 201 Carian left-to-right, right-to-left script Unicode Script  Carian 5.1 49 Ancient/historic Ch 8.5
Cham 358 Cham left-to-right Unicode Script  Cham 5.1 83 Ch 16.10
Cher 445 Cherokee left-to-right Unicode Script  Cherokee 3.0 172 Ch 20.1
Chis 298 Chisoi left-to-right ZZ— Not in Unicode, proposal is mature
Chrs 109 Chorasmian right-to-left script, top-to-bottom Unicode Script  Chorasmian 13.0 28 Ancient/historic Ch 10.8
Cirt 291 Cirth varies ZZ— Not in Unicode
Copt 204 Coptic left-to-right Unicode Script  Coptic 1.0 137 Ancient/historic, disunified from Greek in 4.1 Ch 7.3
Cpmn 402 Cypro-Minoan left-to-right Cypro Minoan 14.0 99 Ancient/historic Ch 8.4
Cprt 403 Cypriot syllabary right-to-left script Unicode Script  Cypriot 4.0 55 Ancient/historic Ch 8.3
Cyrl 220 Cyrillic left-to-right Unicode Script  Cyrillic 1.0 506 Includes typographic variant Old Church Slavonic (see § Cyrs) Ch 7.4
Cyrs 221 Cyrillic (Old Church Slavonic variant) varies ZZ— Typographic variant of Cyrillic (see § Cyrl); Ancient/historic
Deva 315 Devanagari (Nagari) left-to-right Unicode Script  Devanagari 1.0 164 Ch 12.1
Diak 342 Dives Akuru left-to-right Unicode Script  Dives Akuru 13.0 72 Ancient/historic Ch 15.15
Dogr 328 Dogra left-to-right Unicode Script  Dogra 11.0 60 Ancient/historic Ch 15.18
Dsrt 250 Deseret (Mormon) left-to-right Unicode Script  Deseret 3.1 80 Ch 20.4
Dupl 755 Duployan shorthand, Duployan stenography left-to-right Unicode Script  Duployan 7.0 143 Ch 21.6
Egyd 070 Egyptian demotic mixed ZZ— Not in Unicode
Egyh 060 Egyptian hieratic mixed ZZ— Not in Unicode
Egyp 050 Egyptian hieroglyphs right-to-left script, left-to-right Unicode Script  Egyptian Hieroglyphs 5.2 1,110 Ancient/historic Ch 11.4
Elba 226 Elbasan left-to-right Unicode Script  Elbasan 7.0 40 Ancient/historic Ch 8.10
Elym 128 Elymaic right-to-left script Unicode Script  Elymaic 12.0 23 Ancient/historic Ch 10.9
Ethi 430 Ethiopic (Geʻez) left-to-right Unicode Script  Ethiopic 3.0 523 Ch 19.1
Gara 164 Garay right-to-left ZZ— Not in Unicode, approved for version 16.0
Geok 241 Khutsuri (Asomtavruli and Nuskhuri) left-to-right Unicode Script  Georgian Unicode groups Khutsori, Asomtavruli and Nuskhuri into 'Georgian' (see § Geok). Similarly, Mkhedruli and Mtavruli are 'Georgian' (see § Geor) Ch 7.7
Geor 240 Georgian (Mkhedruli and Mtavruli) left-to-right Unicode Script  Georgian 1.0 173 In Unicode this also includes Nuskhuri (Geok) Ch 7.7
Glag 225 Glagolitic left-to-right Unicode Script  Glagolitic 4.1 134 Ancient/historic Ch 7.5
Gong 312 Gunjala Gondi left-to-right Unicode Script  Gunjala Gondi 11.0 63 Ch 13.15
Gonm 313 Masaram Gondi left-to-right Unicode Script  Masaram Gondi 10.0 75 Ch 13.14
Goth 206 Gothic left-to-right Unicode Script  Gothic 3.1 27 Ancient/historic Ch 8.9
Gran 343 Grantha left-to-right Unicode Script  Grantha 7.0 85 Ancient/historic Ch 15.14
Grek 200 Greek left-to-right Unicode Script  Greek 1.0 518 Directionality sometimes as boustrophedon Ch 7.2
Gujr 320 Gujarati left-to-right Unicode Script  Gujarati 1.0 91 Ch 12.4
Gukh 397 Gurung Khema left-to-right ZZ— Not in Unicode, approved for version 16.0
Guru 310 Gurmukhi left-to-right Unicode Script  Gurmukhi 1.0 80 Ch 12.3
Hanb 503 Han with Bopomofo (alias for Han + Bopomofo) mixed ZZ— See § Hani, § Bopo
Hang 286 Hangul (Hangŭl, Hangeul) left-to-right, vertical right-to-left Unicode Script  Hangul 1.0 11,739 Hangul syllables relocated in 2.0 Ch 18.6
Hani 500 Han (Hanzi, Kanji, Hanja) top-to-bottom, columns right-to-left (historically) Han 1.0 99,030 Ch 18.1
Hano 371 Hanunoo (Hanunóo) left-to-right, bottom-to-top Unicode Script  Hanunoo 3.2 21 Ch 17.1
Hans 501 Han (Simplified variant) varies ZZ— Subset of Han (Hanzi, Kanji, Hanja) (see § Hani)
Hant 502 Han (Traditional variant) varies ZZ— Subset of § Hani
Hatr 127 Hatran right-to-left script Unicode Script  Hatran 8.0 26 Ancient/historic Ch 10.12
Hebr 125 Hebrew right-to-left script Unicode Script  Hebrew 1.0 134 Ch 9.1
Hira 410 Hiragana vertical right-to-left, left-to-right Unicode Script  Hiragana 1.0 381 Ch 18.4
Hluw 080 Anatolian Hieroglyphs (Luwian Hieroglyphs, Hittite Hieroglyphs) left-to-right Unicode Script  Anatolian Hieroglyphs 8.0 583 Ancient/historic Ch 11.6
Hmng 450 Pahawh Hmong left-to-right Unicode Script  Pahawh Hmong 7.0 127 Ch 16.11
Hmnp 451 Nyiakeng Puachue Hmong left-to-right Unicode Script  Nyiakeng Puachue Hmong 12.0 71 Ch 16.12
Hrkt 412 Japanese syllabaries (alias for Hiragana + Katakana) vertical right-to-left, left-to-right Unicode Script  Katakana or Hiragana See § Hira, § Kana Ch 18.4
Hung 176 Old Hungarian (Hungarian Runic) right-to-left script Unicode Script  Old Hungarian 8.0 108 Ancient/historic Ch 8.8
Inds 610 Indus (Harappan) mixed ZZ— Not in Unicode, proposal is explored
Ital 210 Old Italic (Etruscan, Oscan, etc.) right-to-left script, left-to-right Unicode Script  Old Italic 3.1 39 Ancient/historic Ch 8.6
Jamo 284 Jamo (alias for Jamo subset of Hangul) varies ZZ— Subset of § Hang
Java 361 Javanese left-to-right Unicode Script  Javanese 5.2 90 Ch 17.4
Jpan 413 Japanese (alias for Han + Hiragana + Katakana) varies ZZ— See § Hani, § Hira and § Kana
Jurc 510 Jurchen left-to-right ZZ— Not in Unicode
Kali 357 Kayah Li left-to-right Unicode Script  Kayah Li 5.1 47 Ch 16.9
Kana 411 Katakana vertical right-to-left, left-to-right Unicode Script  Katakana 1.0 321 Ch 18.4
Kawi 368 Kawi left-to-right Unicode Script  Kawi 15.0 86 Ancient/historic Ch 17.9
Khar 305 Kharoshthi right-to-left script Unicode Script  Kharoshthi 4.1 68 Ancient/historic Ch 14.2
Khmr 355 Khmer left-to-right Unicode Script  Khmer 3.0 146 Ch 16.4
Khoj 322 Khojki left-to-right Unicode Script  Khojki 7.0 65 Ancient/historic Ch 15.7
Kitl 505 Khitan large script left-to-right ZZ— Not in Unicode
Kits 288 Khitan small script vertical right-to-left Unicode Script  Khitan Small Script 13.0 471 Ancient/historic Ch 18.12
Knda 345 Kannada left-to-right Unicode Script  Kannada 1.0 91 Ch 12.8
Kore 287 Korean (alias for Hangul + Han) left-to-right ZZ— See § Hani, § Hang
Kpel 436 Kpelle left-to-right ZZ— Not in Unicode, proposal is explored
Krai 396 Kirat Rai left-to-right ZZ— Not in Unicode, approved for version 16.0
Kthi 317 Kaithi left-to-right Unicode Script  Kaithi 5.2 68 Ancient/historic Ch 15.2
Lana 351 Tai Tham (Lanna) left-to-right Unicode Script  Tai Tham 5.2 127 Ch 16.7
Laoo 356 Lao left-to-right Unicode Script  Lao 1.0 83 Ch 16.2
Latf 217 Latin (Fraktur variant) varies ZZ— Typographic variant of Latin (see § Latn)
Latg 216 Latin (Gaelic variant) left-to-right ZZ— Typographic variant of Latin (see § Latn)
Latn 215 Latin left-to-right Unicode Script  Latin 1.0 1,481 See also: Latin script in Unicode Ch 7.1
Leke 364 Leke left-to-right ZZ— Not in Unicode
Lepc 335 Lepcha (Róng) left-to-right Unicode Script  Lepcha 5.1 74 Ch 13.12
Limb 336 Limbu left-to-right Unicode Script  Limbu 4.0 68 Ch 13.6
Lina 400 Linear A left-to-right Unicode Script  Linear A 7.0 341 Ancient/historic Ch 8.1
Linb 401 Linear B left-to-right Unicode Script  Linear B 4.0 211 Ancient/historic Ch 8.2
Lisu 399 Lisu (Fraser) left-to-right Unicode Script  Lisu 5.2 49 Ch 18.9
Loma 437 Loma left-to-right ZZ— Not in Unicode, proposal is explored
Lyci 202 Lycian left-to-right Unicode Script  Lycian 5.1 29 Ancient/historic Ch 8.5
Lydi 116 Lydian right-to-left script Unicode Script  Lydian 5.1 27 Ancient/historic Ch 8.5
Mahj 314 Mahajani left-to-right Unicode Script  Mahajani 7.0 39 Ancient/historic Ch 15.6
Maka 366 Makasar left-to-right Unicode Script  Makasar 11.0 25 Ancient/historic Ch 17.8
Mand 140 Mandaic, Mandaean right-to-left script Unicode Script  Mandaic 6.0 29 Ch 9.5
Mani 139 Manichaean right-to-left script Unicode Script  Manichaean 7.0 51 Ancient/historic Ch 10.5
Marc 332 Marchen left-to-right Unicode Script  Marchen 9.0 68 Ancient/historic Ch 14.5
Maya 090 Mayan hieroglyphs mixed ZZ— Not in Unicode
Medf 265 Medefaidrin (Oberi Okaime, Oberi Ɔkaimɛ) left-to-right Unicode Script  Medefaidrin 11.0 91 Ch 19.10
Mend 438 Mende Kikakui right-to-left script Unicode Script  Mende Kikakui 7.0 213 Ch 19.8
Merc 101 Meroitic Cursive right-to-left script Unicode Script  Meroitic Cursive 6.1 90 Ancient/historic Ch 11.5
Mero 100 Meroitic Hieroglyphs right-to-left script Unicode Script  Meroitic Hieroglyphs 6.1 32 Ancient/historic Ch 11.5
Mlym 347 Malayalam left-to-right Unicode Script  Malayalam 1.0 118 Ch 12.9
Modi 324 Modi, Moḍī left-to-right Unicode Script  Modi 7.0 79 Ancient/historic Ch 15.12
Mong 145 Mongolian vertical left-to-right, left-to-right Unicode Script  Mongolian 3.0 168 Mong includes Clear and Manchu scripts Ch 13.5
Moon 218 Moon (Moon code, Moon script, Moon type) mixed ZZ— Not in Unicode, proposal is explored
Mroo 264 Mro, Mru left-to-right Unicode Script  Mro 7.0 43 Ch 13.8
Mtei 337 Meitei Mayek (Meithei, Meetei) left-to-right Unicode Script  Meetei Mayek 5.2 79 Ch 13.7
Mult 323 Multani left-to-right Unicode Script  Multani 8.0 38 Ancient/historic Ch 15.10
Mymr 350 Myanmar (Burmese) left-to-right Unicode Script  Myanmar 3.0 223 Ch 16.3
Nagm 295 Nag Mundari left-to-right Unicode Script  Nag Mundari 15.0 42
Nand 311 Nandinagari left-to-right Unicode Script  Nandinagari 12.0 65 Ancient/historic Ch 15.13
Narb 106 Old North Arabian (Ancient North Arabian) right-to-left script Unicode Script  Old North Arabian 7.0 32 Ancient/historic Ch 10.1
Nbat 159 Nabataean right-to-left script Unicode Script  Nabataean 7.0 40 Ancient/historic Ch 10.10
Newa 333 Newa, Newar, Newari, Nepāla lipi left-to-right Unicode Script  Newa 9.0 97 Ch 13.3
Nkdb 085 Naxi Dongba (na²¹ɕi³³ to³³ba²¹, Nakhi Tomba) left-to-right ZZ— Not in Unicode
Nkgb 420 Naxi Geba (na²¹ɕi³³ gʌ²¹ba²¹, 'Na-'Khi ²Ggŏ-¹baw, Nakhi Geba) left-to-right ZZ— Not in Unicode, proposal is explored
Nkoo 165 N’Ko right-to-left script Unicode Script  NKo 5.0 62 Ch 19.4
Nshu 499 Nüshu vertical right-to-left Unicode Script  Nushu 10.0 397 Ch 18.8
Ogam 212 Ogham bottom-to-top, left-to-right Unicode Script  Ogham 3.0 29 Ancient/historic Ch 8.14
Olck 261 Ol Chiki (Ol Cemet’, Ol, Santali) left-to-right Unicode Script  Ol Chiki 5.1 48 Ch 13.10
Onao 296 Ol Onal left-to-right ZZ— Not in Unicode, approved for version 16.0
Orkh 175 Old Turkic, Orkhon Runic right-to-left script Unicode Script  Old Turkic 5.2 73 Ancient/historic Ch 14.8
Orya 327 Oriya (Odia) left-to-right Unicode Script  Oriya 1.0 91 Ch 12.5
Osge 219 Osage left-to-right Unicode Script  Osage 9.0 72 Ch 20.3
Osma 260 Osmanya left-to-right Unicode Script  Osmanya 4.0 40 Ch 19.2
Ougr 143 Old Uyghur mixed Old Uyghur 14.0 26 Ancient/historic Ch 14.11
Palm 126 Palmyrene right-to-left script Unicode Script  Palmyrene 7.0 32 Ancient/historic Ch 10.11
Pauc 263 Pau Cin Hau left-to-right Unicode Script  Pau Cin Hau 7.0 57 Ch 16.13
Pcun 015 Proto-Cuneiform left-to-right ZZ— Not in Unicode
Pelm 016 Proto-Elamite left-to-right ZZ— Not in Unicode
Perm 227 Old Permic left-to-right Unicode Script  Old Permic 7.0 43 Ancient/historic Ch 8.13
Phag 331 Phags-pa vertical left-to-right Unicode Script  Phags-pa 5.0 56 Ancient/historic Ch 14.4
Phli 131 Inscriptional Pahlavi right-to-left script Unicode Script  Inscriptional Pahlavi 5.2 27 Ancient/historic Ch 10.6
Phlp 132 Psalter Pahlavi right-to-left script Unicode Script  Psalter Pahlavi 7.0 29 Ancient/historic Ch 10.6
Phlv 133 Book Pahlavi mixed ZZ— Not in Unicode
Phnx 115 Phoenician right-to-left script Unicode Script  Phoenician 5.0 29 Ancient/historic[g] Ch 10.3
Piqd 293 Klingon (KLI pIqaD) left-to-right Unicode Script  ZZ— Rejected for inclusion in Unicode
Plrd 282 Miao (Pollard) left-to-right Unicode Script  Miao 6.1 149 Ch 18.10
Prti 130 Inscriptional Parthian right-to-left script Unicode Script  Inscriptional Parthian 5.2 30 Ancient/historic Ch 10.6
Psin 103 Proto-Sinaitic mixed ZZ— Not in Unicode
Qaaa-Qabx 900-949 Reserved for private use (range) ZZ— Not in Unicode
Ranj 303 Ranjana left-to-right ZZ— Not in Unicode
Rjng 363 Rejang (Redjang, Kaganga) left-to-right Unicode Script  Rejang 5.1 37 Ch 17.5
Rohg 167 Hanifi Rohingya right-to-left script Unicode Script  Hanifi Rohingya 11.0 50 Ch 16.14
Roro 620 Rongorongo mixed ZZ— Not in Unicode, proposal is explored
Runr 211 Runic left-to-right, boustrophedon Unicode Script  Runic 3.0 86 Ancient/historic Ch 8.7
Samr 123 Samaritan right-to-left script, top-to-bottom Unicode Script  Samaritan 5.2 61 Ch 9.4
Sara 292 Sarati mixed ZZ— Not in Unicode
Sarb 105 Old South Arabian right-to-left script Unicode Script  Old South Arabian 5.2 32 Ancient/historic Ch 10.2
Saur 344 Saurashtra left-to-right Unicode Script  Saurashtra 5.1 82 Ch 13.13
Sgnw 095 SignWriting vertical left-to-right Unicode Script  SignWriting 8.0 672 Ch 21.7
Shaw 281 Shavian (Shaw) left-to-right Unicode Script  Shavian 4.0 48 Ch 8.15
Shrd 319 Sharada, Śāradā left-to-right Unicode Script  Sharada 6.1 96 Ch 15.3
Shui 530 Shuishu left-to-right ZZ— Not in Unicode
Sidd 302 Siddham, Siddhaṃ, Siddhamātṛkā left-to-right Unicode Script  Siddham 7.0 92 Ancient/historic Ch 15.5
Sidt 180 Sidetic right-to-left ZZ— Not in Unicode, proposal is mature
Sind 318 Khudawadi, Sindhi left-to-right Unicode Script  Khudawadi 7.0 69 Ch 15.9
Sinh 348 Sinhala left-to-right Unicode Script  Sinhala 3.0 111 Ch 13.2
Sogd 141 Sogdian horizontal and vertical writing in East Asian scripts, top-to-bottom Unicode Script  Sogdian 11.0 42 Ancient/historic Ch 14.10
Sogo 142 Old Sogdian right-to-left script Unicode Script  Old Sogdian 11.0 40 Ancient/historic Ch 14.9
Sora 398 Sora Sompeng left-to-right Unicode Script  Sora Sompeng 6.1 35 Ch 15.17
Soyo 329 Soyombo left-to-right Unicode Script  Soyombo 10.0 83 Ancient/historic Ch 14.7
Sund 362 Sundanese left-to-right Unicode Script  Sundanese 5.1 72 Ch 17.7
Sunu 274 Sunuwar left-to-right ZZ— Not in Unicode, approved for version 16.0
Sylo 316 Syloti Nagri left-to-right Unicode Script  Syloti Nagri 4.1 45 Ancient/historic Ch 15.1
Syrc 135 Syriac right-to-left script Unicode Script  Syriac 3.0 88 Includes typographic variants Estrangelo (see § Syre), Western (§ Syrj), and Eastern (§ Syrn) Ch 9.3
Syre 138 Syriac (Estrangelo variant) mixed ZZ— Typographic variant of Syriac (see § Syrc)
Syrj 137 Syriac (Western variant) mixed ZZ— Typographic variant of Syriac (see § Syrc)
Syrn 136 Syriac (Eastern variant) mixed ZZ— Typographic variant of Syriac (see § Syrc)
Tagb 373 Tagbanwa left-to-right Unicode Script  Tagbanwa 3.2 18 Ch 17.1
Takr 321 Takri, Ṭākrī, Ṭāṅkrī left-to-right Unicode Script  Takri 6.1 68 Ch 15.4
Tale 353 Tai Le left-to-right Unicode Script  Tai Le 4.0 35 Ch 16.5
Talu 354 New Tai Lue left-to-right Unicode Script  New Tai Lue 4.1 83 Ch 16.6
Taml 346 Tamil left-to-right Unicode Script  Tamil 1.0 123 Ch 12.6
Tang 520 Tangut vertical right-to-left, left-to-right Unicode Script  Tangut 9.0 6,914 Ancient/historic Ch 18.11
Tavt 359 Tai Viet left-to-right Unicode Script  Tai Viet 5.2 72 Ch 16.8
Tayo 380 Tai Yo top-to-bottom, columns right-to-left ZZ— Not in Unicode, proposal is mature
Telu 340 Telugu left-to-right Unicode Script  Telugu 1.0 100 Ch 12.7
Teng 290 Tengwar left-to-right ZZ— Not in Unicode
Tfng 120 Tifinagh (Berber) left-to-right, right-to-left script, top-to-bottom, bottom-to-top Unicode Script  Tifinagh 4.1 59 Ch 19.3
Tglg 370 Tagalog (Baybayin, Alibata) left-to-right Unicode Script  Tagalog 3.2 23 Ch 17.1
Thaa 170 Thaana right-to-left script Unicode Script  Thaana 3.0 50 Ch 13.1
Thai 352 Thai left-to-right Unicode Script  Thai 1.0 86 Ch 16.1
Tibt 330 Tibetan left-to-right Unicode Script  Tibetan 2.0 207 Added in 1.0, removed in 1.1 and reintroduced in 2.0 Ch 13.4
Tirh 326 Tirhuta left-to-right Unicode Script  Tirhuta 7.0 82 Ch 15.11
Tnsa 275 Tangsa left-to-right Tangsa 14.0 89 Ch 13.18
Todr 229 Todhri right-to-left ZZ— Not in Unicode, approved for version 16.0
Tols 299 Tolong Siki left-to-right ZZ— Not in Unicode, proposal is mature
Toto 294 Toto left-to-right Toto 14.0 31 Ch 13.17
Tutg 341 Tulu-Tigalari left-to-right ZZ— Not in Unicode, approved for version 16.0
Ugar 040 Ugaritic left-to-right Unicode Script  Ugaritic 4.0 31 Ancient/historic Ch 11.2
Vaii 470 Vai left-to-right Unicode Script  Vai 5.1 300 Ch 19.5
Visp 280 Visible Speech left-to-right ZZ— Not in Unicode
Vith 228 Vithkuqi left-to-right Vithkuqi 14.0 70 Ancient/historic Ch 8.12
Wara 262 Warang Citi (Varang Kshiti) left-to-right Unicode Script  Warang Citi 7.0 84 Ch 13.9
Wcho 283 Wancho left-to-right Unicode Script  Wancho 12.0 59 Ch 13.16
Wole 480 Woleai mixed ZZ— Not in Unicode, proposal is explored
Xpeo 030 Old Persian left-to-right Unicode Script  Old Persian 4.1 50 Ancient/historic Ch 11.3
Xsux 020 Cuneiform, Sumero-Akkadian left-to-right Unicode Script  Cuneiform 5.0 1,234 Ancient/historic Ch 11.1
Yezi 192 Yezidi right-to-left script Unicode Script  Yezidi 13.0 47 Ancient/historic Ch 9.6
Yiii 460 Yi left-to-right Unicode Script  Yi 3.0 1,220 Ch 18.7
Zanb 339 Zanabazar Square (Zanabazarin Dörböljin Useg, Xewtee Dörböljin Bicig, Horizontal Square Script) left-to-right Unicode Script  Zanabazar Square 10.0 72 Ancient/historic Ch 14.6
Zinh 994 Code for inherited script Inherited 657
Zmth 995 Mathematical notation ZZ— Not a 'script' in Unicode
Zsym 996 Symbols ZZ— Not a 'script' in Unicode
Zsye 993 Symbols (emoji variant) ZZ— Not a 'script' in Unicode
Zxxx 997 Code for unwritten documents ZZ— Not a 'script' in Unicode
Zyyy 998 Code for undetermined script Common 8,306
Zzzz 999 Code for uncoded script Unknown 964,234 In Unicode: All other code points
Notes
References

Missing scripts in Unicode

With each new version of Unicode, new writing systems are added to the international character code. According to a statement by linguist Dr Deborah Anderson of UC Berkeley, there are over 100 writing systems that have not yet been included in Unicode.

According to a list of the project Missing Scripts by the University of Applied Sciences Mainz, Germany, the ANRT Nancy, France and UC Berkeley, USA, there are 294 known writing systems of mankind according to the current state of research (January 2022). 131 of them have not yet been encoded in Unicode, i.e. cannot yet be used on a computer or mobile phone.

See also

References

Tags:

Unicode Script Definition and classificationUnicode Script Character categories within scriptsUnicode Script List of scripts in UnicodeUnicode Script Missing scripts in UnicodeUnicode ScriptArmenian languageControl characterEnglish alphabetFrench alphabetGerman alphabetItalian alphabetLatin alphabetLatin script in UnicodeLetter (alphabet)Ottoman Turkish alphabetTurkish languageUnicodeUnicode symbolsVietnamese languageWriting systemWritten language

🔥 Trending searches on Wiki English:

Stormy DanielsDonald TrumpBreaking BadDevin HaneyWindows 10 version historyTimothée ChalametSeppukuAshley JuddLady GagaPremier LeagueXXX (film series)Darién GapThe SimpsonsBob MarleyAmber HeardSwitzerlandWorld War IMartin Luther King Jr.Seven deadly sinsJoe AlwynIndira GandhiJürgen KloppChet HolmgrenList of most-streamed artists on SpotifyMoulin RougeVelma (TV series)WrestleMania XLNet neutralityKalanithi MaranThe Beekeeper (2024 film)Algebraic notation (chess)2024 Indian general electionFallout (video game)Solomon IslandsShai Gilgeous-AlexanderEmma RobertsAustraliaTaylor SwiftSnapchatBangladeshThe Eras TourArthur the King2024 Mutua Madrid Open – Men's singlesCharlie SheenNazi GermanyBitcoin protocolO. J. SimpsonHenry VIIIRishi Sunak2019 Indian general electionRobert Downey Jr.Pat CumminsGuy RitchieByeon Woo-seokKyle MacLachlanVoyager 1VietnamList of Marvel Cinematic Universe films2024 AFC U-23 Asian CupApple Inc.RobloxCarnation RevolutionAeroflot Flight 593IndonesiaAntónio de Oliveira SalazarRonald ReaganYellowstone (American TV series)Arne SlotBharatiya Janata PartyTom AndersonTom CruiseSean Foley (director)Bill CosbyTitanic🡆 More