Latin Extended-B | |
---|---|
Range | U+0180..U+024F (208 code points) |
Plane | BMP |
Scripts | Latin |
Major alphabets |
Africa alphabet Pan-Nigerian Americanist Khoisan Pinyin Romanian |
Assigned | 208 code points |
Unused | 0 reserved code points |
Unicode version history | |
1.0.0 | 113 (+113) |
1.1 | 148 (+35) |
3.0 | 178 (+30) |
3.2 | 179 (+1) |
4.0 | 183 (+4) |
4.1 | 194 (+11) |
5.0 | 208 (+14) |
Note: |
Latin Extended-B is a block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points U+0180..U+01FF and contained 113 characters. During unification with ISO 10646 for version 1.1, the block was expanded, and another 65 characters were added. In version 3.0, the last thirty available code points in the block were assigned.
The Latin Extended-B block contains ten subheadings for groups of characters: Non-European and historic Latin, African letters for clicks, Croatian digraphs matching Serbian Cyrillic letters, Pinyin diacritic-vowel combinations, Phonetic and historic letters, Additions for Slovenian and Croatian, Additions for Romanian, Miscellaneous additions, Additions for Livonian, and Additions for Sinology. The Non-European and historic, African clicks, Croatian digraphs, Pinyin, and the first part of the Phonetic and historic letters were present in Unicode 1.0; additional Phonetic and historic letters were added for version 3.0; and other Phonetic and historic, as well as the rest of the sub-blocks were the characters added for version 1.1.
The Non-European and historic Latin subheading contains the first 64 characters of the block, and includes various variant letters for use in Zhuang, Americanist phonetic transcription, African languages, and other Latin script alphabets. It does not contain any standard letters with diacritics.
The four African letters for clicks are used in Khoisan orthography.
The Croatian digraphs matching Serbian Cyrillic letters are three sets of three case mappings (lower case, upper case, and title case) of Latin digraphs used for compatibility with Cyrillic texts, the difference between the Croatian and Serbian languages largely being Latin vs. Cyrillic writing.
The 16 Pinyin diacritic-vowel combinations are used to represent the standard Mandarin Chinese vowel sounds with tone marks.
The 35 Phonetic and historic letters are largely various standard and variant Latin letters with diacritic marks.
The 24 Additions for Slovenian and Croatian are all standard Latin letters with unusual diacritics, like the double grave and inverted breve.
The Additions for Romanian are 4 characters that were erroneously unified as having a cedilla, when they have a comma below. The conflation of S and T with cedilla vs. comma below continues to plague Romanian language implementation up to the present.
The Miscellaneous additions subheading contains 39 characters of various description and origin.