ISO/IEC 8859-14:1998, part of the ISO/IEC 8859 series of ASCII-based character encodings, is commonly known as Latin-8 or Celtic. Published in 1998, it was designed to support Celtic languages including Irish, Manx, Scottish Gaelic, Welsh, Cornish, and Breton. The IANA prefers this charset name when combined with the C0 and C1 control codes from ISO/IEC 6429. Extensions include CeltScript’s Extended Latin-8, while Microsoft and FreeDOS have assigned code pages Windows-28604 and 58163 respectively to ISO-8859-14.
History
ISO-8859-14 was originally proposed for the Sami languages.3 ISO 8859-12 was proposed for Celtic.4 Later, ISO 8859-12 was proposed for Devanagari, so the Celtic proposal was changed to ISO 8859-14. The Sami proposal was changed to ISO 8859-15,5 but it got rejected as an ISO/IEC 8859 part, although it was registered as ISO-IR-197.6
The original proposal used a different arrangement of points 0xA1–BF.7 At the committee draft stage of the specification, a dotless i was included at 0xAE,8 which was changed to a registered trademark sign (matching ISO-8859-1) in the final publication.
ISO-IR-182, an earlier (registered in 1994) modification of ISO-8859-1, had added the letters Ẁ, Ẃ, Ẅ, Ỳ, Ÿ, Ŵ, Ŷ and their lowercase forms (except for ÿ, which was already included) for Welsh language use.9 The final published version of ISO-8859-14 includes these letters in the same positions which they appear at in ISO-IR-182.
Codepage layout
Differences from ISO-8859-1 have the Unicode code point number below the character.
ISO/IEC 8859-1410110 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | ||||||||||||||||
1x | ||||||||||||||||
2x | SP | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / |
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
6x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | |
8x | ||||||||||||||||
9x | ||||||||||||||||
Ax | NBSP | Ḃ1E02 | ḃ1E03 | £ | Ċ010A | ċ010B | Ḋ1E0A | § | Ẁ1E80 | © | Ẃ1E82 | ḋ1E0B | Ỳ1EF2 | SHY | ® | Ÿ0178 |
Bx | Ḟ1E1E | ḟ1E1F | Ġ0120 | ġ0121 | Ṁ1E40 | ṁ1E41 | ¶ | Ṗ1E56 | ẁ1E81 | ṗ1E57 | ẃ1E83 | Ṡ1E60 | ỳ1EF3 | Ẅ1E84 | ẅ1E85 | ṡ1E61 |
Cx | À | Á | Â | Ã | Ä | Å | Æ | Ç | È | É | Ê | Ë | Ì | Í | Î | Ï |
Dx | Ŵ0174 | Ñ | Ò | Ó | Ô | Õ | Ö | Ṫ1E6A | Ø | Ù | Ú | Û | Ü | Ý | Ŷ0176 | ß |
Ex | à | á | â | ã | ä | å | æ | ç | è | é | ê | ë | ì | í | î | ï |
Fx | ŵ0175 | ñ | ò | ó | ô | õ | ö | ṫ1E6B | ø | ù | ú | û | ü | ý | ŷ0177 | ÿ |
Draft layout
The first draft had positions A0-BF different. It did not include the pilcrow sign, but included the cent sign instead at its Latin-1 position. Later, it was ruled that the pilcrow sign was more common, so the pilcrow sign remains at its Latin-1 position, and the cent sign was removed instead.
Differences from ISO-8859-14 have the Unicode code point below them.
ISO/IEC 8859-14 draft proposal12 changed rows only0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
Ax | NBSP | Ḃ1E02 | ¢00A2 | £ | ḃ1E03 | Ċ010A | ċ010B | § | Ẁ | © | Ẃ | Ṡ1E60 | Ỳ | SHY | ® | Ÿ |
Bx | Ḋ1E0A | ḋ1E0B | Ḟ1E1E | ḟ1E1F | Ġ0120 | ġ0121 | Ṁ1E40 | ṁ1E41 | ẁ | Ṗ1E56 | ẃ | ṡ1E61 | ỳ | Ẅ | ẅ | ṗ1E57 |
External links
- ISO/IEC 8859-14:1998
- ISO-IR 199 Celtic Supplementary Latin Set (May 1, 1998, submitted by Irish body NSAI/AGITS/WG6)
References
"SheetJS/js-codepage". GitHub. 12 October 2021. https://github.com/SheetJS/js-codepage/blob/cbdc9486c175be0e555f8b2573c531bb801ee457/codepage.md ↩
"Cpi/CPIISO/Codepage.TXT at master · FDOS/Cpi". GitHub. https://github.com/FDOS/cpi/blob/master/CPIISO/codepage.txt ↩
Everson, Michael. "Proposed ISO 8859-14 (later 15)". http://www.evertype.com/pics/emonopics/latin9.jpg ↩
Everson, Michael. "Proposed ISO 8859-12 (later 14)". http://www.evertype.com/pics/emonopics/latin7.jpg ↩
Everson, Michael (1996-06-19). Proposal for a new part of ISO/IEC 8859: Latin alphabet No. 9 (Sámi). /wiki/Michael_Everson ↩
Swedish Institute for Standards (1997-01-24). ISO-IR-197: Sami supplementary Latin set (PDF). ITSCJ/IPSJ. /wiki/Swedish_Standards_Institute ↩
Everson, Michael. "Proposed ISO 8859-12 (later 14)". http://www.evertype.com/pics/emonopics/latin7.jpg ↩
Everson, Michael (1997-05-05). "ISO/IEC CD 8859-14:1997 — Latin alphabet No. 8 (Celtic)" (Committee Draft). http://www.evertype.com/standards/iso8859/8859-14.html ↩
British Standards Institution (1994-03-16). ISO-IR-182: Welsh variant of Latin Alphabet No. 1 (right-hand part) (PDF). ITSCJ/IPSJ. /wiki/British_Standards_Institution ↩
Kuhn, Markus; Whistler, Ken (1999-07-27). "ISO/IEC 8859-14:1998 to Unicode". 8859 to Unicode mapping tables. Unicode, Inc. https://www.unicode.org/Public/MAPPINGS/ISO8859/DatedVersions/8859-14-1998.TXT ↩
International Components for Unicode (ICU), iso-8859_14-1998.ucm, 1999-07-27 https://github.com/unicode-org/icu/blob/master/icu4c/source/data/mappings/iso-8859_14-1998.ucm ↩
Everson, Michael. "Proposed ISO 8859-12 (later 14)". http://www.evertype.com/pics/emonopics/latin7.jpg ↩