Windows-1258 is a code page used in Microsoft Windows for representing Vietnamese text, utilizing combining diacritical marks to cover numerous letter-tone combinations. Unlike Vietnamese standards such as TCVN 5712 or other encodings like VISCII and VNI, Windows-1258 closely resembles Windows-1252 with specific adaptations for Vietnamese letters and the đông sign. While IBM supports it via CCSID 1258, modern applications prefer UTF-8 due to better compatibility and avoidance of issues with Unicode normalization during round-trip conversions.
Character set
The following table shows Windows-1258. Each character is shown with its Unicode equivalent.
Windows-125867891011Code page 1129
IBM's code page 1129 (CCSID 1129 and euro sign extended CCSID 1163)121314 is similar to code page 1258, but with the following differences:
Code page 1129 (differences from code page 1258)1516171819200 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
8x | ||||||||||||||||
9x | ||||||||||||||||
Ax | NBSP | ¡ | ¢ | £ | ¤ | ¥ | ¦ | § | œ | © | ª | « | ¬ | SHY | ® | ¯ |
Bx | ° | ± | ² | ³ | Ÿ | µ | ¶ | · | Œ | ¹ | º | » | ¼ | ½ | ¾ | ¿ |
Differences from Windows-1258 |
See also
External links
- IANA Charset Name Registration of windows-1258
- Michael Kaplan's blog describing the Windows 1258 encoding behavior
References
"Code page 1258 information document". Archived from the original on 2016-03-03. https://web.archive.org/web/20160303213408/http://www-01.ibm.com/software/globalization/cp/cp01258.html ↩
"CCSID 1258 information document". Archived from the original on 2014-11-29. https://web.archive.org/web/20141129211140/http://www-01.ibm.com/software/globalization/ccsid/ccsid1258.html ↩
"CCSID 5354 information document". Archived from the original on 2014-11-29. https://web.archive.org/web/20141129224353/http://www-01.ibm.com/software/globalization/ccsid/ccsid5354.html ↩
Kaplan, Michael S. (2005-04-19). "A few of the gotchas of MultiByteToWideChar". Sorting it all out. http://archives.miloush.net/michkap/archive/2005/04/19/409566.html ↩
Kaplan, Michael S. (2005-04-19). "A few of the gotchas of MultiByteToWideChar". Sorting it all out. http://archives.miloush.net/michkap/archive/2005/04/19/409566.html ↩
Steele, Shawn (1998-04-15). "cp1258 to Unicode table". Microsoft. https://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1258.TXT ↩
Unicode mappings of windows 1258 with "best fit" https://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WindowsBestFit/bestfit1258.txt ↩
Code Page CPGID 01258 (pdf) (PDF), IBM https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01258.pdf ↩
Code Page CPGID 01258 (txt), IBM https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01258.txt ↩
International Components for Unicode (ICU), ibm-1258_P100-1997.ucm, 2002-12-03 https://github.com/unicode-org/icu/blob/master/icu4c/source/data/mappings/ibm-1258_P100-1997.ucm ↩
International Components for Unicode (ICU), ibm-5354_P100-1998.ucm, 2002-12-03 https://github.com/unicode-org/icu/blob/master/icu4c/source/data/mappings/ibm-5354_P100-1998.ucm ↩
"Code page 1129 information document". Archived from the original on 2010-09-21. https://web.archive.org/web/20100921000629/http://www-01.ibm.com/software/globalization/cp/cp01129.html ↩
"CCSID 1129 information document". Archived from the original on 2016-03-27. https://web.archive.org/web/20160327161405/http://www-01.ibm.com/software/globalization/ccsid/ccsid1129.html ↩
"CCSID 1163 information document". Archived from the original on 2014-11-29. https://web.archive.org/web/20141129225256/http://www-01.ibm.com/software/globalization/ccsid/ccsid1163.html ↩
Lunde, Ken (13 January 2009). "Appendix L: Vietnamese Character Sets" (PDF). CJKV Information Processing (2nd ed.). ISBN 978-0-596-51447-1. 978-0-596-51447-1 ↩
Code Page CPGID 01129 (pdf) (PDF), IBM https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01129.pdf ↩
Code Page CPGID 01129 (txt), IBM https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01129.txt ↩
International Components for Unicode (ICU), ibm-1129_P100-1997.ucm, 2002-12-03 https://github.com/unicode-org/icu/blob/master/icu4c/source/data/mappings/ibm-1129_P100-1997.ucm ↩
Code Page CPGID 01163 (pdf) (PDF), IBM https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01163.pdf ↩
Code Page CPGID 01163 (txt), IBM https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01163.txt ↩