Menu
Home Explore People Places Arts History Plants & Animals Science Life & Culture Technology
On this page
KOI8-R
8-bit character set and encoding extending the 7-bit ASCII set for the Latin script to the the Cyrillic script for the Russian language

KOI8-R (RFC 1489) is an 8-bit character encoding derived from the KOI-8 encoding by the programmer Andrei Chernov in 1993 and designed to cover Russian, which uses the Russian subset of a Cyrillic script. KOI-8, on its turn, is an 8-bit extension of the KOI-7 encoding, which inherited a phonetic correspondence of Russian and Latin letters from the MTK-2 teletype code. As a result, Russian Cyrillic letters in KOI8-R are in pseudo-Latin alphabetical order rather than the normal Cyrillic one like in ISO 8859-5. Although this may seem unnatural, this has the useful effect that if the 8th bit is stripped, the text remains partially readable in any ASCII-based encoding (including KOI8-R itself) as a case-reversed transliteration. For example, "Код для обмена и обработки информации" (the Russian meaning of the "KOI" acronym) becomes kOD DLQ OBMENA I OBRABOTKI INFORMACII.

KOI-8 stands for 8-bitnyy kod dlya obmena i obrabotki informatsii (Russian: 8-битный код для обмена и обработки информации) which means "8-Bit Code for Information Interchange". In Microsoft Windows, KOI8-R is assigned the code page number 20866. In IBM, KOI8-R is assigned code page 878. KOI8-R also happens to cover Bulgarian.

It lacks proper quotation marks for these languages: both «...» and the Bulgarian „...“. Windows-1251 does support these, as well as more letters, and has thus become more popular. KOI8-R is used by less than 0.004% of websites, mostly Russian and Bulgarian. Unicode and UTF-8 is preferred to single-byte Cyrillic encodings in modern applications, Unicode contains 436 Cyrillic letters including for Old Cyrillic.

We don't have any images related to KOI8-R yet.
We don't have any YouTube videos related to KOI8-R yet.
We don't have any PDF documents related to KOI8-R yet.
We don't have any Books related to KOI8-R yet.
We don't have any archived web articles related to KOI8-R yet.

Character set

The following table shows the KOI8-R encoding. Each character is shown with its equivalent Unicode code point.

KOI8-R4567
0123456789ABCDEF
0x
1x
2x SP !"#$%&'()*+,-./
3x0123456789:;<=>?
4x@ABCDEFGHIJKLMNO
5xPQRSTUVWXYZ[\]^_
6x`abcdefghijklmno
7xpqrstuvwxyz{|}~
8x─2500│2502┌250C┐2510└2514┘2518├251C┤2524┬252C┴2534┼253C▀2580▄2584█2588▌258C▐2590
9x░2591▒2592▓2593⌠2320■25A0∙2219√221A≈2248≤2264≥2265NBSP⌡2321°00B0²00B2·00B7÷00F7
Ax═2550║2551╒2552ё0451╓2553╔2554╕2555╖2556╗2557╘2558╙2559╚255A╛255B╜255C╝255D╞255E
Bx╟255F╠2560╡2561Ё0401╢2562╣2563╤2564╥2565╦2566╧2567╨2568╩2569╪256A╫256B╬256C©00A9
Cxю044Eа0430б0431ц0446д0434е0435ф0444г0433х0445и0438й0439к043Aл043Bм043Cн043Dо043E
Dxп043Fя044Fр0440с0441т0442у0443ж0436в0432ь044Cы044Bз0437ш0448э044Dщ0449ч0447ъ044A
ExЮ042EА0410Б0411Ц0426Д0414Е0415Ф0424Г0413Х0425И0418Й0419К041AЛ041BМ041CН041DО041E
FxП041FЯ042FР0420С0421Т0422У0423Ж0416В0412Ь042CЫ042BЗ0417Ш0428Э042DЩ0429Ч0427Ъ042A

See also

Further reading

References

  1. (in Russian) ГОСТ 19768-74 (СТ СЭВ 358-76). Машины вычислительные и система обработки данных. Коды 8-битные для обмена и обработки информации.

  2. "SBCS code page information - CPGID: 00878 / Name: Russian internet koi8-r". IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers. IBM. C-H 3-3220-050. Archived from the original on 2017-02-18. Retrieved 2017-02-18. https://www-01.ibm.com/software/globalization/cp/cp00878.html

  3. "CCSID information document; CCSID 878; KOI8-R CYRILLIC". IBM. Retrieved 2017-02-18. https://www-01.ibm.com/software/globalization/ccsid/ccsid878.html

  4. Richter, Helmut (2016-01-04) [1999-08-18]. "KOI8-R.TXT". 2.0. Retrieved 2016-12-09. http://ftp.unicode.org/Public/MAPPINGS/VENDORS/MISC/KOI8-R.TXT

  5. Code Page CPGID 00878 (pdf) (PDF), IBM https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00878.pdf

  6. Code Page CPGID 00878 (txt), IBM https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00878.txt

  7. International Components for Unicode (ICU), ibm-878_P100-1996.ucm, 2002-12-03 https://github.com/unicode-org/icu/blob/master/icu4c/source/data/mappings/ibm-878_P100-1996.ucm