Menu
Home Explore People Places Arts History Plants & Animals Science Life & Culture Technology
On this page
Basic Latin (Unicode block)
Unicode character block

The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character.

The Basic Latin block was included in its present form from version 1.0.0 of the Unicode Standard, without addition or alteration of the character repertoire. Its block name in Unicode 1.0 was ASCII.

We don't have any images related to Basic Latin (Unicode block) yet.
We don't have any YouTube videos related to Basic Latin (Unicode block) yet.
We don't have any PDF documents related to Basic Latin (Unicode block) yet.
We don't have any Books related to Basic Latin (Unicode block) yet.
We don't have any archived web articles related to Basic Latin (Unicode block) yet.

Table of characters

CodeResultDescriptionAcronym
C0 controls
U+0000Null characterNUL
U+0001Start of HeadingSOH
U+0002Start of TextSTX
U+0003End-of-text characterETX
U+0004End-of-transmission characterEOT
U+0005Enquiry characterENQ
U+0006Acknowledge characterACK
U+0007Bell characterBEL
U+0008BackspaceBS
U+0009Horizontal tabHT
U+000ALine feedLF
U+000BVertical tabVT
U+000CForm feedFF
U+000DCarriage returnCR
U+000EShift OutSO
U+000FShift InSI
U+0010Data Link EscapeDLE
U+0011Device Control 1DC1
U+0012Device Control 2DC2
U+0013Device Control 3DC3
U+0014Device Control 4DC4
U+0015Negative-acknowledge characterNAK
U+0016Synchronous IdleSYN
U+0017End of Transmission BlockETB
U+0018Cancel characterCAN
U+0019End of MediumEM
U+001ASubstitute characterSUB
U+001BEscape characterESC
U+001CFile SeparatorFS
U+001DGroup SeparatorGS
U+001ERecord SeparatorRS
U+001FUnit SeparatorUS
ASCII punctuation and symbols
U+0020 SpaceSP
U+0021!Exclamation markEXC
U+0022"Quotation markQUO
U+0023#Number sign
U+0024$Dollar sign
U+0025%Percent sign
U+0026&Ampersand
U+0027'Apostrophe
U+0028(Left parenthesis
U+0029)Right parenthesis
U+002A*Asterisk
U+002B+Plus sign
U+002C,Comma
U+002D-Hyphen-minus
U+002E.Full stop or period
U+002F/Solidus or Slash
ASCII digits
U+00300Digit Zero
U+00311Digit One
U+00322Digit Two
U+00333Digit Three
U+00344Digit Four
U+00355Digit Five
U+00366Digit Six
U+00377Digit Seven
U+00388Digit Eight
U+00399Digit Nine
ASCII punctuation and symbols
U+003A:Colon
U+003B;Semicolon
U+003C<Less-than sign
U+003D=Equal sign
U+003E>Greater-than sign
U+003F?Question mark
U+0040@At sign or Commercial at
Uppercase Latin alphabet
U+0041ALatin Capital letter A
U+0042BLatin Capital letter B
U+0043CLatin Capital letter C
U+0044DLatin Capital letter D
U+0045ELatin Capital letter E
U+0046FLatin Capital letter F
U+0047GLatin Capital letter G
U+0048HLatin Capital letter H
U+0049ILatin Capital letter I
U+004AJLatin Capital letter J
U+004BKLatin Capital letter K
U+004CLLatin Capital letter L
U+004DMLatin Capital letter M
U+004ENLatin Capital letter N
U+004FOLatin Capital letter O
U+0050PLatin Capital letter P
U+0051QLatin Capital letter Q
U+0052RLatin Capital letter R
U+0053SLatin Capital letter S
U+0054TLatin Capital letter T
U+0055ULatin Capital letter U
U+0056VLatin Capital letter V
U+0057WLatin Capital letter W
U+0058XLatin Capital letter X
U+0059YLatin Capital letter Y
U+005AZLatin Capital letter Z
ASCII punctuation and symbols
U+005B[Left Square Bracket
U+005C\Backslash [A]
U+005D]Right Square Bracket
U+005E^Circumflex accent
U+005F_Low line
U+0060`Grave accent
Lowercase Latin alphabet
U+0061aLatin Small Letter A
U+0062bLatin Small Letter B
U+0063cLatin Small Letter C
U+0064dLatin Small Letter D
U+0065eLatin Small Letter E
U+0066fLatin Small Letter F
U+0067gLatin Small Letter G
U+0068hLatin Small Letter H
U+0069iLatin Small Letter I
U+006AjLatin Small Letter J
U+006BkLatin Small Letter K
U+006ClLatin Small Letter L
U+006DmLatin Small Letter M
U+006EnLatin Small Letter N
U+006FoLatin Small Letter O
U+0070pLatin Small Letter P
U+0071qLatin Small Letter Q
U+0072rLatin Small Letter R
U+0073sLatin Small Letter S
U+0074tLatin Small Letter T
U+0075uLatin Small Letter U
U+0076vLatin Small Letter V
U+0077wLatin Small Letter W
U+0078xLatin Small Letter X
U+0079yLatin Small Letter Y
U+007AzLatin Small Letter Z
ASCII punctuation and symbols
U+007B{Left Curly Bracket
U+007C|Vertical bar
U+007D}Right Curly Bracket
U+007E~Tilde
Control character
U+007FDeleteDEL
A The letter U+005C (\) may show up as a Yen(¥) or Won(₩) sign in Japanese/Korean fonts mistaking Unicode (especially UTF-8) as a legacy character set which replaced the backslash with these signs.5

Subheadings

The C0 Controls and Basic Latin block contains six subheadings.6

C0 controls

The C0 Controls, referred to as C0 ASCII control codes in version 1.0, are inherited from ASCII and other 7-bit and 8-bit encoding schemes. The Alias names for C0 controls are taken from the ISO/IEC 6429:1992 standard.7

ASCII punctuation and symbols

This subheading refers to standard punctuation characters, simple mathematical operators, and symbols like the dollar sign, percent, ampersand, underscore, and pipe.8

ASCII digits

The ASCII Digits subheading contains the standard European number characters 1–9 and 0.9

Uppercase Latin alphabet

The Uppercase Latin alphabet subheading contains the standard 26-letter unaccented Latin alphabet in the majuscule.10

Lowercase Latin alphabet

The Lowercase Latin Alphabet subheading contains the standard 26-letter unaccented Latin alphabet in the minuscule.11

Control character

The Control Character subheading contains the "Delete" character.12

Number of symbols, letters and control codes

The table below shows the number of letters, symbols and control codes in each of the subheadings in the C0 Controls and Basic Latin block.

SubheadingNumber of symbolsRange of characters
C0 controls32 control codesU+0000 to U+001F
ASCII punctuation and symbols33 punctuation marks and symbolsU+0020 to U+002F, U+003A to U+0040, U+005B to U+0060 and U+007B to U+007E
ASCII digits10 digitsU+0030 to U+0039
Uppercase Latin Alphabet26 unaccented Latin letters in the majuscule.U+0041 to U+005A
Lowercase Latin Alphabet26 unaccented Latin letters in the minuscule.U+0061 to U+007A
Control character1 control code containing the "Delete" character.U+007F

Chart

C0 Controls and Basic Latin13Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+000xNULSOHSTXETXEOTENQACKBEL BS  HT  LF  VT  FF  CR  SO  SI 
U+001xDLEDC1DC2DC3DC4NAKSYNETBCAN EM SUBESC FS  GS  RS  US 
U+002x SP !"#$%&'()*+,-./
U+003x0123456789:;<=>?
U+004x@ABCDEFGHIJKLMNO
U+005xPQRSTUVWXYZ[\]^_
U+006x`abcdefghijklmno
U+007xpqrstuvwxyz{|}~DEL

Variants

Several of the characters are defined to render as a standardized variant if followed by variant indicators.

A variant is defined for a zero with a short diagonal stroke: U+0030 DIGIT ZERO, U+FE00 VS1 (0︀).1415

Twelve characters (#, *, and the digits) can be followed by U+FE0E VS15 or U+FE0F VS16 to create emoji variants.16171819 They are keycap base characters, for example #️⃣ (U+0023 NUMBER SIGN U+FE0F VS16 U+20E3 COMBINING ENCLOSING KEYCAP). The VS15 version is "text presentation" while the VS16 version is "emoji-style".20

Emoji variation sequences
U+0023002A0030003100320033003400350036003700380039
base#*0123456789
base+VS15+keycap#︎⃣*︎⃣0︎⃣1︎⃣2︎⃣3︎⃣4︎⃣5︎⃣6︎⃣7︎⃣8︎⃣9︎⃣
base+VS16+keycap#️⃣*️⃣0️⃣1️⃣2️⃣3️⃣4️⃣5️⃣6️⃣7️⃣8️⃣9️⃣

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Basic Latin block:

VersionFinal code points21CountUTC IDL2 IDWG2 IDDocument
1.0.0U+0000..007F128(to be determined)
UTC/1999-013Karlsson, Kent (1999-05-27), Tildes and micro sign decompositions
L2/99-176RMoore, Lisa (1999-11-04), "Micro Sign Case Mappings", Minutes from the joint UTC/L2 meeting in Seattle, June 8-10, 1999
L2/04-145Starner, David (2004-04-30), C with stroke character examples from BAE report 1884 (Dorsey)
L2/04-202Anderson, Deborah (2004-06-07), Slashed C Feedback
N3046Suignard, Michel (2006-02-22), Improving formal definition for control characters
N3103 (pdf, doc)Umamaheswaran, V. S. (2006-08-25), "M48.33", Unconfirmed minutes of WG 2 meeting 48, Mountain View, CA, USA; 2006-04-24/27
L2/11-043Freytag, Asmus; Karlsson, Kent (2011-02-02), Proposal to correct mistakes and inconsistencies in certain property assignments for super and subscripted letters
L2/11-160PRI #181 Changing General Category of Twelve Characters, 2011-05-02
L2/11-261R2Moore, Lisa (2011-08-16), "Consensus 128-C3", UTC #128 / L2 #225 Minutes, Accept Ken Whistler's recommendations in L2/11-281 on name aliases for control characters with the addition of the abbreviations BEL and NUL.
L2/11-4382223N4182Edberg, Peter (2011-12-22), Emoji Variation Sequences (Revision of L2/11-429)
L2/15-107Moore, Lisa (2015-05-12), "Consensus 143-C5", UTC #143 Minutes, Add the 12 keycap sequences in emoji-data.txt as provisional named sequences in Unicode 8.0.
L2/15-268Beeton, Barbara; Freytag, Asmus; Iancu, Laurențiu; Sargent, Murray (2015-10-30), Proposal to Represent the Slashed Zero Variant of Empty Set
L2/15-3012425Pournader, Roozbeh (2015-11-01), A proposal for 278 standardized variation sequences for emoji
L2/15-254Moore, Lisa (2015-11-16), "B.12.1.2 Proposal to Represent the Slashed Zero Variant of Empty Set", UTC #145 Minutes
L2/17-294N4914Lunde, Ken (2017-08-14), Proposal to add standardized variation sequence for U+FF10 FULLWIDTH DIGIT ZERO
L2/22-019Scherer, Markus; et al. (2022-01-19), "F.2 F4: U+0019 in ISO vs. NameAliases.txt vs. chart/NamesList.txt", UTC #170 properties feedback & recommendations
L2/22-016Constable, Peter (2022-04-21), "Consensus 170-C24", UTC #170 Minutes, For U+0019, add a Name alias "EM" of type abbreviation, for Unicode version 15.0.

See also

  • Internet portal
  • Language portal

References

  1. "block.txt". The Unicode Consortium. Retrieved 2023-03-23. https://www.unicode.org/Public/UCD/latest/ucd/Blocks.txt

  2. "C0 Controls and Basic Latin" (PDF). The Unicode Standard, Version 15.0. Unicode, Inc. 2022. Retrieved March 22, 2023. https://www.unicode.org/charts/PDF/U0000.pdf

  3. The Unicode Standard Version 1.0, Volume 1. Addison-Wesley Publishing Company, Inc. 1990. ISBN 0-201-56788-1. 0-201-56788-1

  4. "3.8: Block-by-Block Charts" (PDF). The Unicode Standard. version 1.0. Unicode Consortium. https://www.unicode.org/versions/Unicode1.0.0/CodeCharts2.pdf

  5. Michael S. Kaplan (2005-09-17). "When is a backslash not a backslash?". Sorting it all Out. Microsoft. Archived from the original on 2010-06-12. Also available at: http://archives.miloush.net/michkap/archive/2005/09/17/469941.html https://web.archive.org/web/20100612050134/http://blogs.msdn.com/b/michkap/archive/2005/09/17/469941.aspx

  6. "Unicode 6.2 code charts" (PDF). The Unicode Standard. Retrieved 1 April 2013. https://www.unicode.org/Public/6.2.0/charts/CodeCharts.pdf

  7. "Unicode 6.2 code charts" (PDF). The Unicode Standard. Retrieved 1 April 2013. https://www.unicode.org/Public/6.2.0/charts/CodeCharts.pdf

  8. "Unicode 6.2 code charts" (PDF). The Unicode Standard. Retrieved 1 April 2013. https://www.unicode.org/Public/6.2.0/charts/CodeCharts.pdf

  9. "Unicode 6.2 code charts" (PDF). The Unicode Standard. Retrieved 1 April 2013. https://www.unicode.org/Public/6.2.0/charts/CodeCharts.pdf

  10. "Unicode 6.2 code charts" (PDF). The Unicode Standard. Retrieved 1 April 2013. https://www.unicode.org/Public/6.2.0/charts/CodeCharts.pdf

  11. "Unicode 6.2 code charts" (PDF). The Unicode Standard. Retrieved 1 April 2013. https://www.unicode.org/Public/6.2.0/charts/CodeCharts.pdf

  12. "Unicode 6.2 code charts" (PDF). The Unicode Standard. Retrieved 1 April 2013. https://www.unicode.org/Public/6.2.0/charts/CodeCharts.pdf

  13. As of Unicode version 16.0

  14. Beeton, Barbara; Freytag, Asmus; Iancu, Laurențiu; Sargent, Murray (2015-10-30). "L2/15-268: Proposal to Represent the Slashed Zero Variant of Empty Set" (PDF). https://www.unicode.org/L2/L2015/15268-slashed-zero.pdf

  15. "UTS #51 Emoji Variation Sequences". The Unicode Consortium. https://unicode.org/Public/UNIDATA/emoji/emoji-variation-sequences.txt

  16. Edberg, Peter (2011-12-22). "L2/11-438: Emoji Variation Sequences (Revision of L2/11-429)" (PDF). https://www.unicode.org/L2/L2011/11438-emoji-var.pdf

  17. Pournader, Roozbeh (2015-11-01). "L2/15-301: A proposal for 278 standardized variation sequences for emoji" (PDF). https://www.unicode.org/L2/L2015/15301-emoji-sequences.pdf

  18. "UTR #51: Unicode Emoji". Unicode Consortium. 2023-09-05. http://unicode.org/reports/tr51/

  19. "UCD: Emoji Data for UTR #51". Unicode Consortium. 2023-02-01. https://unicode.org/Public/UNIDATA/emoji/emoji-data.txt

  20. "UTS #51 Emoji Variation Sequences". The Unicode Consortium. https://unicode.org/Public/UNIDATA/emoji/emoji-variation-sequences.txt

  21. Proposed code points and characters names may differ from final code points and names

  22. See also L2/10-458, L2/11-414, L2/11-415, and L2/11-429 https://www.unicode.org/L2/L2010/10458-emoji-var.pdf

  23. Refer to the history section of the Miscellaneous Symbols and Pictographs block for additional emoji-related documents /wiki/Miscellaneous_Symbols_and_Pictographs#History

  24. See also L2/15-198 and L2/15-275 https://www.unicode.org/L2/L2015/15198-varseq-text-emoji.pdf

  25. Refer to the history section of the Miscellaneous Symbols and Pictographs block for additional emoji-related documents /wiki/Miscellaneous_Symbols_and_Pictographs#History