The
Unicode code block Latin-1 Supplement
extends the basic 26 letter pairs of
ASCII by providing additional letters for major languages of
Europe. Like ASCII, the Latin-1 set also includes a
miscellaneous set of
punctuation and
mathematical signs.
The languages supported by the Latin-1 supplement include Danish, Dutch, Faroese, Finnish, Flemish, German, Icelandic, Irish, Italian, Norwegian, Portuguese, Spanish and Swedish.
U+00AA ª feminine ordinal indicator
and
U+00BA º masculine ordinal indicator
can be depicted with an underscore, but many modern fonts show them as superscripted Latin letters with no underscore. In sorting and searching, these characters should be treated as weakly equivalent to their Latin character equivalents.
All diacritics in this block are unambiguously spacing characters. For combining or non-spacing diacritics, see the Combining Diacritical Marks code block.
Unicode's
Latin-1 Supplement code block reserves the
128 code points from U+0080 to U+00FF, of which all 128 are currently assigned.
Basic Latin <-- Latin-1 Supplement --> Latin Extended A
All the characters in this code block were added in Unicode 1.1
Number of characters in each General Category :
Letter, Uppercase Lu : 30
Letter, Lowercase Ll : 35
Number, Other No : 6
Punctuation, Initial quote Pi : 1
Punctuation, Final quote Pf : 1
Punctuation, Other Po : 3
Symbol, Math Sm : 4
Symbol, Currency Sc : 4
Symbol, Modifier Sk : 4
Symbol, Other So : 6
Separator, Space Zs : 1
Other, Control Cc : 32
Other, Format Cf : 1
Number of characters in each Bidirectional Category :
Left To Right L : 65
European Number EN : 3
European Number Terminator ET : 6
Common Number Separator CS : 1
Boundary Neutral BN : 32
Paragraph Separator B : 1
Other Neutral ON : 20
The columns below should be interpreted as :
- The Unicode code for the character
- The character in question
- The Unicode name for the character
- The Unicode General Category for the character
- The Unicode Bidirectional Category for the character
If the characters below show up poorly, or not at all, see Unicode Support for possible solutions.
Latin-1 Supplement
C1 controls
Alias names are those for ISO/IEC 6429:1992.
- U+0080 control Cc BN
- U+0081 control Cc BN
- U+0082 break permitted here Cc BN
- ref U+200B zero width space (General Punctuation)
- U+0083 no break here Cc BN
- ref U+2060 word joiner (General Punctuation)
- U+0084 control Cc BN
- * formerly known as INDEX
- U+0085
next line Cc B
- aka next line (nel)
- U+0086 start of selected area Cc BN
- U+0087 end of selected area Cc BN
- U+0088 character tabulation set Cc BN
- U+0089 character tabulation with justification Cc BN
- U+008A line tabulation set Cc BN
- U+008B partial line forward Cc BN
- U+008C partial line backward Cc BN
- U+008D reverse line feed Cc BN
- U+008E single shift two Cc BN
- U+008F single shift three Cc BN
- U+0090 device control string Cc BN
- U+0091 private use one Cc BN
- U+0092 private use two Cc BN
- U+0093 set transmit state Cc BN
- U+0094 cancel character Cc BN
- U+0095 message waiting Cc BN
- U+0096 start of guarded area Cc BN
- U+0097 end of guarded area Cc BN
- U+0098 start of string Cc BN
- U+0099 control Cc BN
- U+009A single character introducer Cc BN
- U+009B control sequence introducer Cc BN
- U+009C string terminator Cc BN
- U+009D operating system command Cc BN
- U+009E privacy message Cc BN
- U+009F application program command Cc BN
Latin-1 punctuation and symbols
Based on ISO/IEC 8859-1 (aka Latin-1) from here.
- U+00A0 no break space Zs CS
- html
- sgml
- * commonly abbreviated as NBSP
- ref U+0020 space (Basic Latin)
- ref U+2007 figure space (General Punctuation)
- ref U+202F narrow no break space (General Punctuation)
- ref U+2060 word joiner (General Punctuation)
- ref U+FEFF zero width no break space (Arabic Presentation Forms B)
- U+00A1 ¡ inverted exclamation mark Po ON
- html ¡
- sgml ¡
- * Spanish, Asturian, Galician
- ref U+0021 ! exclamation mark (Basic Latin)
- U+00A2 ¢ cent sign Sc ET
- html ¢
- sgml ¢
- * Cent
- U+00A3 £ pound sign Sc ET
- html £
- sgml £
- aka pound sterling, irish punt, italian lira, turkish lira, etc.
- * Pound
- ref U+20A4 ₤ lira sign (Currency Symbols)
- ref U+10192 𐆒 Roman semuncia sign (Ancient Symbols)
- U+00A4 ¤ currency sign Sc ET
- html ¤
- sgml ¤
- * other currency symbol characters: 20A0-20B5
- ref U+0024 $ dollar sign (Basic Latin)
- U+00A5 ¥ yen sign Sc ET
- html ¥
- sgml ¥
- aka yuan sign
- * Yen
- * glyph may have one or two crossbars
- U+00A6 ¦ broken bar So ON
- html ¦
- sgml ¦
- aka broken vertical bar (1.0)
- aka parted rule (in typography)
- U+00A7 § section sign So ON
- html §
- sgml §
- * paragraph sign in some European usage
- U+00A8 ¨ diaeresis Sk ON
- html ¨
- sgml ¨ ¨ ¨
- * this is a spacing character
- ref U+0308 ̈ combining diaeresis (Combining Diacritical Marks)
- U+00A9 © copyright sign So ON
- html ©
- sgml ©
- ref U+2117 ℗ sound recording copyright (Letterlike Symbols)
- ref U+24B8 Ⓒ circled Latin capital letter C (Enclosed Alphanumerics)
- U+00AA ª feminine ordinal indicator Ll L
- html ª
- sgml ª
- * Spanish
- U+00AB « left pointing double angle quotation mark Pi ON
- html «
- sgml «
- aka left guillemet
- aka chevrons (in typography)
- * usually opening, sometimes closing
- ref U+226A ≪ much less than (Mathematical Operators)
- ref U+300A 《 left double angle bracket (CJK Symbols and Punctuation)
- U+00AC ¬ not sign Sm ON
- html ¬
- sgml ⫬ ¬
- aka angled dash (in typography)
- ref U+2310 ⌐ reversed not sign (Miscellaneous Technical)
- U+00AD soft hyphen Cf BN
- html ­
- sgml ­
- aka discretionary hyphen
- * commonly abbreviated as SHY
- U+00AE ® registered sign So ON
- html ®
- sgml ®
- aka registered trade mark sign (1.0)
- ref U+24C7 Ⓡ circled Latin capital letter R (Enclosed Alphanumerics)
- U+00AF ¯ macron Sk ON
- html ¯
- sgml ® ¯ ®
- aka overline, APL overbar
- * this is a spacing character
- ref U+02C9 ˉ modifier letter macron (Spacing Modifier Letters)
- ref U+0304 ̄ combining macron (Combining Diacritical Marks)
- ref U+0305 ̅ combining overline (Combining Diacritical Marks)
- U+00B0 ° degree sign So ET
- html °
- sgml °
- * this is a spacing character
- ref U+02DA ˚ ring above (Spacing Modifier Letters)
- ref U+030A ̊ combining ring above (Combining Diacritical Marks)
- ref U+2070 ⁰ superscript zero (Superscripts and Subscripts)
- ref U+2218 ∘ ring operator (Mathematical Operators)
- U+00B1 ± plus minus sign Sm ET
- html ±
- sgml ± ± ±
- ref U+2213 ∓ minus or plus sign (Mathematical Operators)
- U+00B2 ² superscript two No EN
- html ²
- sgml ²
- aka squared
- * other superscript digit characters: 2070-2079
- ref U+00B9 ¹ superscript one (Latin-1 Supplement)
- U+00B3 ³ superscript three No EN
- html ³
- sgml ³
- aka cubed
- ref U+00B9 ¹ superscript one (Latin-1 Supplement)
- U+00B4 ´ acute accent Sk ON
- html ´
- sgml ´
- * this is a spacing character
- ref U+02B9 ʹ modifier letter prime (Spacing Modifier Letters)
- ref U+02CA ˊ modifier letter acute accent (Spacing Modifier Letters)
- ref U+0301 ́ combining acute accent (Combining Diacritical Marks)
- ref U+2032 ′ prime (General Punctuation)
- U+00B5 µ micro sign Ll L
- html µ
- sgml µ
- U+00B6 ¶ pilcrow sign So ON
- html ¶
- sgml ¶
- aka paragraph sign
- * section sign in some European usage
- ref U+204B ⁋ reversed pilcrow sign (General Punctuation)
- ref U+2761 ❡ curved stem paragraph sign ornament (Dingbats)
- U+00B7 · middle dot Po ON
- html ·
- sgml · · ·
- aka midpoint (in typography)
- aka Georgian comma
- aka Greek middle dot (ano teleia)
- ref U+0387 · Greek ano teleia (Greek and Coptic)
- ref U+2022 • bullet (General Punctuation)
- ref U+2024 ․ one dot leader (General Punctuation)
- ref U+2027 ‧ hyphenation point (General Punctuation)
- ref U+2219 ∙ bullet operator (Mathematical Operators)
- ref U+22C5 ⋅ dot operator (Mathematical Operators)
- ref U+30FB ・ Katakana middle dot (Katakana)
- U+00B8 ¸ cedilla Sk ON
- html ¸
- sgml ¸
- * this is a spacing character
- * other spacing accent characters: 02D8-02DB
- ref U+0327 ̧ combining cedilla (Combining Diacritical Marks)
- U+00B9 ¹ superscript one No EN
- html ¹
- sgml ¹
- ref U+00B2 ² superscript two (Latin-1 Supplement)
- ref U+00B3 ³ superscript three (Latin-1 Supplement)
- U+00BA º masculine ordinal indicator Ll L
- html º
- sgml º
- * Spanish
- U+00BB » right pointing double angle quotation mark Pf ON
- html »
- sgml »
- aka right guillemet
- * usually closing, sometimes opening
- ref U+226B ≫ much greater than (Mathematical Operators)
- ref U+300B 》 right double angle bracket (CJK Symbols and Punctuation)
- U+00BC ¼ vulgar fraction one quarter No ON
- html ¼
- sgml ¼
- * bar may be horizontal or slanted
- * other fraction characters: 2153-215E
- U+00BD ½ vulgar fraction one half No ON
- html ½
- sgml ½ ½
- * bar may be horizontal or slanted
- U+00BE ¾ vulgar fraction three quarters No ON
- html ¾
- sgml ¾
- * bar may be horizontal or slanted
- U+00BF ¿ inverted question mark Po ON
- html ¿
- sgml ¿
- aka turned question mark
- * Spanish
- ref U+003F ? question mark (Basic Latin)
- ref U+2E2E ⸮ reversed question mark (Supplemental Punctuation)
Letters
- U+00C0 À Latin capital letter A with grave Lu L
- html À
- sgml À
- U+00C1 Á Latin capital letter A with acute Lu L
- html Á
- sgml Á
- U+00C2 Â Latin capital letter A with circumflex Lu L
- html Â
- sgml Â
- U+00C3 Ã Latin capital letter A with tilde Lu L
- html Ã
- sgml Ã
- U+00C4 Ä Latin capital letter A with diaeresis Lu L
- html Ä
- sgml Ä
- U+00C5 Å Latin capital letter A with ring above Lu L
- html Å
- sgml Å
- ref U+212B Å angstrom sign (Letterlike Symbols)
- U+00C6 Æ Latin capital letter ae Lu L
- html Æ
- sgml Æ
- aka Latin capital ligature ae (1.0)
- U+00C7 Ç Latin capital letter C with cedilla Lu L
- html Ç
- sgml Ç
- U+00C8 È Latin capital letter E with grave Lu L
- html È
- sgml È
- U+00C9 É Latin capital letter E with acute Lu L
- html É
- sgml É
- U+00CA Ê Latin capital letter E with circumflex Lu L
- html Ê
- sgml Ê
- U+00CB Ë Latin capital letter E with diaeresis Lu L
- html Ë
- sgml Ë
- U+00CC Ì Latin capital letter I with grave Lu L
- html Ì
- sgml Ì
- U+00CD Í Latin capital letter I with acute Lu L
- html Í
- sgml Í
- U+00CE Î Latin capital letter I with circumflex Lu L
- html Î
- sgml Î
- U+00CF Ï Latin capital letter I with diaeresis Lu L
- html Ï
- sgml Ï
- U+00D0 Ð Latin capital letter eth Lu L
- html Ð
- sgml Ð
- ref U+00F0 ð Latin small letter eth (Latin-1 Supplement)
- ref U+0110 Đ Latin capital letter D with stroke (Latin Extended A)
- ref U+0189 Ɖ Latin capital letter african d (Latin Extended B)
- U+00D1 Ñ Latin capital letter N with tilde Lu L
- html Ñ
- sgml Ñ
- U+00D2 Ò Latin capital letter O with grave Lu L
- html Ò
- sgml Ò
- U+00D3 Ó Latin capital letter O with acute Lu L
- html Ó
- sgml Ó
- U+00D4 Ô Latin capital letter O with circumflex Lu L
- html Ô
- sgml Ô
- U+00D5 Õ Latin capital letter O with tilde Lu L
- html Õ
- sgml Õ
- U+00D6 Ö Latin capital letter O with diaeresis Lu L
- html Ö
- sgml Ö
Mathematical operator
- U+00D7 × multiplication sign Sm ON
- html ×
- sgml ×
- aka z notation cartesian product
Letters
- U+00D8 Ø Latin capital letter O with stroke Lu L
- html Ø
- sgml Ø
- aka o slash
- ref U+2205 ∅ empty set (Mathematical Operators)
- U+00D9 Ù Latin capital letter U with grave Lu L
- html Ù
- sgml Ù
- U+00DA Ú Latin capital letter U with acute Lu L
- html Ú
- sgml Ú
- U+00DB Û Latin capital letter U with circumflex Lu L
- html Û
- sgml Û
- U+00DC Ü Latin capital letter U with diaeresis Lu L
- html Ü
- sgml Ü
- U+00DD Ý Latin capital letter Y with acute Lu L
- html Ý
- sgml Ý
- U+00DE Þ Latin capital letter thorn Lu L
- html Þ
- sgml Þ
- U+00DF ß Latin small letter sharp s Ll L
- html ß
- sgml ß
- aka eszett
- * German
- * uppercase is "SS"
- * in origin a ligature of 017F and 0073
- ref U+03B2 β Greek small letter beta (Greek and Coptic)
- ref U+1E9E ẞ Latin capital letter sharp s (Latin Extended Additional)
- U+00E0 à Latin small letter A with grave Ll L
- html à
- sgml à
- U+00E1 á Latin small letter A with acute Ll L
- html á
- sgml á
- U+00E2 â Latin small letter A with circumflex Ll L
- html â
- sgml â
- U+00E3 ã Latin small letter A with tilde Ll L
- html ã
- sgml ã
- * Portuguese
- U+00E4 ä Latin small letter A with diaeresis Ll L
- html ä
- sgml ä
- U+00E5 å Latin small letter A with ring above Ll L
- html å
- sgml å
- * Danish, Norwegian, Swedish, Walloon
- U+00E6 æ Latin small letter ae Ll L
- html æ
- sgml æ
- aka Latin small ligature ae (1.0)
- aka ash (from old english ?sc)
- * Danish, Norwegian, Icelandic, Faroese, Old English, French, IPA
- ref U+0153 œ Latin small ligature oe (Latin Extended A)
- ref U+04D5 ӕ Cyrillic small ligature a ie (Cyrillic)
- U+00E7 ç Latin small letter C with cedilla Ll L
- html ç
- sgml ç
- U+00E8 è Latin small letter E with grave Ll L
- html è
- sgml è
- U+00E9 é Latin small letter E with acute Ll L
- html é
- sgml é
- U+00EA ê Latin small letter E with circumflex Ll L
- html ê
- sgml ê
- U+00EB ë Latin small letter E with diaeresis Ll L
- html ë
- sgml ë
- U+00EC ì Latin small letter I with grave Ll L
- html ì
- sgml ì
- * Italian, Malagasy
- U+00ED í Latin small letter I with acute Ll L
- html í
- sgml í
- U+00EE î Latin small letter I with circumflex Ll L
- html î
- sgml î
- U+00EF ï Latin small letter I with diaeresis Ll L
- html ï
- sgml ï
- U+00F0 ð Latin small letter eth Ll L
- html ð
- sgml ð
- * Icelandic, Faroese, Old English, IPA
- ref U+00D0 Ð Latin capital letter eth (Latin-1 Supplement)
- ref U+03B4 δ Greek small letter delta (Greek and Coptic)
- ref U+2202 ∂ partial differential (Mathematical Operators)
- U+00F1 ñ Latin small letter N with tilde Ll L
- html ñ
- sgml ñ
- U+00F2 ò Latin small letter O with grave Ll L
- html ò
- sgml ò
- U+00F3 ó Latin small letter O with acute Ll L
- html ó
- sgml ó
- U+00F4 ô Latin small letter O with circumflex Ll L
- html ô
- sgml ô
- U+00F5 õ Latin small letter O with tilde Ll L
- html õ
- sgml õ
- * Portuguese, Estonian
- U+00F6 ö Latin small letter O with diaeresis Ll L
- html ö
- sgml ö
Mathematical operator
- U+00F7 ÷ division sign Sm ON
- html ÷
- sgml ÷ ÷
- ref U+2215 ∕ division slash (Mathematical Operators)
- ref U+2223 ∣ divides (Mathematical Operators)
Letters
- U+00F8 ø Latin small letter O with stroke Ll L
- html ø
- sgml ø
- aka o slash
- * Danish, Norwegian, Faroese, IPA
- U+00F9 ù Latin small letter U with grave Ll L
- html ù
- sgml ù
- * French, Italian
- U+00FA ú Latin small letter U with acute Ll L
- html ú
- sgml ú
- U+00FB û Latin small letter U with circumflex Ll L
- html û
- sgml û
- U+00FC ü Latin small letter U with diaeresis Ll L
- html ü
- sgml ü
- U+00FD ý Latin small letter Y with acute Ll L
- html ý
- sgml ý
- * Czech, Slovak, Icelandic, Faroese, Welsh, Malagasy
- U+00FE þ Latin small letter thorn Ll L
- html þ
- sgml þ
- * Icelandic, Old English, phonetics
- * Runic letter borrowed into Latin script
- ref U+16A6 ᚦ Runic letter thurisaz thurs thorn (Runic)
- U+00FF ÿ Latin small letter Y with diaeresis Ll L
- html ÿ
- sgml ÿ
- * French
- ref U+0178 Ÿ Latin capital letter Y with diaeresis (Latin Extended A)
http://unicode.org
Some prose may have been lifted verbatim from unicode.org,
as is permitted by their terms of use at http://www.unicode.org/copyright.html