Description Code Entity name =================================== ============ ============== quotation mark " --> " " --> " ampersand & --> & & --> & less-than sign < --> < < --> < greater-than sign > --> > > --> > Description Char Code Entity name =================================== ==== ============ ============== euro sign &128; --> undefined &129; --> single low-9 quotation mark &130; --> latin small letter f with hook &131; --> double low-9 quotation mark &132; --> horizontal ellipsis &133; --> dagger &134; --> double dagger &135; --> modifier letter circumflex accent &136; --> per mille sign &137; --> latin capital letter s with caron &138; --> single left-pointing angle quote mark &139; --> latin capital ligature oe &140; --> undefined &141; --> latin capital letter z with caron &142; --> undefined &143; --> undefined &144; --> left single quotation mark &145; --> right single quotation mark &146; --> left double quotation mark &147; --> right double quotation mark &148; --> bullet &149; --> en dash &150; --> em dash &151; --> small tilde &152; --> trade mark sign &153; --> latin small letter s with caron &154; --> single right-pointing angle quote mark &155; --> latin small ligature oe &156; --> undefined &157; --> latin small letter z with caron &158; --> latin capital letter y with diaeresis &159; --> non-breaking space   --> --> inverted exclamation ¡ --> ¡ ¡ --> ¡ cent sign ¢ --> ¢ ¢ --> ¢ pound sterling £ --> £ £ --> £ general currency sign ¤ --> ¤ ¤ --> ¤ yen sign ¥ --> ¥ ¥ --> ¥ broken vertical bar ¦ --> ¦ ¦ --> ¦ Non-standard &brkbar; --> &brkbar; section sign § --> § § --> § umlaut (dieresis) ¨ --> ¨ ¨ --> ¨ Non-standard ¨ --> ¨ copyright © --> © © --> © feminine ordinal ª --> ª ª --> ª left angle quote, guillemotleft « --> « « --> « not sign ¬ --> ¬ ¬ --> ¬ soft hyphen ­ --> ­ --> registered trademark ® --> ® ® --> ® macron accent ¯ --> ¯ ¯ --> ¯ Non-standard &hibar; --> &hibar; degree sign ° --> ° ° --> ° plus or minus ± --> ± ± --> ± superscript two ² --> ² ² --> ² superscript three ³ --> ³ ³ --> ³ acute accent ´ --> ´ ´ --> ´ micro sign µ --> µ µ --> µ paragraph sign ¶ --> ¶ ¶ --> ¶ middle dot · --> · · --> · cedilla ¸ --> ¸ ¸ --> ¸ superscript one ¹ --> ¹ ¹ --> ¹ masculine ordinal º --> º º --> º right angle quote, guillemotright » --> » » --> » fraction one-fourth ¼ --> ¼ ¼ --> ¼ fraction one-half ½ --> ½ ½ --> ½ fraction three-fourths ¾ --> ¾ ¾ --> ¾ inverted question mark ¿ --> ¿ ¿ --> ¿ capital A, grave accent À --> À À --> À capital A, acute accent Á --> Á Á --> Á capital A, circumflex accent  -->   -->  capital A, tilde à --> à à --> à capital A, dieresis or umlaut mark Ä --> Ä Ä --> Ä capital A, ring Å --> Å Å --> Å capital AE diphthong (ligature) Æ --> Æ Æ --> Æ capital C, cedilla Ç --> Ç Ç --> Ç capital E, grave accent È --> È È --> È capital E, acute accent É --> É É --> É capital E, circumflex accent Ê --> Ê Ê --> Ê capital E, dieresis or umlaut mark Ë --> Ë Ë --> Ë capital I, grave accent Ì --> Ì Ì --> Ì capital I, acute accent Í --> Í Í --> Í capital I, circumflex accent Î --> Î Î --> Î capital I, dieresis or umlaut mark Ï --> Ï Ï --> Ï capital Eth, Icelandic Ð --> Ð Ð --> Ð Non-standard Đ --> Đ capital N, tilde Ñ --> Ñ Ñ --> Ñ capital O, grave accent Ò --> Ò Ò --> Ò capital O, acute accent Ó --> Ó Ó --> Ó capital O, circumflex accent Ô --> Ô Ô --> Ô capital O, tilde Õ --> Õ Õ --> Õ capital O, dieresis or umlaut mark Ö --> Ö Ö --> Ö multiply sign × --> × × --> × capital O, slash Ø --> Ø Ø --> Ø capital U, grave accent Ù --> Ù Ù --> Ù capital U, acute accent Ú --> Ú Ú --> Ú capital U, circumflex accent Û --> Û Û --> Û capital U, dieresis or umlaut mark Ü --> Ü Ü --> Ü capital Y, acute accent Ý --> Ý Ý --> Ý capital THORN, Icelandic Þ --> Þ Þ --> Þ small sharp s, German (sz ligature) ß --> ß ß --> ß small a, grave accent à --> à à --> à small a, acute accent á --> á á --> á small a, circumflex accent â --> â â --> â small a, tilde ã --> ã ã --> ã small a, dieresis or umlaut mark ä --> ä ä --> ä small a, ring å --> å å --> å small ae diphthong (ligature) æ --> æ æ --> æ small c, cedilla ç --> ç ç --> ç small e, grave accent è --> è è --> è small e, acute accent é --> é é --> é small e, circumflex accent ê --> ê ê --> ê small e, dieresis or umlaut mark ë --> ë ë --> ë small i, grave accent ì --> ì ì --> ì small i, acute accent í --> í í --> í small i, circumflex accent î --> î î --> î small i, dieresis or umlaut mark ï --> ï ï --> ï small eth, Icelandic ð --> ð ð --> ð small n, tilde ñ --> ñ ñ --> ñ small o, grave accent ò --> ò ò --> ò small o, acute accent ó --> ó ó --> ó small o, circumflex accent ô --> ô ô --> ô small o, tilde õ --> õ õ --> õ small o, dieresis or umlaut mark ö --> ö ö --> ö division sign ÷ --> ÷ ÷ --> ÷ small o, slash ø --> ø ø --> ø small u, grave accent ù --> ù ù --> ù small u, acute accent ú --> ú ú --> ú small u, circumflex accent û --> û û --> û small u, dieresis or umlaut mark ü --> ü ü --> ü small y, acute accent ý --> ý ý --> ý small thorn, Icelandic þ --> þ þ --> þ small y, dieresis or umlaut mark ÿ --> ÿ ÿ --> ÿ
-->
"what your browser does with it"
-->
"what your browser does with it"
÷ -->
÷
" in the 4th column, this means your browser
doesn't know about the entity name "divide" and just puts it
literally.
This table grew out of an overview of the "ISO
Latin-1 Character Set" overview related to the Hyper-G Text Format
(HTF).
The entity names &brkbar;
and Đ
seem to be unique to HTF.
The entity name &hibar;
has been supported by X Mosaic
but seems to be replaced with ¯
.
The entity names ¨
and ¨
should
be equivalent.
The standards stuff:
The
HTML 2.0 Standard
includes a section on
Character Entity Sets
and an overview on the
HTML Coded Character Set
(The entity names are derived from ISO 8879).
Or have a look at the
Latin-1 Character Entities
as listed in an draft for the
HTML 3.0 specification.
The
Appendix II
of CERN's
HTML+ Discussion Document
contains a
table
(in PostScript format) of the proposed character entities for HTML+ and their
corresponding character codes for Unicode and the Adobe Latin-1 & Symbol
character sets.
Please note that there is nothing wrong with using characters of ISO Latin-1 above 127: the normal transmission protocol for the WWW, HTTP/1.0, uses the 8bit ISO latin-1 as default encoding. (Thanks to Roman Czyborra for pointing this out!)
Other information:
server/ddx/sun/Compose.list
.