summaryrefslogtreecommitdiffstats
path: root/test/iso-8859-1a.html
diff options
context:
space:
mode:
Diffstat (limited to 'test/iso-8859-1a.html')
-rw-r--r--test/iso-8859-1a.html275
1 files changed, 275 insertions, 0 deletions
diff --git a/test/iso-8859-1a.html b/test/iso-8859-1a.html
new file mode 100644
index 0000000..972329d
--- /dev/null
+++ b/test/iso-8859-1a.html
@@ -0,0 +1,275 @@
+<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
+<!-- X-URL: http://www.ramsch.org/martin/uni/fmi-hp/iso8859-1.html -->
+<!-- Date: Tue, 28 Dec 2004 20:24:09 GMT -->
+<!-- Last-Modified: Mon, 15 May 2000 09:37:37 GMT -->
+<HTML>
+<HEAD>
+<TITLE>Martin Ramsch - iso8859-1 table</TITLE>
+<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">
+<BASE HREF="http://www.ramsch.org/martin/uni/fmi-hp/iso8859-1.html">
+</HEAD>
+
+<BODY>
+
+<H1 ALIGN=center>iso8859-1 table, with cp-1252</H1>
+
+<PRE>
+Description Code Entity name
+=================================== ============ ==============
+quotation mark &amp;#34; --> &#34; &amp;quot; --> &quot;
+ampersand &amp;#38; --> &#38; &amp;amp; --> &amp;
+less-than sign &amp;#60; --> &#60; &amp;lt; --> &lt;
+greater-than sign &amp;#62; --> &#62; &amp;gt; --> &gt;
+
+Description Char Code Entity name
+=================================== ==== ============ ==============
+euro sign &amp;128; --> &#128;
+undefined &amp;129; --> &#129;
+single low-9 quotation mark &amp;130; --> &#130;
+latin small letter f with hook &amp;131; --> &#131;
+double low-9 quotation mark &amp;132; --> &#132;
+horizontal ellipsis &amp;133; --> &#133;
+dagger &amp;134; --> &#134;
+double dagger &amp;135; --> &#135;
+modifier letter circumflex accent &amp;136; --> &#136;
+per mille sign &amp;137; --> &#137;
+latin capital letter s with caron &amp;138; --> &#138;
+single left-pointing angle quote mark &amp;139; --> &#139;
+latin capital ligature oe &amp;140; --> &#140;
+undefined &amp;141; --> &#141;
+latin capital letter z with caron &amp;142; --> &#142;
+undefined &amp;143; --> &#143;
+
+undefined &amp;144; --> &#144;
+left single quotation mark &amp;145; --> &#145;
+right single quotation mark &amp;146; --> &#146;
+left double quotation mark &amp;147; --> &#147;
+right double quotation mark &amp;148; --> &#148;
+bullet &amp;149; --> &#149;
+en dash &amp;150; --> &#150;
+em dash &amp;151; --> &#151;
+small tilde &amp;152; --> &#152;
+trade mark sign &amp;153; --> &#153;
+latin small letter s with caron &amp;154; --> &#154;
+single right-pointing angle quote mark &amp;155; --> &#155;
+latin small ligature oe &amp;156; --> &#156;
+undefined &amp;157; --> &#157;
+latin small letter z with caron &amp;158; --> &#158;
+latin capital letter y with diaeresis &amp;159; --> &#159;
+
+non-breaking space &amp;#160; --> &#160; &amp;nbsp; --> &nbsp;
+inverted exclamation &amp;#161; --> &#161; &amp;iexcl; --> &iexcl;
+cent sign &amp;#162; --> &#162; &amp;cent; --> &cent;
+pound sterling &amp;#163; --> &#163; &amp;pound; --> &pound;
+general currency sign &amp;#164; --> &#164; &amp;curren; --> &curren;
+yen sign &amp;#165; --> &#165; &amp;yen; --> &yen;
+broken vertical bar &amp;#166; --> &#166; &amp;brvbar; --> &brvbar;
+ Non-standard &amp;brkbar; --> &brkbar;
+section sign &amp;#167; --> &#167; &amp;sect; --> &sect;
+umlaut (dieresis) &amp;#168; --> &#168; &amp;uml; --> &uml;
+ Non-standard &amp;die; --> &die;
+copyright &amp;#169; --> &#169; &amp;copy; --> &copy;
+feminine ordinal &amp;#170; --> &#170; &amp;ordf; --> &ordf;
+left angle quote, guillemotleft &amp;#171; --> &#171; &amp;laquo; --> &laquo;
+not sign &amp;#172; --> &#172; &amp;not; --> &not;
+soft hyphen &amp;#173; --> &#173; &amp;shy; --> &shy;
+registered trademark &amp;#174; --> &#174; &amp;reg; --> &reg;
+macron accent &amp;#175; --> &#175; &amp;macr; --> &macr;
+ Non-standard &amp;hibar; --> &hibar;
+degree sign &amp;#176; --> &#176; &amp;deg; --> &deg;
+plus or minus &amp;#177; --> &#177; &amp;plusmn; --> &plusmn;
+superscript two &amp;#178; --> &#178; &amp;sup2; --> &sup2;
+superscript three &amp;#179; --> &#179; &amp;sup3; --> &sup3;
+acute accent &amp;#180; --> &#180; &amp;acute; --> &acute;
+micro sign &amp;#181; --> &#181; &amp;micro; --> &micro;
+paragraph sign &amp;#182; --> &#182; &amp;para; --> &para;
+middle dot &amp;#183; --> &#183; &amp;middot; --> &middot;
+cedilla &amp;#184; --> &#184; &amp;cedil; --> &cedil;
+superscript one &amp;#185; --> &#185; &amp;sup1; --> &sup1;
+masculine ordinal &amp;#186; --> &#186; &amp;ordm; --> &ordm;
+right angle quote, guillemotright &amp;#187; --> &#187; &amp;raquo; --> &raquo;
+fraction one-fourth &amp;#188; --> &#188; &amp;frac14; --> &frac14;
+fraction one-half &amp;#189; --> &#189; &amp;frac12; --> &frac12;
+fraction three-fourths &amp;#190; --> &#190; &amp;frac34; --> &frac34;
+inverted question mark &amp;#191; --> &#191; &amp;iquest; --> &iquest;
+capital A, grave accent &amp;#192; --> &#192; &amp;Agrave; --> &Agrave;
+capital A, acute accent &amp;#193; --> &#193; &amp;Aacute; --> &Aacute;
+capital A, circumflex accent &amp;#194; --> &#194; &amp;Acirc; --> &Acirc;
+capital A, tilde &amp;#195; --> &#195; &amp;Atilde; --> &Atilde;
+capital A, dieresis or umlaut mark &amp;#196; --> &#196; &amp;Auml; --> &Auml;
+capital A, ring &amp;#197; --> &#197; &amp;Aring; --> &Aring;
+capital AE diphthong (ligature) &amp;#198; --> &#198; &amp;AElig; --> &AElig;
+capital C, cedilla &amp;#199; --> &#199; &amp;Ccedil; --> &Ccedil;
+capital E, grave accent &amp;#200; --> &#200; &amp;Egrave; --> &Egrave;
+capital E, acute accent &amp;#201; --> &#201; &amp;Eacute; --> &Eacute;
+capital E, circumflex accent &amp;#202; --> &#202; &amp;Ecirc; --> &Ecirc;
+capital E, dieresis or umlaut mark &amp;#203; --> &#203; &amp;Euml; --> &Euml;
+capital I, grave accent &amp;#204; --> &#204; &amp;Igrave; --> &Igrave;
+capital I, acute accent &amp;#205; --> &#205; &amp;Iacute; --> &Iacute;
+capital I, circumflex accent &amp;#206; --> &#206; &amp;Icirc; --> &Icirc;
+capital I, dieresis or umlaut mark &amp;#207; --> &#207; &amp;Iuml; --> &Iuml;
+capital Eth, Icelandic &amp;#208; --> &#208; &amp;ETH; --> &ETH;
+ Non-standard &amp;Dstrok; --> &Dstrok;
+capital N, tilde &amp;#209; --> &#209; &amp;Ntilde; --> &Ntilde;
+capital O, grave accent &amp;#210; --> &#210; &amp;Ograve; --> &Ograve;
+capital O, acute accent &amp;#211; --> &#211; &amp;Oacute; --> &Oacute;
+capital O, circumflex accent &amp;#212; --> &#212; &amp;Ocirc; --> &Ocirc;
+capital O, tilde &amp;#213; --> &#213; &amp;Otilde; --> &Otilde;
+capital O, dieresis or umlaut mark &amp;#214; --> &#214; &amp;Ouml; --> &Ouml;
+multiply sign &amp;#215; --> &#215; &amp;times; --> &times;
+capital O, slash &amp;#216; --> &#216; &amp;Oslash; --> &Oslash;
+capital U, grave accent &amp;#217; --> &#217; &amp;Ugrave; --> &Ugrave;
+capital U, acute accent &amp;#218; --> &#218; &amp;Uacute; --> &Uacute;
+capital U, circumflex accent &amp;#219; --> &#219; &amp;Ucirc; --> &Ucirc;
+capital U, dieresis or umlaut mark &amp;#220; --> &#220; &amp;Uuml; --> &Uuml;
+capital Y, acute accent &amp;#221; --> &#221; &amp;Yacute; --> &Yacute;
+capital THORN, Icelandic &amp;#222; --> &#222; &amp;THORN; --> &THORN;
+small sharp s, German (sz ligature) &amp;#223; --> &#223; &amp;szlig; --> &szlig;
+small a, grave accent &amp;#224; --> &#224; &amp;agrave; --> &agrave;
+small a, acute accent &amp;#225; --> &#225; &amp;aacute; --> &aacute;
+small a, circumflex accent &amp;#226; --> &#226; &amp;acirc; --> &acirc;
+small a, tilde &amp;#227; --> &#227; &amp;atilde; --> &atilde;
+small a, dieresis or umlaut mark &amp;#228; --> &#228; &amp;auml; --> &auml;
+small a, ring &amp;#229; --> &#229; &amp;aring; --> &aring;
+small ae diphthong (ligature) &amp;#230; --> &#230; &amp;aelig; --> &aelig;
+small c, cedilla &amp;#231; --> &#231; &amp;ccedil; --> &ccedil;
+small e, grave accent &amp;#232; --> &#232; &amp;egrave; --> &egrave;
+small e, acute accent &amp;#233; --> &#233; &amp;eacute; --> &eacute;
+small e, circumflex accent &amp;#234; --> &#234; &amp;ecirc; --> &ecirc;
+small e, dieresis or umlaut mark &amp;#235; --> &#235; &amp;euml; --> &euml;
+small i, grave accent &amp;#236; --> &#236; &amp;igrave; --> &igrave;
+small i, acute accent &amp;#237; --> &#237; &amp;iacute; --> &iacute;
+small i, circumflex accent &amp;#238; --> &#238; &amp;icirc; --> &icirc;
+small i, dieresis or umlaut mark &amp;#239; --> &#239; &amp;iuml; --> &iuml;
+small eth, Icelandic &amp;#240; --> &#240; &amp;eth; --> &eth;
+small n, tilde &amp;#241; --> &#241; &amp;ntilde; --> &ntilde;
+small o, grave accent &amp;#242; --> &#242; &amp;ograve; --> &ograve;
+small o, acute accent &amp;#243; --> &#243; &amp;oacute; --> &oacute;
+small o, circumflex accent &amp;#244; --> &#244; &amp;ocirc; --> &ocirc;
+small o, tilde &amp;#245; --> &#245; &amp;otilde; --> &otilde;
+small o, dieresis or umlaut mark &amp;#246; --> &#246; &amp;ouml; --> &ouml;
+division sign &amp;#247; --> &#247; &amp;divide; --> &divide;
+small o, slash &amp;#248; --> &#248; &amp;oslash; --> &oslash;
+small u, grave accent &amp;#249; --> &#249; &amp;ugrave; --> &ugrave;
+small u, acute accent &amp;#250; --> &#250; &amp;uacute; --> &uacute;
+small u, circumflex accent &amp;#251; --> &#251; &amp;ucirc; --> &ucirc;
+small u, dieresis or umlaut mark &amp;#252; --> &#252; &amp;uuml; --> &uuml;
+small y, acute accent &amp;#253; --> &#253; &amp;yacute; --> &yacute;
+small thorn, Icelandic &amp;#254; --> &#254; &amp;thorn; --> &thorn;
+small y, dieresis or umlaut mark &amp;#255; --> &#255; &amp;yuml; --> &yuml;
+</PRE>
+<!-- removed: second /PRE, a hack for HotJava 1.0 preBeta 1 -->
+<HR>
+
+<STRONG>How to read</STRONG> this table. The columns are
+<DL COMPACT>
+<DT>1st:<DD>textual <EM>description</EM> of the character
+<DT>2nd:<DD>character inserted directly into the HTML page as <EM>one
+ byte</EM>
+<DT>3rd:<DD>character written as <EM>numeric HTML entity</EM>, in the
+ format:<BR>"how it looks literally" <CODE>--&gt;</CODE>
+ "what your browser does with it"
+<DT>4th:<DD>character written as <EM>symbolic HTML entity</EM>, in the
+ format:<BR>"how it looks literally" <CODE>--&gt;</CODE>
+ "what your browser does with it"
+</DL>
+
+So for example, if you see something like "<CODE>&amp;divide; -->
+&amp;divide;</CODE>" in the 4th column, this means your browser
+doesn't know about the entity name "divide" and just puts it
+literally.
+
+<P>
+<STRONG>This table</STRONG> grew out of an overview of the "ISO
+Latin-1 Character Set" overview related to the Hyper-G Text Format
+(<A HREF="http://www.hyperwave.de/HTFdoc">HTF</A>).
+
+The entity names <CODE>&amp;brkbar;</CODE> and <CODE>&amp;Dstrok;</CODE>
+seem to be unique to HTF.
+
+The entity name <CODE>&amp;hibar;</CODE> has been supported by X Mosaic
+but seems to be replaced with <CODE>&amp;macr;</CODE>.
+
+The entity names <CODE>&amp;uml;</CODE> and <CODE>&amp;die;</CODE> should
+be equivalent.
+
+<P><STRONG>The standards stuff:</STRONG>
+The
+<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html-spec/">HTML 2.0 Standard</A>
+includes a section on
+<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html-spec/html-spec_9.html#SEC99">Character Entity Sets</A>
+and an overview on the
+<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html-spec/html-spec_13.html#SEC106">HTML Coded Character Set</A>
+(The entity names are derived from <A HREF="http://www.ucc.ie/info/net/isolat1.html">ISO 8879</A>).
+<BR>
+
+Or have a look at the
+<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html3/latin1.html">Latin-1 Character Entities</A>
+as listed in an draft for the
+<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html3/CoverPage.html">HTML 3.0 specification</A>.
+<BR>
+
+The
+<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/HTMLPlus/htmlplus_59.html">Appendix II</A>
+of CERN's
+<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/HTMLPlus/htmlplus_1.html">HTML+ Discussion Document</A>
+contains a
+<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/HTMLPlus/htmlplus_table.ps">table</A>
+(in PostScript format) of the proposed character entities for HTML+ and their
+corresponding character codes for Unicode and the Adobe Latin-1 &amp; Symbol
+character sets.
+<P>
+
+<STRONG>Please note</STRONG> that there is nothing wrong with using
+characters of ISO Latin-1 above 127: the normal transmission protocol
+for the WWW,
+<A HREF="http://www.w3.org/pub/WWW/Protocols/rfc1945/rfc1945">HTTP/1.0</A>,
+uses the 8bit ISO latin-1 as default encoding.
+(Thanks to Roman
+Czyborra for pointing this out!)
+<P>
+
+<STRONG>Other information:</STRONG>
+<UL>
+
+<LI><STRONG>Kevin J. Brewer</STRONG> has done two very good pages on the subject:
+ <UL>
+ <LI><A HREF="http://www.bbsinc.com/iso8859.html">ASCII - ISO 8859-1 (Latin-1) with HTML 3.0 Entities Table</A> and
+ <LI><A HREF="http://www.bbsinc.com/iso8879.html">ISO 8879 Entities Gopher Menu</A>
+ </UL>
+
+<LI>The excellent overview on the series of
+ <A HREF="http://czyborra.com/charsets/iso8859.html">ISO 8859
+ character sets</A> compiled by Roman Czyborra.
+
+<LI>Also have a look on Alan Flavell's page of
+ <A HREF="http://ppewww.ph.gla.ac.uk/%7Eflavell/iso8859/iso8859-pointers.html">pointers
+ to information about ISO8859</A>. It's written very well!
+
+<LI>Maybe also of interest to you is the
+ <A HREF="ftp://ftp.vlsivie.tuwien.ac.at/pub/8bit/FAQ-ISO-8859-1">ISO
+ 8859-1 FAQ</A> by Michael Gschwind
+ (<A HREF="mailto:mike@vlsivie.tuwien.ac.at">mike@vlsivie.tuwien.ac.at</A>),
+ part of his page on
+ <A HREF="http://www.vlsivie.tuwien.ac.at/mike/i18n.html">Internationalization</A>.
+
+<LI>For users of X11R5 on SunOS systems: the
+ <A HREF="Compose.txt">table over the compose combinations</A>
+ (also coded <A HREF="Compose.html">with entities</A> where possible).
+ It's taken from the MIT X sources in
+ <CODE>server/ddx/sun/Compose.list</CODE>.
+
+<LI>Finally you could have a look at
+ <A HREF="ftp://ds.internic.net/rfc/rfc1345.txt">RFC 1345:
+ Character Mnemonics &amp; Character Sets</A>
+ by K. Simonsen (06/11/92, 103 pages, approx. 240 kbyte).
+
+</UL>
+
+
+<HR>
+
+<ADDRESS><A HREF="http://ramsch.home.pages.de/">Martin Ramsch</A>, 16.02.1994, 07.01.1996, 01.07.1996, 1998-10-09, 2000-05-15</ADDRESS>
+
+</BODY>
+</HTML>