HTML Extremes

These are a little tricky, and might break some quick-and-dirty implementations. But they are parsed correctly by implementations based on libHTML-930106.tar.Z, available from the WWW code archives. These constructs are not recommended.

Document Structure

The tags for this element have spaces in them.

Another H4 Just in case it missed the close tag with spaces

Header Elements

Body Elements

Delimiter Recognition

Character reference: '&#SPACE;' and È. And character from data: &, and from markup: &.

And-hash from data: &# and from markup &#.

Less-thans as data: < <1 <-)

Less-than-slash as data: greater-than (pretty much always data): > abc> 0>

entity reference terminated by start tag open: <

character reference terminated by start tag open: >

comment: The sample implementation groks.

comment w/space between -- and >:

marked section close without mdc: ]]. processing instruction: The sample implementation treats it as a processing instrcution, so you don't see it.

Anchors

spaces around '='

single quoted value

character references and entity references in attribute value literal

quotes in attribute value literal

ISO Latin 1 Entities in HTML

This is a machine-translation of "ISO 8879:1986//ENTITIES Added Latin 1//EN" to an HTML definition list.
connolly@convex.com
Æ
capital AE diphthong (ligature)
Á
capital A, acute accent
Â
capital A, circumflex accent
À
capital A, grave accent
Å
capital A, ring
Ã
capital A, tilde
Ä
capital A, dieresis or umlaut mark
Ç
capital C, cedilla
Ð
capital Eth, Icelandic
É
capital E, acute accent
Ê
capital E, circumflex accent
È
capital E, grave accent
Ë
capital E, dieresis or umlaut mark
Í
capital I, acute accent
Î
capital I, circumflex accent
Ì
capital I, grave accent
Ï
capital I, dieresis or umlaut mark
Ñ
capital N, tilde
Ó
capital O, acute accent
Ô
capital O, circumflex accent
Ò
capital O, grave accent
Ø
capital O, slash
Õ
capital O, tilde
Ö
capital O, dieresis or umlaut mark
Þ
capital THORN, Icelandic
Ú
capital U, acute accent
Û
capital U, circumflex accent
Ù
capital U, grave accent
Ü
capital U, dieresis or umlaut mark
Ý
capital Y, acute accent
á
small a, acute accent
â
small a, circumflex accent
æ
small ae diphthong (ligature)
à
small a, grave accent
å
small a, ring
ã
small a, tilde
ä
small a, dieresis or umlaut mark
ç
small c, cedilla
é
small e, acute accent
ê
small e, circumflex accent
è
small e, grave accent
ð
small eth, Icelandic
ë
small e, dieresis or umlaut mark
í
small i, acute accent
î
small i, circumflex accent
ì
small i, grave accent
ï
small i, dieresis or umlaut mark
ñ
small n, tilde
ó
small o, acute accent
ô
small o, circumflex accent
ò
small o, grave accent
ø
small o, slash
õ
small o, tilde
ö
small o, dieresis or umlaut mark
ß
small sharp s, German (sz ligature)
þ
small thorn, Icelandic
ú
small u, acute accent
û
small u, circumflex accent
ù
small u, grave accent
ü
small u, dieresis or umlaut mark
ý
small y, acute accent