These are a little tricky, and might break some quick-and-dirty
implementations. But they are parsed correctly by implementations
based on libHTML-930106.tar.Z, available from the WWW code archives.
These constructs are not recommended.
Document Structure
The tags for this element have spaces in them.
Another H4 Just in case it missed the close tag with spaces
Header Elements
Body Elements
Character reference: 'SPACE;' and È.
And character from data: &, and from markup: &.
And-hash from data: and from markup &#.
Less-thans as data: < <1 <-)
Less-than-slash as data: 1 -)
greater-than (pretty much always data): > abc> 0>
entity reference terminated by start tag open: <
character reference terminated by start tag open: >
comment: The sample
implementation groks.
comment w/space between -- and >:
marked section close without mdc: ]].
processing instruction: The sample implementation
treats it as a processing instrcution, so you don't see it.
Anchors
spaces around '='
single quoted value
character references
and entity references in attribute value literal
quotes in
attribute value literal
ISO Latin 1 Entities in HTML
This is a machine-translation of "ISO 8879:1986//ENTITIES Added Latin 1//EN"
to an HTML definition list.
connolly@convex.com
- Æ
- capital AE diphthong (ligature)
- Á
- capital A, acute accent
- Â
- capital A, circumflex accent
- À
- capital A, grave accent
- Å
- capital A, ring
- Ã
- capital A, tilde
- Ä
- capital A, dieresis or umlaut mark
- Ç
- capital C, cedilla
- Ð
- capital Eth, Icelandic
- É
- capital E, acute accent
- Ê
- capital E, circumflex accent
- È
- capital E, grave accent
- Ë
- capital E, dieresis or umlaut mark
- Í
- capital I, acute accent
- Î
- capital I, circumflex accent
- Ì
- capital I, grave accent
- Ï
- capital I, dieresis or umlaut mark
- Ñ
- capital N, tilde
- Ó
- capital O, acute accent
- Ô
- capital O, circumflex accent
- Ò
- capital O, grave accent
- Ø
- capital O, slash
- Õ
- capital O, tilde
- Ö
- capital O, dieresis or umlaut mark
- Þ
- capital THORN, Icelandic
- Ú
- capital U, acute accent
- Û
- capital U, circumflex accent
- Ù
- capital U, grave accent
- Ü
- capital U, dieresis or umlaut mark
- Ý
- capital Y, acute accent
- á
- small a, acute accent
- â
- small a, circumflex accent
- æ
- small ae diphthong (ligature)
- à
- small a, grave accent
- å
- small a, ring
- ã
- small a, tilde
- ä
- small a, dieresis or umlaut mark
- ç
- small c, cedilla
- é
- small e, acute accent
- ê
- small e, circumflex accent
- è
- small e, grave accent
- ð
- small eth, Icelandic
- ë
- small e, dieresis or umlaut mark
- í
- small i, acute accent
- î
- small i, circumflex accent
- ì
- small i, grave accent
- ï
- small i, dieresis or umlaut mark
- ñ
- small n, tilde
- ó
- small o, acute accent
- ô
- small o, circumflex accent
- ò
- small o, grave accent
- ø
- small o, slash
- õ
- small o, tilde
- ö
- small o, dieresis or umlaut mark
- ß
- small sharp s, German (sz ligature)
- þ
- small thorn, Icelandic
- ú
- small u, acute accent
- û
- small u, circumflex accent
- ù
- small u, grave accent
- ü
- small u, dieresis or umlaut mark
- ý
- small y, acute accent