Internationalization (at WWW2002) - slide "QA: Validator"

Updated to take character encoding into accout:

Correct character encoding labeling and detection is a prerequesite for validation (and all other operations!)
Works now for encodings such as iso-2022-jp, Shift_JIS,... (the "<" byte is used for the '<' character and other things)
Follows standard priorities (example)
Converts bytes to characters and fails if this is not possible (more examples)
Cannot catch all problems automatically, because some encodings are very similar, but allows trial-and-error with visual (or audio) checking
To be done: Support UTF-16