This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.
E.g. "℁" (U+2101) gives "a/s". Which means "http://ex℁ample℁" becomes "http://exa/sample/" except the host is "exa/sample" rather than "exa"... We should probably fail host parsing if the output gives any label that contains "/" as a code point. Presumably by further overriding the IDNA2003 ToASCII algorithm. Other code points that would change re-parsing and would need to be added: ":", "\", "?", "#". Source: http://krijnhoetmer.nl/irc-logs/whatwg/20130815#l-327
There is no need to "override" the algorithm; IDNA2003 already includes a flag for that purpose: "UseSTD3ASCIIRules"; see section 4 of RFC3490.
It does, but that excludes way more code points than implementations do and is not compatible with the web. E.g. _ (U+005F) must not be excluded.
https://github.com/whatwg/url/commit/81cdd6704ea695e1619e76794227d2c9d10d2aa7 https://github.com/whatwg/url/commit/0eaf28c5ae63b5b0487cce484f3ce201e0d98494