The form invokes a generic XSLT service that takes
dcq-test1.html has a number of examples from the DC spec.
Try the tidy service if you have HTML that isn't well-formed.For example, the ADAM page isn't well-formed (i.e. if it isn't XHTML), but the results of running the ADAM page thru tidy is.
and returns the result.
I wrote the guts of dc-extract.xsl on my palm pilot, over drinks with Eric Miller and Dan Brickley in Amsterdam after WWW9 in an effort to show them how easy it is to use XSLT to extract RDF from real-world data.
This version (in testing) lowercases correctly (we want dc:title not dc:Title) and 'dumbs down' constructs such as <meta name="DC.Date.modified" content="2002-11-15" /> to simple RDF properties.