We started doing this to see how some seemingly academic topics shake out when we
have a real live Web of public RDF data files.
- provenance: RDF stores need to keep track of where they found these files
- open world: descriptions are scattered, incomplete, partial
- identity reasoning: tools need to automatically figure out when two files talk
about the same thing