Relationships between Semantic Web and Digital
Libraries
Examples
Future Directions
Lessons Learned
Where we're going from here
Open Discussion
The Semantic Web: What is it?
Many things to many people...
The Semantic Web
An interesting bed-time story...
The Semantic Web: The original web realized
The original proposal of the WWW from 1989 included a
figure showing how information about a Web of relationships
amongst named objects could unify a number of information
management tasks.
The Semantic Web - A Simple Extension to the Current
Web
Web is a set of Resources and Links
Resources and Links are identified by URI's
Uniform Resource Identifiers
Resources may be strongly typed
To a user, this will become even a more exciting
world
Machines will be able to consume machine-readable
information, better enabling computers and people to
work, learn and exchange knowledge more
effectively
Making it cost-effective for people to effectively record
their knowledge.
Focus on machine consumption.
"The Semantic Web is an extension of the
current web in which information is given well-defined
meaning, better enabling computers and people to work in
cooperation." -- Tim Berners-Lee, James Hendler,
Ora Lassila, The Semantic Web
"The bane of my existence is doing things that
I know the computer could do for me." -- Dan
Connolly, The XML Revolution
Ultimate goal - the design of enabling technologies to
support machine facilitated global knowledge exchange
Clarification of design, goals, implementation
guidelines
Position papers, Articles, Presentations, Tutorials,
Tools
W3C Semantic Web Activity - Approach to Deployment
Establish guiding architectural principles
Facilitate the design of enabling standards and
technologies
Focus on short term deployment
Eye toward longer term research issues
Understand policy implications
Foster an environment for effective cooperation and
collaboration
A lot of hard work
And a bit of luck
Semantic Web Principles
Everything Identifiable is on the Semantic Web
All resources have identy
People, places, and things in the physical world will
have online representations identified by Uniform Resource
Identifiers.
URI's facilitate effective integration, active
participation and be contextualized in the Semantic Web.
Semantic Web Principles
Partial Information
The Web is unbounded.
The original design of the Web differed from
traditional hypertext systems in sacrificing link
integrity for scalability.
The Semantic Web it unbounded.
Anyone can say anything about anything.
There will always be more to discover.
Semantic Web Principles
Web of Trust
All statements found on the Web occur in some
context.
Applications need this context in order to evaluate the
trustworthiness of the statements.
The machinery of the Semantic Web does not assert that
all statements found on the Web are "true".
Truth - or more pragmatically, trustworthiness - is
evaluated by each application that processes the information
found on the Web.
Semantic Web Principles
Evolution
The Semantic Web must allow the independent work of
diverse communities to be combined effectively.
Support the ability to add new information without
insisting that the old be modified.
Provide communities the ability to resolve ambiguities
and clarify inconsistencies.
Semantic Web must be based on descriptive conventions
that can expand as human understanding expands.
Semantic Web Principles
Minimalist Design
Make the simple things simple, and the complex things
possible.
Standardize nor more than is necessary.
More than the sum of the parts
Semantic Web
Building on proven ideas
Combines RDF, XML, Xlink, hypertext and metadata
approaches to linked data
Enable simple applications now that plan for future
complexity (eg. Dublin Core, RSS, PRISM, MusicBrainz)
Focussed on general principles of Web automation and data
aggregation
Architectural aims
Define a convention for applications that exchange
metadata on the Web
XML used for "serialization syntax"
Enable vocabulary semantics to be defined by communities
of expertise, not W3C
Provide for the fine-grained mixing of diverse
metadata
Layer Cake
RDF Core Working Group
RDF provides a common framework for representing metadata
across many applications.
Implementor feedback concerning the RDF Model and Syntax
Recommendation points to the need for a number of fixes,
clarifications and improvements to the specification of RDF's
abstract model and XML syntax.
Also tasked to complete the work on RDF vocabulary
description present in the RDF Schema Candidate
Recommendation.
Provide an account of the relationships between the basic
components of RDF (Model, Syntax, Schema) and the larger XML
family of recommendations.
detailed technical discussion of all approaches to
the use of classical logic on the Web for the
representation of data such as inference rules,
ontologies, and complex schemata.
Annotations are stored on annotation servers as metadata
and presented to the user by a client capable of
understanding this metadata and capable of interacting with
an annotation server with the HTTP service protocol.
Clients include editor/browser (Amaya) and browser
plug-ins (Annozilla)
HTTP-based RDF store
Testbed for other collaboration support tools (e.g. peer
review, web of trust, recomender services, WAI, etc.)
W3C Team members: Marja-Riitta Koivunen, Ralph Swick,
Jose Kahan, Eric Prud'hommeaux
Advanced Development - Workflow
Scenario (a): W3T requirement to automate creation,
maintenance of W3C TR Page. Solution: W3C Working Group
Chairs make assertions that a document is at a particular
status by submitting announcement to list.
Advanced Development - Workflow (Announce, Org, Contact
Merge)
Scenario (b): W3C AC Representatives need to identify
status of all documents whose parent organization has sponsored
working group members. Solution (a): Document
information associated with a particular working group is
merged with W3C Contact information.
Formalized means for expressing simple vocabularies
"These ambiguities, redundancies, and deficiencies recall
those attributed by Dr. Franz Kuhn to a certain Chinese
encyclopedia entitled Celestial Emporium of Benevolent
Knowledge. On those remote pages it is written that animals are
divided into (a) those that belong to the Emperor, (b) embalmed
ones, (c) those that are trained, (d) suckling pigs, (e)
mermaids, (f) fabulous ones, (g) stray dogs, (h) those that are
included in this classification, (i) those that tremble as if
they were mad, (j) innumerable ones, (k) those drawn with a
very fine camel's hair brush, (l) others, (m) those that have
just broken a flower vase, (n) those that resemble flies from a
distance." -- Essay: "The Analytical Language of John
Wilkins", in La Nación, 8 February 1942