Copyright ©2001 W3C® (MIT, INRIA, Keio), All Rights Reserved. W3C liability, trademark, document use and software licensing rules apply.
This W3C Working Draft revises the specification of the XML syntax of RDF as originally described in RDF Model & Syntax. This document presents the syntax as amended and clarified by the RDF Core Working Group with the specification now based on the XML Information Set along with mapping rules for creating RDF models as described in the RDF Model Theory W3C Working Draft.
This section describes the status of this document at the time of its publication. Other documents may supersede this document. The latest status of this document series is maintained at the W3C.
This is a W3C Working Draft for the RDF Core Working Group produced as part of the W3C Semantic Web Activity. It incorporates decisions made by the Working Group updating the XML syntax for RDF from the original RDF Model & Syntax ([RDFMS]) document and includes a re-representing of the syntax in terms of the XML Information Set with rules for generation of RDF models.
This document is being released for review by W3C members and other interested parties to encourage feedback and comments, especially with regard to how the changes affect existing implementations. This is the current state of an ongoing work on the syntax and mapping process and may not yet record all of the work in the grammar section of the original document.
This is a draft document and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use it as reference material or to cite as other than "work in progress". A list of current W3C Recommendations and other technical documents can be found at http://www.w3.org/TR/.
Comments on this document are invited and should be sent to the public mailing list www-rdf-comments@w3.org. An archive of comments is available at http://lists.w3.org/Archives/Public/www-rdf-comments/.
1 Introduction
2 An XML syntax for RDF
3 Data Model
3.1 Root Node
3.2 Element Node
3.3 End Element Node
3.4 Attribute Node
3.5 Text Node
3.6 Identifier Node
3.7 Information Set Mapping
3.8 The RDF Namespace
3.9 Identifiers
4 Notation
4.1 Terminology
4.2 Grammar Notation
4.3 Notation Forms
5 RDF/XML Grammar
6 Serialising an RDF Graph to RDF/XML
7 Acknowledgments
8 References
A Issues affecting RDF/XML Syntax (Non-Normative)
A.1 Document Issues / Tasks (Non-Normative)
A.2 RDF Core WG Open Issues affecting RDF/XML Syntax (Non-Normative)
A.3 RDF Core WG Decided Issues affecting RDF/XML Syntax (Non-Normative)
A.4 RDF Core WG Postponed Issues affecting RDF/XML Syntax (Non-Normative)
B Syntax Schemas (Non-Normative)
B.1 RELAX NG Syntax Schema (Non-Normative)
B.2 Other Syntax Schemas (Non-Normative)
C Original Grammar (Non-Normative)
D Updated Grammar after RDF Core decisions (Non-Normative)
E Changes (Non-Normative)
This document describes the XML ([XML]) syntax for RDF as originally defined in the RDF Model & Syntax ([RDFMS]) W3C Recommendation. Subsequent implementations of this syntax and comparison of the resulting RDF models have shown that there was ambiguity - implementations generated different models and certain syntax forms were not widely implemented. These issues were generally made as either feedback to the www-rdf-comments@w3.org (archive) or from discussions on the RDF Interest Group list www-rdf-interest@w3.org (archive) .
The RDF Core Working Group is chartered to respond to the need for a number of fixes, clarifications and improvements to the specification of RDF's abstract model and XML syntax. The working group invites feedback from the developer community on the effects of its proposals on existing implementations and documents.
Several decisions including amendments and deletions to the grammar are referred to below. The definitive record of the decisions is the RDF Core WG issues list.
This document re-represents the original EBNF grammar in terms of the XML Information Set ([INFOSET]) items which moves from the rather low-level details, such as particular forms of empty elements. This allows the grammar to be more precisely recorded and the mapping from the XML syntax to the RDF model more clearly shown. The mapping to the RDF model (a graph) is done by emitting statements in the form defined in the N-Triples section of RDF Test Cases ([RDF-TESTS]) Working Draft which creates an RDF model, that has semantics defined by RDF Model Theory ([RDF-MODEL]) Working Draft.
This document illustrates one way to create triples from the XML - any other method that results in the same RDF graph may be used.
In particular:
The RDF Model Theory ([RDF-MODEL]) provides a formal description of RDF. This can be thought of as a graph consisting of nodes and arcs. The node describe resources that can be labelled with URIs, string literals or are blank. The arcs connect the nodes and are all labelled with URIs. This graph is more precisely called a directed edge-labelled graph; each edge is an arc with a direction (an arrow) connecting two nodes. These edges can be described as triples of subject node, at the blunt end of the arrow/arc, property arc and an object node at the sharp end of the arrow/arc. The property arc is also interpreted as an attribute, relationship or predicate of the resource with a value given by the object node content.
In order to encode the graph in XML, the nodes and arcs are turned into XML elements, attributes, element content and attribute values. The URI labels for properties and object nodes are written in XML via XML Namespaces ([XML-NS]) which gives a namespace URI for a short prefix along with namespace-qualified elements and attributes names called local names. The (namespace URI, local name) pair are chosen such that concatenating them forms the original node URI. The URIs labelling subject nodes are stored in XML attribute values. The nodes labelled by string literals (which are always object nodes) become element text content or attribute values.
This transformation turns paths in the graph of the form
Node, Arc, Node, Arc, Node, Arc, ...
into sequences of elements inside elements. This
results in a striping when the elements are written down;
alternating between node elements and
property elements. The Node at the
start of the sequence is always a subject node and turns into
a containing element called an rdf:Description
that
is written at the top level of RDF/XML, below the XML document element
(in this case rdf:RDF
). So the chains of stripes
start at the top of an RDF/XML document and always begin with nodes.
For example, here is a graph written as ASCII saying "there exists
a document (this one) with a title, RDF/XML Syntax Specification (Revised)" and
"this document has an editor, the editor has a name "Dave Beckett"
and a home page http://purl.org/net/dajobe/.
[URI]
is used for a node with a URI,
[]
for a blank node, and
--[property]-->
is used for an arc.
[http://www.w3.org/TR/rdf-syntax-grammar] --[has a title] -> "RDF/XML Syntax Specification (Revised)" --[has an editor]-> [] --[has a name] -> "Dave Beckett" --[has a home page]-> [http://purl.org/net/dajobe/]
Taking one path through the graph:
[http://www.w3.org/TR/rdf-syntax-grammar] --[has an editor]-> [] --[has a home page]-> [http://purl.org/net/dajobe/]
This corresponds to the node/arc stripes:
[http://www.w3.org/TR/rdf-syntax-grammar]
-[has an editor]->
[]
-[has a home page]->
[http://purl.org/net/dajobe/]
In RDF/XML this sequence of 5 nodes and arcs correponds to 5 XML elements:
<rdf:Description> <ex:editor> <rdf:Description> <ex:homePage> <rdf:Description> </rdf:Description> </ex:homePage> </rdf:Description> </ex:editor> </rdf:Description>
Which consists of some nodes with known URIs that can be filled in and others that remain blank:
<rdf:Description rdf:about="http://www.w3.org/TR/rdf-syntax-grammar"> <ex:editor> <rdf:Description> <ex:homePage> <rdf:Description rdf:about="http://purl.org/net/dajobe/"> </rdf:Description> </ex:homePage> </rdf:Description> </ex:editor> </rdf:Description>
There are several abbreviations that can be used to make very
common uses more easy to write down. It is typical for the same
resource to be described with multiple properties and values at the
same time, so multiple child elements can be put inside
rdf:Description
, all of which are properties of that
node.
When the property value is a string it can be encoded more simply as an XML attribute and value, as an attribute of the node element. This is known as a property attribute.
Another very common use is when a node is an instance of a class
with rdf:type
relationship, usually called a typed
node. This shorthand is done by replacing the
rdf:Description
element name with the namespaced-element
corresponding to the URI of the value of the type relationship.
The above forms the basis of the RDF/XML syntax and although there are some other abbreviated forms, such as for generating the RDF list properties and for skipping having to write down a blank element node, which breaks the striping but is useful for, amongst other uses, encoding properties with multiple-values.
The example above filled out and completed, using some abbreviations gives:
<?xml version="1.0"?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:ex="http://example.org/stuff/1.0/"> <rdf:Description rdf:about="http://www.w3.org/TR/rdf-syntax-grammar"> <dc:title>RDF/XML Syntax Specification (Revised)</dc:title> <ex:editor rdf:parseType="Resource"> <ex:fullName>Dave Beckett</ex:fullName> <ex:homePage rdf:resource="http://purl.org/net/dajobe/" /> </ex:editor> </rdf:Description> </rdf:RDF>
For a longer introduction to the RDF/XML striped syntax with a historical perspective, see RDF: Understanding the Striped RDF/XML Syntax ([STRIPEDRDF]).
Note: This section is still under development and the working group is working on an RDF primer document.
This syntax operates on an XML document as a sequence of nodes in document order in the style of [XPATH] Information Set Mapping serialised into document-order. The resulting nodes are intended to be similar to the events that are produced by the [SAX2] XML API. This model is conceptual only and does not mandate any implementation method; in particular [XPATH] is not required.
The syntax does not support non-well-formed XML documents, nor documents that otherwise don't have an XML Information Set; for example, that don't conform to XML Namespaces W3C Recommendation ([XML-NS]).
This specification requires an information set as defined in [INFOSET] which supports at least the following information items and properties:
This specification does not require any destructive alterations to the input information set; no items are added, removed or modified..
This section is intended to satisfy the requirements for Conformance in the [INFOSET] specification.
There are six types of node defined in the following subsections. Most nodes are constructed from an Infoset information item (except for Identifier). The effect of a node constructor is to create a new node with a unique identity, distinct from all other nodes. Nodes have properties, and all have the string-value property that may be part of the node or computed from the string-value of contained nodes.
Created from an Document Information Item and takes the following properties and their values from the element information item: document-element, children and base-uri.
Created from an Element Information Item and takes the following properties and their values from the element information item: local-name, namespace-name, children, attributes and parent. When this node is created from such values, the URI property is defined with a string value of the concatenation of the value of the namespace-name property and the value of the local-name property. On creation the li-counter property is added with initial integer value 1.
The subject property may be added and takes the value of an Identifier node. This is used on elements that deal with one node in the RDF model, this generally being the subject of a statement.
Takes no properties but marks the end of the containing element in the sequence.
Created from an Attribute Information Item and takes the properties local-name, namespace-name and owner element and their values from respective element information item properties. When this node is created from such values, two properties and values are defined. Firstly the string-value property is defined with the normalized value as specified by [XML]. An attribute whose normalized value is a zero-length string is not treated specially: it results in an attribute node whose string-value is a zero-length string. Secondly the URI property is defined with a string value of the concatenation of the value of the namespace-name property and the value of the local-name property.
Created from a sequence of one or more consecutive Character Information Items. Has the single property string-value which has the value of the string made from concatenating the character code property of each of the character information items. [NOTE: Identical to XPath.]
A node for a typed identifier which can have the following three properties: identifier and identifier-type and string-value. These nodes are created by giving two values for the identifier and identifier-type properties. The identifier property takes a string value and the identifier-type property can take values "URI" or "bnodeID".
The string-value property is defined from the other properties as follows: If identifier-type is "URI" then the value is the concatenation of "<", the value of the identifier property and ">". If identifier-type is "bnodeID" then the value is the concatenation of "_:" and the value of the identifier property.
For further information on identifiers in the RDF model, see section 3.9.
To transform the Infoset into the sequence of nodes, each information item is transformed as described above to generate a tree of nodes with properties and values. Each element node is then replaced as described below to turn the tree of nodes into a sequence in document order.
The RDF Namespace URI is
http://www.w3.org/1999/02/22-rdf-syntax-ns#
and is typically used in XML with the prefix rdf
although
this is not required. The namespace contains the following names only:
RDF Description
ID about bagID parseType resource
subject predicate object
Seq Bag Alt Statement Property
type value
li _n
where n is a non-negative integer.
Throughout this document the terminology rdf:name will be used to indicate name is from the RDF namespace and it has a URI of the concatenation of the RDF Namespace URI and name. For example, rdf:type has the URI http://www.w3.org/1999/02/22-rdf-syntax-ns#type
Implementors Note: The names aboutEach and aboutEachPrefix were removed from the language by the RDF Core WG - see the issues rdfms-abouteach and rdfms-abouteachprefix for further information.
The RDF model uses three types of identifiers (or labels) for nodes and arcs in the graph - absolute URI references, literals and unlabelled or blank nodes. The latter are given local identifiers in the N-Triples serialisation of the model in order to represent the graph correctly. These identifiers can be generated and must match the name production in N-Triples.
The URI references can be either given as absolute URIs, relative URIs that have to be resolved from the document URI, or constructed. The constructed URIs in RDF are either made from XML Namespace qualified element or attributes names (QNames) or from the value of rdf:ID or rdf:bagID attribute values.
XML QNames give URIs by concatenating the namespace URI and
the XML local name. For example, if the XML Namespace prefix
foo
has URI http://example.org/somewhere/ then the QName
foo:bar
would correspond to the URI
http://example.org/somewhere/bar. Note that this restricts which
URIs can be made and the same URI can be given in multiple ways.
The rdf:ID and rdf:bagID values generate URIs by considering them as equivalent to the relative URI "#" concatenated with the attribute value. This can then be resolved relative to the document URI to give the absolute URI.
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119 ([KEYWORDS]).
The following notation is used for describing the nodes and grammar EBNF.
Notation | Meaning |
---|---|
property=value | A node property with a given value |
node.property | Returns the value of the given node property |
root(prop1=value1, prop2=value2, ...) |
A root node with properties |
start_element(prop1=value1, prop2=value2, ...) children end_element() |
A sequence of element node with properties, a possibly empty list of nodes as element content and an end element node |
attribute(prop1=value1, prop2=value2, ...) |
An attribute node with properties |
identifier(prop1=value1, prop2=value2, ...) |
An identifier node with properties |
text() | A text node |
base-uri | The value of the base-uri property of the root node |
list(item1, item2, ...); list() | An ordered list of items in document order; an empty list |
set(item1, item2, ...); set() | An unordered set of items; an empty set |
* | Zero or more of preceding term |
? | Zero or one of preceding term |
+ | One or more of preceding term |
A | B | ... | The A, B, ... terms are alternatives. |
A - B | The term A but not the term B |
"ABC" | A string of characters A, B, C in order. |
concat(A, B, ..) | A string created by concatenating the terms in order. |
anyURI | Any legal URI. |
anyString | Any string. |
rdf:X | See section 3.8 |
The following notation forms are used to indicate
A sequence of lines of N-Triples output from a grammar production adding to an RDF model.
If the RDF/XML is a standalone XML content, then the grammar starts with Root Node doc.
If the content is known to be RDF/XML by context, such as when RDF/XML is embedded inside other XML content, then the grammar can either start at Element Node RDF (only when an element is legal at that point in the XML) or at production nodeElementList (only when element content is legal, since this is a list of elements). For such embedded RDF/XML, the base-uri value must be initialised from the containing XML since no Root Node will be available. Note that if such embedding occurs, the grammar may be entered several times but no state is expected to be preserved.
root(document-element=RDF,
children=list(RDF))
start_element(URI = rdf:RDF,
attributes=set())
nodeElementList
end_element()
ws* (nodeElement ws* )*
start_element(URI=anyURI,
attributes=set((idAttr | aboutAttr )?, bagIdAttr?, propertyAttr*))
propertyEltList
end_element()
The processing of some of the attributes have to be done before other work such as dealing with children nodes or other attributes. These can be processed in any order:
The following can then be performed in any order:
e.subject.string-value <a.URI> <a.string-value> .
e.subject.string-value <a.URI> "a.string-value" .
e.subject.string-value <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <e.URI> .
If an attribute a with a.URI = rdf:bagID is present, create a new node n = identifier(identifier=concat(base-uri, "#", a.string-value), identifier-type="URI") and add the following statement to the model:
n.string-value <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/1999/02/22-rdf-syntax-ns#Bag> .
Then for all statements generated above (except the immediately previous statement) are reified with node n using the reification rules in section 5.26.
WARNING The rules for rdf:bagID given here are known to be incomplete in this draft and this is under consideration by the RDF Core working group.
White space as defined by [XML] definition White Space Rule [3] S in section Common Syntactic Constructs
ws* (propertyElt ws* ) *
If element e has e.URI = rdf:li then apply the list expansion rules on element e.parent in section 5.27 to give a new URI u and set the value of e.URI to be u.
Note: It is expected that in future the number of values of the rdf:parseType attribute will increase, and those values will probably be XML namespace-qualified names (QNames).
start_element(URI=anyURI,
attributes=set(idAttr?))
ws* nodeElement ws*
end_element()
For element e, and the single contained nodeElement n the following statement is added to the model:
e.parent.subject.string-value <e.URI> n.subject.string-value .
If the rdf:ID attribute a is given, the above statement is reified with identifier(identifier=concat(base-uri, "#", a.string-value), identifier-type="URI") using the reification rules in section 5.26.
start_element(URI=anyURI,
attributes=set(idAttr?))
text()
end_element()
Note: The empty literal case is defined in production emptyPropertyElt
For element e, and the text node t the following statement is added to the model:
e.parent.subject.string-value <e.URI> "t.string-value" .
If the rdf:ID attribute a is given, the above statement is reified with identifier(identifier=concat(base-uri, "#", a.string-value), identifier-type="URI") using the reification rules in section 5.26.
start_element(URI=anyURI,
attributes=set(idAttr?, parseLiteral))
literal
end_element()
For element e and the literal l, if l is empty then the statement object value is "" and the following statement is added to the model:
e.parent.subject.string-value <e.URI> "" .
Test: Indicated by test009.rdf and test009.nt
Otherwise, the following statement is added to the model:
e.parent.subject.string-value <e.URI> l.string-value .
If the rdf:ID attribute a is given, the above statement is reified with identifier(identifier=concat(base-uri, "#", a.string-value), identifier-type="URI") using the reification rules in section 5.26.
Open Issue: The result of a literal from rdf:parseType="Literal" content has not yet been decided by the RDF Core WG; it is dependent on the resolution of several open issues. One possible method would be to serialise it into a string but that has several problems including use of namespaces. Another could be to use the XML Canonicalisation W3C Recommendation but since this document is a revision of an earlier syntax, it may be difficult to require the use of this newer standard.
start_element(URI=anyURI,
attributes=set(idAttr?, parseResource))
propertyEltList
end_element()
Generate a local blank node identifier i and use it to create a new node n with the value of identifier(identifier=i, identifier-type="bnodeID").
Add the following statement to the model:
e.parent.subject.string-value <e.URI> n.string-value .
Test: Indicated by test004.rdf and test004.nt
If the rdf:ID attribute a is given, the statement above is reified with identifier(identifier=concat(base-uri, "#", a.string-value), identifier-type="URI") using the reification rules in section 5.26.
If the element content c is not an empty, then use node n to create a new sequence of nodes as follows:
start_element(URI=rdf:Description,
subject=n,
attributes=set())
c
end_element()
Then process the resulting sequence using production nodeElement.
start_element(URI=anyURI,
attributes=set(idAttr?, parseOther))
propertyEltList
end_element()
The processing of rdf:parseType string values other than "Resource" or "Literal" is currently to treat the content as if it were "Literal". Processing MUST then continue at production parseTypeLiteralPropertyElt.
Note: It is RECOMMENDED, but not REQUIRED that the rdf:parseType value is made available to user applications, possibly as part of the literal value. This note depends on the resolution of some open RDF Core WG issues so may be clarified further in future drafts.
start_element(URI=anyURI,
attributes=set((idAttr | resourceAttr)?, bagIdAttr?, propertyAttr*))
end_element()
Choose one of the following combinations of allowed attributes. Note in particular that rdf:ID and rdf:resource are alternatives, or both can be omitted and furthermore that bagID cannot be used when there are no propertyAttr given.
If there are no attributes or only the optional rdf:ID attribute i then the following statement is added to the model:
e.parent.subject.string-value <e.URI> "" .
and then if i is given, the above statement is reified with identifier(identifier=concat(base-uri, "#", i.string-value), identifier-type="URI") using the reification rules in section 5.26.
Test: Indicated by test002.rdf and test002.nt
Test: Indicated by test005.rdf and test005.nt
If there is an rdf:resource attribute a and no other attributes then add the following statement to the model:
e.parent.subject.string-value <e.URI> <a.string-value> .
Test: Indicated by test001.rdf and test001.nt
If there is one or more propertyAttr; an rdf:ID attribute, an rdf:resource attribute or neither; and optionally an rdf:bagID attribute b, then:
Then add the following statement to the model:
e.parent.subject.string-value <e.URI> r.string-value .
For all propertyAttr a
If a.URI = rdf:type then the following statement is added to the model:
r.string-value <a.URI> <a.string-value> .
Otherwise the following statement is added to the model:
r.string-value <a.URI> "a.string-value" .
If b is given, create a new node n = identifier(identifier=concat(base-uri, "#", b.string-value), identifier-type="URI") and add the following statement to the model:
n.string-value <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/1999/02/22-rdf-syntax-ns#Bag> .
Then all statements generated from the propertyAttr attributes are reified with node n using the reification rules in section 5.26.
Test: Indicated by test013.rdf and test013.nt
Test: Indicated by test014.rdf and test014.nt
WARNING The use of rdf:ID given here is still under consideration by the RDF Core working group - see the issue rdfms-not-id-and-resource-attr.
WARNING The rules for rdf:bagID given here are known to be incomplete in this draft and this is under consideration by the RDF Core working group.
attribute(URI = rdf:ID
string-value=rdf-id)
Note that the names used as values of rdf:ID and rdf:bagID attributes must be unique in a single RDF/XML document since they come from the same set of names.
attribute(URI = rdf:about
string-value=URI-reference)
attribute(URI = rdf:bagID
string-value=rdf-id)
Note that the names used as values of rdf:ID and rdf:bagID attributes must be unique in a single RDF/XML document since they come from the same set of names.
attribute(URI=anyURI - ( rdf:RDF | rdf:Description | rdf:ID | rdf:about | rdf:bagID | rdf:parseType | rdf:resource | rdf:li),
string-value=anyString)
attribute(URI = rdf:resource
string-value=URI-reference)
attribute(URI = rdf:parseType
string-value="Literal")
attribute(URI = rdf:parseType
string-value="Resource")
attribute(URI = rdf:parseType
string-value=anyString - ("Resource" | "Literal") )
CDATA interpreted as a URI reference defined in Uniform Resource Identifiers (URI) ([URIS]) BNF production URI-reference.
Any XML element content that is allowed according to [XML] definition Content of Elements Rule [43] content. in section 3.1 Start-Tags, End-Tags, and Empty-Element Tags
ISSUE: Should this be changed from any legal XML Nmtoken to be the same as that for XML IDs? In [XML] XML IDs must match Validity constraint: ID which requires the identifiers to match the Name production - a more restricted identifier than Nmtoken.
For a statement with terms s, p and o corresponding to the N-Triples:
s p o .
add the following statements to the model using the given Identifier Node r:
r.string-value <http://www.w3.org/1999/02/22-rdf-syntax-ns#subject> s .
r.string-value <http://www.w3.org/1999/02/22-rdf-syntax-ns#predicate> p .
r.string-value <http://www.w3.org/1999/02/22-rdf-syntax-ns#object> o .
r.string-value <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/1999/02/22-rdf-syntax-ns#Statement> .
For the given element e, generate a new URI u with value concat("http://www.w3.org/1999/02/22-rdf-syntax-ns#_", e.li-counter) property, increment the value of the e.li-counter property by 1 and return u.
It is not possible for all graphs that can be expressed in the RDF Model Theory ([RDF-MODEL]) to be encoded in this syntax. If you do a round trip from RDF/XML to RDF graph and then back to RDF/XML the meaning will be the same but don't expect the RDF/XML that comes out to be exactly the same.
There are two different approaches to serialising RDF.
The basic approach uses the basic RDF syntax from [RDFMS]. In this:
rdf:Description
element, using an rdf:about
attribute.rdf:resource
attribute specifying
the object of the triple.The basic serialisation is recommended for applications in which the output RDF/XML is to be used only in further RDF processing. Where the intent is for the output RDF/XML file to be read by people, the basic serialisation proves unsatisfactory. The basic serialisation does not conform to more restricted sub-dialects of RDF, such as RSS[RSS] or CC/PP[CC/PP]. Hence, it is not appropriate for such applications, for which dialect specific serialisers are needed.
If more human readable output is needed the following factors should be considered:
It is not possible to use the RDF/XML serialisation for serialising an RDF graph in which any triple has a property label which cannot be expressed as a XML namespace-qualified name (QName).
An approach to serialising RDF/XML using the full grammar in a top-down recursive descent fashion is discussed in [UNPARSING].
The following people provided valuable contributions to the document:
This section records local issues to be resolved and issues that were reported to the RDF Core WG related to the XML syntax and their disposition. This section is not the definitive list or description of the latter - see the RDF Core WG issues list. Decided issues may also have associated test cases which can be found in the RDF Test Cases W3C Working Draft.
Add more an introductory section on how the syntax works including examples (graph, XML elements) based on the descriptive version in Dan Brickley's RDF: Understanding the Striped RDF/XML Syntax
rdf:bagID specification is incomplete and wrong.
The resolution texts here are suggestions only and not agreed by the working group.
What triples are generated for nested description elements with bagIDs?
Action: Nested description elements with bagIDs generate the same triples as top-level description elements with bagIDs. Specifically triples generated as a result of the parent propertyElt element do not get reified and included in the bag.
Suggestion that the rdf:value property be replaced by rdf:toString.
Action: This will not be changed in the syntax but the usage of the rdf:value property will be described in the RDF primer and/or Model Theory.
How should a parser process namespaces in a literal which is XML markup?
Action: ?
The mapping of QNames to URI's generates incorrect URI's.
Action: The algorithm to generate URIs for RDF concepts cannot be changed in the current syntax without breaking existing applications. To address the specific example in the issue of XML Schemas, RDF applications can use the namespace URI http://www.w3.org/2000/10/XMLSchema# in RDF/XML to generate the correct XML Schema concept URIs for properties and classes etc. This approach has been successfully with the DAML+OIL using RDF, RDFS and DAML terms along with XML schema data types.
RDF embedded in XHTML and other XML documents is hard to validate
Action: Will not be addressed in the current RDF/XML syntax since it is likely to require changes that would not be backwards compatible. Some help with validation can be found with the schemas for XML validation in Appendix D.
How does xml-base affect RDF
Action: ?
What mime type should RDF Schema and other RDF documents have?
Action: This document [will] defines the syntax for Internet Media Type (or MIME Type) for application/rdf+xml and the registration of this type will be done when this document is stable. NOTE: This is an unregistered type at this time and should not be used in applications. See also the Draft for RDF Media Type registration by Aaron Swartz.
MUST a parser created bags of reified statements for all Description elements?
Action: No, only those which are explicitly reified using an rdf:ID on a propertyElt or by an rdf:bagID on the description.
The propertyElt production 6.12 of the grammar does not allow both an ID attribute and a resource attribute to be specified.
Action: Action: The grammar has[will be] been modified to forbid the use of an rdf:ID attribute on an empty property element. This is consistent with using rdf:ID="attr" as an abbreviation for rdf:about="#attr" and removes the suggestion that it reifys a statement, which it never did in the original grammar form.
What is the difference between using and ID attribute to 'create' a new resource and an about attribute to refer to it?
Action: rdf:ID="attr" is an abbreviation for rdf:about="#attr" and the handling of rdf:ID has been[will be] updated to show this.
On 25th May 2001, the WG decided that ALL attributes must be namespace qualified. There is a description of the decision, including detail on the grammar productions affected and a collection of test cases
Action: Removal of original grammar productions 6.6, 6.7, 6.8, 6.9, 6.11, 6.18, 6.32, 6.33
On 1st June 2001, the WG decided that aboutEachPrefix would be removed from the RDF Model and Syntax Recommendation on the grounds that there is a lack of implementation experience, and it therefore should not be in the recommendation. A future version of RDF may consider support for this feature.
Action: Removal of original grammar production 6.8
On 29th June 2001, the WG decided that containers will match the typed node production in the grammar (production 6.13) and that the container specific productions (productions 6.25 to 6.31) and any references to them be removed from the grammar. rdf:li elements will be translated to rdf:_nnn elements when they are found matching either a propertyElt (production 6.12) or a a typedNode (production 6.13). The decision includes a set of test cases.
Action: Removal of original grammar productions 6.25, 6.26, 6.27, 6.28, 6.29, 6.30, 6.31
On 8th June 2001 the WG decided how empty property elements should be interpreted. The decision is fully represented by the test cases.
Action: Inserted pointers to the the test cases into the grammar at the places where empty property elements are recognised.
On 29th June 2001, the WG decided that rdf:aboutEach attributes are not allowed on an rdf:Description (or typed node) element which is the object of a statement.
Action: None needed - rdf:aboutEach removed from the language on 7th December 2001.
The language describing the syntax is unclear [in section 6]
On 26th October 2001, the WG decided that this issue is closed by the new approach to defining the syntax in this document.
Action: A main goal of this document is to make the syntax clearer and more precise. In particular the grammar section and the pointers to schemas for XML validation help address this.
A formal grammar for RDF.
On 26th October 2001, the WG decided that this issue is closed by the new approach to defining the syntax in this document.
Action: A main goal of this document is to make the syntax clearer and more precise. In particular the grammar section and the pointers to schemas for XML validation help address this.
On 30th November 2001, the WG decided that this issue was closed by the following resolution.
Action: The use of rdf:RDF, rdf:ID, rdf:about, rdf:resource, rdf:bagID, rdf:parseType, rdf:aboutEach and rdf:li except as reserved names as specified in the grammar is an error. [Later rdf:aboutEach was removed from the language on 7th December 2001]
processing rdf:aboutEach requires a processing of sub-property relations.
On 7th December 2001, the WG decided to remove rdf:aboutEach from the language on the grounds it is not widely used, it is not widely implemented correctly, it has confusing interactions with bagID as recorded in rdfms-abouteach, it does not scale as parsers have to save state, this is the wrong layer in which to implement such functionality. (Note: This is an unofficial record of the resolution since at publication date, the issue list has not been updated to record it.)
Action: Removed from the grammar.
On 7th December 2001, the WG decided to remove rdf:aboutEach from the language and consequently this issue was closed.
Action: None needed.
The syntax needs a more convenient way to express the reification of a statement.
On 26th October 2001, the WG decided that this issue was postponed for consideration by a future working group.
Action: None required.
The RDF XML syntax cannot represent all possible Property URI's.
On 26th October 2001, the WG decided that this issue was postponed for consideration by a future working group.
Action: None required.
Suggestion that Qnames should be allowed as values for attributes such as rdf:about.
On 26th October 2001, the WG decided that this issue was postponed for consideration by a future working group.
Action: None required.
The RDF/XML syntax can't represent an an arbritary graph structure.
On 26th October 2001, the WG decided that this issue was postponed for consideration by a future working group.
Action: None required.
Two schema language authors submitted schemas for RDF/XML based on the revised grammar in the previous version of this draft. We include pointers to these schemas for information purposes and an example schema; they are not part of this specification.
This is an example schema in RELAX NG's non-XML format (for ease of reading) but applications should use the standard XML version. These formats are described in RELAX NG ([RELAXNG]) and RELAX NG Non-XML Syntax ([RELAXNG-NX]).
# # RELAX NG Schema (non-XML) for RDF/XML Syntax # # This schema is for information only and NON-NORMATIVE # # It is based on one originally written by James Clark in # http://lists.w3.org/Archives/Public/www-rdf-comments/2001JulSep/0248.html # and updated with later changes. # namespace local = "" namespace rdf = "http://www.w3.org/1999/02/22-rdf-syntax-ns#" datatypes xsd = "http://www.w3.org/2001/XMLSchema-datatypes" start = doc doc = RDF RDF = element rdf:RDF { nodeElementList } nodeElementList = nodeElement* # Should be something like: # ws* , ( nodeElement , ws* )* # but RELAXNG does this by default, ignoring whitespace separating tags. nodeElement = element * - (local:* |rdf:RDF |rdf:ID|rdf:about |rdf:bagID|rdf:parseType|rdf:resource |rdf:li ) { (idAttr | aboutAttr )?, bagIdAttr?, propertyAttr*, propertyEltList } # FIXME: Not sure if it is possible to say "and not things # beginning with _ in the rdf: namespace". ws = " " # Not used in this RELAX NG schema; but should be any legal XML # whitespace defined by http://www.w3.org/TR/2000/REC-xml-20001006#NT-S propertyEltList = propertyElt* # Should be something like: # ws* , ( propertyElt , ws* )* # but RELAXNG does this by default, ignoring whitespace separating tags. propertyElt = resourcePropertyElt | literalPropertyElt | parseTypeLiteralPropertyElt | parseTypeResourcePropertyElt | parseTypeOtherPropertyElt | emptyPropertyElt resourcePropertyElt = element * - (local:* |rdf:RDF|rdf:Description |rdf:ID|rdf:about |rdf:bagID|rdf:parseType|rdf:resource) { idAttr?, nodeElement } literalPropertyElt = element * - (local:* |rdf:RDF|rdf:Description |rdf:ID|rdf:about |rdf:bagID|rdf:parseType|rdf:resource) { idAttr?, text } parseTypeLiteralPropertyElt = element * - (local:* |rdf:RDF|rdf:Description |rdf:ID|rdf:about |rdf:bagID|rdf:parseType|rdf:resource) { idAttr?, parseLiteral, literal } parseTypeResourcePropertyElt = element * - (local:* |rdf:RDF|rdf:Description |rdf:ID|rdf:about |rdf:bagID|rdf:parseType|rdf:resource) { idAttr?, parseResource, propertyEltList } parseTypeOtherPropertyElt = element * - (local:* |rdf:RDF|rdf:Description |rdf:ID|rdf:about |rdf:bagID|rdf:parseType|rdf:resource) { idAttr?, parseOther, any } emptyPropertyElt = element * - (local:* |rdf:RDF|rdf:Description |rdf:ID|rdf:about |rdf:bagID|rdf:parseType|rdf:resource) { (idAttr | resourceAttr)?, bagIdAttr?, propertyAttr* } idAttr = attribute rdf:ID { IDsymbol } aboutAttr = attribute rdf:about { URI-reference } bagIdAttr = attribute rdf:bagID { IDsymbol } propertyAttr = attribute * - (local:* |rdf:RDF|rdf:Description |rdf:ID|rdf:about |rdf:bagID|rdf:parseType|rdf:resource |rdf:li) { string } resourceAttr = attribute rdf:resource { URI-reference } parseLiteral = attribute rdf:parseType { "Literal" } parseResource = attribute rdf:parseType { "Resource" } parseOther = attribute rdf:parseType { text } URI-reference = string literal = any IDsymbol = xsd:NMTOKEN any = mixed { element * { attribute * { text }*, any }* }
Two schema language authors submitted schemas for RDF/XML based on the new grammar in the previous version of this draft. We include pointers to these schemas for information purposes; they are not part of this specification.
This section contains the EBNF grammar of the RDF/XML syntax from RDF Model & Syntax Formal Grammar for RDF section. The only changes made here were to make it legal XHTML via tidy and to change the links to the productions to point to those in the original document.
[6.1] RDF ::= ['<rdf:RDF>'] obj* ['</rdf:RDF>'] [6.2] obj ::= description | container [6.3] description ::= '<rdf:Description' idAboutAttr? bagIdAttr? propAttr* '/>' | '<rdf:Description' idAboutAttr? bagIdAttr? propAttr* '>' propertyElt* '</rdf:Description>' | typedNode [6.4] container ::= sequence | bag | alternative [6.5] idAboutAttr ::= idAttr | aboutAttr | aboutEachAttr [6.6] idAttr ::= ' ID="' IDsymbol '"' [6.7] aboutAttr ::= ' about="' URI-reference '"' [6.8] aboutEachAttr ::= ' aboutEach="' URI-reference '"' | ' aboutEachPrefix="' string '"' [6.9] bagIdAttr ::= ' bagID="' IDsymbol '"' [6.10] propAttr ::= typeAttr | propName '="' string '"' (with embedded quotes escaped) [6.11] typeAttr ::= ' type="' URI-reference '"' [6.12] propertyElt ::= '<' propName idAttr? '>' value '</' propName '>' | '<' propName idAttr? parseLiteral '>' literal '</' propName '>' | '<' propName idAttr? parseResource '>' propertyElt* '</' propName '>' | '<' propName idRefAttr? bagIdAttr? propAttr* '/>' [6.13] typedNode ::= '<' typeName idAboutAttr? bagIdAttr? propAttr* '/>' | '<' typeName idAboutAttr? bagIdAttr? propAttr* '>' propertyElt* '</' typeName '>' [6.14] propName ::= Qname [6.15] typeName ::= Qname [6.16] idRefAttr ::= idAttr | resourceAttr [6.17] value ::= obj | string [6.18] resourceAttr ::= ' resource="' URI-reference '"' [6.19] Qname ::= [ NSprefix ':' ] name [6.20] URI-reference ::= string, interpreted per [URI] [6.21] IDsymbol ::= (any legal XML name symbol) [6.22] name ::= (any legal XML name symbol) [6.23] NSprefix ::= (any legal XML namespace prefix) [6.24] string ::= (any XML text, with "<", ">", and "&" escaped) [6.25] sequence ::= '<rdf:Seq' idAttr? '>' member* '</rdf:Seq>' | '<rdf:Seq' idAttr? memberAttr* '/>' [6.26] bag ::= '<rdf:Bag' idAttr? '>' member* '</rdf:Bag>' | '<rdf:Bag' idAttr? memberAttr* '/>' [6.27] alternative ::= '<rdf:Alt' idAttr? '>' member+ '</rdf:Alt>' | '<rdf:Alt' idAttr? memberAttr? '/>' [6.28] member ::= referencedItem | inlineItem [6.29] referencedItem ::= '<rdf:li' resourceAttr '/>' [6.30] inlineItem ::= '<rdf:li' '>' value </rdf:li>' | '<rdf:li' parseLiteral '>' literal </rdf:li>' | '<rdf:li' parseResource '>' propertyElt* </rdf:li>' [6.31] memberAttr ::= ' rdf:_n="' string '"' (where n is an integer) [6.32] parseLiteral ::= ' parseType="Literal"' [6.33] parseResource ::= ' parseType="Resource"' [6.34] literal ::= (any well-formed XML)
(Note: there are EBNF bugs in the 6.30 production where the </rdf:li> tags are not fully enclosed in quotes as '</rdf:li>')
This section updates the original grammar in Appendix C by amending and deleting various productions according to the recorded RDF Core WG decisions. Some productions are also removed since they are no longer needed, once the above changes are made.
Key:
This text should be added If it is not, your
browser will not display this section properly.
This text should be deleted. If it is not, your
browser will not display this section properly.
Production Number |
Production Name |
Definition |
---|---|---|
6.1 | RDF | "<rdf:RDF>" | description |
6.3 | description | "<rdf:Description" idAboutAttr? bagIdAttr? propAttr* "/>" | "<rdf:Description" idAboutAttr? bagIdAttr? propAttr* ">" propertyElt* "</rdf:Description>" | typedNode |
6.5 | idAboutAttr | idAttr | aboutAttr | aboutEachAttr |
6.6 | idAttr | " rdf:ID=\"" IDsymbol "\"" |
6.7 | aboutAttr | " rdf:about=\"" URI-reference "\"" |
6.8 | aboutEachAttr | " rdf:aboutEach=\"" URI-reference "\"" |
6.9 | bagIdAttr | " rdf:bagID=\"" IDsymbol "\"" |
6.10 | propAttr | typeAttr | propName "=\"" string "\"" (with embedded quotes escaped) |
6.11 | typeAttr | " rdf:type=\"" URI-reference "\"" |
6.12 | propertyElt | "<" propName idAttr? ">" value "</" propName ">" | "<" propName idAttr? parseLiteral ">" literal "</" propName ">" | "<" propName idAttr? parseResource ">" propertyElt* "</" propName ">" | "<" propName idRefAttr? bagIdAttr? propAttr* "/>" |
6.13 | typedNode | "<" typeName idAboutAttr? bagIdAttr? propAttr* "/>" | "<" typeName idAboutAttr? bagIdAttr? propAttr* ">" propertyElt* "</" typeName ">" |
6.14 | propName | Qname |
6.15 | typeName | Qname |
6.16 | idRefAttr | idAttr | resourceAttr |
6.17 | value | |
6.18 | resourceAttr | " rdf:resource=\"" URI-reference "\"" |
6.19 | Qname | |
6.20 | URI-reference | string, interpreted per [URI] |
6.21 | IDsymbol | any legal XML name symbol |
6.22 | name | any legal XML name symbol |
6.23 | NSprefix | any legal XML namespace prefix |
6.24 | string | any XML text, with "<", ">", and "&" escaped |
| "<rdf:Seq" idAttr? memberAttr* "/>" |
||
| "<rdf:Bag" idAttr? memberAttr* "/>" |
||
| "<rdf:Alt" idAttr? memberAttr? "/>" |
||
| "<rdf:li" parseLiteral ">" literal </rdf:li>" | "<rdf:li" parseResource ">" propertyElt* </rdf:li>" |
||
6.32 | parseLiteral | " rdf:parseType=\"Literal\"" |
6.33 | parseResource | " rdf:parseType=\"Resource\"" |
6.34 | literal | any well-formed XML |
Changes since the 06 September 2001 working draft