XQuery 1.0 and XPath 2.0 Formal Semantics

1 Introduction

This document defines the formal semantics of XQuery 1.0 and XPath 2.0. The present document is part of a set of documents that together define the XQuery 1.0 and XPath 2.0 languages:

[XQuery 1.0: An XML Query Language] introduces the XQuery 1.0 language, defines its capabilities from a user-centric view, and defines the language syntax.
[XML Path Language (XPath) 2.0] introduces the XPath 2.0 language, defines its capabilities from a user-centric view, and defines the language syntax.
[Functions and Operators] lists the functions and operators defined for the [XPath/XQuery] language and specifies the required types of their parameters and return value.
[Data Model] formally specifies the data model used by [XPath/XQuery] to represent the content of XML documents. The [XPath/XQuery] language is formally defined by operations on this data model.
[Data Model Serialization] specifies how [XPath/XQuery] data model values are serialized into XML.

The scope and goals for the [XPath/XQuery] language are discussed in the charter of the W3C [XSL/XML Query] Working Group and in the [XPath/XQuery] requirements [XML Query 1.0 Requirements].

This document defines the semantics of [XPath/XQuery] by giving a precise formal meaning to each of the expressions of the [XPath/XQuery] specification in terms of the [XPath/XQuery] data model. This document assumes that the reader is already familiar with the [XPath/XQuery] language. This document defines the formal semantics for XPath 2.0 only when the XPath 1.0 backward compatibility rules are not in effect.

Two important design aspects of [XPath/XQuery] are that it is functional and that it is typed. These two aspects play an important role in the [XPath/XQuery] Formal Semantics.

[XPath/XQuery] is a functional language. [XPath/XQuery] is built from expressions, rather than statements. Every construct in the language (except for the XQuery query prolog) is an expression and expressions can be composed arbitrarily. The result of one expression can be used as the input to any other expression, as long as the type of the result of the former expression is compatible with the input type of the latter expression with which it is composed. Another characteristic of a functional language is that variables are always passed by value, and a variable's value cannot be modified through side effects.

[XPath/XQuery] is a typed language. Types can be imported from one or more XML Schemas that describe the input documents and the output document, and the [XPath/XQuery] language can then perform operations based on these types. In addition, [XPath/XQuery] supports static type analysis. Static type analysis infers the output type of an expression based on the type of its input expressions. In addition to inferring the type an expression for the user, static typing allows early detection of type errors, and can be used as the basis for certain classes of optimization. The [XPath/XQuery] type system captures most of the features of [Schema Part 1], including global and local element and attribute declarations, complex and simple type definitions, named and anonymous types, derivation by restriction, extension, list and union, substitution groups, and wildcard types. It does not model uniqueness constraints and facet constraints on simple types.

This document is organized as follows. [2 Preliminaries] introduces the notations used to define the [XPath/XQuery] Formal Semantics. These include the formal notations for values in the [XPath/XQuery] data model and for types in XML Schema. The next three sections: [3 Basics], [4 Expressions], and [5 Modules and Prologs] have the same structure as the corresponding sections in the [XQuery 1.0: An XML Query Language] and [XML Path Language (XPath) 2.0] documents. This allows the reader to quickly find the formal definition of a particular language construct. [3 Basics] defines the semantics for basic [XPath/XQuery] concepts, and [4 Expressions] defines the dynamic and static semantics of each [XPath/XQuery] expression. [5 Modules and Prologs] defines the semantics of the [XPath/XQuery] prolog. [7 Additional Semantics of Functions] defines the static semantics of several functions in [Functions and Operators] and gives the dynamic and static semantics of several supporting functions used in this document. The remaining sections, [8 Auxiliary Judgments] and [C Importing Schemas], contain material that supports the formal semantics of [XPath/XQuery]. [8 Auxiliary Judgments] defines formal judgments that relate data model values to types, that relate types to types, and that support the formal definition of validation. These judgments are used in the definition of expressions in [4 Expressions]. Lastly, [C Importing Schemas], specifies how XML Schema documents are imported into the [XPath/XQuery] type system and relates XML Schema types to the [XPath/XQuery] type system.

1.1 Normative and Informative Sections

Certain aspects of language processing are described in this specification as implementation-defined or implementation-dependent.

[Definition: Implementation-defined indicates an aspect that may differ between implementations, but must be specified by the implementor for each particular implementation.]
[Definition: Implementation-dependent indicates an aspect that may differ between implementations, is not specified by this or any W3C specification, and is not required to be specified by the implementor for any particular implementation.]

A language aspect described in this specification as implementation-defined or implementation dependent may be further constrained by the specifications of a host language in which XPath or XQuery is embedded.

This document contains the normative static semantics of [XPath/XQuery]. The static semantics rules in [3 Basics], [4 Expressions], [5 Modules and Prologs], and [7 Additional Semantics of Functions] are normative. [3.1.1 Static Context] is normative, because it defines the static context used in the static typing rules. [8 Auxiliary Judgments] is normative, because it contains all the judgments necessary for defining SequenceType Matching.

The dynamic semantics of [XPath/XQuery] are normatively defined in [XQuery 1.0: An XML Query Language] and [XML Path Language (XPath) 2.0]. In this document, the dynamic semantic rules in [3 Basics], [4 Expressions], and [5 Modules and Prologs], the examples, and the material labeled as "Note" are provided for explanatory purposes and are not normative.

The mapping rules from XML Schema to the XQuery type system provided in [C Importing Schemas], and the formal semantics of XML Schema validation in [E Auxiliary Judgments for Validation] are informative and do not handle every feature of XML Schema.

2 Preliminaries

This section provides the background necessary to understand the Formal Semantics, introduces the notations that are used, and explains its relationship to other documents.

2.1 Introduction to the Formal Semantics

Why a Formal Semantics? The goal of the formal semantics is to complement the [XPath/XQuery] specification ([XQuery 1.0: An XML Query Language] and [XML Path Language (XPath) 2.0]), by defining the meaning of [XPath/XQuery] expressions with mathematical rigor.

A rigorous formal semantics clarifies the intended meaning of the English specification, ensures that no corner cases are left out, and provides a reference for implementation.

Why use formal notations? Rigor is achieved by the use of formal notations to represent [XPath/XQuery] objects such as expressions, XML values, and XML Schema types, and by the systematic definition of the relationships between those objects to reflect the meaning of the language. In particular, the dynamic semantics relates [XPath/XQuery] expressions to the XML value to which they evaluate, and the static semantics relates [XPath/XQuery] expressions to the XML Schema type that is inferred for that expression.

The Formal Semantics uses several kinds of formal notations to define the relationships between [XPath/XQuery] expressions, XML values, and XML Schema types. This section introduces the notations for judgments, inference rules, and mapping rules as well as the notation for environments, which implement the dynamic and static contexts. The reader already familiar with these notations can skip this section and continue with [2.3 XML Values].

2.1.1 Notations from grammar productions

Grammar productions are used to describe "objects" (values, types, [XPath/XQuery] expressions, etc.) manipulated by the Formal Semantics. The Formal Semantics makes use of several kinds of grammar productions: productions from the [XPath/XQuery] grammar itself, productions for a subset of the [XPath/XQuery] language called the XQuery Core which is used throughout this document, and other productions used for formal specification only such as for the XQuery type system.

XQuery grammar productions describe the XQuery language and expressions. XQuery productions are identified by a number, which corresponds to their number in the [XQuery 1.0: An XML Query Language] document, and are marked with "(XQuery)". For instance, the following production describes FLWOR expressions in XQuery.

[For/FLWOR] Expressions

[33 (XQuery)] FLWORExpr^XQ ::= (ForClause | LetClause)+ WhereClause? OrderByClause? "return" ExprSingle

For the purpose of this document, the differences between the XQuery 1.0 and the XPath 2.0 grammars are mostly irrelevant. By default, this document uses XQuery 1.0 grammar productions. Whenever the grammar for XPath 2.0 differs from the one for XQuery 1.0, the corresponding XPath 2.0 productions are also given. XPath productions are identified by a number, which corresponds to their number in [XML Path Language (XPath) 2.0], and are marked with "(XPath)". For instance, the following production describes for expressions in XPath.

[For/FLWOR] Expressions

[4 (XPath)] ForExpr^XP ::= SimpleForClause "return" ExprSingle

XQuery Core grammar productions describe the XQuery Core. The Core grammar is given in [A Normalized core grammar]. Core productions are identified by a number, which corresponds to their number in [A Normalized core grammar], and are marked with "(Core)". For instance, the following production describes the simpler form of the "FLWOR" expression in the XQuery Core.

Core FLWOR Expressions

[32 (Core)] FLWORExpr ::= (ForClause | LetClause) "return" ExprSingle

The Formal Semantics manipulates "objects" (values, types, expressions, etc.) for which there is no existing grammar production in the [XQuery 1.0: An XML Query Language] document. In these cases, specific grammar productions are introduced. Notably, additional productions are used to describe values in the [Data Model], and to describe the [XPath/XQuery] type system. Formal Semantics productions are identified by a number, and are marked by "(Formal)". For instance, the following production describes global type definitions in the [XPath/XQuery] type system.

Type Definitions

[39 (Formal)] Definition ::= ("define" "element" ElementName OptSubstitution OptNillable TypeReference) | ("define" "attribute" AttributeName TypeReference) | ("define" "type" TypeName TypeDerivation)

Note that grammar productions that are specific to the Formal Semantics (i.e., marked with "(Formal)") are not part of [XPath/XQuery]. They are not accessible to the user and are only used in the course of defining the languages' semantics.

Grammar non-terminals are used extensively in this document to represent objects in judgments (see the next section). As a convenience, non-terminals used in judgments link to the appropriate grammar production.

2.1.2 Notations for judgments

The basic building block of the formal specification is called a judgment. A judgment expresses whether a property holds or not.

For example:

Notation

The judgment

Object is a positive integer

holds if the object Object is a positive integer.

A judgment may hold (if it is true) or not hold (if it is false). For instance '1 is a positive integer' holds and '-1 is a positive integer' does not hold.

Notation

Here are two other example judgments.

The judgment

Expr => Value

holds if the expression Expr yields (or evaluates to) the value Value.

The judgment

Expr : Type

holds if the expression Expr has the type Type.

Most other judgments used in this document are short English sentences intended to reflect their meaning, and written in bold fonts. For instance, the judgment

Axis principal PrincipalNodeKind

holds if PrincipalNodeKind is the principal node kind for the axis Axis.

A judgment can contain symbols and patterns.

Symbols are purely syntactic and are used to write the judgment itself. In general, symbols in a judgment are chosen to reflect its meaning. For example, 'is beautiful', '=>' and ':' are symbols, the second and third of which should be read "yields", and "has type" respectively.

Patterns are used to represent objects that can be constructed from a given grammar production. In patterns, italicized words correspond to non-terminals in the grammar. The name of those non-terminals is significant, and may be instantiated only to an "object" (a value, a type, an expression, etc.) that can be substituted legally for that non-terminal. For example, 'Expr' is a pattern that stands for every [XPath/XQuery] expressions, 'Expr₁ + Expr₂' is a pattern that stands for every addition expression, 'element a { Value }' is a pattern that stands for every value in the [XPath/XQuery] data model that is an 'a' element.

Non-terminals in a pattern may appear with subscripts (e.g. Expr₁, Expr₂) to distinguish different instances of the same sort of pattern. In some cases, non-terminals in a pattern may have a name that is not exactly the name of that non terminal, but is based on it. For instance, a BaseTypeName is a pattern that stands for a type name, as would TypeName, or TypeName₂. This usage is limited, and only occurs to improve the readability of some of the inference rules.

When instantiating the judgment, each pattern must be instantiated to an appropriate sort of "object" (value, type, expression, etc). For example, '3 => 3' and '$x+0 => 3' are both instances of the judgment 'Expr => Value'. Note that in the first judgment, '3' corresponds to both the expression '3' (on the left-hand side of the => symbol) and to the value '3' (on the right-hand side of the => symbol).

In some cases, inference rules may need to use the fact that a certain judgment does not hold. not(Judgment) holds iff Judgment does not hold.

In some cases, a pattern may be instantiated to a value within a finite set of pre-determined values. We may write that set of possible values using the in judgment. For instance, the judgment

Color in { blue, green }

holds if the pattern Color has either the value blue or the value green.

2.1.3 Notations for environments

An environment component is a dictionary that maps a symbol (e.g., a function name or a variable name) to an "object" (e.g., a function body, a type, a value). One can access information in an environment component or update it.

If "envComp" is an environment component, then "envComp(symbol)" denotes the "object" to which symbol is mapped. The notation is intentionally similar to function application, because an environment component can be considered a function from the argument symbol to the "object" to which the symbol is mapped.

This document uses environments that group related environment components. If "env" is an environment containing the environment component "envComp", that environment component is denoted "env.envComp". The value that symbol is mapped to in that environment component is denoted "env.envComp(symbol)".

The two main environments used in the Formal Semantics are: a dynamic environment (dynEnv), which models the [XPath/XQuery]'s dynamic context, and a static environment (statEnv), which models the [XPath/XQuery]'s static context. Both are defined in [3.1 Expression Context].

For example, dynEnv.varValue denotes the dynamic environment component that maps variables to values and dynEnv.varValue(Variable) denotes the value of the variable Variable in the dynamic context.

Environments are used in a judgment to capture some of the context in which the judgment is computed, and most judgments are computed assuming that some environment is given. This assumption is denoted by prefixing the judgment with "env |-". The "|-" symbol is called a "turnstile" and is used in almost all inference rules.

For instance, the judgment

dynEnv |- Expr => Value

is read as: Assuming the dynamic environment dynEnv, the expression Expr yields the value Value.

Environments can be updated, using the following notation:

"env + envComp(symbol => object) " denotes the new environment that is identical to env except that the environment component envComp has been updated to map symbol to object. The notation symbol => object indicates that symbol is mapped to object in the new environment.
In case the environment component contains only a constant value (e.g., the ordering mode which can only be either ordered or unordered), the following notation is used to set its value. "env + envComp( object ) ".
The following shorthand is also allowed: "env + envComp( symbol₁ => object₁ ; ... ; symbol_n => object_n ) " in which each symbol is mapped to a corresponding object in the new environment.

This notation is equivalent to nested updates, as in " (env + envComp( symbol₁ => object₁) + ... ) + env(symbol_n => object_n)".

Updating an environment creates a copy of the original environment and overrides any previous binding that might exist for the same name and the same component in that environment. Updating the environment is used to capture the scope of a symbol (e.g., for variables, namespace prefixes, etc). For instance, in the following expression

  let $x := 1 return
  let $x := $x + 2 return
  $x - 3

each let expression changes the dynamic context by binding a new variable to a new value. Each different context is represented by a different environment. The original environment, in which the expression 1 is evaluated, does not contain any binding for variable $x. This environment is updated a first time with a binding of variable $x to the value 1, and this environment is used for the evaluation of the expression $x + 2. Then it is updated a second time with a binding of variable $x to the value 3, and this environment is used for the evaluation of the expression $x - 3.

Also, note that there are no operations to remove entries from environments. This is never necessary as updating an environment effectively creates a new extended copy of the original environment, leaving the original environment accessible wherever it is in scope along with the updated copy.

2.1.4 Notations for inference rules

Inference rules are used to specify how to infer whether a given judgment holds or not. Inference rules express the logical relation between judgments and describe how complex judgments can be concluded from simpler premise judgments.

A logical inference rule is written as a collection of premises and a conclusion, written respectively above and below a dividing line, as follows:

premise₁ ... premise_n

conclusion

All premises and the conclusion are judgments. From a logical point of view, an inference rule is a deduction that if the premises hold, then the conclusion holds as well. In that sense, the previous inference rule has a similar meaning as the following logical statement.

IF premise₁

AND ...

AND premise_n

THEN conclusion

Here is a simple example of inference rule, which uses specific instances of the example judgment 'Expr => Value' from above:

$x => 0 3 => 3

$x + 3 => 3

This inference rule expresses the following property: if the variable expression '$x' yields the value '0', and the literal expression '3' yields the value '3', then the expression '$x + 3' yields the value '3'.

An inference rule may have no premises above the line, which means that the expression below the line always holds. For instance:

3 => 3

This inference rule expresses the following property: evaluating the literal expression '3' always yields the value '3'.

The two above rules are expressed in terms of specific expressions and values, but usually rules are more abstract. That is, the judgments are not fully instantiated. Here is a rule that says that for any variable Variable that yields the integer value Integer, adding '0' yields the same integer value:

VarRef => Integer

VarRef + 0 => Integer

Each occurrence of a given pattern in a particular inference rule must be instantiated to the same "object" within the entire rule. This means that one can talk about "the value of Variable" instead of the value bound to the first (second, etc) occurrence of VarRef.

Here is an example of a rule occurring later in this document.

statEnv |- Expr₁ : Type₁ statEnv |- Expr₂ : Type₂

statEnv |- Expr₁ , Expr₂ : Type₁, Type₂

This rule is read as follows: if two expressions Expr₁ and Expr₂ are known to have the static types Type₁ and Type₂ (the two premises above the line), then it is the case that the sequence expression "Expr₁ , Expr₂" has the static type "Type₁, Type₂", which is the sequence of types Type₁ and Type₂. Note that this inference rule does not modify the static environment.

The following rule defines the static semantics of a "let" expression. The binding of the new variable is captured by an update to the varType component of the original static environment.

statEnv |- VarName of var expands to expanded-QName

statEnv |- Expr₁ : Type₁ statEnv + varType(expanded-QName => Type₁) |- Expr₂ : Type₂

statEnv |- let $VarName := Expr₁ return Expr₂ : Type₂

This rule is read as follows: First, because the variable is a QName, it is first expanded into an expanded QName. Second, the type Type₁ for the "let" input expression Expr₁ is computed. Then the "let" variable with expanded name, expanded-QName with type Type₁ is added into the varType component of the static environment statEnv. Finally, the type Type₂ of Expr₂ is computed in that new environment.

2.1.5 Putting it together

In isolation, each inference rule describes a fragment of the semantics for a given judgment. Put together, inference rules describe possible inferences that can be used to decide whether that a particular judgment hold.

For a given judgment, and a set of inference rules, if that judgment can be inferred to be true, the inference succeeds. In most cases, the inference will proceed by proving intermediate judgments, following the consequences from one judgment to the next by applying successive inference rules.

Such inference is a mechanism which can be used to describe both static type analysis and dynamic evaluation. More specifically, performing static typing consists in proving that the following judgment holds for a given expression Expr.

statEnv |- Expr : Type

If the judgment holds for a given type Type, this type is a possible static type for the expression. If there exists no type for which this judgment holds, then static typing fails and a static type error is returned to the user.

Consider the following expression.

  fn:count((1,2,3))

Using the static typing rules given for expressions in the rest of this document, one can deduce that the expression is of type xs:integer through the following inference.

  statEnv |- 1 : xs:integer  (from typing of literals)
  statEnv |- 2 : xs:integer  (from typing of literals)
  --------------------------------------------------- (sequence)
    statEnv |- 1,2 : xs:integer, xs:integer
    statEnv |- 3 : xs:integer
    ----------------------------------------------------- (sequence)
    statEnv |- 1,2,3 : xs:integer, xs:integer, xs:integer

    declare function fn:count($x as item()*) as xs:integer
    statEnv |- xs:integer,xs:integer,xs:integer <: item*
    ---------------------------------------------------------- (function call)
    statEnv |- fn:count((1,2,3)) : xs:integer

Conversly, consider the following expression.

  fn:nilled((1,2,3))

Using the static typing rules given for expressions in the rest of this document, one can apply inference rules up to the following point.

    ....
    ----------------------------------------------------- (sequence)
    statEnv |- 1,2,3 : xs:integer, xs:integer, xs:integer

However, there is no rule that can infer the type of fn:nilled((1,2,3)), because the static typing rules for function calls will only hold if the type of the function parameters is a subtype of the expected type. However, here (xs:integer,xs:integer,xs:integer) is not a node type, which is the expected type for the function fn:nilled.

Note that in some cases, the inference can only proceed through the appropriate changes to the environment. For instance, consider the following expression.

  let $x := 1 return ($x,$x)

Using the static typing rules given for expressions in the rest of this document, one can deduce that the expression is of type (xs:integer,xs:integer) through the following inference.

statEnv0.varType = ()

  -------------------------- (literal)
  statEnv0 |- 1 : xs:integer

statEnv1 = statEnv0 + varType($x => xs:integer)

     statEnv1.varType($x) = xs:integer
     --------------------------------- (variable reference)
     statEnv1 |- $x : xs:integer

     statEnv1.varType($x) = xs:integer
     --------------------------------- (variable reference)
     statEnv1 |- $x : xs:integer

     ------------------------------------------- (sequence)
     statEnv1 |- ($x,$x) : xs:integer,xs:integer

  -------------------------------------------------------------- (let)
  statEnv0 |- let $x := 1 return ($x,$x) : xs:integer,xs:integer

This example illustrates how each rule is applied to individual sub-expressions, and how the environment is used to maintain the relevant context information.

2.2 URIs, Namespaces, and Prefixes

The Formal Semantics does not formally specify the adjustment of relative URIs according to a base URI. All URIs used in this document are assumed to be absolute URIs.

The Formal Semantics uses the following namespace prefixes.

fn: for functions and operators from the [Functions and Operators] document.
xs: for XML Schema components and built-in types.
xdt: for [XPath/XQuery] built-in types.

All these prefixes are assumed to be bound to the appropriate URIs.

In addition, the Formal Semantics uses the following special prefixes for specification purposes.

dm: for accessors of the [Data Model].
op: for operators in [Functions and Operators].
fs: for functions and types defined in the formal semantics.

These prefixes are always italicized to emphasize that the corresponding functions, variables, and types are abstract: they are not and cannot be made accessible in [XPath/XQuery]. None of these special prefixes are given an explicit URI, but they behave as if they had one for the purposes of namespace resolution.

2.3 XML Values

The [XPath/XQuery] language is defined over values of the [XPath/XQuery] data model. The [XPath/XQuery] data model is defined normatively in [Data Model]. We define the formal notation that is used in this document to describe and manipulate values in inference rules. Formal values are used for specification purposes only and are not exposed to the [XPath/XQuery] user.

This section gives the grammar for formal values, along with a summary of the corresponding data model properties. In the context of this document, all constraints on values that are specified in [Data Model] are assumed to hold.

2.3.1 Formal values

A value is a sequence of zero or more items. An item is either an atomic value or a node.

An atomic value is a value in the value space of an atomic type, labeled with the name of that atomic type. An atomic type is either a primitive or derived atomic type according to XML Schema [Schema Part 2], xdt:untypedAtomic, or xdt:anyAtomicType.

A node is either an element, an attribute, a document, a text, a comment, or a processing-instruction node.

Element nodes have a type annotation^XQ and contain a complex value or a simple value. Attribute nodes have a type annotation^XQ and contain a simple value. Text nodes always contain one string value of type xdt:untypedAtomic, therefore the corresponding type annotation is omitted in the formal notation of a text node. Document nodes do not have a type annotation and contain a sequence of element, text, comment, or processing-instruction nodes.

A simple value is a sequence of atomic values.

A complex value is a sequence of attribute nodes followed by a sequence of element, text, comment, or processing-instruction nodes.

A type annotation^XQ can be either the QName of a declared type or an anonymous type. An anonymous type corresponds to an XML Schema type for which the schema writer did not provide a name. Anonymous type names are not visible to the user, but are generated during schema validation and used to annotate nodes in the data model. By convention, anonymous type names are written using the fs: Formal Semantics prefix: fs:anon₀, fs:anon₁, etc.

Formal values are defined by the following grammar.

Values

[7 (Formal)]	`Value`	::=	`Item \| (Value "," Value) \| ("(" ")")`
[21 (Formal)]	`Item`	::=	`NodeValue \| AtomicValue`
[22 (Formal)]	`AtomicValue`	::=	`AtomicValueContent TypeAnnotation?`
[1 (Formal)]	`AtomicValueContent`	::=	`String \| Boolean \| Decimal \| Float \| Double \| Duration \| DateTime \| Time \| Date \| GYearMonth \| GYear \| GMonthDay \| GDay \| GMonth \| HexBinary \| Base64Binary \| AnyURI \| expanded-QName \| NOTATION`
[2 (Formal)]	`TypeAnnotation`	::=	`"of" "type" TypeName`
[9 (Formal)]	`ElementValue`	::=	`"element" ElementName "nilled"? TypeAnnotation? "{" Value "}" ("{" NamespaceBindings "}")?`
[10 (Formal)]	`AttributeValue`	::=	`"attribute" AttributeName TypeAnnotation? "{" SimpleValue "}"`
[8 (Formal)]	`SimpleValue`	::=	`AtomicValue \| (SimpleValue "," SimpleValue) \| ("(" ")")`
[11 (Formal)]	`DocumentValue`	::=	`"document" "{" Value "}"`
[13 (Formal)]	`CommentValue`	::=	`"comment" "{" String "}"`
[14 (Formal)]	`ProcessingInstructionValue`	::=	`"processing-instruction" NCName "{" String "}"`
[12 (Formal)]	`TextValue`	::=	`"text" "{" String "}"`
[20 (Formal)]	`NodeValue`	::=	`ElementValue \| AttributeValue \| DocumentValue \| TextValue \| CommentValue \| ProcessingInstructionValue`
[3 (Formal)]	`ElementName`	::=	`QName`
[6 (Formal)]	`AttributeName`	::=	`QName`
[23 (Formal)]	`TypeName`	::=	`QName`
[15 (Formal)]	`NamespaceBindings`	::=	`NamespaceBinding ("," NamespaceBinding)*`
[17 (Formal)]	`NamespaceBinding`	::=	`"namespace" NCName "{" AnyURI "}"`

Notation

In that grammar, "String" indicates the value space of xs:string, "Decimal" indicates the value space of xs:decimal, etc.

Element (resp. attributes) without type annotations, are assumed to have the type annotation xs:anyType (resp. xs:anySimpleType). Atomic values without type annotations, are assumed to have a type annotation which is the base type for the corresponding value. For instance, "Hello, World!" is equivalent to "Hello, World!" of type xs:string.

Untyped elements (e.g., from well-formed documents) have the type annotation^XQ xdt:untyped, untyped attributes have the type annotation^XQ xdt:untypedAtomic, and untyped atomic values have the type annotation^XQ xdt:untypedAtomic.

An element has an optional "nilled" marker. This marker is present only if the element has been validated against an element type in the schema which is "nillable", and the element has no content and an attribute xsi:nil set to "true".

An element also has a sequence of namespace bindings, which are the set of in-scope namespaces for that element. Each namespace binding is a prefix, URI pair. Elements without namespace bindings are assumed to have an empty set of in-scope namespaces.

Note:

In [XPath], the in-scope namespaces of an element node are represented by a collection of namespace nodes arranged on a namespace axis, which is optional and deprecated in [XML Path Language (XPath) 2.0]. XQuery does not support the namespace axis and does not represent namespace bindings in the form of nodes.

2.3.2 Examples of values

A well-formed document

  <fact>The cat weighs <weight units="lbs">12</weight> pounds.</fact>

In the absence of a Schema, this document is represented as

  element fact of type xdt:untyped {
    text { "The cat weighs " },
    element weight of type xdt:untyped {
      attribute units of type xdt:untypedAtomic {
        "lbs" of type xdt:untypedAtomic
      }
      text { "12" }
    },
    text { " pounds." }
  }

A document before and after validation.

  <weight xsi:type="xs:integer">42</weight>

The formal model for values can represent values before and after validation. Before validation, this element is represented as:

  element weight of type xdt:untyped {
    attribute xsi:type of type xdt:untypedAtomic {
      "xs:integer" of type xdt:untypedAtomic
    },
    text { "42" }
  }

After validation, this element is represented as:

  element weight of type xs:integer {
    attribute xsi:type of type xs:QName {
      "xs:integer" of type xs:QName
    },
    42 of type xs:integer
  }

An element with a list type

  <sizes>1 2 3</sizes>

Before validation, this element is represented as:

  element sizes of type xdt:untyped {
    text { "1 2 3" }
  }

Assume the following Schema.

  <xs:element name="sizes" type="sizesType"/>
  <xs:simpleType name="sizesType">
    <xs:list itemType="sizeType"/>
  </xs:simpleType>
  <xs:simpleType name="sizeType">
    <xs:restriction base="xs:integer"/>
  </xs:simpleType>

After validation against this Schema, the element is represented as:

  element sizes of type sizesType {
    1 of type sizeType,
    2 of type sizeType,
    3 of type sizeType
  }

An element with an anonymous type

  <sizes>1 2 3</sizes>

Before validation, this element is represented as:

  element sizes of type xdt:untyped {
    text { "1 2 3" }
  }

Assume the following Schema.

  <xs:element name="sizes">
    <xs:simpleType>
      <xs:list itemType="xs:integer"/>
    </xs:simpleType>
  </xs:element>

After validation, this element is represented as:

  element sizes of type fs:anon1 {
    1 of type xs:integer,
    2 of type xs:integer,
    3 of type xs:integer
  }

where fs:anon₁ stands for the internal anonymous name generated by the system for the sizes element.

A nillable element with xsi:type set to true:

  <sizes xsi:nil="true"/>

Before validation, this element is represented as:

  element sizes of type xdt:untyped {
    attribute xsi:nil of type xdt:untypedAtomic { "true" of type xdt:untypedAtomic }
  }

Assume the following Schema.

  <xs:element name="sizes" type="sizesType" nillable="true"/>

After validation against this Schema, the element is represented as:

  element sizes nilled of type sizesType {
    attribute xsi:nil of type xs:boolean { true of type xs:boolean }
  }

An element with a union type

  <sizes>1 two 3 four</sizes>

Before validation, this element is represented as:

  element sizes of type xdt:untyped {
    text { "1 two 3 four" }
  }

Assume the following Schema:

  <xs:element name="sizes" type="sizesType"/>
  <xs:simpleType name="sizesType">
    <xs:list itemType="sizeType"/>
  </xs:simpleType>
  <xs:simpleType name="sizeType">
    <xs:union memberType="xs:integer xs:string"/>
  </xs:simpleType>

After validation against this Schema, the element is represented as:

  element sizes of type sizesType {
    1 of type xs:integer,
    "two" of type xs:string,
    3 of type xs:integer,
    "four" of type xs:string
  }

2.4 The [XPath/XQuery] Type System

The [XPath/XQuery] type system is used in the specification of the dynamic and of the static semantics of [XPath/XQuery]. This section introduces formal notations for describing types.

2.4.1 XML Schema and the [XPath/XQuery] Type System

The [XPath/XQuery] type system is based on [Schema Part 1] and [Schema Part 2]. [Schema Part 1] and [Schema Part 2] specify normatively the type information available in [XPath/XQuery]. We define the formal notation that is used in this document to describe and manipulate types in inference rules. Formal types are used for specification purposes only and are not exposed to the [XPath/XQuery] user.

Representation of content models. For the purpose of static typing, the [XPath/XQuery] type system only describes minOccurs, maxOccurs, and minLength, maxLength on list types for the occurrences that correspond to the DTD operators +, *, and ?. Choices are represented using the DTD operator |. All groups are represented using the interleaving operator (&).

Representation of anonymous types. To clarify the semantics, the [XPath/XQuery] type system makes all anonymous types explicit.

Representation of XML Schema simple type facets and identity constraints. For simplicity, XML Schema simple type facets and identity constraints are not formally represented in the [XPath/XQuery] type system. However, an [XPath/XQuery] implementation supporting XML Schema import and validation must take simple type facets and identity constraints into account.

This document describe types in the [XPath/XQuery] types system, as well as the operations and properties over those types which are used to define the [XPath/XQuery] static typing feature. The two most important properties are whether a data instances matches a type, and whether a type is a subtype of another. Those properties are described in [8.3 Judgments for type matching]. This document does not describe all other possible properties over those types.

The mapping from XML Schema into the [XPath/XQuery] type system is given in [C Importing Schemas]. The rest of this section is organized as follows. [2.4.2 Item types] describes item types, [2.4.3 Content models] describes content models, and [2.4.4 Top level definitions] describe top-level type declarations.

2.4.2 Item types

An item type is either an atomic type, an element type, an attribute type, a document node type, a text node type, a comment node type, or a processing instruction type. We distinguish between document nodes, attribute nodes, and nodes that can occur in element content (elements, comments, processing instructions, and text nodes), as we need to refer to element content types later in the formal semantics.

Item Types

[25 (Formal)]	`FormalItemType`	::=	`AtomicTypeName \| NodeType`
[28 (Formal)]	`AtomicTypeName`	::=	`QName`
[26 (Formal)]	`NodeType`	::=	`DocumentType \| AttributeType \| ElementContentType`
[27 (Formal)]	`ElementContentType`	::=	`ElementType \| "comment" \| "processing-instruction" \| "text"`
[29 (Formal)]	`ElementType`	::=	`"element" ElementNameOrWildcard OptTypeSpecifier`
[4 (Formal)]	`ElementNameOrWildcard`	::=	`QName \| "*"`
[5 (Formal)]	`AttributeNameOrWildcard`	::=	`QName \| "*"`
[85 (Formal)]	`OptTypeSpecifier`	::=	`TypeSpecifier?`
[30 (Formal)]	`TypeSpecifier`	::=	`OptNillable TypeReference`
[31 (Formal)]	`AttributeType`	::=	`"attribute" AttributeNameOrWildcard OptTypeReference`
[83 (Formal)]	`OptNillable`	::=	`Nillable?`
[32 (Formal)]	`Nillable`	::=	`"nillable"`
[86 (Formal)]	`OptTypeReference`	::=	`TypeReference?`
[36 (Formal)]	`TypeReference`	::=	`"of" "type" TypeName`
[48 (Formal)]	`DocumentType`	::=	`"document" ("{" Type "}")?`

An element or attribute type has an optional name and an optional type reference. A name alone corresponds to a reference to a global element or attribute declaration. A name with a type reference corresponds to a local element or attribute declaration. The word "element" or "attribute" alone refers to the wildcard types for any element or any attribute. In addition, an element type has an optional nillable flag that indicates whether the element can be nilled or not.

A document type has an optional content type. If no content type is given, then the type is treated as being the wildcard type for documents, i.e., a sequence of text and element nodes. For consistency with element nodes, PIs and comments are not indicated in that wildcard type, but may occur in instances.

Note

Generic node types (e.g., node()) such as used in the SequenceType production, are interpreted in the type system as a union of the corresponding node types (e.g., element,attribute,text,comment and processing-instruction nodes) and therefore do not appear in the grammar. The semantics of sequence types is described in [3.5.4 SequenceType Matching].

Examples

The following is a text node type

  text

The following is a type for all elements

  element * of type xs:anyType

The following is a type for all elements of type string

  element * of type xs:string

The following is a type for a nillable element of type string and with name size

  element size nillable of type xs:string

The following is a reference to a global attribute declaration

  attribute sizes

The following is a type for elements with anonymous type fs:anon₁:

  element sizes of type fs:anon1

2.4.3 Content models

Following XML Schema, types in [XPath/XQuery] are composed from item types by optional, one or more, zero or more, all group, sequence, choice, empty sequence (written empty), or empty choice (written none).

The type empty matches the empty sequence. The type none matches no values. none is the identity for choice, that is (Type | none) = Type. The type none is the static type for [7.2.9 The fn:error function].

Types

The [XPath/XQuery] type system includes three binary operators on types: ",", "|" and "&", corresponding respectively to sequence, choice and all groups in Schema. The [XPath/XQuery] type system includes three unary operators on types: "*", "+", and "?", corresponding respectively to zero or more instances of the type, one or more instances of the type, or an optional instance of the type.

The "&" operator builds the "interleaved product" of two types. The type Type₁ & Type₂ matches any sequence that is an interleaving of two sequences of items, Value₁ and Value₂, with Value₁ matching Type₁ and Value₂ matching Type₂. The interleaving of two sequences of items Value₁ and Value₂ is any sequence Value₀ such that there is an ordered partition of Value₀ into the two sub-sequences Value₁ and Value₂. The interleaved product captures the semantics of all groups in XML Schema, but is more general as it applies to arbitrary types. All groups in XML Schema are restricted to apply only on global or local element declarations with minOccurs 0 or 1, and maxOccurs 1.

For example, consider the types Type₁ = xs:integer,xs:integer,xs:integer and Type₂ = xs:string,xs:string. Value₁ = (1,2,3) matches the type Type₁ and Value₂ = ("a","b") matches the type Type₂. Any of the following Value₀ are interleavings of Value₁ and Value₂, and therefore match the type (Type₁ & Type₂):

Value0 = (1,2,3,"a","b")
Value0 = (1,2,"a",3,"b")
Value0 = (1,2,"a","b",3)
Value0 = (1,"a",2,3,"b")
Value0 = (1,"a",2,"b",3)
Value0 = (1,"a","b",2,3)
Value0 = ("a",1,2,3,"b")
Value0 = ("a",1,2,"b",3)
Value0 = ("a",1,"b",2,3)
Value0 = ("a","b",1,2,3)

Types precedence order. To improve readability when writing types, we assume the following precedence order between operators on types.

#	Operator
1	\| (choice)
2	& (interleaving)
3	, (sequence)
4	*, +, ? (occurrence)

Parenthesis can be used to enforce precedence. For instance

  xs:string | xs:integer, xs:float*

is equivalent to

  xs:string | (xs:integer, (xs:float*))

and a different precedence can be obtained by writing

  ((xs:string | xs:integer), xs:float)*

Examples

A sequence of elements

The "," operator builds the "sequence" of two types. For example,

  element title of type xs:string, element year of type xs:integer

is a sequence of an element title of type string followed by an element year of type integer.

The union of two element types

The "|" operator builds the "union" of two types. For example,

  element editor of type xs:string | element bib:author

means either an element editor of type string, or a reference to the global element bib:author.

An all group of two elements

The "&" operator builds the "interleaved product" of two types. For example,

  (element a & element b) =
    element a, element b
  | element b, element a

which specifies that the a and b elements can occur in any order.

An empty type

The following type matches the empty sequence.

  empty

A sequence of zero or more elements

The following type matches zero or more elements each of which can be a surgeon or a plumber.

  (element surgeon | element plumber)*

2.4.4 Top level definitions

Top level definitions correspond to global element declarations, global attribute declarations and type definitions in XML Schema.

Type Definitions

[40 (Formal)]	`Definitions`	::=	`(Definition Definitions)?`
[39 (Formal)]	`Definition`	::=	`("define" "element" ElementName OptSubstitution OptNillable TypeReference) \| ("define" "attribute" AttributeName TypeReference) \| ("define" "type" TypeName TypeDerivation)`
[84 (Formal)]	`OptSubstitution`	::=	`Substitution?`
[41 (Formal)]	`Substitution`	::=	`"substitutes" "for" ElementName`
[33 (Formal)]	`TypeDerivation`	::=	`ComplexTypeDerivation \| AtomicTypeDerivation`
[34 (Formal)]	`ComplexTypeDerivation`	::=	`Derivation? OptMixed "{" Type? "}"`
[35 (Formal)]	`AtomicTypeDerivation`	::=	`"restricts" AtomicTypeName`
[37 (Formal)]	`Derivation`	::=	`("restricts" TypeName) \| ("extends" TypeName)`
[82 (Formal)]	`OptMixed`	::=	`Mixed?`
[38 (Formal)]	`Mixed`	::=	`"mixed"`

A type definition has a name (possibly anonymous) and a type derivation. In the case of a complex type, the derivation indicates whether it is derived by extension or restriction, its base type, and its content model, with an optional flag indicating if it has mixed content.

Example

For instance, the following complex type

 <complexType name="UKAddress">
   <complexContent>
     <extension base="ipo:Address">
       <sequence>
         <element name="postcode" type="ipo:UKPostcode"/>
       </sequence>
       <attribute name="exportCode" type="positiveInteger" fixed="1"/>
     </extension>
   </complexContent>
 </complexType>

is represented as follows

  define type UKAddress extends ipo:Address {
    attribute exportCode of type ipo:UKPostcode,
    element postcode of type positiveInteger
  };

Example

In the case of simple types derived by union or list, the derivation is always a restriction from the base type xs:anySimpleType, and has a content which is a union of the member types, or a repetition of the item type. For instance, the two following simple type declarations

<xsd:simpleType name="listOfMyIntType">
  <xsd:list itemType="myInteger"/>
</xsd:simpleType>

<xsd:simpleType name="zipUnion">
  <xsd:union memberTypes="USState FrenchRegion"/>
</xsd:simpleType>

are represented as follows

define type listOfMyIntType restricts xs:anySimpleType {
  myInteger*
}

define type zipUnion restricts xs:anySimpleType {
  USState | FrenchRegion
}

Example

In the case of an atomic type, it just indicates its base type. For instance, the following type definition

<xsd:simpleType name="SKU">
 <xsd:restriction base="xsd:string">
  <xsd:pattern value="\d{3}-[A-Z]{2}"/>
 </xsd:restriction>
</xsd:simpleType>

is represented as follow

  define type SKU restrict xsd:string;

Example

When the type derivation is omitted, the type derives by restriction from xs:anyType. For instance:

  define type Bib { element book* } =
  define type Bib restricts xs:anyType { element book* }

Example

Empty content can be indicated with the explicit empty sequence, or omitted, as in:

  define type Bib { } =
  define type Bib { empty }

Global element and attribute declarations always have a name and a reference to a (possibly anonymous) type. A global element declaration also may declare a substitution group for the element and whether the element is nillable.

Example

A type declaration with one element name of type xs:string follows by one or more elements street of type xs:string.

  define type Address {
    element name of type xs:string,
    element street of type xs:string*
  }

Example

A type declaration with complex content derived by extension

  define type USAddress extends Address {
    element zip name of type xs:integer
  }

Example

A type declaration with mixed content

  define type Section mixed {
    (element h1 of type xs:string |
     element p of type xs:string |
     element div of type Section)*
  }

Example

A type declaration with simple content derived by restriction

  define type SKU restricts xs:string

Example

An element declaration

  define element address of type Address

Example

An element declaration with a substitution group

  define element usaddress substitutes for address of type USAddress

Example

An element declaration which is nillable

  define element zip nillable of type xs:integer

2.4.5 Example of a complete Schema

Here is a schema describing purchase orders from [XML Schema Part 0].

  <xsd:schema xmlns:xsd="http://www.w3.org/2001/XMLSchema">
  
   <xsd:annotation>
    <xsd:documentation xml:lang="en">
     Purchase order schema for Example.com.
     Copyright 2000 Example.com. All rights reserved.
    </xsd:documentation>
   </xsd:annotation>
  
   <xsd:element name="purchaseOrder" type="PurchaseOrderType"/>
  
   <xsd:element name="comment" type="xsd:string"/>
  
   <xsd:complexType name="PurchaseOrderType">
    <xsd:sequence>
     <xsd:element name="shipTo" type="USAddress"/>
     <xsd:element name="billTo" type="USAddress"/>
     <xsd:element ref="comment" minOccurs="0"/>
     <xsd:element name="items"  type="Items"/>
    </xsd:sequence>
    <xsd:attribute name="orderDate" type="xsd:date"/>
   </xsd:complexType>
  
   <xsd:complexType name="USAddress">
    <xsd:sequence>
     <xsd:element name="name"   type="xsd:string"/>
     <xsd:element name="street" type="xsd:string"/>
     <xsd:element name="city"   type="xsd:string"/>
     <xsd:element name="state"  type="xsd:string"/>
     <xsd:element name="zip"    type="xsd:decimal"/>
    </xsd:sequence>
    <xsd:attribute name="country" type="xsd:NMTOKEN" fixed="US"/>
   </xsd:complexType>
  
   <xsd:complexType name="Items">
    <xsd:sequence>
     <xsd:element name="item" minOccurs="0" maxOccurs="unbounded">
      <xsd:complexType>
        <xsd:sequence>
         <xsd:element name="productName" type="xsd:string"/>
         <xsd:element name="quantity">
          <xsd:simpleType>
           <xsd:restriction base="xsd:positiveInteger">
            <xsd:maxExclusive value="100"/>
           </xsd:restriction>
          </xsd:simpleType>
         </xsd:element>
         <xsd:element name="USPrice"  type="xsd:decimal"/>
         <xsd:element ref="comment"   minOccurs="0"/>
         <xsd:element name="shipDate" type="xsd:date" minOccurs="0"/>
        </xsd:sequence>
        <xsd:attribute name="partNum" type="SKU" use="required"/>
      </xsd:complexType>
     </xsd:element>
    </xsd:sequence>
   </xsd:complexType>
  
   <!-- Stock Keeping Unit, a code for identifying products -->
   <xsd:simpleType name="SKU">
    <xsd:restriction base="xsd:string">
     <xsd:pattern value="\d{3}-[A-Z]{2}"/>
    </xsd:restriction>
   </xsd:simpleType>
  
  </xsd:schema>

Here is the mapping of the above schema into the [XPath/XQuery] type system.

  declare namespace xsd = "http://www.w3.org/2001/XMLSchema";

  define element purchaseOrder of type PurchaseOrderType;
 
  define element comment of type xsd:string;
  
  define type PurchaseOrderType {
    attribute orderDate of type xsd:date?,
    element shipTo of type USAddress,
    element billTo of type USAddress,
    element comment?,
    element items of type Items
  };

  define type USAddress {
    attribute country of type xsd:NMTOKEN,
    element name of type xsd:string,
    element street of type xsd:string,
    element city of type xsd:string,
    element state of type xsd:string,
    element zip of type xsd:decimal
  };

  define type Items {
    attribute partNum of type SKU,
    element item of type fs:anon1*
  };

  define type fs:anon1 {
    element productName of type xsd:string,
    element quantity of type fs:anon2,
    element USPrice of type xsd:decimal,
    element comment?,
    element shipDate of type xsd:date?
  };

  define type fs:anon2 restricts xsd:positiveInteger;

  define type SKU restrict xsd:string;

Note that the two anonymous types in the item element declarations are mapping to types with names fs:anon₁ and fs:anon₂.

The following additional definitions illustrate how more advanced XML Schema features (a complex type derived by extension, an anonymous simple type derived by restriction, and substitution groups) are represented in the [XPath/XQuery] type system.

  <complexType name="NYCAddress">
    <complexContent>
     <extension base="USAddress">
      <sequence>
       <element ref="apt"/>
      </sequence>
     </extension>
    </complexContent>
  </complexType>

  <element name="apt">
    <xsd:simpleType>
     <xsd:restriction base="xsd:positiveInteger">
      <xsd:maxExclusive value="10000"/>
     </xsd:restriction>
    </xsd:simpleType>
  </element>

  <element name="usaddress" substitutionGroup="address" type="USAddress"/>
  <element name="nycaddress" substitutionGroup="usaddress" type="NYCAddress"/>

The above definitions are mapped into the [XPath/XQuery] type system as follows:

  define type NYCAddress extends USAddress {
    element apt
  }

  define element apt of type fs:anon3

  define type fs:anon3 restricts xsd:positiveInteger

  define element usaddress  substitutes for address of type USAddress
  define element nycaddress substitutes for usaddress of type NYCAddress

2.5 Functions and operators

The [Functions and Operators] document defines built-in functions available in [XPath/XQuery]. A number of these functions are used to define the [XPath/XQuery] semantics; those functions are listed in [B.1 Functions and Operators used in the Formal Semantics].

Many functions in the [Functions and Operators] document are generic: they perform operations on arbitrary components of the data model, e.g., any kind of node, or any sequence of items. For instance, the fn:unordered returns its input sequence in an implementation-dependent order. The signature of the fn:unordered function takes arbitrary items as input and output:

  fn:unordered($sourceSeq as item()*) as item()*

As defined, this signature provides little useful type information. For such functions, better type information can often be obtained by having the output type depend on the type of input parameters. For instance, if the function fn:unordered is applied on a sequence of a elements, the result is also a sequence of a elements.

In order to provide better static typing for those functions, specific typing rules are given in [7 Additional Semantics of Functions].

3 Basics

The organization of this section parallels the organization of Section 2 Basics^XQ.

3.1 Expression Context

Introduction

The expression context for a given expression consists of all the information that can affect the result of the expression. This information is organized into the static context and the dynamic context. This section specifies the environments that represent the context information used by [XPath/XQuery] expressions.

3.1.1 Static Context

Notation

We introduce the following auxiliary grammar production to describe function signatures.

[94 (Formal)]	`FunctionSig`	::=	`"declare" "function" QName "(" TypeList? ")" "as" SequenceType`
[95 (Formal)]	`TypeList`	::=	`SequenceType ("," SequenceType)*`

statEnv denotes the environment available during static analysis. Static analysis may extend parts of the static environment. The static environment is also available during dynamic evaluation.

If analysis of an expression relies on some component of the static context that has not been assigned a value, a static error is raised.

The following environment components are part of the static environment:

statEnv.xpath1.0_compatibility

The statEnv.xpath1.0_compatibility environment component corresponds to the XPath 1.0 compatibility flag in the [XPath/XQuery] static context. It specifies whether the semantic rules for backward compatibility with XPath 1.0 are in effect. This document defines the formal semantics for XPath 2.0 only when the XPath 1.0 backward compatibility rules are not in effect.

statEnv.namespace

The statEnv.namespace environment component corresponds to statically known namespaces in the [XPath/XQuery] static context.

The statEnv.namespace environment component maps a namespace prefix (NCName) onto a namespace kind and a namespace URI (URI), or (#UNDECLARED). The namespace kind is either passive or active. The namespace kind determines whether a namespace node is created for an element during element construction. The (#UNDECLARED) value may be used to indicate that the prefix has been undeclared, and may be occur only in the case the implementation supports [XML Names 1.1].

statEnv.default_elem_namespace

The statEnv.default_elem_namespace environment component corresponds to the default element/type namespace in the [XPath/XQuery] static context.

The statEnv.default_elem_namespace environment component contains a namespace URI (a URI) or the null namespace (#NULL-NAMESPACE) and is used for any unprefixed QName appearing in a position where an element or type name is expected.

statEnv.default_function_namespace

The statEnv.default_function_namespace environment component corresponds to the default function namespace in the [XPath/XQuery] static context.

The statEnv.default_function_namespace environment component contains a namespace URI (a URI) or the null namespace (#NULL-NAMESPACE) and is used for any unprefixed QName appearing as the function name in a function call.

statEnv.typeDefn

The statEnv.typeDefn environment component corresponds to the in-scope schema types in the [XPath/XQuery] static context.

The statEnv.typeDefn environment component maps expanded type names (expanded TypeNames) onto their type definition (Definition). A type name may be globally declared or anonymous.

statEnv.elemDecl

The statEnv.elemDecl environment component corresponds to the in-scope element declarations in the [XPath/XQuery] static context.

The statEnv.elemDecl environment component maps expanded element names (expanded ElementNames) onto their declaration (Definition).

statEnv.attrDecl

The statEnv.attrDecl environment component corresponds to the in-scope attribute declarations in the [XPath/XQuery] static context.

The statEnv.attrDecl environment component maps expanded attribute names (expanded AttributeNames) onto their declaration (Definition).

statEnv.varType

The statEnv.varType environment component corresponds to the in-scope variables in the [XPath/XQuery] static context.

The statEnv.varType environment component maps expanded variable names (expanded Variables) to their static type (Type).

The context item static type in the [XPath/XQuery] static context is represented by the binding of the variable $fs:dot to its corresponding type in statEnv.varType.

statEnv.funcType

The statEnv.funcType environment component corresponds to the function signatures part of the [XPath/XQuery] static context.

The statEnv.funcType environment component stores the static type signatures of functions. Because [XPath/XQuery] allows multiple functions with the same name differing in the number of parameters, this environment component maps an expanded QName and an arity to a function signatures FunctionSig.

statEnv.collations

The statEnv.collations environment component corresponds to the statically known collations in the [XPath/XQuery] static context.

The statEnv.collations environment component maps a unique namespace URI (a URI) to a pair of functions: the first function takes a set of strings and returns a sequence containing those strings in sorted order; and the second function takes two strings, returns true if they are considered equal, and false if not.

statEnv.defaultCollation

The statEnv.defaultCollation environment component corresponds to the default collation in the [XPath/XQuery] static context.

The statEnv.defaultCollation environment component is a pair of functions as described in statEnv.collations above.

statEnv.constructionMode

The statEnv.constructionMode environment component corresponds to the construction mode in the [XPath/XQuery] static context.

The statEnv.constructionMode environment component is one of preserve or strip.

statEnv.orderingMode

The statEnv.orderingMode environment component corresponds to the ordering mode in the [XPath/XQuery] static context.

The statEnv.orderingMode environment component is one of ordered or unordered.

statEnv.defaultEmptySequenceOrder

The statEnv.defaultEmptySequenceOrder environment component corresponds to the default order for empty sequences in the [XPath/XQuery] static context.

The statEnv.defaultEmptySequenceOrder environment component controls whether an empty sequence is interpreted as the greatest value or as the least value during processing of an order by clause in a FLWOR expression. Its value may be greatest or least.

statEnv.boundarySpace

The statEnv.boundarySpace environment component corresponds to the boundary-space policy in the [XPath/XQuery] static context.

The statEnv.boundarySpace environment component controls the processing of boundary whitespace by element constructors. Its value may be preserve or strip.

statEnv.copyNamespacesMode

The statEnv.copyNamespacesMode environment component corresponds to the copy-namespaces mode in the [XPath/XQuery] static context.

The statEnv.copyNamespacesMode environment component controls the namespace bindings that are assigned when an existing element node is copied by an element constructor. Its value consists of two parts: preserve or no-preserve, and inherit or no-inherit.

statEnv.baseURI

The statEnv.baseURI environment component corresponds to the base URI in the [XPath/XQuery] static context.

The statEnv.baseURI environment component contains a unique namespace URI (a URI).

statEnv.docType

The statEnv.docType environment component corresponds to the statically known documents in the [XPath/XQuery] static context. It contains the static type for the input documents, and is used to provide the static type to the fn:doc function.

The statEnv.docType environment component contains bindings from input URIs (a URI) to types (a Type).

statEnv.collectionType

The statEnv.collectionType environment component corresponds to the statically known collections in the [XPath/XQuery] static context. It contains the static type for the input collections, and is used to provide the static type to the fn:collection function.

The statEnv.collectionType environment component contains bindings from input URIs (a URI) to types (a Type).

statEnv.defaultCollectionType

The statEnv.defaultCollectionType environment component corresponds to the statically known default collection type in the [XPath/XQuery] static context. It contains the static type for the default collection, and is used to provide the static type to the fn:collection function when called with no arguments.

The statEnv.defaultCollectionType environment component contains type (a Type).

Note that the boundary-space behavior is not formally specified in this document.

Environments have an initial state when [expression/query] processing begins, containing, for example, the function signatures of all built-in functions. The initial values for the static context are defined in Section C Context Components^XQ and Section C Context Components^XP and is denoted by statEnvDefault in the Formal Semantics.

Here is an example that shows how the static environment is modified in response to a namespace definition.

statEnv + namespace(NCName => (passive, URI)) |- Expr : Type

statEnv |- declare namespace NCName = URI; Expr : Type

This rule reads as follows: "the phrase on the bottom (a namespace declaration in the query prolog followed by a sequence of expressions) is well-typed (accepted by the static type inference rules) within an environment statEnv if the sequence of expressions above the line is well-typed in the environment obtained from statEnv by adding the namespace declaration".

The helper function fs:active_ns(statEnv) returns all the active in-scope namespaces in the given static environment.

For each attribute and element node in Value, such that the node has name expanded-QName in the namespace URI, the helper function fs:get_static_ns_from_items(statEnv, Value) returns the in-scope namespace that corresponds to URI. This is a reverse-lookup on statEnv.namespace by URI.

3.1.1.1 Resolving QNames to Expanded QNames

A common use of the static environment is to expand a QName by looking up the URI that corresponds to the QName's namespace prefix in the statEnv.namespace environment component and by constructing an expanded-QName^DM, which contains the URI and the QName's local part. Element and type names may be in the null namespace, that is, there is no URI associated with their namespace prefix. The null namespace is denoted by the special value #NULL-NAMESPACE.

The auxiliary judgments below expand an element, type, attribute, variable, or function QName by looking up the namespace prefix in statEnv.namespace or, if the QName is unqualified, by using the appropriate default namespace.

Notation

The judgment

statEnv |- QName of elem/type expands to expanded-QName

holds when the element or type QName expands to the given expanded QName.

The judgment

statEnv |- QName of attr expands to expanded-QName

holds when the attribute QName expands to the given expanded QName.

The judgment

statEnv |- QName of var expands to expanded-QName

holds when the variable QName expands to the given expanded QName.

The judgment

statEnv |- QName of func expands to expanded-QName

holds when the function QName expands to the given expanded QName.

Semantics

Note that none of the inference rules can infer a resolved name in the case a given namespace prefix is bound to the (#UNDECLARED) value. As a result, namespace resolution will fail if the implementation supports [XML Names 1.1] and a given namespace prefixed as been undeclared.

An element or type QName consisting of a prefix NCName and a local part NCName expands to the URI (or the null namespace) corresponding to that prefix and the local part.

statEnv.namespace(NCName₁) = URI-or-#NULL-NAMESPACE

statEnv |- NCName₁:NCName₂ of elem/type expands to (URI-or-#NULL-NAMESPACE, NCName₂)

An element or type QName consisting only of a local part NCName expands to the default element/type namespace and the local part.

statEnv.default_elem_namespace = URI-or-#NULL-NAMESPACE

statEnv |- NCName of elem/type expands to (URI-or-#NULL-NAMESPACE, NCName)

An attribute QName consisting of a prefix NCName and a local part NCName expands to the URI (or the null namespace) corresponding to the prefix and the local part.

statEnv.namespace(NCName₁) = URI-or-#NULL-NAMESPACE

statEnv |- NCName₁:NCName₂ of attr expands to (URI-or-#NULL-NAMESPACE, NCName₂)

An attribute QName consisting only of a local part NCName expands to the null namespace and the local part.

statEnv |- NCName of attr expands to (#NULL-NAMESPACE, NCName)

A variable QName consisting of a prefix NCName and a local part NCName expands to the URI that corresponds to the prefix and the local part.

statEnv.namespace(NCName₁) = URI

statEnv |- NCName₁:NCName₂ of var expands to (URI, NCName₂)

A variable QName consisting only of a local part NCName expands to the null namespace and the local part.

statEnv |- NCName of var expands to (#NULL-NAMESPACE, NCName)

A function QName consisting of a prefix NCName and a local part NCName expands to the URI that corresponds to the prefix and the local part.

statEnv.namespace(NCName₁) = URI

statEnv |- NCName₁:NCName₂ of func expands to (URI, NCName₂)

A function QName consisting only of a local part NCName expands to the default function namespace URI and the local part.

statEnv.default_function_namespace = URI

statEnv |- NCName of func expands to (URI, NCName)

3.1.2 Dynamic Context

dynEnv denotes the environment available during dynamic evaluation. Dynamic evaluation may extend parts of the dynamic environment.

If evaluation of an expression relies on some component of the dynamic context that has not been assigned a value, a dynamic error is raised.

The following environment components are part of the dynamic environment:

dynEnv.varValue

The dynEnv.varValue environment component corresponds to the variable values, the context item, the context position and the context size in the [XPath/XQuery] evaluation context.

The dynamic value environment component maps an expanded variable name (expanded Variable) to the variable's value (Value) or to the value #IMPORTED(URI), if the variable is defined in the imported module with namespace URI.

dynEnv.funcDefn

The dynEnv.funcDefn environment component corresponds to the function implementations (or definition) part of the [XPath/XQuery] dynamic context.

The dynEnv.funcDefn environment component maps an expanded function name and parameter signature of the form "expanded-QName (Type₁, ..., Type_n)" to the remainder of the corresponding function definition, which is either the value #BUILT-IN for functions defined in [Functions and Operators]; the value #EXTERNAL for externally defined functions; the value #IMPORTED(URI), if the function is defined in the imported module with namespace URI; or, if the function is locally declared, the function's body and a list of variables, which are the function's formal parameters, of the form "(Expr, Variable₁,..., Variable_n)".

The initial function environment component (dynEnvDefault.funcDefn) maps the signatures of the internal functions defined in [B.2 Mapping of Overloaded Internal Functions] and the signatures of the functions defined in [Functions and Operators] to #BUILT-IN.

dynEnv.dateTime

The dynEnv.dateTime environment component corresponds to the current dateTime in the [XPath/XQuery] dynamic context.

dynEnv.timezone

The dynEnv.timezone environment component corresponds to the implicit timezone in the [XPath/XQuery] dynamic context and is used by the timezone related functions in [Functions and Operators].

dynEnv.docValue

The dynEnv.docValue environment component corresponds to the available documents in the [XPath/XQuery] dynamic context. It contains the document nodes corresponding to input documents, and is used to provide the dynamic value of the fn:doc function.

The dynEnv.docValue environment component contains bindings from input URIs (a URI) to documents (a DocumentValue).

dynEnv.collectionValue

The dynEnv.collectionValue environment component corresponds to the available collections in the [XPath/XQuery] dynamic context. It contains the root nodes corresponding to the input collections, and is used to provide the dynamic value of the fn:collection function.

The dynEnv.collectionValue environment component contains bindings from input URIs (a URI) to a sequence of nodes.

dynEnv.defaultCollectionValue

The dynEnv.defaultCollectionValue environment component corresponds to the default collection in the [XPath/XQuery] dynamic context. It contains the sequence of nodes corresponding to the default collection, and is used to provide the dynamic value of the fn:collection function when called with no arguments.

The dynEnv.defaultCollectionValue environment component contains a sequence of nodes.

The initial values for the dynamic context are defined in Section C Context Components^XQ and Section C Context Components^XP. The corresponding initial dynamic environment is denoted by dynEnvDefault in the Formal Semantics.

The following Formal Semantics variables represent the context item, context position, and context size properties of the dynamic context:

Built-in Variable	Represents:
`$`fs:`dot`	context item
`$`fs:`position`	context position
`$`fs:`last`	context size

Within this document, variables with the "fs" prefix are reserved for use in the formal specification. Values of $fs:position and $fs:last can be obtained by invoking the fn:position and fn:last functions, respectively.

3.2 Processing Model

This section reviews the processing model for [XPath/XQuery]. The [XPath/XQuery] processing model is defined normatively in Section 2.2 Processing Model^XQ. This section also explains how the main notations (normalization rules, static type inference, and dynamic evaluation) relate to the phases in that processing model.

3.2.1 Processing model

The following figure depicts the [XPath/XQuery] processing model

Figure 1: Processing Model Overview

This processing model is not intended to describe an actual implementation, although a naive implementation might be based upon it. It does not prescribe an implementation technique, but any implementation should produce the same results as obtained by following this processing model and applying the rest of the Formal Semantics specification.

Query processing consists of two phases: a static analysis phase and a dynamic evaluation phase. Static analysis is further divided into four sub-phases. Each phase consumes the result of the previous phase and generates output for the next phase. For each processing phase, we point to the relevant notations introduced later in the document.

[Definition: The static analysis phase depends on the expression itself and on the static context. The static analysis phase does not depend on input data (other than schemas).]

The purpose of the static analysis phase is to detect errors, e.g., syntax errors or type errors, at compile time rather than at run-time. If no error occurs, the result of static analysis could be some compiled form of [expression/query], suitable for execution by a compiled-[expression/query] processor. Static analysis consists of the following sub-phases:

Parsing. (Step SQ1 in Figure 1). The grammar for the [XPath/XQuery] syntax is defined in [XQuery 1.0: An XML Query Language]. Parsing may generate syntax errors. If no error occurs, an internal operation tree of the parsed query is created.
Static Context Processing. (Steps SQ2, SQ3, and SQ4 in Figure 1). The static semantics of [expression/query] depends on the input static context. The input static context needs to be generated before the [expression/query] can be analysed. In XQuery, the input static context may be defined by the processing environment and by declarations in the Query Prolog (See [5 Modules and Prologs]). In XPath, the input static context is defined by the processing environment. The static context is denoted by statEnv.
Normalization. (Step SQ5 in Figure 1). To simplify the semantics specification, some normalization is performed on the [expression/query]. The [XPath/XQuery] language provides many powerful features that make [expression/query]s simpler to write and use, but are also redundant. For instance, a complex for expression might be rewritten as a composition of several simple for expressions. The language composed of these simpler [expression/query] is called the [XPath/XQuery] Core language and is described by a grammar which is a subset of the XQuery grammar. The grammar of the [XPath/XQuery] Core language is given in [A Normalized core grammar].

During the normalization phase, each [XPath/XQuery] [expression/query] is mapped into its equivalent [expression/query] in the Core. (Note that this has nothing to do with Unicode Normalization, which works on character strings.) Normalization works by recursive application of the normalization rules over a given expression.

Specifically the normalization phase is defined in terms of the static part of the context (statEnv) and a [expression/query] (Expr) abstract syntax tree. Formal notations for the normalization phase are introduced in [3.2.2 Normalization judgment].

After normalization, the full semantics is obtained by giving a semantics to the normalized Core [expression/query]. This is done during the last two phases.
Static type analysis. (Step SQ6 in Figure 1). Static type analysis is optional. If this phase is not supported, then normalization is followed directly by dynamic evaluation.

Static type analysis checks whether each [expression/query] is type safe, and if so, determines its static type. Static type analysis is defined only for Core [expression/query]. Static type analysis works by recursive application of the type inference rules over a given expression.

If the [expression/query] is not type-safe, static type analysis yields a type error. For instance, a comparison between an integer value and a string value might be detected as an type error during the static type analysis. If static type analysis succeeds, it yields an abstract syntax tree where each sub-expression is associated with its static type.

More precisely, the static analysis phase is defined in terms of the static context (statEnv) and a Core [expression/query] (CoreExpr). Formal notations for the static analysis phase are introduced in [3.2.3 Static typing judgment].

Static typing does not imply that the content of XML documents must be rigidly fixed or even known in advance. The [XPath/XQuery] type system accommodates "flexible" types, such as elements that can contain any content. Schema-less documents are handled in [XPath/XQuery] by associating a standard type with the document, such that it may include any legal XML content.

If the static analysis phase succeeds, the dynamic evaluation phase (sometimes also called "execution") evaluates a query on input document(s).

Dynamic Context Processing. (Steps DQ2 and DQ3 in Figure 1).The dynamic semantics of [expression/query] depends on the dynamic input context. The dynamic input context needs to be generated before the [expression/query] can be evaluated. The dynamic input context may be defined by the processing environment and by statements in the Query Prolog (See [5 Modules and Prologs]). In XPath, the dynamic input context is defined by the processing environment. The static input context is denoted by dynEnv.
Dynamic Evaluation. (Steps DQ4 and DQ5 in Figure 1). This phase computes the value of an [expression/query]. The semantics of evaluation is defined only for Core [expression/query] terms. The formal description of evaluation works by recursive application of the evaluation rules over a given expression. (Note that in practice some implementations may prefer top-down evaluation strategies.) Evaluation may result in a value OR a dynamic error, which may be a non-type error or a type error. If static typing of an expression does not raise a type error, then dynamic evaluation of the same expression will not raise a type error (and thus dynamic type checking can be avoided when static typing is enabled). Dynamic evaluation may still raise a non-type error.

The dynamic evaluation phase is defined in terms of the static context (statEnv) and evaluation context (dynEnv), and a Core [expression/query] (CoreExpr). Formal notations for the dynamic evaluation phase are introduced in [3.2.4 Dynamic evaluation judgment].

Static type analysis catches only certain classes of errors. For instance, it can detect a comparison operation applied between incompatible types (e.g., xs:int and xs:date). Some other classes of errors cannot be detected by the static analysis and are only detected at evaluation time. For instance, whether an arithmetic expression on 32 bit integers (xs:int) yields an out-of-bound value can only be detected at run-time by looking at the data.

While implementations are free to implement different processing models, the [XPath/XQuery] static semantics relies on the existence of a static type analysis phase that precedes any access to the input data.

The above processing phases are all internal to the [XPath/XQuery] processor. They do not deal with how the [XPath/XQuery] processor interacts with the outside world, notably how it accesses actual documents and types. A typical [expression/query] engine would support at least three other important processing phases:

Schema Import Processing. The [XPath/XQuery] type system is based on XML Schema. In order to perform dynamic or static typing, the [XPath/XQuery] processor needs to build type descriptions that correspond to the schema(s) of the input documents. This phase is achieved by mapping all schemas required by the [expression/query] into the [XPath/XQuery] type system. The XML Schema import phase is described in [C Importing Schemas].
Data Model Generation. Expressions are evaluated on values in the [Data Model]. XML documents must be loaded into the [Data Model] before the evaluation phase. This is described in the [Data Model] and is not discussed further here.
Serialization. Once the [expression/query] is evaluated, processors might want to serialize the result of the [expression/query] as actual XML documents. Serialization of data model instances is described in [Data Model Serialization] and is not discussed further here.

The parsing phase is not specified formally; the formal semantics does not define a formal model for the syntax trees, but uses the [XPath/XQuery] concrete syntax directly. More details about parsing for XQuery 1.0 can be found in the [XQuery 1.0: An XML Query Language] document and more details about parsing for XPath 2.0 can be found in the [XML Path Language (XPath) 2.0] document. No further discussion of parsing is included here.

3.2.2 Normalization judgment

Normalization is specified using mapping rules, which describe how a [XPath/XQuery] expression is rewritten into an expression in the [XPath/XQuery] Core. Mapping rules are also used in [C Importing Schemas] to specify how XML Schemas are imported into the [XPath/XQuery] type system.

Notation

Mapping rules are written using a square bracket notation, as follows:

[Object]_Subscript

Mapped Object

The original "object" is written above the == sign. The rewritten "object" is written beneath the == sign. The subscript is used to indicate what kind of "object" is mapped, and sometimes to pass some information between mapping rules.

Since normalization is always applied in the presence of a static context, the above rule is a shorthand for:

statEnv |- [Object] _Subscript == Mapped Object

The static environment is used in certain normalization rules (e.g. for normalization of function calls). To keep the notation simpler, the static environment is not written in the normalization rules, but it is assumed to be available.

The normalization rule that is used to map "top-level" expressions in the [XPath/XQuery] syntax into expressions in the [XPath/XQuery] Core is:

[Expr]_Expr

CoreExpr

which indicates that the expression Expr is normalized to the expression CoreExpr in the [XPath/XQuery] Core (with the implied statEnv).

Example

For instance, the following [expression/query]

    for $i in (1, 2),
        $j in (3, 4)
    return
      element pair { ($i,$j) }

is normalized to the Core expression

    for $i in (1, 2) return
      for $j in (3, 4) return
          element pair { ($i,$j) }

in which the "FWLR" expression is mapped into a composition of two simpler "for" expressions.

3.2.3 Static typing judgment

The static semantics is specified using type inference rules, which relate [XPath/XQuery] expressions to types and specify under what conditions an expression is well typed.

Notation

The judgment

statEnv |- Expr : Type

holds when, in the static environment statEnv, the expression Expr has type Type.

Example

The result of static type inference is to associate a static type with every [expression/query], such that any evaluation of that [expression/query] is guaranteed to yield a value that belongs to that type.

For instance, the following expression.

   let $v := 3 return $v+5

has type xs:integer. This can be inferred as follows: the input literals '3' and '5' have type integer, so the variable $v also has type integer. Since the sum of two integers is an integer, the complete expression has type integer.

Note

The type of an expression is computed by inference. Static type inference rules define for each kind of expression how to compute the type of the expression given the types of its sub-expressions. Here is a simple example:

statEnv |- Expr₁ : xs:boolean statEnv |- Expr₂ : Type₂ statEnv |- Expr₃ : Type₃

statEnv |- if Expr₁ then Expr₂ else Expr₃ : ( Type₂ | Type₃ )

This rule states that if the conditional expression of an "if" expression has type boolean, then the type of the entire expression is one of the two types of its "then" and "else" clauses. Note that the resulting type is represented as a union: '(Type₂|Type₃)'.

The "left half" (the part before the :) of the expression below the line corresponds to some [expression/query], for which a type is computed. If the [expression/query] has been parsed into an internal abstract syntax tree, this usually corresponds to some node in that tree. The expression usually has patterns in it (here Expr₁, Expr₂, and Expr₃) that need to be matched against the children of the node in the abstract syntax tree. The expressions above the line indicate things that need to be computed to use this rule; in this case, the types of the condition expression and the two branches of the if-then-else expression. Once those types are computed (by further applying static inference rules recursively to the expressions on each side), then the type of the expression below the line can be computed. This example illustrates a general feature of the [XPath/XQuery] type system: the type of an expression depends only on the type of its sub-expressions. The overall static type inference algorithm is recursive, following the abstract syntax of the [expression/query]. At each point in the recursion, an appropriate matching inference rule is sought; if at any point there is no applicable rule, then static type inference has failed and the [expression/query] is not type correct.

3.2.4 Dynamic evaluation judgment

The dynamic, or operational, semantics is specified using value inference rules, which relate [XPath/XQuery] expressions to values, and in some cases specify the order in which an [XPath/XQuery] expression is evaluated.

Notation

The judgment

statEnv; dynEnv |- Expr => Value

holds when, in the static environment statEnv and dynamic environment dynEnv, the expression Expr yields the value Value.

The static environment is used in certain cases (e.g. for type matching) during evaluation. To keep the notation simpler, the static environment is not written in the dynamic inference rules, but it is assumed to be available.

Example

For instance, the following expression.

   let $v := 3 return $v+5

yields the integer value 8. This can be inferred as follows: the input literals '3' and '5' denote the values 3 and 5, respectively, so the variable $v has the value 3. Since the sum of 3 and 5 is 8, the complete expression has the value 8.

Note

As with static type inference, logical inference rules are used to determine the value of each expression, given the dynamic environment and the values of its sub-expressions.

The inference rules used for dynamic evaluation, like those for static type inference, follow a bottom-up recursive structure, computing the value of expressions from the values of their sub-expressions.

3.3 Error Handling

Expressions can raise errors during static analysis or dynamic evaluation. The [Functions and Operators] [XQuery 1.0: An XML Query Language], and [XML Path Language (XPath) 2.0] specify the conditions under which an expression or operator raises an error. The user may raise an error explicitly by calling the fn:error function, which takes an optional item as an argument.

This document does not describe formally the conditions under which dynamic errors are raised. Notably, it does not specify the error codes or the rules about errors and optimization, as described in [XQuery 1.0: An XML Query Language]. Instead, this document describe the rules necessary to statically detect the subset of the [XPath/XQuery] dynamic errors known as type error^XQ.

3.4 Concepts

[XPath/XQuery] is most generally used to process documents. The representation of a document is normatively defined in [Data Model]. The functions used to access documents and collections are normatively defined in [Functions and Operators].

3.4.1 Document Order

Document order is defined in [Data Model].

3.4.2 Atomization

Atomization converts an item sequence into a sequence of atomic values and is implemented by the fn:data function. Atomization is applied to a value when the value is used in a context in which a sequence of atomic values is required.

3.4.3 Effective Boolean Value

If a sequence of items is encountered where a boolean value is expected, the item sequence's effective boolean value is used. The fn:boolean function returns the effective boolean value of an item sequence.

3.4.4 Input Sources

[XPath/XQuery] has a set of functions that provide access to input data. These functions are of particular importance because they provide a way in which an expression can reference a document or a collection of documents. The dynamic semantics of these three input functions are described in more detail in [Functions and Operators].

3.4.5 URI Literals

In certain places in the XQuery grammar, a statically known valid absolute URI is required. These places are denoted by the grammatical symbol URILiteral, and are treated as described in [XQuery 1.0: An XML Query Language].

3.5 Types

3.5.1 Predefined Schema Types

All the built-in types of XML Schema are recognized by [XPath/XQuery]. In addition, [XPath/XQuery] recognizes the predefined types xdt:anyAtomicType, xdt:untypedAtomic and xdt:untyped and the duration subtypes xdt:yearMonthDuration and xdt:dayTimeDuration . The definition of those types in the [XPath/XQuery] type system is given below.

[Definition: The following type definition of xs:anyType reflects the semantics of the Ur type from Schema in the [XPath/XQuery] type system.]

  define type xs:anyType restricts xs:anyType {
    attribute * of type xs:anySimpleType*,
    ( xdt:anyAtomicType* | ( element * of type xs:anyType | text | comment | processing-instruction )* )
  }

[Definition: The following type definition of xs:anySimpleType reflects the semantics of the Ur simple type from Schema in the [XPath/XQuery] type system.]

  define type xs:anySimpleType restricts xs:anyType {
    xdt:anyAtomicType*
  }

The name of the Ur simple type is xs:anySimpleType. It is derived by restriction from xs:anyType, its content is a sequence any atomic types.

[Definition: The following type definition of xdt:anyAtomicType reflects the semantics of xdt:anyAtomicType in the [XPath/XQuery] type system.]

  define type xdt:anyAtomicType restricts xs:anySimpleType {
    ( xs:string
    | xs:boolean
    | xs:decimal
    | xs:float
    | xs:double
    | xs:duration
    | xs:dateTime
    | xs:time
    | xs:date
    | xs:gYearMonth
    | xs:gYear
    | xs:gMonthDay
    | xs:gDay
    | xs:gMonth
    | xs:hexBinary
    | xs:base64Binary
    | xs:anyURI
    | xs:QName
    | xs:NOTATION
    | xdt:untypedAtomic )
  }

[Definition: The following type definitions of the XML Schema primitive types reflect the semantics of the primitive types from Schema in the [XPath/XQuery] type system.]

  define type xs:string       restricts xdt:anyAtomicType
  define type xs:boolean      restricts xdt:anyAtomicType
  define type xs:decimal      restricts xdt:anyAtomicType
  define type xs:float        restricts xdt:anyAtomicType
  define type xs:double       restricts xdt:anyAtomicType
  define type xs:duration     restricts xdt:anyAtomicType
  define type xs:dateTime     restricts xdt:anyAtomicType
  define type xs:time         restricts xdt:anyAtomicType
  define type xs:date         restricts xdt:anyAtomicType
  define type xs:gYearMonth   restricts xdt:anyAtomicType
  define type xs:gYear        restricts xdt:anyAtomicType
  define type xs:gMonthDay    restricts xdt:anyAtomicType
  define type xs:gDay         restricts xdt:anyAtomicType
  define type xs:gMonth       restricts xdt:anyAtomicType
  define type xs:hexBinary    restricts xdt:anyAtomicType
  define type xs:base64Binary restricts xdt:anyAtomicType
  define type xs:anyURI       restricts xdt:anyAtomicType
  define type xs:QName        restricts xdt:anyAtomicType
  define type xs:NOTATION     restricts xdt:anyAtomicType

All of those primitive types derive from xdt:anyAtomicType. Note that the value space of each atomic type (such as xs:string) does not appear. The value space for each type is built-in and is as defined in [Schema Part 2].

[Definition: The type xdt:untypedAtomic is defined as follows.]

  define type xdt:untypedAtomic restricts xdt:anyAtomicType

Note that this rule does not indicate the value space of xdt:untypedAtomic. By definition, xdt:untypedAtomic has the same value space as xs:string.

The following example shows two atomic values. The first one is a value of type string containing "Database". The second one is an untyped atomic value containing "Database".

  "Databases" of type xs:string
  "Databases" of type xdt:untypedAtomic

[Definition: The type xdt:untyped is defined as follows.]

  define type xdt:untyped restricts xs:anyType {
    attribute * of type xdt:untypedAtomic*,
    ( element * of type xdt:untyped | text | comment | processing-instruction )*
  }

[Definition: The following type definitions of the XML Schema derived types reflect the semantics of the XML Schema types derived by restriction from another atomic type.]

  define type xs:normalizedString   restricts xs:string
  define type xs:token              restricts xs:normalizedString
  define type xs:language           restricts xs:token
  define type xs:NMTOKEN            restricts xs:token
  define type xs:Name               restricts xs:token
  define type xs:NCName             restricts xs:Name
  define type xs:ID                 restricts xs:Name
  define type xs:IDREF              restricts xs:Name
  define type xs:ENTITY             restricts xs:Name
  define type xs:integer            restricts xs:decimal
  define type xs:nonPositiveInteger restricts xs:integer
  define type xs:negativeInteger    restricts xs:nonPositiveInteger
  define type xs:long               restricts xs:integer
  define type xs:int                restricts xs:long
  define type xs:short              restricts xs:int
  define type xs:byte               restricts xs:short
  define type xs:nonNegativeInteger restricts xs:integer
  define type xs:unsignedLong       restricts xs:nonNegativeInteger
  define type xs:unsignedInt        restricts xs:unsignedLong
  define type xs:unsignedShort      restricts xs:unsignedInt
  define type xs:unsignedByte       restricts xs:unsignedShort
  define type xs:positiveInteger    restricts xs:nonNegativeInteger

Three XML Schema built-in derived types are derived by list, as follows. Note that those derive directly from xs:anySimpleType, since they are derived by list, and that their value space is defined using a "one or more" occurrence indicator.

  define type xs:NMTOKENS restricts xs:anySimpleType { xs:NMTOKEN+ }
  define type xs:IDREFS   restricts xs:anySimpleType { xs:IDREF+ }
  define type xs:ENTITIES restricts xs:anySimpleType { xs:ENTITY+ }

For example, here is an element whose content is of type xs:IDREFS.

  element a of type xs:IDREFS {
    "id1" of type xs:IDREF,
    "id2" of type xs:IDREF,
    "id3" of type xs:IDREF
  }

Note that the type name xs:IDREFS derives from xs:anySimpleType, but not from xs:IDREF. As a consequence, calling the following three XQuery functions with the element a as a parameter succeeds for f1 and f2, but raises a type error for f3.

  declare function f1($x as element(*,xs:anySimpleType)) { $x }
  declare function f2($x as element(*,xs:IDREFS)) { $x }
  declare function f3($x as element(*,xs:IDREF)) { $x }

[Definition: The totally ordered duration types, xdt:yearMonthDuration and xdt:dayTimeDuration , are derived by restriction from xs:duration.]

  define type xdt:yearMonthDuration restricts xs:duration
  define type xdt:dayTimeDuration   restricts xs:duration

[Definition: In addition, the Formal Semantics uses the additional type fs:numeric. This type is necessary for the specification of some of XPath type conversion rules. It is defined as follows.]

  define type fs:numeric restricts xdt:anyAtomicType { xs:decimal | xs:float | xs:double }

3.5.2 Typed Value and String Value

The typed value of a node is computed by the fn:data function, and the string value of a node is computed by the fn:string function, defined in [Functions and Operators]. The normative definitions of typed value and string value are defined in [Data Model].

3.5.3 SequenceType Syntax

Introduction

Sequence types can be used in [XPath/XQuery] to refer to an XML Schema type. Sequence types are used to declare the types of function parameters and in several [XPath/XQuery] expressions.

The syntax of sequence types is described by the following grammar productions.

SequenceType

[119 (XQuery)]	`SequenceType^XQ`	::=	`("empty-sequence" "(" ")") \| (ItemType OccurrenceIndicator?)`
[121 (XQuery)]	`ItemType^XQ`	::=	`KindTest \| ("item" "(" ")") \| AtomicType`
[120 (XQuery)]	`OccurrenceIndicator^XQ`	::=	`"?" \| "*" \| "+"`
[122 (XQuery)]	`AtomicType^XQ`	::=	`QName`
[123 (XQuery)]	`KindTest^XQ`	::=	`DocumentTest \| ElementTest \| AttributeTest \| SchemaElementTest \| SchemaAttributeTest \| PITest \| CommentTest \| TextTest \| AnyKindTest`
[125 (XQuery)]	`DocumentTest^XQ`	::=	`"document-node" "(" (ElementTest \| SchemaElementTest)? ")"`
[133 (XQuery)]	`ElementTest^XQ`	::=	`"element" "(" (ElementNameOrWildcard ("," TypeName "?"?)?)? ")"`
[135 (XQuery)]	`SchemaElementTest^XQ`	::=	`"schema-element" "(" ElementDeclaration ")"`
[136 (XQuery)]	`ElementDeclaration^XQ`	::=	`ElementName`
[129 (XQuery)]	`AttributeTest^XQ`	::=	`"attribute" "(" (AttribNameOrWildcard ("," TypeName)?)? ")"`
[131 (XQuery)]	`SchemaAttributeTest^XQ`	::=	`"schema-attribute" "(" AttributeDeclaration ")"`
[132 (XQuery)]	`AttributeDeclaration^XQ`	::=	`AttributeName`
[134 (XQuery)]	`ElementNameOrWildcard^XQ`	::=	`ElementName \| "*"`
[138 (XQuery)]	`ElementName^XQ`	::=	`QName`
[130 (XQuery)]	`AttribNameOrWildcard^XQ`	::=	`AttributeName \| "*"`
[137 (XQuery)]	`AttributeName^XQ`	::=	`QName`
[139 (XQuery)]	`TypeName^XQ`	::=	`QName`
[128 (XQuery)]	`PITest^XQ`	::=	`"processing-instruction" "(" (NCName \| StringLiteral)? ")"`
[127 (XQuery)]	`CommentTest^XQ`	::=	`"comment" "(" ")"`
[126 (XQuery)]	`TextTest^XQ`	::=	`"text" "(" ")"`
[124 (XQuery)]	`AnyKindTest^XQ`	::=	`"node" "(" ")"`

Core Grammar

The Core grammar productions for sequence types are:

[83 (Core)]	`SequenceType`	::=	`("empty-sequence" "(" ")") \| (ItemType OccurrenceIndicator?)`
[85 (Core)]	`ItemType`	::=	`KindTest \| ("item" "(" ")") \| AtomicType`
[84 (Core)]	`OccurrenceIndicator`	::=	`"?" \| "*" \| "+"`
[86 (Core)]	`AtomicType`	::=	`QName`
[87 (Core)]	`KindTest`	::=	`DocumentTest \| ElementTest \| AttributeTest \| SchemaElementTest \| SchemaAttributeTest \| PITest \| CommentTest \| TextTest \| AnyKindTest`
[89 (Core)]	`DocumentTest`	::=	`"document-node" "(" (ElementTest \| SchemaElementTest)? ")"`
[97 (Core)]	`ElementTest`	::=	`"element" "(" (ElementNameOrWildcard ("," TypeName "?"?)?)? ")"`
[99 (Core)]	`SchemaElementTest`	::=	`"schema-element" "(" ElementDeclaration ")"`
[100 (Core)]	`ElementDeclaration`	::=	`ElementName`
[93 (Core)]	`AttributeTest`	::=	`"attribute" "(" (AttribNameOrWildcard ("," TypeName)?)? ")"`
[95 (Core)]	`SchemaAttributeTest`	::=	`"schema-attribute" "(" AttributeDeclaration ")"`
[96 (Core)]	`AttributeDeclaration`	::=	`AttributeName`
[98 (Core)]	`ElementNameOrWildcard`	::=	`ElementName \| "*"`
[102 (Core)]	`ElementName`	::=	`QName`
[94 (Core)]	`AttribNameOrWildcard`	::=	`AttributeName \| "*"`
[101 (Core)]	`AttributeName`	::=	`QName`
[103 (Core)]	`TypeName`	::=	`QName`
[92 (Core)]	`PITest`	::=	`"processing-instruction" "(" (NCName \| StringLiteral)? ")"`
[91 (Core)]	`CommentTest`	::=	`"comment" "(" ")"`
[90 (Core)]	`TextTest`	::=	`"text" "(" ")"`
[88 (Core)]	`AnyKindTest`	::=	`"node" "(" ")"`

The semantics of SequenceTypes is defined by means of normalization rules from SequenceTypes into types in the [XPath/XQuery] type system (See [2.4 The [XPath/XQuery] Type System]).

However, the [XPath/XQuery] type system not being part of the [XPath/XQuery] syntax, the SequenceType syntax is still part of the [XPath/XQuery] Core. Normalization from SequenceTypes to types is not applied during the normalization phase but whenever a dynamic or static rule requires it. Normalization of SequenceTypes is the only example of normalization that does not yield an expression in the [XPath/XQuery] Core and that occurs on-demand in dynamic or static rules.

3.5.4 SequenceType Matching

Introduction

During processing of a query, it is sometimes necessary to determine whether a given value matches a type that was declared using the SequenceType syntax. This process is known as SequenceType matching, and is formally specified in [8.3 Judgments for type matching].

Notation

To define normalization of SequenceTypes to the [XPath/XQuery] type system, the following auxiliary mapping rule is used.

[SequenceType]_sequencetype

Type

specifies that SequenceType is mapped to a Type, in the [XPath/XQuery] type system.

Normalization

OccurenceIndicators are left unchanged when normalizing SequenceTypes into [XPath/XQuery] types. Each kind of SequenceType component is normalized separately into the [XPath/XQuery] type system.

[ItemType OccurrenceIndicator]_sequencetype

[ItemType]_sequencetype OccurrenceIndicator

The "empty-sequence()" sequence type is mapped to the empty type.

[empty-sequence()]_sequencetype

empty

An atomic type is normalized to itself in the [XPath/XQuery] type system.

[AtomicType]_sequencetype

AtomicType

An "element" SequenceType without content or with a wildcard and no type name is normalized into a wildcard element type.

[element()]_sequencetype

element * of type xs:anyType

[element(*)]_sequencetype

element * of type xs:anyType

An "element" SequenceType with a wildcard and a type name is normalized into a wildcard element type with a corresponding type name. The presence of a "?" after the type name indicates a nillable element.

[element(*,TypeName)]_sequencetype

element * of type TypeName

[element(*,TypeName?)]_sequencetype

element * nillable of type TypeName

An "element" SequenceType with a name and a type name is normalized into an element type with a corresponding type name. The presence of a "?" after the type name indicates a nillable element.

[element(ElementName,TypeName)]_sequencetype

element ElementName of type TypeName

[element(ElementName,TypeName?)]_sequencetype

element ElementName nillable of type TypeName

An "element" SequenceType with only a name is normalized into a nillable element type with a corresponding name. The reason for the normalization to allow nillable elements is because the semantics of SequenceTypes in that case allows it to match every possible element with that names, regardless of its type or nilled property.

[element(ElementName)]_sequencetype

element ElementName nillable of type xs:anyType

A "schema-element" SequenceType with an element declaration is normalized into a reference to the corresponding global element declaration.

[schema-element(ElementName)]_sequencetype

element ElementName

An "attribute" SequenceType without content or with a wildcard and no type name is normalized into a wildcard attribute type.

[attribute()]_sequencetype

attribute * of type xs:anySimpleType

[attribute(*)]_sequencetype

attribute * of type xs:anySimpleType

An "attribute" SequenceType with a wildcard and a type name is normalized into a wildcard attribute type with a corresponding type name.

[attribute(*,TypeName)]_sequencetype

attribute * of type TypeName

An "attribute" SequenceType with a name and a type name is normalized into an attribute type with a corresponding type name.

[attribute(AttributeName,TypeName)]_sequencetype

attribute AttributeName of type TypeName

A "schema-attribute" SequenceType with an attribute declaration is normalized into a reference to the corresponding global attribute declaration.

[schema-attribute(AttributeName)]_sequencetype

attribute AttributeName

A "document-node()" sequence types is normalized into the corresponding document type.

[document-node()]_sequencetype

document { (element * of type xs:anyType | text | comment | processing-instruction)* }

A "document-node" sequence type with an element test (resp. a schema element test) is normalized into the corresponding document type, whose content is the normalization of the element test (resp. schema element test), interleaved with an arbitrary sequence of processing instruction, comment, and text nodes.

[document-node(ElementTest)]_sequencetype

document { [ElementTest]_sequencetype & ( processing-instruction | comment ) *}

[document-node(SchemaElementTest)]_sequencetype

document { [SchemaElementTest]_sequencetype & ( processing-instruction | comment ) *}

A "processing-instruction()" SequenceType is normalized into the corresponding processing-instruction type.

[processing-instruction()]_sequencetype

processing-instruction

The [XPath/XQuery] type system does not model the target of a processing-instruction, which is treated as a dynamic property. Therefore a "processing-instruction" SequenceType with a string or NCName parameter is normalized into an optional processing-instruction type.

[processing-instruction(StringLiteral)]_sequencetype

processing-instruction?

[processing-instruction(NCName)]_sequencetype

processing-instruction?

A "comment()" SequenceType is normalized into the corresponding comment type.

[comment()]_sequencetype

comment

A "text()" SequenceType is normalized into the corresponding text type.

[text()]_sequencetype

text

The "node()" SequenceType denotes any node. It is normalized into a choice between the corresponding wildcard types for each kind of node.

[node()]_sequencetype

(element * of type xs:anyType | attribute * of type xs:anySimpleType | text | document { (element * of type xs:anyType | text | comment | processing-instruction)* } | comment | processing-instruction)

The "item()" SequenceType denotes any node or atomic value. It is normalized into a choice between the corresponding wildcard types for each kind of nodes or atomic values.

[item()]_sequencetype

3.6 Comments

[151 (XQuery)]	`Comment^XQ`	::=	`"(:" (CommentContents \| Comment)* ":)"`
[159 (XQuery)]	`CommentContents^XQ`	::=	`(Char+ - (Char* ('(:' \| ':)') Char*))`

Comments are lexical constructs only, and have no effect on the meaning of the query, and therefore do not have any formal semantics.

3.7 XML-defined Terminals

The following terminals are defined by XML.

[152 (XQuery)]	`PITarget^XQ`	::=	`[http://www.w3.org/TR/REC-xml#NT-PITarget]^XML`
[153 (XQuery)]	`CharRef^XQ`	::=	`[http://www.w3.org/TR/REC-xml#NT-CharRef]^XML`
[154 (XQuery)]	`QName^XQ`	::=	`[http://www.w3.org/TR/REC-xml-names/#NT-QName]^Names`
[155 (XQuery)]	`NCName^XQ`	::=	`[http://www.w3.org/TR/REC-xml-names/#NT-NCName]^Names`
[156 (XQuery)]	`S^XQ`	::=	`[http://www.w3.org/TR/REC-xml#NT-S]^XML`
[157 (XQuery)]	`Char^XQ`	::=	`[http://www.w3.org/TR/REC-xml#NT-Char]^XML`

4 Expressions

This section gives the semantics of all the [XPath/XQuery] expressions. The organization of this section parallels the organization of Section 3 Expressions^XQ.

[31 (XQuery)]	`Expr^XQ`	::=	`ExprSingle ("," ExprSingle)*`
[32 (XQuery)]	`ExprSingle^XQ`	::=	`FLWORExpr \| QuantifiedExpr \| TypeswitchExpr \| IfExpr \| OrExpr`
[1 (XPath)]	`XPath^XP`	::=	`Expr`

For each expression, a short description and the relevant grammar productions are given. The semantics of an expression includes the normalization, static analysis, and dynamic evaluation phases. Recall that normalization rules translate [XPath/XQuery] syntax into Core syntax. In the sections that contain normalization rules, the Core grammar productions into which the expression is normalized are also provided. After normalization, sections on static type inference and dynamic evaluation define the static type and dynamic value for the Core expression.

Core Grammar

The Core grammar productions for expressions are:

[30 (Core)]	`Expr`	::=	`ExprSingle ("," ExprSingle)*`
[31 (Core)]	`ExprSingle`	::=	`FLWORExpr \| TypeswitchExpr \| IfExpr \| OrExpr`

Static Type Analysis

It is a static type error for any expression to have the empty type, except for the following expressions and functions:

Empty parenthesis (), which denote the empty sequence.
The fn:data function and all functions in the fs namespace applied to empty parenthesis().
Any function which returns the empty type.

The reason for those exception is that they are typically part of the result of normalizing a larger user-level expression and are used to capture the semantics of the user-level expression when applied to the empty sequence.

The rule below enforces the above constraints. It is a static type error, if the following conditions hold for a given expression Expr.

statEnv |- Expr : Type

Type <: empty

not(Expr is empty parenthesis () or fn:data or any fs function applied to empty parenthesis ())

A static type error is raised for expression Expr

In general, static type errors are raised whenever there is no static type inference rules which can compute the type of a given expression. This is the reason for the absence of a formal post-condition in this rules. There is indeed a rule that infers the type for expression Expr, however the inferred type is empty and still a static type error must be raised.

Example

The above rule is useful in catching common mistakes, such as the misspelling of an element or attribute name or referencing of an element or attribute that does not exist. For instance, the following path expression

  $x/title

raises a static type error if the type of variable $x does not include any title children elements.

4.1 Primary Expressions

Primary expressions are the basic primitives of the language. They include literals, variables, function calls, and the parenthesized expressions.

Primary Expressions

Core Grammar

The Core grammar production for primary expressions is:

Primary Expressions

[63 (Core)] PrimaryExpr ::= Literal | VarRef | ParenthesizedExpr | FunctionCall | Constructor

4.1.1 Literals

Introduction

A literal is a direct syntactic representation of an atomic value. [XPath/XQuery] supports two kinds of literals: string literals and numeric literals.

Literals

[85 (XQuery)]	`Literal^XQ`	::=	`NumericLiteral \| StringLiteral`
[86 (XQuery)]	`NumericLiteral^XQ`	::=	`IntegerLiteral \| DecimalLiteral \| DoubleLiteral`
[141 (XQuery)]	`IntegerLiteral^XQ`	::=	`Digits`
[142 (XQuery)]	`DecimalLiteral^XQ`	::=	`("." Digits) \| (Digits "." [0-9]*)`
[143 (XQuery)]	`DoubleLiteral^XQ`	::=	`(("." Digits) \| (Digits ("." [0-9]*)?)) [eE] [+-]? Digits`
[144 (XQuery)]	`StringLiteral^XQ`	::=	`('"' (PredefinedEntityRef \| CharRef \| EscapeQuot \| [^"&])* '"') \| ("'" (PredefinedEntityRef \| CharRef \| EscapeApos \| [^'&])* "'")`
[140 (XQuery)]	`URILiteral^XQ`	::=	`StringLiteral`
[145 (XQuery)]	`PredefinedEntityRef^XQ`	::=	`"&" ("lt" \| "gt" \| "amp" \| "quot" \| "apos") ";"`
[158 (XQuery)]	`Digits^XQ`	::=	`[0-9]+`

Core Grammar

The Core grammar productions for literals are:

Literals

[64 (Core)]	`Literal`	::=	`NumericLiteral \| StringLiteral`
[65 (Core)]	`NumericLiteral`	::=	`IntegerLiteral \| DecimalLiteral \| DoubleLiteral`
[105 (Core)]	`IntegerLiteral`	::=	`Digits`
[106 (Core)]	`DecimalLiteral`	::=	`("." Digits) \| (Digits "." [0-9]*)`
[107 (Core)]	`DoubleLiteral`	::=	`(("." Digits) \| (Digits ("." [0-9]*)?)) [eE] [+-]? Digits`
[108 (Core)]	`StringLiteral`	::=	`('"' (EscapeQuot \| [^"])* '"') \| ("'" (EscapeApos \| [^'])* "'")`
[119 (Core)]	`Digits`	::=	`[0-9]+`

Notation

To define the dynamic semantics of literals, we introduce the following auxiliary judgments.

The judgment

dynEnv |- LiteralExpr has atomic value AtomicValue

Holds if the literal expression LiteralExpr corresponds to the value AtomicValue. This judgment yields an atomic value, according to the rules described in [XQuery 1.0: An XML Query Language]. Notably, this judgment deals with handling of literal overflows for numeric literals, and handling of character references, and predefined entity references for string literals.

Normalization

Literals are left unchanged through normalization.

[IntegerLiteral]_Expr

IntegerLiteral

[DecimalLiteral]_PrologDecl

DecimalLiteral

[DoubleLiteral]_Expr

DoubleLiteral

[StringLiteral]_Expr

StringLiteral

Static Type Analysis

In the static semantics, the type of an integer literal is simply xs:integer:

statEnv |- IntegerLiteral : xs:integer

Dynamic Evaluation

In the dynamic semantics, an integer literal is evaluated by constructing an atomic value in the data model, which consists of the literal value and its type:

dynEnv |- IntegerLiteral has atomic value Integer

dynEnv |- IntegerLiteral => Integer

The formal definitions of decimal, double, and string literals are analogous to those for integer.

Static Type Analysis

statEnv |- DecimalLiteral : xs:decimal

Dynamic Evaluation

dynEnv |- DecimalLiteral has atomic value Decimal

dynEnv |- DecimalLiteral => Decimal

Static Type Analysis

statEnv |- DoubleLiteral : xs:double

Dynamic Evaluation

dynEnv |- DoubleLiteral has atomic value Double

dynEnv |- DoubleLiteral => Double

Static Type Analysis

statEnv |- StringLiteral : xs:string

Dynamic Evaluation

dynEnv |- StringLiteral has atomic value String

dynEnv |- StringLiteral => String

4.1.2 Variable References

Introduction

A variable evaluates to the value to which the variable's QName is bound in the dynamic context.

Variable References

[87 (XQuery)]	`VarRef^XQ`	::=	`"$" VarName`
[88 (XQuery)]	`VarName^XQ`	::=	`QName`

Core Grammar

The Core grammar productions for variable references are:

Primary Expressions

[66 (Core)]	`VarRef`	::=	`"$" VarName`
[67 (Core)]	`VarName`	::=	`QName`

Normalization

Variable references are left unchanged through normalization.

[VarRef]_Expr

VarRef

Static Type Analysis

In the static semantics, the type of a variable is simply its type in the static environment statEnv.varType:

statEnv |- VarName of var expands to expanded-QName

statEnv.varType(expanded-QName) = Type

statEnv |- $ VarName : Type

If the variable is not bound in the static environment, a static type error is raised.

Dynamic Evaluation

In the dynamic semantics, a locally declared variable is evaluated by "looking up" its value in dynEnv.varValue:

statEnv |- VarName of var expands to expanded-QName

dynEnv.varValue(expanded-QName) = Value

dynEnv |- $ VarName => Value

In the dynamic semantics, a reference to a variable imported from a module is evaluated by accessing the dynamic context of the module in which the variable is declared.

statEnv |- VarName of var expands to expanded-QName

dynEnv.varValue(expanded-QName) = #IMPORTED(URI)

URI =>_{module_dynEnv} dynEnv₁

dynEnv₁.varValue(expanded-QName) = Value

dynEnv |- $ VarName => Value

4.1.3 Parenthesized Expressions

[89 (XQuery)] ParenthesizedExpr^XQ ::= "(" Expr? ")"

Core Grammar

The Core grammar production for parenthesized expressions is:

[68 (Core)] ParenthesizedExpr ::= "(" Expr? ")"

Empty parenthesis () always have the empty type. Remember that it is a static type error for most expressions other than () to have the empty type (see [4 Expressions] for the complete rule.)

Static Type Analysis

statEnv |- () : empty

statEnv |- Expr : Type

statEnv |- ( Expr ) : Type

Dynamic Evaluation

Empty parenthesis () evaluate to the empty sequence.

dynEnv |- () => ()

dynEnv |- Expr => Value

dynEnv |- ( Expr ) => Value

4.1.4 Context Item Expression

[90 (XQuery)] ContextItemExpr^XQ ::= "."

Introduction

A context item expression evaluates to the context item, which may be either a node or an atomic value.

Normalization

A context item expression is normalized to the built-in variable $fs:dot. Because it can only be bound through the external context or a path expression, there is no need for a specific typing rule to enforce that its value is a singleton item.

[.]_Expr

$fs:dot

4.1.5 Function Calls

Introduction

A function call consists of a QName followed by a parenthesized list of zero or more expressions. In [XPath/XQuery], the actual argument to a function is called an argument and the formal argument of a function is called a parameter. We use the same terminology here.

Function Calls

[93 (XQuery)] FunctionCall^XQ ::= QName "(" (ExprSingle ("," ExprSingle)*)? ")"

Because [XPath/XQuery] implicitly converts the values of function arguments, a normalization step is required.

Core Grammar

The Core grammar production for function calls is:

Function Calls

[71 (Core)] FunctionCall ::= QName "(" (ExprSingle ("," ExprSingle)*)? ")"

Notation

Normalization of function calls uses an auxiliary mapping []_{FunctionArgument(SequenceType)} used to insert conversions of function arguments that depend only on the expected SequenceType of the corresponding parameters. It is defined as follows:

[Expr]_{FunctionArgument(SequenceType)}

[[[Expr]_Expr]_{AtomizeAtomic(SequenceType)}]_{Convert(SequenceType)}

where

[Expr]_{AtomizeAtomic(SequenceType)} denotes

fn:data(Expr) If [SequenceType]_sequencetype <: xdt:anyAtomicType*

Expr Otherwise

which specifies that if the function expects atomic parameters, then fn:data is called to obtain them.
[Expr]_{Convert(SequenceType)} denotes

fs:convert-simple-operand(Expr,PrototypicalValue) If [SequenceType]_sequencetype <: xdt:anyAtomicType*

Expr Otherwise

where PrototypicalValue is a built-in atomic value used to encode the expected atomic type (for instance the value 1.0 if the expected type is xs:decimal). A value is used here since [XPath/XQuery] expressions cannot operate directly on types. Which value is chosen does not have any impact on the actual semantics, only its actual atomic type matters.

Note

The fs:convert-simple-operand function takes a PrototypicalValue, which is a value of the target type, to ensure that conversion to base types is possible even though types are not first class objects in [XPath/XQuery].

Normalization

Each argument expression in a function call is normalized to its corresponding Core expression by applying []_{FunctionArgument(SequenceType)} for each argument with the expected SequenceType for the argument inserted.

[ QName (Expr₁, ..., Expr_n) ]_Expr

QName ( [Expr₁]_{FunctionArgument(SequenceType1)}, ..., [Expr_n]_{FunctionArgument(SequenceTypen)} )

Note that this normalization rule depends on the function signatures, which is used to get the types of the function parameters (SequenceType₁,...,SequenceType_n). For user-defined functions, the function signature can be obtained from the XQuery prolog where the function is declared. For built-in functions, the signature is given in the [Functions and Operators] document. For overloaded built-in functions, several signatures may exists, however, because they all correspond to sequences of atomic values, they all result in the same normalization.

Static Type Analysis

Different sets of static typing rules are used to type check function calls depending on which of the following categories the belong to: overloaded internal functions, built-in functions with a specific typing rule, and other built-in and user-defined functions.

The following rule is common to all those categories, and is used to bootstrap type inference, by first looking-up the expanded QName for the function, then applying the appropriate set of inference rule depending on the category in which the function is.

statEnv |- QName of func expands to expanded-QName

statEnv |- Expr₁ : Type₁

...

statEnv |- Expr_n : Type_n

statEnv |- expanded-QName(Type₁,...,Type_n) : Type

statEnv |- QName (Expr₁,...,Expr_n) : Type

The following depends on the kind of function call.

If the expanded QName for the function corresponds to one of the overloaded internal fs: functions listed in [B.2 Mapping of Overloaded Internal Functions], the rules in [B.2 Mapping of Overloaded Internal Functions] are applied.
If the expanded QName for the function corresponds to one of the built-in functions with a specialized typing rule, listed in [7 Additional Semantics of Functions], the rules in [7 Additional Semantics of Functions] are applied.
Otherwise, the following general rule is applied.

The rule looks up the function in the static environment and checks that some signature for the function satisfies the following constraint: the type of each actual argument is a subtype of some type that can be promoted to the type of the corresponding function parameter. In this case, the function call is well typed and the result type is the return type specified in the function's signature.

statEnv.funcType(expanded-QName,n) = declare function expanded-QName(Type₁', ..., Type_n') as Type'

statEnv |- Type₁ can be promoted to Type₁'

...

statEnv |- Type_n can be promoted to Type_n'

statEnv |- expanded-QName(Type₁, ..., Type_n) : Type'

The function body itself is not analyzed for each invocation: static typing of the function definition itself guarantees that the function body always returns a value of the declared return type.

Notice that the static context contains at most one function declaration for each function. This is possible since the treatment of overloaded operators is done through a set of specific rules which do not require access to the environment. See [B.2 Mapping of Overloaded Internal Functions].

Notation

The following auxiliary judgment

dynEnv |- function expanded-QName with types (Type₁,...,Type_n) on values (Value₁,...,Value_n) yields Value

holds when applying the function with expanded QName expanded-QName, and parameters of type (Type₁,...,Type_n) on the values (Value₁,...,Value_n) yields the value Value.

That judgment is defined below for each kind of function (user-defined, built-in, external, and imported functions).

Dynamic Evaluation

The following rule applies to all the different kinds of functions using the previously defined judgment.

dynEnv |- Expr₁ => Value₁

...

dynEnv |- Expr_n => Value_n

statEnv |- QName of func expands to expanded-QName

statEnv.funcType(expanded-QName) = FunctionSig

FunctionSig = declare function expanded-QName(Type₁, ..., Type_n) as Type

statEnv |- Value₁ against Type₁ promotes to Value₁'

...

statEnv |- Value_n against Type_n promotes to Value_n'

dynEnv |- function expanded-QName with types (Type₁,...,Type_n) on values (Value₁',...,Value_n') yields Value

statEnv |- Value against Type promotes to Value'

dynEnv |- QName ( Expr₁, ..., Expr_n ) => Value'

First the function name is expanded, and the expanded name is used to retrieve the function signature from the static environment. Then, the rule evaluates each function argument expression, and the resulting values are promoted according to the expected type for the function. The result of evaluating the function is obtained through the auxiliary judgment previously defined, and the resulting value is promoted according to the expected return type.

In case the function is a user defined function in a main module, the expression body is retrieved from the dynamic environment and used to compute the value of the function. The rule extends dynEnv.varValue by binding each formal variable to its corresponding value, and evaluates the body of the function in the new environment. The resulting value is the value of the function call.

dynEnv.funcDefn(expanded-QName(Type₁, ..., Type_n)) = (Expr, Variable₁, ... , Variable_n)

#MAIN =>_{module_dynEnv} dynEnv₁

dynEnv₁ + varValue( Variable₁ => Value₁; ...; Variable_n => Value_n) |- Expr => Value

dynEnv |- function expanded-QName with types (Type₁,...,Type_n) on values (Value₁,...,Value_n) yields Value

Note that the function body is evaluated in the dynamic environment containing the main module declarations.

The rule for evaluating an function imported from a module is similar to that for evaluating a user-defined function in a main module, except that the function call is evaluated in the dynamic context of the module in which it is declared, and that the appropriate additional type matching must be performed.

dynEnv.funcDefn(expanded-QName(Type₁, ..., Type_n)) = #IMPORTED(URI)

URI =>_{module_statEnv} statEnv₁

URI =>_{module_dynEnv} dynEnv₁

statEnv.funcType₁(expanded-QName) = FunctionSig'

FunctionSig' = declare function expanded-QName(Type₁', ..., Type_n') as Type'

statEnv |- Value₁ matches Type₁'

...

statEnv |- Value_n matches Type_n'

dynEnv₁.funcDefn(expanded-QName(Type₁', ..., Type_n')) = (Expr, Variable₁, ... , Variable_n)

dynEnv₁ + varValue( Variable₁ => Value₁; ...; Variable_n => Value_n) |- Expr => Value

dynEnv |- function expanded-QName with types (Type₁,...,Type_n) on values (Value₁,...,Value_n) yields Value

If the function is a built-in function (resp. special formal semantics function), the value returned by the function is the one specified in [Functions and Operators] (resp. [7 Additional Semantics of Functions]).

dynEnv.funcDefn(expanded-QName(Type₁, ..., Type_n)) = #BUILT-IN

The built-in function expanded-QName (See [Functions and Operators] or [7 Additional Semantics of Functions]) applied to values (Value₁,...,Value_n) yields the value Value

dynEnv |- function expanded-QName with types (Type₁,...,Type_n) on values (Value₁,...,Value_n) yields Value

If the function is an external function, the value returned by the function is implementation-defined.

dynEnv.funcDefn(expanded-QName(Type₁, ..., Type_n)) = #EXTERNAL

The external function expanded-QName applied to values (Value₁,...,Value_n) yields the value Value

dynEnv |- function expanded-QName with types (Type₁,...,Type_n) on values (Value₁,...,Value_n) yields Value

4.2 Path Expressions

Introduction

Path expressions are used to locate nodes within a tree. There are two kinds of path expressions, absolute path expressions and relative path expressions. An absolute path expression is a rooted relative path expression. A relative path expression is composed of a sequence of steps.

Path Expressions

[68 (XQuery)]	`PathExpr^XQ`	::=	`("/" RelativePathExpr?) \| ("//" RelativePathExpr) \| RelativePathExpr`
[69 (XQuery)]	`RelativePathExpr^XQ`	::=	`StepExpr (("/" \| "//") StepExpr)*`

Core Grammar

PathExpr and RelativePathExpr are fully normalized, therefore they have no corresponding productions in the Core. The grammar for path expressions in the Core starts with the StepExpr production.

Normalization

Absolute path expressions are path expressions starting with the / or // symbols, indicating that the expression must be applied on the root node in the current context. The root node in the current context is the greatest ancestor of the context node. The following two rules normalize absolute path expressions to relative ones. They use the fn:root function, which returns the greatest ancestor of its argument node. The treat expressions guarantee that the value bound to the context variable $fs:dot is a document node.

[/]_Expr

(fn:root(self::node()) treat as document-node())

[/ RelativePathExpr]_Expr

[((fn:root(self::node())) treat as document-node()) / RelativePathExpr]_Expr

[// RelativePathExpr]_Expr

[((fn:root(self::node())) treat as document-node) / descendant-or-self::node() / RelativePathExpr]_Expr

[ StepExpr // RelativePathExpr ]_Expr

[StepExpr / descendant-or-self::node() / RelativePathExpr]_Expr

A composite relative path expression (using /) is normalized into a for expression by concatenating the sequences obtained by mapping each node of the left-hand side in document order to the sequence it generates on the right-hand side. The call to the fs:distinct-doc-order function ensures that the result is in document order without duplicates. The dynamic context is defined by binding the $fs:dot, $fs:sequence, $fs:position and $fs:last variables.

Note that sorting by document order enforces the restriction that input and output sequences contains only nodes, and that the last step in a path expression may actually return atomic values.

[StepExpr / RelativePathExpr]_Expr

fs:apply-ordering-mode (

fs:distinct-doc-order-or-atomic-sequence (

let $fs:sequence as node()* := [StepExpr]_Expr return

let $fs:last := fn:count($fs:sequence) return

for $fs:dot at $fs:position in $fs:sequence return

[RelativePathExpr]_Expr

))

4.2.1 Steps

Note that this section uses some auxiliary judgments which are defined in [8.2 Judgments for step expressions and filtering].

Introduction

Steps

[70 (XQuery)]	`StepExpr^XQ`	::=	`FilterExpr \| AxisStep`
[71 (XQuery)]	`AxisStep^XQ`	::=	`(ReverseStep \| ForwardStep) PredicateList`
[72 (XQuery)]	`ForwardStep^XQ`	::=	`(ForwardAxis NodeTest) \| AbbrevForwardStep`
[75 (XQuery)]	`ReverseStep^XQ`	::=	`(ReverseAxis NodeTest) \| AbbrevReverseStep`
[82 (XQuery)]	`PredicateList^XQ`	::=	`Predicate*`

Core Grammar

The Core grammar productions for XPath steps are:

Steps

[54 (Core)]	`StepExpr`	::=	`PrimaryExpr \| AxisStep`
[55 (Core)]	`AxisStep`	::=	`ReverseStep \| ForwardStep`
[56 (Core)]	`ForwardStep`	::=	`ForwardAxis NodeTest`
[58 (Core)]	`ReverseStep`	::=	`ReverseAxis NodeTest`

Note

Step expressions can be followed by predicates. Normalization of predicates uses the following auxiliary mapping rule: []_Predicates, which is specified in [4.2.2 Predicates]. Normalization for step expressions also uses the following auxiliary mapping rule: []_Axis, which is specified in [4.2.1.1 Axes].

Normalization

Normalization of predicates need to distinguish between forward steps, reverse steps, and primary expressions.

As explained in the [XPath/XQuery] document, applying a step in XPath changes the focus (or context). The change of focus is made explicit by the normalization rule below, which binds the variable $fs:dot to the node currently being processed, and the variable $fs:position to the position (i.e., the position within the input sequence) of that node.

There are two sets of normalization rules for Predicates. The first set of rules apply when the predicate is a numeric literal or the expression last(). The second set of rules apply to all predicate expressions other than numeric literals and the expression last(). In the first case, the normalization rules provides a more precise static type than if the general rules were applied.

When the predicate expression is a numeric literal or the fn:last function, the following normalization rules apply.

[ForwardStep PredicateList [ NumericLiteral ]]_Expr

let $fs:sequence := fs:apply-ordering-mode(fs:distinct-doc-order( [ForwardStep PredicateList]_Expr )) return

fn:subsequence($fs:sequence,NumericLiteral,1)

[ForwardStep PredicateList [ fn:last() ]]_Expr

let $fs:sequence := fs:apply-ordering-mode(fs:distinct-doc-order( [ForwardStep PredicateList]_Expr )) return

let $fs:last := fn:count($fs:sequence) return

fn:subsequence($fs:sequence,$fs:last,1)

When predicates are applied on a reverse step, the position variable is bound in reverse document order.

[ReverseStep PredicateList [ NumericLiteral ]]_Expr

let $fs:sequence := fs:apply-ordering-mode(fs:distinct-doc-order( [ReverseStep PredicateList]_Expr )) return

let $fs:last := fn:count($fs:sequence) return

let $fs:position := $fs:last - NumericLiteral + 1 return

then fn:subsequence($fs:sequence,$fs:position,1)

When the step is a reverse axis, then the last item in the context sequence is the first in document order.

[ReverseStep PredicateList [ fn:last() ]]_Expr

let $fs:sequence := fs:apply-ordering-mode(fs:distinct-doc-order( [ReverseStep PredicateList]_Expr )) return

then fn:subsequence($fs:sequence,1,1)

The normalization rules above all use the function fn:subsequence to select a particular item. The static typing rules for this function are defined in [7.2.13 The fn:subsequence function].

When predicates are applied on a forward step, the input sequence is first sorted in document order and duplicates are removed. The context is changed by binding the $fs:dot variable to each node in document order.

[ForwardStep PredicateList [ Expr ]]_Expr

let $fs:sequence := fs:apply-ordering-mode(fs:distinct-doc-order( [ForwardStep PredicateList]_Expr )) return

let $fs:last := fn:count($fs:sequence) return

for $fs:dot at $fs:position in $fs:sequence return

if [Expr]_Predicates then $fs:dot else ()

When predicates are applied on a reverse step, the input sequence is first sorted in document order and duplicates are removed. The context is changed by binding the $fs:dot variable to each node in document order.

[ReverseStep PredicateList [ Expr ]]_Expr

let $fs:sequence := fs:apply-ordering-mode(fs:distinct-doc-order( [ReverseStep PredicateList]_Expr )) return

let $fs:last := fn:count($fs:sequence) return

for $fs:dot at $fs:new in $fs:sequence return

let $fs:position := $fs:last - $fs:new + 1 return

if [Expr]_Predicates then $fs:dot else ()

Finally, a stand-alone forward or reverse step is normalized by the auxiliary normalization rule for Axis.

[ForwardStep]_Expr

fs:apply-ordering-mode([ForwardStep]_Axis)

[ReverseStep]_Expr

fs:apply-ordering-mode([ReverseStep]_Axis)

Static Type Analysis

The static semantics of an Axis NodeTest pair is obtained by retrieving the type of the context node, and applying the two filters (the Axis, and then the NodeTest with a PrincipalNodeKind) on the result.

statEnv.varType($fs:dot) = Type₁

statEnv |- Type₁ <: [node()]_sequencetype

statEnv |- axis Axis of Type₁ : Type₂

Axis principal PrincipalNodeKind

statEnv |- test NodeTest with PrincipalNodeKind of Type₂ : Type₃

statEnv |- Axis NodeTest : Type₃

Note

Note that the second judgment in the inference rule requires that the context item be a node, guaranteeing that a type error is raised when the context item is an atomic value.

Dynamic Evaluation

The dynamic semantics of an Axis NodeTest pair is obtained by retrieving the context node, and applying the two filters (Axis, then NodeTest) on the result. The application of each filter is expressed through the filter judgment as follows.

dynEnv.varValue($fs:dot) = Value₁

statEnv |- Value₁ matches node

dynEnv |- axis Axis of Value₁ => Value₂

Axis principal PrincipalNodeKind

dynEnv |- test NodeTest with PrincipalNodeKind of Value₂ => Value₃

dynEnv |- Axis NodeTest => fs:distinct-doc-order(Value₃)

Note

Note that the second judgment in the inference rule guarantees that the context item is bound to a node.

4.2.1.1 Axes

Introduction

The XQuery grammar for forward and reverse axis is as follows.

Axes

[73 (XQuery)]	`ForwardAxis^XQ`	::=	`("child" "::") \| ("descendant" "::") \| ("attribute" "::") \| ("self" "::") \| ("descendant-or-self" "::") \| ("following-sibling" "::") \| ("following" "::")`
[76 (XQuery)]	`ReverseAxis^XQ`	::=	`("parent" "::") \| ("ancestor" "::") \| ("preceding-sibling" "::") \| ("preceding" "::") \| ("ancestor-or-self" "::")`

In the case of XPath, forward axis also contain the namespace:: axis.

Axes

Core Grammar

The Core grammar productions for XPath axis are:

Axes

[57 (Core)]	`ForwardAxis`	::=	`("child" "::") \| ("descendant" "::") \| ("attribute" "::") \| ("self" "::") \| ("descendant-or-self" "::") \| ("namespace" "::")`
[59 (Core)]	`ReverseAxis`	::=	`("parent" "::") \| ("ancestor" "::") \| ("ancestor-or-self" "::")`

Notation

The normalization of axes uses the following auxiliary mapping rule: []_Axis.

Normalization

The normalization for all axes is specified as follows.

The semantics of the following(-sibling) and preceding(-sibling) axes are expressed by mapping them to Core expressions. All other axes are part of the Core and therefore are left unchanged through normalization.

[following-sibling:: NodeTest]_Axis

[let $e := . return $e/parent::node()/child:: NodeTest [.>>$e]]_Expr

[following:: NodeTest]_Axis

[ancestor-or-self::node()/following-sibling::node()/descendant-or-self::NodeTest]_Expr

All other forward axes are part of the Core [XPath/XQuery] and handled by the normalization rules below:

[child:: NodeTest]_Axis

child:: NodeTest

[attribute:: NodeTest]_Axis

attribute:: NodeTest

[self:: NodeTest]_Axis

self:: NodeTest

[descendant:: NodeTest]_Axis

descendant:: NodeTest

[descendant-or-self:: NodeTest]_Axis

descendant-or-self:: NodeTest

[namespace:: NodeTest]_Axis

namespace:: NodeTest

Reverse axes:

[preceding-sibling:: NodeTest]_Axis

[let $e := . return $e/parent::node()/child:: NodeTest [.<<$e]]_Expr

[preceding:: NodeTest]_Axis

[ancestor-or-self::node()/preceding-sibling::node()/descendant-or-self::NodeTest]_Expr

All other reverse axes are part of the Core [XPath/XQuery] and handled by the normalization rules below:

[parent:: NodeTest]_Axis

parent:: NodeTest

[ancestor:: NodeTest]_Axis

ancestor:: NodeTest

[ancestor-or-self:: NodeTest]_Axis

ancestor-or-self:: NodeTest

4.2.1.2 Node Tests

Introduction

A node test is a condition applied on the nodes selected by an axis step. Node tests are described by the following grammar productions.

Node Tests

[78 (XQuery)]	`NodeTest^XQ`	::=	`KindTest \| NameTest`
[79 (XQuery)]	`NameTest^XQ`	::=	`QName \| Wildcard`
[80 (XQuery)]	`Wildcard^XQ`	::=	`"" \| (NCName ":" "") \| ("*" ":" NCName)`

Core Grammar

The Core grammar productions for node tests are:

Node Tests

[60 (Core)]	`NodeTest`	::=	`KindTest \| NameTest`
[61 (Core)]	`NameTest`	::=	`QName \| Wildcard`
[62 (Core)]	`Wildcard`	::=	`"" \| (NCName ":" "") \| ("*" ":" NCName)`

Notation

For convenience, we will use the grammar non-terminals Prefix, and LocalPart, both of which are NCNames, in some of the inference rules. They are defined by the following grammar productions.

Prefix and LocalPart

[18 (Formal)]	`Prefix`	::=	`NCName`
[19 (Formal)]	`LocalPart`	::=	`NCName`

4.2.2 Predicates

Introduction

A predicate consists of an expression, called a predicate expression, enclosed in square brackets.

[83 (XQuery)] Predicate^XQ ::= "[" Expr "]"

Notation

Normalization of predicates uses the following auxiliary mapping rule: []_Predicates.

Normalization

Predicates in path expressions are normalized with a special mapping rule:

[Expr]_Predicates

typeswitch ([Expr]_Expr)

case $v as fs:numeric return op:numeric-equal($v, $fs:position)

default $v return fn:boolean($v)

Note that the semantics of predicates whose input expression returns a numeric value also work if that value is not an integer. In those cases the op:numeric-equal returns false when compared to a position. For example the expression //a[3.4] returns the empty sequence)

4.2.3 Unabbreviated Syntax

The corresponding Section in the [XPath/XQuery] document just contains examples.

4.2.4 Abbreviated Syntax

Abbreviated Syntax

[74 (XQuery)]	`AbbrevForwardStep^XQ`	::=	`"@"? NodeTest`
[77 (XQuery)]	`AbbrevReverseStep^XQ`	::=	`".."`

Normalization

Here are normalization rules for the abbreviated syntax.

[ .. ]_Expr

[parent::node()]_Axis

[ @ NameTest ]_Expr

attribute :: NameTest

[ NodeTest ]_Expr

[child :: NodeTest]_Axis

4.3 Sequence Expressions

Introduction

[XPath/XQuery] supports operators to construct and combine sequences. A sequence is an ordered collection of zero or more items. An item is either an atomic value or a node.

4.3.1 Constructing Sequences

Constructing Sequences

[31 (XQuery)]	`Expr^XQ`	::=	`ExprSingle ("," ExprSingle)*`
[49 (XQuery)]	`RangeExpr^XQ`	::=	`AdditiveExpr ( "to" AdditiveExpr )?`

Core Grammar

The Core grammar production for sequence expressions is:

Core Sequence Expressions

[30 (Core)] Expr ::= ExprSingle ("," ExprSingle)*

Normalization

A sequence expression is normalized into a sequence of normalized single expressions:

[Expr₁ , Expr₂]_Expr

[Expr₁]_Expr, [Expr₂]_Expr

Static Type Analysis

The type of the sequence expression is the sequence over the types of the individual expressions.

statEnv |- Expr₁ : Type₁ statEnv |- Expr₂ : Type₂

statEnv |- Expr₁ , Expr₂ : Type₁, Type₂

Dynamic Evaluation

Each expression in the sequence is evaluated and the resulting values are concatenated into one sequence.

dynEnv |- Expr₁ => Value₁ dynEnv |- Expr₂ => Value₂

dynEnv |- Expr₁, Expr₂ => Value₁, Value₂

Normalization

The range operator is normalized to the fs:to function.

[Expr₁ to Expr₂]_Expr

fs:to(([Expr₁]_Expr),([Expr₂]_Expr))

Static Type Analysis

The static semantics of the fs:to function is defined in [Functions and Operators].

Dynamic Evaluation

The dynamic semantics of the op:to operator is defined in [Functions and Operators].

4.3.2 Filter Expressions

Introduction

Filter Expression

[81 (XQuery)] FilterExpr^XQ ::= PrimaryExpr PredicateList

Core Grammar

There are no Core grammar productions for filter expressions as they are normalized to other Core expressions.

Normalization

When a predicate with a numeric literal or the last() expression is applied on a primary expression, it is normalized using the fn:subsequence function. This results in a more precise static type for those cases.

[PrimaryExpr PredicateList [ NumericLiteral ]]_Expr

let $fs:sequence := [PrimaryExpr PredicateList]_Expr return

fn:subsequence($fs:sequence,NumericLiteral,1)

[PrimaryExpr PredicateList [ last() ]]_Expr

let $fs:sequence := [PrimaryExpr PredicateList]_Expr return

fn:subsequence($fs:sequence,NumericLiteral,$fs:last)

In the general case, when a predicate is applied on a primary expression, it is normalized to a FLWOR expression as follows. The input sequence is processed in sequence order and the context item is bound to each item in the input sequence.

[PrimaryExpr PredicateList [ Expr ]]_Expr

let $fs:sequence := [PrimaryExpr PredicateList]_Expr return

let $fs:last := fn:count($fs:sequence) return

for $fs:dot at $fs:position in $fs:sequence return

if ([Expr]_Predicates) then $fs:dot else ()

Static Type Analysis

There are no additional static type rules for filter expressions.

Dynamic Evaluation

There are no additional dynamic evaluation rules for filter expressions.

4.3.3 Combining Node Sequences

[XPath/XQuery] provides several operators for combining sequences of nodes.

Combining Sequences

[52 (XQuery)]	`UnionExpr^XQ`	::=	`IntersectExceptExpr ( ("union" \| "\|") IntersectExceptExpr )*`
[53 (XQuery)]	`IntersectExceptExpr^XQ`	::=	`InstanceofExpr ( ("intersect" \| "except") InstanceofExpr )*`

Notation

The union, intersect, and except expressions are normalized into function calls to the appropriate functions. The mapping function []_SequenceOp is defined by the following table:

SequenceOp	[SequenceOp]_SequenceOp
"union"	op:union
"\|"	op:union
"intersect"	op:intersect
"except"	op:except

Normalization

Operators for combining node sequences are normalized as follows.

[Expr₁ SequenceOp Expr₂]_Expr

fs:apply-ordering-mode ([SequenceOp]_SequenceOp ( [Expr₁]_Expr, [Expr₂]_Expr ))

Static Type Analysis

The static semantics of the operators that combine sequences are defined in [7.2.14 The op:union, op:intersect, and op:except operators].

Dynamic Evaluation

The dynamic semantics for function calls is given in [4.1.5 Function Calls].

4.4 Arithmetic Expressions

[XPath/XQuery] provides arithmetic operators for addition, subtraction, multiplication, division, and modulus, in their usual binary and unary forms.

Arithmetic Expressions

[50 (XQuery)]	`AdditiveExpr^XQ`	::=	`MultiplicativeExpr ( ("+" \| "-") MultiplicativeExpr )*`
[51 (XQuery)]	`MultiplicativeExpr^XQ`	::=	`UnionExpr ( ("" \| "div" \| "idiv" \| "mod") UnionExpr )`
[58 (XQuery)]	`UnaryExpr^XQ`	::=	`("-" \| "+")* ValueExpr`
[59 (XQuery)]	`ValueExpr^XQ`	::=	`ValidateExpr \| PathExpr \| ExtensionExpr`

Core Grammar

The Core grammar production for arithmetic expressions is:

[48 (Core)] ValueExpr ::= ValidateExpr | StepExpr | ExtensionExpr

Notation

The mapping functions []_ArithOp and _UnaryArithOp are defined by the following tables:

ArithOp	[ArithOp]_ArithOp
"+"	fs:`plus`
"-"	fs:`minus`
"*"	fs:`times`
"div"	fs:`div`
"mod"	fs:`mod`

UnaryArithOp	[UnaryArithOp]_UnaryArithOp
"+"	fs:`unary-plus`
"-"	fs:`unary-minus`

Core Grammar

There are no Core grammar productions for arithmetic expressions as they are normalized to other Core expressions.

Normalization

The normalization rules for all the arithmetic operators except idiv first atomize each argument by applying fn:data and then apply the internal function fs:convert-operand to each argument. If the first argument to this function has type xdt:untypedAtomic, then the first argument is cast to a double, otherwise it is returned unchanged. The overloaded internal function corresponding to the arithmetic operator is then applied to the two converted arguments. The table above maps the operators to the corresponding internal function. The mapping from the overloaded internal functions to the corresponding non-overloaded function is given in [B.2 Mapping of Overloaded Internal Functions].

[Expr₁ ArithOp Expr₂]_Expr

[ArithOp]_ArithOp (	fs:`convert-operand`(`fn:data`(([Expr₁]_Expr)), 1.0E0),
	fs:`convert-operand`(`fn:data`(([Expr₂]_Expr)), 1.0E0))

The normalization rules for the idiv operator are similar, but instead of casting arguments with type xdt:untypedAtomic to xs:double, they are cast to xs:integer.

[Expr₁ idiv Expr₂]_Expr

fs:`idiv` (	fs:`convert-operand`(`fn:data`(([Expr₁]_Expr)), 1),
	fs:`convert-operand`(`fn:data`(([Expr₂]_Expr)), 1))

The unary operators are mapped similarly.

[+ Expr]_Expr

fs:unary-plus(fs:convert-operand(fn:data(([Expr]_Expr)), 1.0E0))

[- Expr]_Expr

fs:unary-minus(0, fs:convert-operand(fn:data(([Expr]_Expr)), 1.0E0))

Static Type Analysis

The static semantics for function calls is given in [4.1.5 Function Calls].

Dynamic Evaluation

The dynamic semantics for function calls is given in [4.1.5 Function Calls].

4.5 Comparison Expressions

Introduction

Comparison expressions allow two values to be compared. [XPath/XQuery] provides three kinds of comparison expressions, called value comparisons, general comparisons, and node comparisons.

Comparison Expressions

[48 (XQuery)]	`ComparisonExpr^XQ`	::=	`RangeExpr ( (ValueComp \| GeneralComp \| NodeComp) RangeExpr )?`
[61 (XQuery)]	`ValueComp^XQ`	::=	`"eq" \| "ne" \| "lt" \| "le" \| "gt" \| "ge"`
[60 (XQuery)]	`GeneralComp^XQ`	::=	`"=" \| "!=" \| "<" \| "<=" \| ">" \| ">="`
[62 (XQuery)]	`NodeComp^XQ`	::=	`"is" \| "<<" \| ">>"`

4.5.1 Value Comparisons

Notation

The mapping function []_ValueOp is defined by the following table:

ValueOp	[ValueOp]_ValueOp
"`eq`"	fs:eq
"`ne`"	fs:ne
"`lt`"	fs:lt
"`le`"	fs:le
"`gt`"	fs:gt
"`ge`"	fs:ge

Core Grammar

There are no Core grammar productions for value comparisons as they are normalized to other Core expressions.

Normalization

The normalization rules for the value comparison operators first atomize each argument by applying fn:data and then apply the internal function fs:convert-operand defined in [7.1.1 The fs:convert-operand function]. If the first argument to this function has type xdt:untypedAtomic, then the first argument is cast to a string, otherwise it is returned unchanged. The overloaded internal function corresponding to the value comparison operator is then applied to the two converted arguments. The table above maps the value operators to the corresponding internal function. The mapping from the overloaded internal functions to the corresponding non-overloaded function is given in [B.2 Mapping of Overloaded Internal Functions].

[Expr₁ ValueOp Expr₂]_Expr

[ValueOp]_ValueOp (	fs:`convert-operand`(`fn:data`(([Expr₁]_Expr)), "string"),
	fs:`convert-operand`(`fn:data`(([Expr₂]_Expr)), "string") )

Static Type Analysis

The static semantics for function calls is given in [4.1.5 Function Calls]. The comparison functions all have return type xs:boolean, as specified in [Functions and Operators].

Dynamic Evaluation

The dynamic semantics for function calls is given in [4.1.5 Function Calls].

4.5.2 General Comparisons

Introduction

General comparisons are defined by adding existential semantics to value comparisons. The operands of a general comparison may be sequences of any length. The result of a general comparison is always true or false.

Notation

For convenience, GeneralOp denotes the operators "=", "!=", "<", "<=", ">", or ">=".

The function []_GeneralOp is defined by the following table:

GeneralOp	[GeneralOp]_GeneralOp
"`=`"	fs:eq
"`!=`"	fs:ne
"`<`"	fs:lt
"`<=`"	fs:le
"`>`"	fs:gt
"`>=`"	fs:ge

Core Grammar

There are no Core grammar productions for general comparisons as they are normalized to existentially quantified Core expressions.

Normalization

The normalization rule for a general comparison expression first atomizes each argument by applying fn:data and then applies the existentially quantified some expression to each sequence. The internal function fs:convert-operand is applied to each pair of atomic values. If the first argument to this function has type xdt:untypedAtomic, then the first argument is cast to type of the second argument. If the second argument has type xdt:untypedAtomic, the first argument is cast to a string. The overloaded internal function corresponding to the general comparison operator is then applied to the two converted values.

[Expr₁ GeneralOp Expr₂]_Expr

some $v1 in fn:data(([Expr₁]_Expr)) satisfies

some $v2 in fn:data(([Expr₂]_Expr)) satisfies

let $u1 := fs:convert-operand($v1, $v2) return

let $u2 := fs:convert-operand($v2, $v1) return

[GeneralOp]_GeneralOp ($u1, $u2)

4.5.3 Node Comparisons

Core Grammar

There are no Core grammar productions for node comparisons as they are normalized to other Core expressions.

Normalization

The normalization rules for node comparisons map each argument expression and then apply the internal function corresponding to the node comparison operator. The internal function are defined in [B.2 Mapping of Overloaded Internal Functions].

[Expr₁ is Expr₂]_Expr

fs:is-same-node(([Expr₁]_Expr), ([Expr₂]_Expr))

[Expr₁ << Expr₂]_Expr

fs:node-before(([Expr₁]_Expr), ([Expr₂]_Expr))

[Expr₁ >> Expr₂]_Expr

fs:node-after(([Expr₁]_Expr), ([Expr₂]_Expr))

Static Type Analysis

The static semantics for the internal functions are defined in [B.2 Mapping of Overloaded Internal Functions].

Dynamic Evaluation

The dynamic semantics for internal function is defined in [B.2 Mapping of Overloaded Internal Functions].

4.6 Logical Expressions

Introduction

A logical expression is either an and-expression or an or-expression. The value of a logical expression is always one of the boolean values: true or false.

Logical Expressions

[46 (XQuery)]	`OrExpr^XQ`	::=	`AndExpr ( "or" AndExpr )*`
[47 (XQuery)]	`AndExpr^XQ`	::=	`ComparisonExpr ( "and" ComparisonExpr )*`

Core Grammar

The Core grammar productions for logical expressions are:

Core Logical Expressions

[44 (Core)]	`OrExpr`	::=	`AndExpr ( "or" AndExpr )*`
[45 (Core)]	`AndExpr`	::=	`CastableExpr ( "and" CastableExpr )*`

Normalization

The normalization rules for "and" and "or" first get the effective boolean value of each argument, then apply the appropriate Core operator.

[Expr₁ and Expr₂]_Expr

fn:boolean(([Expr₁]_Expr)) and fn:boolean(([Expr₂]_Expr))

[Expr₁ or Expr₂]_Expr

fn:boolean(([Expr₁]_Expr)) or fn:boolean(([Expr₂]_Expr))

Static Type Analysis

The logical expressions require that each subexpression have type xs:boolean. The result type is also xs:boolean.

statEnv |- Expr₁ : xs:boolean statEnv |- Expr_n : xs:boolean

statEnv |- Expr₁ and Expr₂ : xs:boolean

statEnv |- Expr₁ : xs:boolean statEnv |- Expr_n : xs:boolean

statEnv |- Expr₁ or Expr₂ : xs:boolean

Dynamic Evaluation

The dynamic semantics of logical expressions is non-deterministic. This non-determinism permits implementations to use short-circuit evaluation strategies when evaluating logical expressions. In the expression, Expr₁ and Expr₂, if either expression raises an error or evaluates to false, the entire expression may raise an error or evaluate to false. In the expression, Expr₁ or Expr₂, if either expression raises an error or evaluates to true, the entire expression may raise an error or evaluate to true.

dynEnv |- Expr_i => false 1 <= i <= 2

dynEnv |- Expr₁ and Expr₂ => false

dynEnv |- Expr₁ => true dynEnv |- Expr₂ => true

dynEnv |- Expr₁ and Expr₂ => true

dynEnv |- Expr_i => true 1 <= i <= 2

dynEnv |- Expr₁ or Expr₂ => true

dynEnv |- Expr₁ => false dynEnv |- Expr₂ => false

dynEnv |- Expr₁ or Expr₂ => false

4.7 Constructors

[XPath/XQuery] supports two forms of constructors. Direct constructors support literal XML syntax for elements, attributes, text nodes, processing-instructions and comments. Computed constructors can be used to construct elements, attributes, text nodes, processing-instructions, comments, and document nodes. All direct constructors are normalized into computed constructors, i.e., there are no direct-constructor expressions in the Core.

4.7.1 Direct Element Constructors

Introduction

The static and dynamic semantics of the direct forms of element and attribute constructors are specified on the equivalent computed element and attribute constructors.

Constructors

[94 (XQuery)]	`Constructor^XQ`	::=	`DirectConstructor \| ComputedConstructor`
[95 (XQuery)]	`DirectConstructor^XQ`	::=	`DirElemConstructor \| DirCommentConstructor \| DirPIConstructor`
[96 (XQuery)]	`DirElemConstructor^XQ`	::=	`"<" QName DirAttributeList ("/>" \| (">" DirElemContent* "</" QName S? ">"))`
[101 (XQuery)]	`DirElemContent^XQ`	::=	`DirectConstructor \| CDataSection \| CommonContent \| ElementContentChar`
[148 (XQuery)]	`ElementContentChar^XQ`	::=	`Char - [{}<&]`
[102 (XQuery)]	`CommonContent^XQ`	::=	`PredefinedEntityRef \| CharRef \| "{{" \| "}}" \| EnclosedExpr`
[107 (XQuery)]	`CDataSection^XQ`	::=	`"<![CDATA[" CDataSectionContents "]]>"`
[108 (XQuery)]	`CDataSectionContents^XQ`	::=	`(Char* - (Char* ']]>' Char*))`
[97 (XQuery)]	`DirAttributeList^XQ`	::=	`(S (QName S? "=" S? DirAttributeValue)?)*`
[98 (XQuery)]	`DirAttributeValue^XQ`	::=	`('"' (EscapeQuot \| QuotAttrValueContent)* '"') \| ("'" (EscapeApos \| AposAttrValueContent)* "'")`
[99 (XQuery)]	`QuotAttrValueContent^XQ`	::=	`QuotAttrContentChar \| CommonContent`
[100 (XQuery)]	`AposAttrValueContent^XQ`	::=	`AposAttrContentChar \| CommonContent`
[149 (XQuery)]	`QuotAttrContentChar^XQ`	::=	`Char - ["{}<&]`
[150 (XQuery)]	`AposAttrContentChar^XQ`	::=	`Char - ['{}<&]`
[146 (XQuery)]	`EscapeQuot^XQ`	::=	`'""'`
[147 (XQuery)]	`EscapeApos^XQ`	::=	`"''"`
[29 (XQuery)]	`EnclosedExpr^XQ`	::=	`"{" Expr "}"`

Notation

The auxiliary mapping rules []_{ElementContent}, and []_{ElementContent-unit} are defined in this section and are used for the normalization of the content of direct element constructors.

Core Grammar

The Core grammar productions for constructors are:

Constructors

[72 (Core)]	`Constructor`	::=	`ComputedConstructor`
[73 (Core)]	`ComputedConstructor`	::=	`CompDocConstructor \| CompElemConstructor \| CompAttrConstructor \| CompTextConstructor \| CompCommentConstructor \| CompPIConstructor`
[28 (Core)]	`EnclosedExpr`	::=	`"{" Expr "}"`

There are no Core grammar productions for direct XML element or attribute constructors as they are normalized to computed constructors.

Normalization

We start with the rules for normalizing a direct element constructors' content. Literal XML character data (CDATA) is assumed to be processed directly at parsing level so it does not require any formal treatment. We distinguish between direct element constructors that contain only one element-content unit and those that contain more than one element-content unit. An element-content unit is a contiguous sequence of literal characters (character references, escaped braces, and predefined entity references), one enclosed expression, one direct element constructor, one XML comment, or one XML processing instruction. Here are three direct element constructors that each contain one element-content unit:

<date>{ xs:date("2003-03-18") }</date>

<name>Dizzy Gillespe</name>

<comment><!-- Just a comment --></comment>

The first contains one enclosed expression, the second contains one contiguous sequence of characters, and the third contains one XML comment.

After boundary-space is stripped, the next example contains six element-content units:

<address>
  <!-- Dizzy's address -->
  { 123 }-0A <street>Roosevelt Ave.</street> Flushing, NY { 11368 }
</address>

It contains one XML comment, followed by one enclosed expression that contains the integer 123, one contiguous sequence of characters ("-0A "), one direct XML element constructor, one contiguous sequence of characters (" Flushing, NY "), and one enclosed expression that contains the integer 11368. Evaluation of that constructor will result in the following element.

<address><!-- Dizzy's address -->123-0A <street>Roosevelt Ave.</street> Flushing, NY 11368</address>

Adjacent element-content units are convenient because they permit arbitrary interleaving of text and atomic data. During evaluation, atomic values are converted to text nodes containing the string representations of the atomic values, and then adjacent text nodes are concatenated together. In the example above, the integer 123 is converted to a string and concatenated with "-0A" and the result is a single text node containing "123-0A".

In general, we do not want to convert all atomic values to text nodes, especially when performing static-type analysis, because we lose useful type information. For example, if we normalize the first example above as follows, we lose the important information that the user constructed a date value, not just a text node containing an arbitrary string:

<date>{ xs:date("2003-03-18") }</date>
 (normalization that loses type information) == 
element date { text { "2003-03-18" } }

To preserve useful type information, we distinguish between direct element constructors that contain one element-content unit and those that contain more than one (because multiple element-content units commonly denote concatenation of atomic data and text). Below are two examples of normalization for element constructors.

<date>{ xs:date("2003-03-18") }</date>
 ==
element date { xs:date("2003-03-18") } 

<address>
  <!-- Dizzy's address -->
  { 123 }-0A <street>Roosevelt Ave.</street> Flushing, NY { 11368 }
</address>
 ==
element address {
  fs:item-sequence-to-node-sequence(
    comment { " Dizzy's address "},
    123, 
    text { "-0A "}, 
    element street {"Roosevelt Ave."},
    text { " Flushing, NY "  },
    11368
  )
}

Notation

We introduce the following auxiliary grammar productions.

[92 (Formal)]	`ElementContentUnit`	::=	`DirectConstructor \| EnclosedExpr \| DirCharsUnit`
[93 (Formal)]	`DirCharsUnit`	::=	`(ElementContentChar \| PredefinedEntityRef \| CharRef \| "{{" \| "}}")+`

Given the distinction between direct element constructors that we made above, we give two normalization rules for a direct element constructor's content. If the direct element constructor contains exactly one element-content unit, we simply normalize that unit by applying the normalization rule for the element content:

[ ElementContentUnit ]_{ElementContent-unit}

[ ElementContentUnit ]_{ElementContent}

If the direct element constructor contains more than one element-content unit, we normalize each unit individually and construct a sequence of the normalized results interleaved with empty text nodes. The empty text nodes guarantee that the results of evaluating consecutive element-content units can be distinguished. Then we apply the function fs:item-sequence-to-node-sequence. Section 3.7.1 Direct Element Constructors^XQ specifies the rules for converting a sequence of atomic values and nodes into a sequence of nodes before element construction. The Formal Semantics function fs:item-sequence-to-node-sequence implements these conversion rules.

[ElementContentUnit₁ ..., ElementContentUnit_n]_{ElementContent-unit}, n > 1

fs:item-sequence-to-node-sequence([ ElementContentUnit₁ ]_{ElementContent} , text { "" }, ..., text { "" }, [ ElementContentUnit_n]_{ElementContent})

We need to distinguish between multiple element-content units, because the rule for converting sequences of atomic values into strings apply to sequences within distinct enclosed expressions. The empty text nodes are eliminated during evaluation of fs:item-sequence-to-node-sequence when consecutive text nodes are coalesced into a single text node. The text node guarantees that a whitespace character will not be inserted between atomic values computed by distinct enclosed expressions. For example, here is an expression, its normalization, and the resulting XML value:

<example>{ 1 }{ 2 }</example>
 ==
element example { fs:item-sequence-to-node-sequence ((1, text {""}, 2)) }
 ==>
<example>12</example>

In the absence of the empty text node, the expression would evaluate to the following incorrect value:

<example>{ 1 }{ 2 }</example>
 (incorrect normalization) ==
element example { fs:item-sequence-to-node-sequence ((1, 2)) }
 (incorrect value) ==>
<example>1 2</example>

Now that we have explained the normalization rules for direct element content, we give the rules for the two forms of direct XML element constructors. Note that the direct attribute constructors are normalized twice: the []_{NamespaceAttrs} normalizes the namespace-declaration attributes and []_Attribute normalizes all other attributes that are not namespace-declaration attributes.

[ < QName AttributeList > DirElemContent* </ QName S? > ]_Expr

element QName { [ AttributeList ]_{NamespaceAttrs} , [ AttributeList ]_Attribute , [ DirElemContent* ]_{ElementContent} }

[ < QName AttributeList /> ]_Expr

element QName { [ AttributeList ]_{NamespaceAttrs} , [ AttributeList ]_Attribute }

Next, we give the normalization rules for each element-content unit. The normalization rule for a contiguous sequence of characters assumes:

that the significant whitespace characters in element constructors have been preserved, as described in [4.7.1.4 Whitespace in Element Content];
that character references have been resolved to individual characters and predefined entity references have been resolved to sequences of characters, and
that the rule is applied to the longest contiguous sequence of characters.

The following normalization rule takes the longest consecutive sequence of individual characters that include literal characters, escaped curly braces, character references, and predefined entity references and normalizes the character sequence as a text node containing the string of characters.

[DirCharsUnit]_{ElementContent}

text { fn:codepoints-to-string(DirCharsUnit) }

XML processing instructions and comments in element content are normalized by applying the standard normalization rules for expressions, which appear in [4.7.2 Other Direct Constructors].

[DirPIConstructor]_{ElementContent}

[DirPIConstructor]_Expr

[DirCommentConstructor]_{ElementContent}

[DirCommentConstructor]_Expr

An enclosed expression in element content is normalized by normalizing each individual expression in its expression sequence and then constructing a sequence of the normalized values:

[ { Expr₁, ..., Expr_n } ]_{ElementContent}

[ Expr₁ ]_Expr , ..., [ Expr_n]_Expr

Static Type Analysis

There are no additional static type rules for direct XML element or attribute constructors.

Dynamic Evaluation

There are no additional dynamic evaluation rules for direct XML element or attribute constructors.

4.7.1.1 Attributes

Like literal XML element constructors, literal XML attribute constructors are normalized to computed attribute constructors.

Notation

The auxiliary mapping rules []_Attribute, []_{AttributeContent}, and []_{AttributeContent-unit}, are defined in this section and are used for the normalization of the content of direct attribute constructors.

Normalization

Direct attributes may contain namespace-declaration attributes. The normalization rules for attributes ignore namespace-declaration attributes -- they are handled by the normalization rules in [4.7.1.2 Namespace Declaration Attributes].

An AttributeList is normalized by the following rule, which maps each of the individual attribute-value expressions in the attribute list and constructs a sequence of the normalized values.

[

QName₁ S? = S? '"' AttributeValue₀ '"'

QName_n S? = S? '""' AttributeValue_n '"'

]_Attribute

([QName₁ S? = S? '"' AttributeValue₀ '"']_Attribute

...,

[QName_n S? = S? '"' AttributeValue_n '"']_Attribute)

Namespace-declaration attributes, i.e., those attributes whose prefix is xmlns are ignored by mapping them to the empty sequence.

[Prefix:LocalPart S? = S? '"' AttributeValue '"']_Attribute

(Prefix = xmlns)

()

All attributes that are not namespace-declaration attributes are mapped to computed attribute constructors.

[Prefix:LocalPart S? = S? '"' AttributeValue '"']_Attribute

not(Prefix = xmlns)

attribute [Prefix:LocalPart ]_Expr { [AttributeValue]_{AttributeContent}}

As with literal XML elements, we need to distinguish between direct attribute constructors that contain one attribute-content unit and those that contain multiple attribute-content units, because the rule for converting sequences of atomic values into strings are applied to sequences within distinct enclosed expressions. If the direct attribute constructor contains exactly one attribute-content unit, we simply normalize that unit by applying the normalization rule for the attribute content:

[ AttributeValueContent₁ ]_{AttributeContent-unit}

[AttributeValueContent₁]_{AttributeContent}

If the direct attribute constructor contains more than one attribute-content unit, we normalize each unit individually and construct a sequence of the normalized results interleaved with empty text nodes. The empty text nodes guarantee that the results of evaluating consecutive attribute-content units can be distinguished. Then we apply the function fs:item-sequence-to-untypedAtomic, which applies the appropriate conversion rules to the normalized attribute content:

[ AttributeValueContent₁ ..., AttributeValueContent_n ]_{AttributeContent-unit}, n > 1

fs:item-sequence-to-untypedAtomic(([ AttributeValueContent₁ ]_{AttributeContent} , text { "" }, ..., text {""}, [ AttributeValueContent_n]_{AttributeContent}))

Literal characters, escaped curly braces, character references, and predefined entity references in attribute content are treated as in element content. In addition, the normalization rule for characters in attributes assumes:

that an escaped single or double quote is converted to an individual single or double quote.

The following normalization rules take the longest consecutive sequence of individual characters that include literal characters, escaped curly braces, escaped quotes, character references, predefined entity references, and escaped single and double quotes and normalizes the character sequence as a string.

[( Char | CharRef | EscapeQuot | EscapeApos | PredefinedEntityRef ) +]_{AttributeContent}

fn:codepoints-to-string(( Char | CharRef | EscapeQuot | EscapeApos | PredefinedEntityRef )+)

We normalize an enclosed expression in attribute content by normalizing each individual expression in its expression sequence and then construct a sequence of the normalized values:

[ { Expr₀, ..., Expr_n } ]_{AttributeContent}

([ Expr₀ ]_Expr , ..., [ Expr_n]_Expr)

4.7.1.2 Namespace Declaration Attributes

Notation

The auxiliary mapping rules []_{NamespaceAttr}, and []_{NamespaceAttrs} are defined in this section and are used for the normalization of namespace declaration attributes.

Normalization

Direct attributes may contain namespace-declaration attributes. The normalization rules for namespace-declaration attributes ignore all non-namespace attributes -- they are handled by the normalization rules in [4.7.1.1 Attributes].

An AttributeList containing namespace-declaration attributes is normalized by the following rule, which maps each of the individual namespace-declaration attributes in the attribute list and constructs a sequence of the normalized namespace attribute values.

[

QName₁ S? = S? '"' AttributeValue₀ '"'

QName_n S? = S? '""' AttributeValue_n '"'

]_{NamespaceAttrs}

([QName₁ S? = S? '"' AttributeValue₀ '"']_{NamespaceAttr}

...,

[QName_n S? = S? '"' AttributeValue_n '"']_{NamespaceAttr})

Attributes whose prefix is not xmlns are ignored by mapping them to the empty sequence.

[Prefix:LocalPart S? = S? '"' AttributeValue '"']_{NamespaceAttr}

not (Prefix = xmlns)

()

Namespace-declaration attributes are normalized to local namespace declarations (CompElemNamespace).

[Prefix:LocalPart S? = S? '"' AttributeValue '"']_{NamespaceAttr}

(Prefix = xmlns)

namespace LocalPart { [AttributeValue]_{AttributeContent}}

4.7.1.3 Content

The rules for normalizing element content are given above in [4.7.1 Direct Element Constructors].

4.7.1.4 Whitespace in Element Content

Section 3.7.1.4 Boundary Whitespace^XQ describes how whitespace in element and attribute constructors is processed depending on the value of the xmlspace declaration in the query prolog. the Formal Semantics assumes that the rules for handling whitespace are applied prior to normalization rules, for example, during parsing of a query. Therefore, there are no formal rules for handling whitespace.

4.7.2 Other Direct Constructors

Other Constructors

[105 (XQuery)]	`DirPIConstructor^XQ`	::=	`"<?" PITarget (S DirPIContents)? "?>"`
[106 (XQuery)]	`DirPIContents^XQ`	::=	`(Char* - (Char* '?>' Char*))`
[103 (XQuery)]	`DirCommentConstructor^XQ`	::=	`"<!--" DirCommentContents "-->"`
[104 (XQuery)]	`DirCommentContents^XQ`	::=	`((Char - '-') \| ('-' (Char - '-')))*`

Normalization

A literal XML processing instruction is normalized into a computed processing-instruction constructor; its character content is converted to a string as in attribute content.

[<? NCName Char* ?>"]_Expr

[processing-instruction NCName { Char* }]_Expr

A literal XML comment is normalized into a computed comment constructor; its character content is converted to a string as in attribute content.

[]_Expr

[comment { Char* }]_Expr

Static Type Analysis

There are no additional static type rules for direct processing-instruction or comment constructors.

Dynamic Evaluation

There are no additional dynamic evaluation rules for direct processing-instruction or comment constructors.

4.7.3 Computed Constructors

Computed Constructors

4.7.3.1 Computed Element Constructors

Introduction

This section describes the semantics of computed element constructors. Remember that direct element constructors are normalized into computed element constructors. This document does not formally specify how namespaces are copied. The semantics of namespaces copying in element constructors can be found in [XQuery 1.0: An XML Query Language].

[111 (XQuery)]	`CompElemConstructor^XQ`	::=	`"element" (QName \| ("{" Expr "}")) "{" ContentExpr? "}"`
[112 (XQuery)]	`ContentExpr^XQ`	::=	`Expr`

Notation

Local namespace declarations may occur explicitly in a computed element constructor or may be the result of normalizing namespace-declaration attributes contained in direct element constructors. For local element declarations that occur explicitly in a query, the immediately enclosing expression of the local namespace declaration (CompElemNamespace) must be a computed element constructor; otherwise, as specified in [XPath/XQuery], a static error is raised.

Core Grammar

The Core grammar productions for computed element constructors are:

Computed Element Constructors

[75 (Core)]	`CompElemConstructor`	::=	`"element" (QName \| ("{" Expr "}")) "{" ContentExpr "}"`
[76 (Core)]	`ContentExpr`	::=	`Expr`

Normalization

If the content expression is missing, the computed element constructor is normalized as if its content expression was the empty sequence.

[element QName { }]_Expr

[element QName { () }]_Expr

Computed element constructors using the fs:item-sequence-to-node-sequence function over their content expression.

[element QName { Expr }]_Expr

element QName { fs:item-sequence-to-node-sequence(([Expr]_Expr)) }

When the name of the element is also computed, the normalization rule applies atomization to the name expression.

[element { Expr₁ } { Expr₂ }]_Expr

element { fn:data(([Expr₁]_Expr)) }{ fs:item-sequence-to-node-sequence(([Expr₂]_Expr)) }

Static Type Analysis

The normalization rules of direct element and attribute constructors leave us with only the computed forms of constructors. The static semantic for constructors is defined on all the computed forms. The computed element constructor itself has two forms: one in which the element name is a literal QName, and the other in which the element name is a computed expression.

A computed element constructor creates a new element with either the type annotation^XQ xdt:untyped (in strip construction mode), or with the type annotation^XQ xs:anyType (in preserve construction mode). The content expression must return a sequence of nodes with attribute nodes at the beginning.

statEnv.constructionMode = preserve

statEnv |- Expr : Type

statEnv |- Type <: attribute *, (element | text | comment | processing-instruction) *

statEnv |- element QName { Expr } : element QName of type xs:anyType

statEnv.constructionMode = strip

statEnv |- Expr : Type

statEnv |- Type <: attribute *, (element | text | comment | processing-instruction) *

statEnv |- element QName { Expr } : element QName of type xdt:untyped

In case the element name is computed as well, the name expression must be of type xs:QName, xs:string, or xdt:untypedAtomic.

statEnv.constructionMode = preserve

statEnv |- Expr₁ : Type₁

statEnv |- Type₁ <: (xs:QName | xs:string | xdt:untypedAtomic)

statEnv |- Expr₂ : Type₂

statEnv |- Type₂ <: attribute *, (element | text | comment | processing-instruction) *

statEnv |- element { Expr₁ } { Expr₂ } : element of type xs:anyType

statEnv.constructionMode = strip

statEnv |- Expr₁ : Type₁

statEnv |- Type₁ <: (xs:QName | xs:string | xdt:untypedAtomic)

statEnv |- Expr₂ : Type₂

statEnv |- Type₂ <: attribute *, (element | text | comment | processing-instruction) *

statEnv |- element { Expr₁ } { Expr₂ } : element of type xdt:untyped

Dynamic Evaluation

The following rules take a computed element constructor expression and construct an element node. The dynamic semantics for computed element constructors is the most complex of all expressions in XQuery. Here is how to read the rule below.

First, the element's content expression is partitioned into the local namespace declarations and all other expressions, and the local namespace declarations are evaluated, yielding a sequence of namespace bindings. The static environment is extended to include the new namespace bindings, which are all active. In Section 3.7.1.2 Namespace Declaration Attributes^XQ, it is implementation-defined whether undeclaration of namespace prefixes (by setting the namespace prefix to the zero-length string) in an element constructor is supported. In the dynamic semantics below, we assume all local namespace declarations declare a binding of a prefix to a URI.

Second, the function fs:item-sequence-to-node-sequence is applied to the element's content expression (excluding local namespace declarations); this function call is evaluated in the new static and dynamic environment. Recall from [4.7.1 Direct Element Constructors] that during normalization, we do not convert the content of direct element constructors that contain one element-content unit. This guarantees that useful type information is preserved for static analysis. Since the conversion function fs:item-sequence-to-node-sequence was not applied to all element constructors during normalization, we have to apply it at evaluation time. (Obviously, it is possible to elide the application of fs:item-sequence-to-node-sequence injected during normalization and the application injected during evaluation.) The resulting value Value₀ must match zero-or-more attributes followed by zero-or-more element, text, processing-instruction or comment nodes.

Third, The namespace bindings are concatenated with the list of active namespaces in the namespace environment statEnv.namespace and the namespaces corresponding to the element's name and all attributes names. The resulting sequence is the sequence of namespace bindings for the element.

Expr = CompElemNamespace₁, ..., CompElemNamespace_n, (Expr₀)

CompElemNamespace₁ = namespace NCName₁ { URI₁ }

...

CompElemNamespace_n = namespace NCName_n { URI_n }

statEnv₁ = statEnv + namespace(NCName₁ => (active, URI₁))

...

statEnv_n = statEnv_n-1 + namespace(NCName_n => (active, URI_n))

statEnv_n; dynEnv |- fs:item-sequence-to-node-sequence(Expr₀) => Value₀

statEnv |- Value₀ matches (attribute*, (element | text | processing-instruction | comment)*)

NamespaceBindings = CompElemNamespace₁, ..., CompElemNamespace_n, fs:active_ns(statEnv.namespace), fs:get_static_ns_from_items(statEnv, Value₀)

statEnv; dynEnv |- element QName { Expr } => Value₀

The dynamic evaluation of an element constructor with a computed name is similar. There is one additional rule that checks that the value of the element's name expression matches xs:QName.

dynEnv |- Expr₁ => Value₀ statEnv |- Value₀ matches xs:QName

Expr₂ = CompElemNamespace₁, ..., CompElemNamespace_n, (Expr₃)

CompElemNamespace₁ = namespace NCName₁ { URI₁ }

...

CompElemNamespace_n = namespace NCName_n { URI_n }

statEnv₁ = statEnv + namespace(NCName => (active, URI₁))

...

statEnv_n = statEnv_n-1 + namespace(NCName => (active, URI_n))

statEnv_n, dynEnv |- fs:item-sequence-to-node-sequence(Expr₃) => Value₁

statEnv_n |- Value₁ matches (attribute*, (element | text | processing-instruction | comment)*)

NamespaceBindings = CompElemNamespace₁, ..., CompElemNamespace_n, fs:active_ns(statEnv.namespace), fs:get_static_ns_from_items(statEnv, Value₁)

statEnv; dynEnv |- element { Expr₁ } { Expr₂ } => Value₁

4.7.3.2 Computed Attribute Constructors

[113 (XQuery)] CompAttrConstructor^XQ ::= "attribute" (QName | ("{" Expr "}")) "{" Expr? "}"

Core Grammar

The Core grammar production for computed attribute constructors is:

Computed Attribute Constructors

[77 (Core)] CompAttrConstructor ::= "attribute" (QName | ("{" Expr "}")) "{" Expr "}"

Normalization

Computed attribute constructors are normalized by mapping their name and content expression in a similar way as computed element constructors. The normalization rule uses the fs:item-sequence-to-untypedAtomic function.

[attribute QName { }]_Expr

[attribute QName { () }]_Expr

[attribute QName { Expr }]_Expr

attribute QName { fs:item-sequence-to-untypedAtomic(([Expr]_Expr)) }

[attribute { Expr₁ } { Expr₂ }]_Expr

attribute { fn:data(([Expr₁]_Expr)) } { fs:item-sequence-to-untypedAtomic(([Expr₂]_Expr)) }

Static Type Analysis

The normalization rules for direct attribute constructors leave us with only the computed form of the attribute constructors. Like in a computed element constructor, a computed attribute constructor has two forms: one in which the attribute name is a literal QName, and the other in which the attribute name is a computed expression.

In the case of attribute constructors, the type annotation^XQ is always xdt:untypedAtomic.

statEnv |- Expr : Type

statEnv |- Type <: xdt:untypedAtomic

statEnv |- attribute QName { Expr } : attribute QName of type xdt:untypedAtomic

statEnv |- Expr₁ : Type₁

statEnv |- Type₁ <: (xs:QName | xs:string | xdt:untypedAtomic)

statEnv |- Expr₂ : Type₂

statEnv |- Type₂ <: xdt:untypedAtomic

statEnv |- attribute { Expr₁ } { Expr₂ } : attribute of type xdt:untypedAtomic

Dynamic Evaluation

The following rules take a computed attribute constructor expression and construct an attribute node. The rules are similar to those rules for element constructors. First, the attribute's name is expanded into a qualified name. Second, the function fs:item-sequence-to-untypedAtomic is applied to the content expression and this function call is evaluated in the dynamic environment. Recall from [4.7.3.2 Computed Attribute Constructors] that during normalization, we do not convert the content of direct attribute constructors that contain one attribute-content unit. This guarantees that useful type information is preserved for static analysis. Since the conversion function fs:item-sequence-to-untypedAtomic was not applied to all attribute constructors during normalization, we have to apply it at evaluation time. (As before, it is possible to elide the application of fs:item-sequence-to-untypedAtomic injected during normalization and the application injected during evaluation.)

statEnv |- QName of attr expands to expanded-QName

dynEnv |- fs:item-sequence-to-untypedAtomic(Expr) => Value

dynEnv |- attribute QName { Expr } => attribute expanded-QName of type xdt:untypedAtomic { Value }

dynEnv |- Expr₁ => Value₁

statEnv |- Value₁ matches xs:QName

statEnv |- QName₁ = xs:QName(Value₁)

dynEnv |- fs:item-sequence-to-untypedAtomic(Expr₂) => Value₂

dynEnv |- attribute { Expr₁ } { Expr₂ } => attribute QName₁ of type xdt:untypedAtomic { Value₂ }

4.7.3.3 Document Node Constructors

[110 (XQuery)] CompDocConstructor^XQ ::= "document" "{" Expr "}"

Core Grammar

The Core grammar production for a computed document constructor is:

Core computed document constructor

[74 (Core)] CompDocConstructor ::= "document" "{" Expr "}"

Normalization

A document node constructor contains an expression, which must evaluate to a sequence of element, text, comment, or processing-instruction nodes. Section 3.7.3.3 Document Node Constructors^XQ specifies the rules for converting a sequence of atomic values and nodes into a sequence of nodes before document construction. The built-in function [7.1.5 The fs:item-sequence-to-node-sequence function] implements this conversion.

[document { Expr }]_Expr

document { fs:item-sequence-to-node-sequence(([Expr]_Expr)) }

Static Type Analysis

The static semantics checks that the type of the argument expression is a sequence of element, text, processing-instruction, and comment nodes. The type of the entire expression is the most general document type, because the document constructor erases all type annotation^XQ on its content nodes.

statEnv |- Expr : Type

statEnv |- Type <: (element | text | processing-instruction | comment)*

statEnv |- document { Expr } : document { Type }

Dynamic Evaluation

The dynamic semantics checks that the argument expression evaluates to a value that is a sequence of element, text, processing-instruction, or comment nodes. The entire expression evaluates to a new document node value. If the construction mode is set to strip, the type annotation^XQ for all the nodes in content of a document node are erased.

statEnv.constructionMode = preserve

dynEnv |- Expr => Value

statEnv |- Value matches (element | text | processing-instruction | comment)*

dynEnv |- document { Expr } => document { Value }

statEnv.constructionMode = strip

dynEnv |- Expr => Value₁

Value₁ erases to Value₂

statEnv |- Value₂ matches (element | text | processing-instruction | comment)*

dynEnv |- document { Expr } => document { Value₂ }

4.7.3.4 Text Node Constructors

[114 (XQuery)] CompTextConstructor^XQ ::= "text" "{" Expr "}"

Core Grammar

The Core grammar production for a computed text constructor is:

[78 (Core)] CompTextConstructor ::= "text" "{" Expr "}"

Normalization

A text node constructor contains an expression, which must evaluate to an xs:string value. Section 3.7.3.4 Text Node Constructors^XQ specifies the rules for converting a sequence of atomic values into a string prior to construction of a text node. Each node is replaced by its string value. For each adjacent sequence of one or more atomic values returned by an enclosed expression, a untyped atomic value is constructed, containing the canonical lexical representation of all the atomic values, with a single blank character inserted between adjacent values. As formal specification of these conversion rules is not instructive, [7.1.6 The fs:item-sequence-to-untypedAtomic function] implements this conversion.

[text { Expr }]_Expr

text { (fs:item-sequence-to-untypedAtomic-text(fn:data(([Expr]_Expr)))) cast as xs:string? }

Static Type Analysis

The static semantics checks that the argument expression has type xs:string or empty. The type of the entire expression is an optional text node type, as the text node constructor returns the empty sequence if its argument is the empty sequence.

statEnv |- Expr : xs:string?

statEnv |- text { Expr } : text?

Dynamic Evaluation

If the argument expression returns the empty sequence, the text node constructor returns the empty sequence.

dynEnv |- Expr => ()

dynEnv |- text { Expr } => ()

If the argument expression returns a value of type xs:string, the text node constructor returns a text node with that string as content.

dynEnv |- Expr => Value statEnv |- Value matches xs:string

dynEnv |- text { Expr } => text { Value }

4.7.3.5 Computed Processing Instruction Constructors

[116 (XQuery)] CompPIConstructor^XQ ::= "processing-instruction" (NCName | ("{" Expr "}")) "{" Expr? "}"

Core Grammar

The Core grammar production for computed processing-instruction constructors is:

[80 (Core)] CompPIConstructor ::= "processing-instruction" (NCName | ("{" Expr "}")) "{" Expr? "}"

Normalization

Computed processing-instruction constructors are normalized by mapping their name and content expression in the same way that computed element and attribute constructors are normalized.

[processing-instruction NCName { }]_Expr

[processing-instruction NCName { () }]_Expr

[processing-instruction NCName { Expr }]_Expr

processing-instruction NCName { fs:item-sequence-to-untypedAtomic-PI(([Expr]_Expr)) }

[processing-instruction { Expr₁ } { Expr₂ }]_Expr

processing-instruction { fn:data(([Expr₁]_Expr)) } { fs:item-sequence-to-untypedAtomic-PI(([Expr₂]_Expr)) }

Static Type Analysis

The static typing rules for processing-instruction constructors are straightforward.

statEnv |- Expr : xdt:untypedAtomic

statEnv |- processing-instruction NCName { Expr } : processing-instruction

statEnv |- Expr₁ : (xs:NCName | xs:string | xdt:untypedAtomic) statEnv |- Expr₂ : xdt:untypedAtomic

statEnv |- processing-instruction { Expr₁ } { Expr₂ } : processing-instruction

Dynamic Evaluation

The dynamic evaluation rules for computed processing instructions are straightforward.

dynEnv |- Expr => Value statEnv |- Value matches xdt:untypedAtomic

dynEnv |- processing-instruction NCName { Expr } => processing-instruction NCName { Value }

dynEnv |- Expr₁ => Value₁

statEnv |- Value₁ matches xs:NCName

dynEnv |- xs:NCName(Value₁) => NCName₁

dynEnv |- Expr₂ => Value₂ statEnv |- Value₂ matches xdt:untypedAtomic

dynEnv |- processing-instruction { Expr₁ } { Expr₂ } => processing-instruction NCName₁ { Value₂ }

dynEnv |- Expr₁ => Value₁

statEnv |- Value₁ matches xs:string

dynEnv |- xs:NCName(Value₁); => NCName₁

dynEnv |- Expr₂ => Value₂ statEnv |- Value₂ matches xdt:untypedAtomic

dynEnv |- processing-instruction { Expr₁ } { Expr₂ } => processing-instruction NCName₁ { Value₂ }

dynEnv |- Expr₁ => Value₁

statEnv |- Value₁ matches xdt:untypedAtomic

dynEnv |- xs:NCName(Value₁) => NCName₁

dynEnv |- Expr₂ => Value₂ statEnv |- Value₂ matches xdt:untypedAtomic

dynEnv |- processing-instruction { Expr₁ } { Expr₂ } => processing-instruction NCName₁ { Value₂ }

4.7.3.6 Computed Comment Constructors

[115 (XQuery)] CompCommentConstructor^XQ ::= "comment" "{" Expr "}"

Core Grammar

The Core grammar production for computed comment constructors is:

[79 (Core)] CompCommentConstructor ::= "comment" "{" Expr "}"

Normalization

Computed comment constructors are normalized by mapping their content expression.

[comment { Expr }]_Expr

comment { (fs:item-sequence-to-untypedAtomic-comment(([Expr]_Expr))) cast as xs:string }

Static Type Analysis

The static typing rule for computed comment constructors is straightforward.

statEnv |- Expr : xs:string

statEnv |- comment { Expr } : comment

Dynamic Evaluation

The dynamic evaluation rule for computed comment constructors is straightforward.

dynEnv |- Expr => Value statEnv |- Value matches xs:string

dynEnv |- comment { Expr } => comment { Value }

4.7.4 In-scope Namespaces of a Constructed Element

The effect of in-scope namespaces on constructed elements is specified in [4.7.1 Direct Element Constructors] and [4.7.3.1 Computed Element Constructors].

4.8 [For/FLWOR] Expressions

Introduction

[XPath/XQuery] provides [For/FLWOR] expressions for iteration, for binding variables to intermediate results, and filtering bound variables according to a predicate.

A FLWORExpr in XQuery 1.0 consists of a sequence of ForClauses and LetClauses, followed by an optional WhereClause, followed by the , as described by the following grammar productions. Each variable binding is preceded by an optional type declaration which specify the type expected for the variable.

The dynamic semantics of the ordering mode in FLWOR expressions is not specified formally. The dynamic semantics is not specified formally as it would require the introduction of tuples, which are not supported in the [XPath/XQuery] data model.

[For/FLWOR] Expressions

[33 (XQuery)]	`FLWORExpr^XQ`	::=	`(ForClause \| LetClause)+ WhereClause? OrderByClause? "return" ExprSingle`
[34 (XQuery)]	`ForClause^XQ`	::=	`"for" "$" VarName TypeDeclaration? PositionalVar? "in" ExprSingle ("," "$" VarName TypeDeclaration? PositionalVar? "in" ExprSingle)*`
[36 (XQuery)]	`LetClause^XQ`	::=	`"let" "$" VarName TypeDeclaration? ":=" ExprSingle ("," "$" VarName TypeDeclaration? ":=" ExprSingle)*`
[118 (XQuery)]	`TypeDeclaration^XQ`	::=	`"as" SequenceType`
[35 (XQuery)]	`PositionalVar^XQ`	::=	`"at" "$" VarName`
[37 (XQuery)]	`WhereClause^XQ`	::=	`"where" ExprSingle`
[38 (XQuery)]	`OrderByClause^XQ`	::=	`(("order" "by") \| ("stable" "order" "by")) OrderSpecList`
[39 (XQuery)]	`OrderSpecList^XQ`	::=	`OrderSpec ("," OrderSpec)*`
[40 (XQuery)]	`OrderSpec^XQ`	::=	`ExprSingle OrderModifier`
[41 (XQuery)]	`OrderModifier^XQ`	::=	`("ascending" \| "descending")? ("empty" ("greatest" \| "least"))? ("collation" URILiteral)?`
[4 (XPath)]	`ForExpr^XP`	::=	`SimpleForClause "return" ExprSingle`
[5 (XPath)]	`SimpleForClause^XP`	::=	`"for" "$" VarName "in" ExprSingle ("," "$" VarName "in" ExprSingle)*`

Core Grammar

The Core grammar productions for FLWOR expressions are:

For Expressions

[32 (Core)]	`FLWORExpr`	::=	`(ForClause \| LetClause) "return" ExprSingle`
[33 (Core)]	`ForClause`	::=	`"for" "$" VarName TypeDeclaration? PositionalVar? "in" ExprSingle`
[35 (Core)]	`LetClause`	::=	`"let" "$" VarName TypeDeclaration? ":=" ExprSingle`
[34 (Core)]	`PositionalVar`	::=	`"at" "$" VarName`
[82 (Core)]	`TypeDeclaration`	::=	`"as" SequenceType`
[36 (Core)]	`OrderByClause`	::=	`(("order" "by") \| ("stable" "order" "by")) OrderSpecList`
[37 (Core)]	`OrderSpecList`	::=	`OrderSpec ("," OrderSpec)*`
[38 (Core)]	`OrderSpec`	::=	`ExprSingle OrderModifier`
[39 (Core)]	`OrderModifier`	::=	`("ascending" \| "descending")? ("empty" ("greatest" \| "least"))? ("collation" URILiteral)?`

4.8.1 FLWOR expressions

Notation

For convenience, we introduce the following auxiliary grammar productions.

[87 (Formal)]	`OptTypeDeclaration`	::=	`TypeDeclaration?`
[88 (Formal)]	`OptPositionalVar`	::=	`PositionalVar?`

Notation

Individual [For/FLWOR] clauses are normalized by means of the auxiliary normalization rules:

[FLWORClause]_FLWOR(Expr)

Where FLWORClause can be any either a ForClause, a LetClause, a WhereClause, or an OrderByClause. The OrderByClause is discussed in [4.8.4 Order By and Return Clauses].

Normalized FLWOR expressions restrict a For and Let clause to bind only one variable. Otherwise, the Core FLWOR expression is the same as the XQuery FLWOR expression.

Notation

The auxiliary rule []_FLWOR(Expr) normalizes a For, Let, or Where clause in a FLWORExpr expression. Note that the rule takes the remainder of the FLWOR expression (other For, Let, or Where clauses and the Return clause) as a parameter in Expr.

Normalization

The [For/FLWOR] expressions include the FLWORExpr of XQuery and the ForExpr of XPath. The normalization rule for ForExpr is simple: It simply unrolls a ForExpr that binds multiple variables into nested ForExprs, each of which bind one variable.

[for VarRef₀ in Expr₀, ..., VarRef_n in Expr_n return Expr ]_Expr

for VarRef₀ in [Expr₀]_Expr return

...

for VarRef_n in [Expr_n]_Expr return

[Expr]_Expr

Full FLWORExpr expressions are normalized to nested Core expressions using two sets of normalization rules. Note that some of the rules also accept ungrammatical FLWORExprs such as "where Expr₁ return Expr₂". This does not matter, as normalization is always applied on parsed [XPath/XQuery] expressions, and ungrammatical FLWORExprs would be rejected by the parser beforehand.

The first set of rules is applied on a full [For/FLWOR] expression, splitting it at the clause level, then applying further normalization on each separate clause.

[ (ForClause | LetClause | WhereClause | OrderByClause) FLWORExpr ]_Expr

[(ForClause | LetClause | WhereClause | OrderByClause)]_FLWOR([FLWORExpr]_Expr)

[ (ForClause | LetClause | WhereClause | OrderByClause) return Expr ]_Expr

[(ForClause | LetClause | WhereClause | OrderByClause)]_FLWOR([Expr]_Expr)

Then each [For/FLWOR] clause is normalized separately. A ForClause may bind more than one variable, whereas a For expression in the [XPath/XQuery] Core binds and iterates over only one variable. Therefore, a ForClause is normalized to nested for expressions:

[

for VarRef₁ OptTypeDeclaration₁ OptPositionalVar₁ in Expr₁,

···,

VarRef_n OptTypeDeclaration_n OptPositionalVar_n in Expr_n

] _FLWOR(Expr)

for VarRef₁ OptTypeDeclaration₁ OptPositionalVar₁ in [Expr₁]_Expr return

···

for VarRef_n OptTypeDeclaration_n OptPositionalVar_n in [ Expr_n ]_Expr return Expr

Note that the additional Expr parameter of the auxiliary normalization rule is used as the final return expression.

Likewise, a LetClause clause is normalized to nested let expressions, each of which binds one variable:

[

let VarRef₁ OptTypeDeclaration₁ := Expr₁,

···,

VarRef_n OptTypeDeclaration_n := Expr_n

]_FLWOR(Expr)

let VarRef₁ OptTypeDeclaration₁ := [Expr₁ ]_Expr return

···

let VarRef_n OptTypeDeclaration_n := [Expr_n]_Expr return Expr

A WhereClause is normalized to an IfExpr, with the else-branch returning the empty sequence:

[ where Expr₁]_FLWOR(Expr)

if ( [Expr₁]_Expr ) then Expr else ()

Example

The following simple example illustrates, how a FLWORExpr is normalized. The for expression in the example below is used to iterate over two collections, binding variables $i and $j to items in these collections. It uses a let clause to binds the local variable $k to the sum of both numbers, and a where clause to select only those numbers that have a sum equal to or greater than the integer 5.

  for $i as xs:integer in (1, 2),
      $j in (3, 4)
  let $k := $i + $j
  where $k >= 5
  return
    <tuple>
       <i> { $i } </i>
       <j> { $j } </j>
    </tuple>

By the first set of rules, this is normalized to (except for the operators and element constructor which are not treated here):

  for $i as xs:integer in (1, 2) return
    for $j in (3, 4) return
      let $k := $i + $j return
        if ($k >= 5) then 
          <tuple>
            <i> { $i } </i>
            <j> { $j } </j>
          </tuple>
        else
          ()

For each binding of $i to an item in the sequence (1 , 2) the inner for expression iterates over the sequence (3 , 4) to produce tuples ordered by the ordering of the outer sequence and then by the ordering of the inner sequence. This Core expression eventually results in the following document fragment:

  (<tuple>
      <i>1</i>
      <j>4</j>
   </tuple>,
   <tuple>
      <i>2</i>
      <j>3</j>
   </tuple>,
   <tuple>
      <i>2</i>
      <j>4</j>
   </tuple>)

4.8.2 For expression

Static Type Analysis

A single for expression is typed as follows: First Type₁ of the iteration expression Expr₁ is inferred. Then the prime type of Type₁, prime(Type₁), is computed. This is a union over all item types in Type₁ (See [8.4 Judgments for FLWOR and other expressions on sequences]). With the variable component of the static environment statEnv extended with VarRef₁ as type prime(Type₁), the type Type₂ of Expr₂ is inferred. Because the for expression iterates over the result of Expr₁, the final type of the iteration is Type₂ multiplied with the possible number of items in Type₁ (one, ?, *, or +). This number is determined by the auxiliary type-function quantifier(Type₁).

statEnv |- Expr₁ : Type₁

statEnv + varType(VarRef₁ => prime(Type₁)) |- Expr₂ : Type₂

statEnv |- for VarRef₁ in Expr₁ return Expr₂ : Type₂ · quantifier(Type₁)

When a positional variable Variable_pos is present, the static environment is also extended with the positional variable typed as an xs:integer.

statEnv |- Expr₁ : Type₁

statEnv + varType(VarRef₁ => prime(Type₁);VarRef_pos => xs:integer) |- Expr₂ : Type₂

statEnv |- for VarRef₁ at VarRef_pos in Expr₁ return Expr₂ : Type₂ · quantifier(Type₁)

When a type declaration is present, the static semantics also checks that the type of the input expression is a subtype of the declared type and extends the static environment by typing VarRef₁ with type Type₀. This semantics is specified by the following typing rule.

statEnv |- Expr₁ : Type₁

Type₀ = [ SequenceType ]_sequencetype

statEnv |- prime(Type₁) <: Type₀

statEnv + varType(VarRef₁ => Type₀) |- Expr₂ => Type₂

statEnv |- for VarRef₁ as SequenceType in Expr₁ return Expr₂ : Type₂ · quantifier(Type₁)

The last rule contains a For expression that contains a type declaration and a positional variable. When the positional variable is present, the static environment is also extended with the positional variable typed as an integer.

statEnv |- Expr₁ : Type₁

Type₀ = [ SequenceType ]_sequencetype

statEnv |- prime(Type₁) <: Type₀

statEnv + varType(VarRef₁ => Type₀; VarRef_pos => xs:integer) |- Expr₂ : Type₂

statEnv |- for VarRef₁ as SequenceType at VarRef_pos in Expr₁ return Expr₂ : Type₂ · quantifier(Type₁)

Example

For example, if $example is bound to the sequence 10.0, 1.0E1, 10 of type xs:decimal, xs:float, xs:integer, then the query

  for $s in $example
  return $s * 2

is typed as follows:

  (1) prime(xs:decimal, xs:float, xs:integer) =
      xs:decimal | xs:float | xs:integer
  (2) quantifier(xs:decimal, xs:float, xs:integer) = +
  (3) $s : xs:decimal | xs:float | xs:integer
  (4) $s * 2 : 
      xs:decimal | xs:float | xs:integer
  (5) result-type :
      ( xs:decimal | xs:float | xs:integer ) +

This result-type is not the most specific type possible. It does not take into account the order of elements in the input type, and it ignores the individual and overall number of elements in the input type. The most specific type possible is: element out {element one {}}, element out {element two {}}, element out {element three {}}. However, inferring such a specific type for arbitrary input types and arbitrary return clauses requires significantly more complex type inference rules. In addition, if put into the context of an element, the specific type violates the "unique particle attribution" restriction of XML schema, which requires that an element must have a unique content model within a particular context.

Dynamic Evaluation

The evaluation of a for expression distinguishes two cases: If the iteration expression Expr₁ evaluates to the empty sequence, then the entire expression evaluates to the empty sequence:

dynEnv |- Expr₁ => ()

dynEnv |- for VarRef₁ OptTypeDeclaration in Expr₁ return Expr₂ => ()

Otherwise, the iteration expression Expr₁, is evaluated to produce the sequence Item₁, ..., Item_n. For each item Item_i in this sequence, the body of the for expression Expr₂ is evaluated in the dynamic environment dynEnv extended with VarRef₁ bound to Item_i. This produces values Value_i, ..., Value_n which are concatenated to produce the result sequence.

dynEnv |- Expr₁ => Item₁ ,..., Item_n

statEnv |- VarRef of var expands to Variable

dynEnv + varValue(Variable => Item₁) |- Expr₂ => Value₁

···

dynEnv + varValue(Variable => Item_n) |- Expr₂ => Value_n

dynEnv |- for VarRef in Expr₁ return Expr₂ => Value₁ ,..., Value_n

The following rule is the same as the rule above, but includes the optional positional variable VarRef_pos. If present, VarRef_pos is bound to the position of the item in the input sequence, i.e., the value i.

dynEnv |- Expr₁ => Item₁ ,..., Item_n

statEnv |- VarRef of var expands to Variable statEnv |- VarRef_pos of var expands to Variable_pos

dynEnv + varValue(Variable => Item₁; Variable_pos => 1) |- Expr₂ => Value₁

···

dynEnv + varValue(Variable => Item_n; Variable_pos => n) |- Expr₂ => Value_n

dynEnv |- for VarRef at VarRef_pos in Expr₁ return Expr₂ => Value₁ ,..., Value_n

When a type declaration is present, the dynamic semantics also checks that each item in the result of evaluating Expr₁ matches the declared type. This semantics is specified by the following dynamic rule.

dynEnv |- Expr₁ => Item₁ ,..., Item_n

Type₀ = [ SequenceType ]_sequencetype

statEnv |- Item₁ matches Type₀

statEnv |- VarRef of var expands to Variable dynEnv + varValue(Variable => Item₁) |- Expr₂ => Value₁

···

statEnv |- Item_n matches Type₀

dynEnv + varValue(Variable => Item_n) |- Expr₂ => Value_n

dynEnv |- for VarRef as SequenceType in Expr₁ return Expr₂ => Value₁ ,..., Value_n

The last rule covers a for expression that contains a type declaration and a positional variable.

dynEnv |- Expr₁ => Item₁ ,..., Item_n

Type₀ = [ SequenceType ]_sequencetype

statEnv |- Item₁ matches Type₀

statEnv |- VarRef of var expands to Variable statEnv |- VarRef_pos of var expands to Variable_pos

dynEnv + varValue(Variable => Item₁; Variable_pos => 1) |- Expr₂ => Value₁

···

statEnv |- Item_n matches Type₀

dynEnv + varValue(Variable => Item_n; Variable_pos => n) |- Expr₂ => Value_n

dynEnv |- for VarRef as SequenceType at VarRef_pos in Expr₁ return Expr₂ => Value₁ ,..., Value_n

Note that this definition allows non-deterministic evaluation of the resulting sequence, since the judgments above the inference rule can be evaluated in any order.

Example

Note that if the expression in the return clause results in a sequence, sequences are never nested in the [XPath/XQuery] data model. For instance, in the following for expression:

  
  for $i in (1,2)
    return (<i> {$i} </i>, <negi> {-$i} </negi>)

each iteration in the for results in a sequence of two elements, which are then concatenated and flattened in the resulting sequence:

  
  (<i>1</i>,
   <negi>-1</negi>,
   <i>2</i>,
   <negi>-2</negi>)

4.8.3 Let Expression

Static Type Analysis

A let expression extends the static environment statEnv with Variable₁ of type Type₁ inferred from Expr₁, and infers the type of Expr₂ in the extended environment to produce the result type Type₂.

statEnv |- Expr₁ : Type₁ statEnv |- VarRef of var expands to Variable statEnv + varType(Variable₁ => Type₁) |- Expr₂ : Type₂

statEnv |- let VarRef := Expr₁ return Expr₂ : Type₂

When a type declaration is present, the static semantics also checks that the type of the input expression is a subtype of the declared type and extends the static environment by typing Variable₁ with type Type₀. This semantics is specified by the following static rule.

statEnv |- Expr₁ : Type₁

Type₀ = [ SequenceType ]_sequencetype

statEnv |- Type₁ <: Type₀

statEnv |- VarRef of var expands to Variable statEnv + varType(Variable₁ => Type₀ ) |- Expr₂ : Type₂

statEnv |- let VarRef₁ as SequenceType := Expr₁ return Expr₂ : Type₂

Dynamic Evaluation

A let expression extends the dynamic environment dynEnv with Variable bound to Value₁ returned by Expr₁, and evaluates Expr₂ in the extended environment to produce Value₂.

dynEnv |- Expr₁ => Value₁

statEnv |- VarRef of var expands to Variable dynEnv + varValue(Variable₁ => Value₁) |- Expr₂ => Value₂

dynEnv |- let VarRef₁ := Expr₁ return Expr₂ => Value₂

When a type declaration is present, the dynamic semantics also checks that the result of evaluating Expr₁ matches the declared type. This semantics is specified as the following dynamic rule.

dynEnv |- Expr₁ => Value₁

Type₀ = [ SequenceType ]_sequencetype

statEnv |- Value₁ matches Type₀

statEnv |- VarRef of var expands to Variable dynEnv + varValue(Variable₁ => Value₁) |- Expr₂ => Value₂

dynEnv |- let VarRef₁ as SequenceType := Expr₁ return Expr₂ => Value₂

Example

Note the use of the environments to define the scope of each variable. For instance, in the following nested let expression:

  let $k := 5 return
    let $k := $k + 1 return
      $k+1

the outermost let expression binds variable $k to the integer 5 in the environment, then the expression $k+1 is computed, yielding value 6, to which the second variable $k is bound. The expression then results in the final integer 7.

4.8.4 Order By and Return Clauses

Introduction

The dynamic semantics of the OrderByClause is not specified formally. The dynamic semantics is not specified formally as it would require the introduction of tuples, which are not supported in the [XPath/XQuery] data model. The dynamic semantics of the order-by clause can be found in Section 3.8.3 Order By and Return Clauses^XQ.

Because an OrderByClause does not effect the type of a FLWORExpr expression, the static semantics of a FLWORExpr expression with an OrderByClause is equivalent to the static semantics of an equivalent FLWORExpr in which the OrderByClause is omitted but a gt comparison is applied.

Notation

To define normalization of OrderBy, the following auxiliary mapping rule is used.

[OrderSpecList]_{OrderSpecList}

[LetClause ... LetClause]

which specify that OrderSpecList is mapped to Expr.

Normalization

An OrderByClause is normalized to a Let clause, nested For expressions, and atomization, which guarantees that the OrderSpecList is well typed. Note that if evaluated dynamically, the normalization of OrderByClause given here does not express the required sorting semantics, but this normalization does provide the correct static type. Notably, the normalization rule uses the gt operation, which implies that the ordering criteria is typed using the same static typing rules, taking into account existential quantification, and atomization.

[ stable? order by OrderSpecList]_FLWOR(Expr)

[OrderSpecList]_{OrderSpecList} return Expr

Each OrderSpec is normalized the auxiliary atomization normalization rule.

[Expr OrderModifier, OrderSpecList]_{OrderSpecList}

let $fs:new₀ :=

for $fs:new₁ in Expr

for $fs:new₂ in Expr return

[$fs:new₁ gt $fs:new₂]_Expr

[OrderSpecList]_{OrderSpecList}

4.9 Ordered and Unordered Expressions

Introduction

The purpose of ordered and unordered expressions is to set the ordering mode in the static context to ordered or unordered for a certain region in a query. The specified ordering mode applies to the expression nested inside the curly braces.

[91 (XQuery)]	`OrderedExpr^XQ`	::=	`"ordered" "{" Expr "}"`
[92 (XQuery)]	`UnorderedExpr^XQ`	::=	`"unordered" "{" Expr "}"`

Core Grammar

The Core grammar productions for ordered/unordered expressions are:

[69 (Core)]	`OrderedExpr`	::=	`"ordered" "{" Expr "}"`
[70 (Core)]	`UnorderedExpr`	::=	`"unordered" "{" Expr "}"`

Normalization

OrderedExpr (resp. UnorderedExpr) expressions are normalized to OrderedExpr (resp. UnorderedExpr) expressions in the [XPath/XQuery] Core.

[ordered { Expr }]_Expr

ordered { [Expr]_Expr }

[unordered { Expr }]_Expr

unordered { [Expr]_Expr }

Static Type Analysis

OrderedExpr and UnorderedExpr expressions set the ordering mode in the static context to ordered or unordered.

statEnv₁ = statEnv + orderingMode(ordered)

statEnv₁ |- Expr : Type

statEnv |- ordered { Expr } : Type

statEnv₁ = statEnv + orderingMode(unordered)

statEnv₁ |- Expr : Type

statEnv |- unordered { Expr } : Type

Dynamic Evaluation

OrderedExpr and UnorderedExpr expressions only have an effect on the static context. The effect on the evaluation of its subexpression(s) is captured using the fs:apply-ordering-mode function, which introduced during normalization of axis steps, union, intersect, and except expressions, and FLWOR expressions that have no order by clause.

dynEnv |- Expr => Value

dynEnv |- ordered { Expr } => Value

dynEnv |- Expr => Value

dynEnv |- unordered { Expr } => Value

4.10 Conditional Expressions

Introduction

A conditional expression supports conditional evaluation of one of two expressions.

Conditional Expression

[45 (XQuery)] IfExpr^XQ ::= "if" "(" Expr ")" "then" ExprSingle "else" ExprSingle

Core Grammar

The Core grammar production for the conditional expression is:

Core Conditional Expression

[43 (Core)] IfExpr ::= "if" "(" Expr ")" "then" ExprSingle "else" ExprSingle

Normalization

Conditional expressions are normalized as follows.

[if (Expr₁) then Expr₂ else Expr₃]_Expr

if (fn:boolean(([ Expr₁ ]_Expr))) then [Expr₂]_Expr else [Expr₃]_Expr

Static Type Analysis

statEnv |- Expr₁ : xs:boolean statEnv |- Expr₂ : Type₂ statEnv |- Expr₃ : Type₃

statEnv |- if (Expr₁) then Expr₂ else Expr₃ : (Type₂ | Type₃)

Dynamic Evaluation

If the conditional's boolean expression Expr₁ evaluates to true, Expr₂ is evaluated and its value is produced. If the conditional's boolean expression evaluates to false, Expr₃ is evaluated and its value is produced. Note that the existence of two separate evaluation rules ensures that only one branch of the conditional is evaluated.

dynEnv |- Expr₁ => true dynEnv |- Expr₂ => Value₂

dynEnv |- if (Expr₁) then Expr₂ else Expr₃ => Value₂

dynEnv |- Expr₁ => false dynEnv |- Expr₃ => Value₃

dynEnv |- if (Expr₁) then Expr₂ else Expr₃ => Value₃

4.11 Quantified Expressions

Introduction

[XPath/XQuery] defines two quantification expressions:

Quantified Expression

[42 (XQuery)]	`QuantifiedExpr^XQ`	::=	`("some" \| "every") "$" VarName TypeDeclaration? "in" ExprSingle ("," "$" VarName TypeDeclaration? "in" ExprSingle)* "satisfies" ExprSingle`
[6 (XPath)]	`QuantifiedExpr^XP`	::=	`("some" \| "every") "$" VarName "in" ExprSingle ("," "$" VarName "in" ExprSingle)* "satisfies" ExprSingle`

Core Grammar

The Core grammar production for quantified expressions is:

[40 (Core)] QuantifiedExpr ::= ("some" | "every") "$" VarName TypeDeclaration? "in" ExprSingle ("," "$" VarName TypeDeclaration? "in" ExprSingle)* "satisfies" ExprSingle

Normalization

The quantified expressions are normalized into nested Core quantified expressions, each of which binds one variable.

[some VarRef₁ in Expr₁, ..., VarRef_n in Expr_n satisfies Expr]_Expr

some VarRef₁ in [Expr₁]_Expr satisfies

some VarRef₂ in [Expr₂]_Expr satisfies

...

some VarRef_n in [Expr_n]_Expr satisfies

fn:boolean(([Expr]_Expr))

[every VarRef₁ in Expr₁, ..., VarRef_n in Expr_n satisfies Expr]_Expr

every VarRef₁ in [Expr₁]_Expr satisfies

every VarRef₂ in [Expr₂]_Expr satisfies

...

every VarRef_n in [Expr_n]_Expr satisfies

fn:boolean(([Expr]_Expr))

Static Type Analysis

The static semantics of the quantified expressions uses the prime operator on types, which is explained in [8.4 Judgments for FLWOR and other expressions on sequences]. These rules are similar to those for For expressions in [4.8.2 For expression].

statEnv |- Expr₁ : Type₁

statEnv |- VarRef₁ of var expands to Variable₁

statEnv + varType(Variable₁ => prime(Type₁)) |- Expr₂ : xs:boolean

statEnv |- some VarRef₁ in Expr₁ satisfies Expr₂ : xs:boolean

The next rule is for SomeExpr with the optional type declaration.

statEnv |- Expr₁ : Type₁

Type₀ = [ SequenceType ]_sequencetype

statEnv |- prime(Type₁) <: Type₀

statEnv |- VarRef₁ of var expands to Variable₁

statEnv + varType(Variable₁ => Type₀) |- Expr₂ : xs:boolean

statEnv |- some VarRef₁ as SequenceType in Expr₁ satisfies Expr₂ : xs:boolean

The next rule is for EveryExpr without the optional type declaration.

statEnv |- Expr₁ : Type₁

statEnv |- VarRef₁ of var expands to Variable₁

statEnv + varType(Variable₁ => prime(Type₁)) |- Expr₂ : xs:boolean

statEnv |- every VarRef₁ in Expr₁ satisfies Expr₂ : xs:boolean

The next rule is for EveryExpr with the optional type declaration.

statEnv |- Expr₁ : Type₁

Type₀ = [ SequenceType ]_sequencetype

statEnv |- prime(Type₁) <: Type₀

statEnv |- VarRef₁ of var expands to Variable₁

statEnv + varType(Variable₁ => Type₀) |- Expr₂ : xs:boolean

statEnv |- every VarRef₁ as SequenceType in Expr₁ satisfies Expr₂ : xs:boolean

Dynamic Evaluation

The existentially quantified "some" expression yields true if any evaluation of the satisfies expression yields true. The existentially quantified "some" expression yields false if every evaluation of the satisfies expression is false. A quantified expression may raise an error if any evaluation of the satisfies expression raises an error. The dynamic semantics of quantified expressions is non-deterministic. This non-determinism permits implementations to use short-circuit evaluation strategies when evaluating quantified expressions.

dynEnv |- Expr₁ => Item₁, ..., Item_n

statEnv |- VarRef₁ of var expands to Variable₁

dynEnv + varValue(Variable₁ => Item_i) |- Expr₂ => true 1 <= i <= n

dynEnv |- some VarRef₁ in Expr₁ satisfies Expr₂ => true

The next rule is for SomeExpr with the optional type declaration, in which some evaluation of the satisfies expression yields true.

dynEnv |- Expr₁ => Item₁ ... Item_n

Type₀ = [ SequenceType ]_sequencetype

statEnv |- Item_i matches Type₀ 1 <= i <= n

statEnv |- VarRef₁ of var expands to Variable₁

dynEnv + varValue(Variable₁ => Item_i) |- Expr₂ => true

dynEnv |- some VarRef₁ as SequenceType in Expr₁ satisfies Expr₂ => true

The next rule is for SomeExpr without the optional type declaration, in which all evaluations of the satisfies expression yield false.

dynEnv |- Expr₁ => Item₁ ... Item_n

statEnv |- VarRef₁ of var expands to Variable₁

dynEnv + varValue(Variable₁ => Item₁) |- Expr₂ => false

...

dynEnv + varValue(Variable₁ => Item_n) |- Expr₂ => false

dynEnv |- some VarRef₁ in Expr₁ satisfies Expr₂ => false

The next rule is for SomeExpr with the optional type declaration, in which all evaluations of the satisfies expression yields false.

dynEnv |- Expr₁ => Item₁ ... Item_n

Type₀ = [ SequenceType ]_sequencetype

statEnv |- VarRef₁ of var expands to Variable₁

dynEnv + varValue(Variable₁ => Item₁) |- Expr₂ => false

statEnv |- Item₁ matches Type₀

...

dynEnv + varValue(Variable₁ => Item_n) |- Expr₂ => false

statEnv |- Item_n matches Type₀

dynEnv |- some VarRef₁ as SequenceType in Expr₁ satisfies Expr₂ => false

The universally quantified "every" expression yields false if any evaluation of the satisfies expression yields false. The universally quantified "every" expression yields true if every evaluation of the satisfies expression is true.

dynEnv |- Expr₁ => Item₁ ... Item_n

statEnv |- VarRef₁ of var expands to Variable₁

dynEnv + varValue(Variable₁ => Item_i) |- Expr₂ => false 1 <= i <= n

dynEnv |- every VarRef₁ in Expr₁ satisfies Expr₂ => false

The next rule is for EveryExpr with the optional type declaration, in which some evaluation of the satisfies expression yields false.

dynEnv |- Expr₁ => Item₁ ... Item_n

Type₀ = [ SequenceType ]_sequencetype

statEnv |- Item_i matches Type₀

statEnv |- VarRef₁ of var expands to Variable₁

dynEnv + varValue(Variable₁ => Item_i) |- Expr₂ => false 1 <= i <= n

dynEnv |- every VarRef₁ as SequenceType in Expr₁ satisfies Expr₂ => false

The next rule is for EveryExpr in which all evaluations of the satisfies expression yields true.

dynEnv |- Expr₁ => Item₁ ... Item_n

statEnv |- VarRef₁ of var expands to Variable₁

dynEnv + varValue(Variable₁ => Item₁) |- Expr₂ => true

...

dynEnv + varValue(Variable₁ => Item_n) |- Expr₂ => true

dynEnv |- every VarRef₁ in Expr₁ satisfies Expr₂ => true

The next rule is for EveryExpr with the optional type declaration in which all evaluations of the satisfies expression yields true.

dynEnv |- Expr₁ => Item₁ ... Item_n

Type₀ = [ SequenceType ]_sequencetype

statEnv |- VarRef₁ of var expands to Variable₁

dynEnv + varValue(Variable₁ => Item₁) |- Expr₂ => true

statEnv |- Item₁ matches Type₀

...

dynEnv + varValue(Variable₁ => Item_n) |- Expr₂ => true

statEnv |- Item_n matches Type₀

dynEnv |- every VarRef₁ as SequenceType in Expr₁ satisfies Expr₂ => true

4.12 Expressions on SequenceTypes

Introduction

Expressions on SequenceTypes are expressions whose semantics depends on the type of some of the sub-expressions to which they are applied. The syntax of SequenceType expressions is described in [3.5.3 SequenceType Syntax].

4.12.1 Instance Of

SequenceType expressions

[54 (XQuery)] InstanceofExpr^XQ ::= TreatExpr ( "instance" "of" SequenceType )?

Introduction

The SequenceType expression "Expr instance of SequenceType" is true if and only if the result of evaluating expression Expr is an instance of the type referred to by SequenceType.

Normalization

An InstanceofExpr expression is normalized into a TypeswitchExpr expression. Note that the following normalization rule uses a variable $fs:new, which is a newly created variable which must not conflict with any variables already in scope. This variable is necessary to comply with the syntax of typeswitch expressions in the Core [XPath/XQuery], but is never used.

[Expr instance of SequenceType]_Expr

typeswitch ([ Expr ]_Expr)

case $fs:new as SequenceType return fn:true()

default $fs:new return fn:false()

4.12.2 Typeswitch

SequenceType expressions

[43 (XQuery)]	`TypeswitchExpr^XQ`	::=	`"typeswitch" "(" Expr ")" CaseClause+ "default" ("$" VarName)? "return" ExprSingle`
[44 (XQuery)]	`CaseClause^XQ`	::=	`"case" ("$" VarName "as")? SequenceType "return" ExprSingle`

Introduction

The typeswitch expression chooses one of several expressions to evaluate based on the dynamic type of an input value.

Each branch of a typeswitch expression may have an optional VarRef, which is bound to the value of the input expression. This variable is optional in [XPath/XQuery] but mandatory in the [XPath/XQuery] Core. One of the reasons for having this variable is that it is assigned a specific type for the corresponding branch.

Core Grammar

The Core grammar productions for typeswitch are:

[41 (Core)]	`TypeswitchExpr`	::=	`"typeswitch" "(" Expr ")" CaseClause+ "default" ("$" VarName)? "return" ExprSingle`
[42 (Core)]	`CaseClause`	::=	`"case" ("$" VarName "as")? SequenceType "return" ExprSingle`

Notation

For convenience, we introduce the following auxiliary grammar productions.

[89 (Formal)] OptVarRef ::= VarRef?

Notation

To define normalization of case clauses to the [XPath/XQuery] Core, the following auxiliary mapping rule is used.

[CaseClause]_Case

CaseClause

specifies that CaseClause is mapped to CaseClause, in the [XPath/XQuery] type system.

Normalization

Normalization of a typeswitch expression guarantees that every branch has an associated VarRef. The following normalization rule adds a newly created variable that does not appear in the rest of the query. Note that $fs:new is a newly generated variable that must not conflict with any variables already in scope and that is not used in any of the sub-expressions.

[ case SequenceType return Expr ]_Case

case $fs:new₁ as SequenceType return [ Expr ]_Expr

[ case VarRef as SequenceType return Expr ]_Case

case VarRef as SequenceType return [ Expr ]_Expr

[ default return Expr ]_Case

default $fs:new₁ return [ Expr ]_Expr

[ default VarRef return Expr ]_Case

default VarRef return [ Expr ]_Expr

[

typeswitch ( Expr₀ )

CaseClause₁

···

CaseClause_n

default OptVarRef return Expr_n+1

]_Expr

typeswitch ( [ Expr₀ ]_Expr )

[CaseClause₁]_Case

···

[CaseClause_n]_Case

[ default OptVarRef return Expr_n+1 ]_Case

Notation

The following auxiliary grammar production is used to identify branches of the typeswitch.

CaseRules

[78 (Formal)] CaseRules ::= ("case" "$" VarName "as" SequenceType "return" Expr CaseRules) | ("default" "$" VarName "return" Expr)

The following judgment

statEnv |- Type₁ case CaseRules : Type

is used in the static of typeswitch. It indicates that under the static environment statEnv, and with the input type of the typeswitch being Type₁, the given case rule yields the type Type.

The following judgment

dynEnv |- Value₁ against CaseRules => Value₂

is used in the dynamic semantics of typeswitch. It indicates that under the dynamic environment dynEnv, with the input value of the typeswitch being Value₁, the given case rules yields the value Value₂.

Static Type Analysis

The static typing rules for the typeswitch expression are simple. Each case clause and the default clause of the typeswitch is typed independently. The type of the entire typeswitch expression is the union of the types of all the clauses.

statEnv |- Expr₀ : Type₀

statEnv |- Type₀ case case VarRef₁ as SequenceType₁ return Expr₁ : Type₁

···

statEnv |- Type₀ case case VarRef_n as SequenceType_n return Expr_n : Type_n

statEnv |- Type₀ case default VarRef_n+1 return Expr_n : Type_n+1

statEnv |-

(typeswitch (Expr₀)

case VarRef₁ as SequenceType₁ return Expr₁

···

case VarRef_n as SequenceType_n return Expr_n

default VarRef_n+1 return Expr_n+1)

: (Type₁ | ... | Type_n+1)

To type one case clause, the case variable is assigned the type of the case clause CaseType and the body of the clause is typed in the extended environment. Thus, the type of a case clause is independent of the type of the input expression.

CaseType = [ SequenceType ]_sequencetype

statEnv |- VarRef of var expands to Variable

statEnv + varType(Variable => CaseType ) |- Expr : Type

statEnv |- Type₀ case case VarRef as SequenceType return Expr : Type

To type the default clause, the variable is assigned the type of the input expression and the body of the default clause is typed in the extended environment.

statEnv + varType(VarRef => Type₀ ) |- Expr : Type

statEnv |- Type₀ case default VarRef return Expr : Type

Dynamic Evaluation

The evaluation of a typeswitch proceeds as follows. First, the input expression is evaluated, yielding an input value. The effective case is the first case clause such that the input value matches the SequenceType in the case clause. The return clause of the effective case is evaluated and the value of the return expression is the value of the typeswitch expression.

dynEnv |- Expr => Value₀

dynEnv |- Value₀ against CaseRules => Value₁

dynEnv |- typeswitch (Expr) CaseRules => Value₁

If the value matches the sequence type, the following rule applies: It extends the dynamic environment by binding the variable Variable to Value₀ and evaluates the body of the return clause.

CaseType = [ SequenceType ]_sequencetype

statEnv |- Value₀ matches CaseType

statEnv |- VarRef of var expands to Variable

dynEnv + varValue(Variable => Value₀) |- Expr => Value₁

dynEnv |- Value₀ against case VarRef as SequenceType return Expr CaseRules => Value₁

If the value does not match the sequence type, the current case is not evaluated, and the remaining case rules are evaluated order by applying the inference rule recursively.

CaseType = [ SequenceType ]_sequencetype statEnv |- not(Value₀ matches CaseType) dynEnv |- Value₀ against CaseRules => Value₁

dynEnv |- Value₀ against case SequenceType VarRef return Expr CaseRules => Value₁

The last rule states that the default branch of a typeswitch expression always evaluates to the value of its return clause.

statEnv |- VarRef of var expands to Variable

dynEnv + varValue(Variable => Value₀) |- Expr => Value₁

dynEnv |- Value₀ against default VarRef return Expr => Value₁

4.12.3 Cast

Introduction

The cast expression can be used to convert a value to a specific datatype. It changes both the type and value of the result of an expression, and can only be applied to an atomic value.

[57 (XQuery)]	`CastExpr^XQ`	::=	`UnaryExpr ( "cast" "as" SingleType )?`
[117 (XQuery)]	`SingleType^XQ`	::=	`AtomicType "?"?`

Core Grammar

The Core grammar productions for cast expressions are:

[47 (Core)]	`CastExpr`	::=	`ValueExpr ( "cast" "as" SingleType )?`
[81 (Core)]	`SingleType`	::=	`AtomicType "?"?`

Normalization

The normalization of cast applies atomization to its argument. The type declaration asserts that the result is a single atomic value. The second normalization rule applies when the target type is optional.

[Expr cast as AtomicType ]_Expr

let $v as xdt:anyAtomicType := fn:data(([ Expr ]_Expr)) return

$v cast as AtomicType

[Expr cast as AtomicType? ]_Expr

let $v as xdt:anyAtomicType? := fn:data(([ Expr ]_Expr)) return

typeswitch ($v)

case $fs:new as empty-sequence() return ()

default $fs:new return $v cast as AtomicType

Static Type Analysis

The static typing rule of cast expression is as follows. The type of a Core cast expression is always the target type. Note that a cast expression can fail at run-time if the given value cannot be cast to the target type.

statEnv |- Expr cast as AtomicType : AtomicType

Dynamic Evaluation

The dynamic semantics of cast expressions is defined in Section 17 Casting^FO. The semantics of cast expressions depends on the type of the input value and on the target type. For any source and target primitive types, the casting table in Section 17 Casting^FO indicates whether the cast from the source type to the target type is permitted. When a cast is permitted, the detailed dynamic rules for cast in Section 17 Casting^FO are applied. These rules are not specified further here.

4.12.4 Castable

[56 (XQuery)] CastableExpr^XQ ::= CastExpr ( "castable" "as" SingleType )?

Castable expressions check whether a value can be cast to a given type.

Core Grammar

The Core grammar production for castable is:

[46 (Core)] CastableExpr ::= CastExpr ( "castable" "as" SingleType )?

Normalization

The normalization of castable simply maps its expression argument.

[Expr castable as AtomicType]_Expr

let $v as xdt:anyAtomicType := fn:data(([ Expr ]_Expr)) return

$v castable as AtomicType

[Expr castable as AtomicType?]_Expr

let $v as xdt:anyAtomicType? := fn:data(([ Expr ]_Expr)) return

$v castable as AtomicType?

Static Type Analysis

The type of a Core castable expression is always a boolean.

statEnv |- Expr castable as AtomicType : xs:boolean

Dynamic Evaluation

If casting succeeds, then the castable expression evaluates to true.

dynEnv |- Expr => Value₁

dynEnv |- Value₁ cast as AtomicType => Value₂

dynEnv |- Expr castable as AtomicType => true

Otherwise, 'castable as' evaluates to false.

dynEnv |- not(Expr => Value₁)

dynEnv |- Expr₁ castable as AtomicType₂ => false

4.12.5 Constructor Functions

Constructor functions provide an alternative syntax for casting.

Normalization

Constructor functions for atomic types are normalized to explicit cast as expressions.

[AtomicType(Expr)]_Expr

[Expr cast as AtomicType? ]_Expr

4.12.6 Treat

[55 (XQuery)] TreatExpr^XQ ::= CastableExpr ( "treat" "as" SequenceType )?

Introduction

The expression "Expr treat as SequenceType", can be used to change the static type of the result of an expression without changing its value. The treat-as expression raises a dynamic error if the dynamic type of the input value does not match the specified type.

Normalization

Treat as expressions are normalized to typeswitch expressions. Note that the following normalization rule uses a variable $fs:new, which is a newly created variable that does not conflict with any variables already in scope.

[Expr treat as SequenceType]_Expr

typeswitch ([ Expr ]_Expr)

case $fs:new as SequenceType return $fs:new

default $fs:new return fn:error()

4.13 Validate Expressions

[63 (XQuery)]	`ValidateExpr^XQ`	::=	`"validate" ValidationMode? "{" Expr "}"`
[64 (XQuery)]	`ValidationMode^XQ`	::=	`"lax" \| "strict"`

Core Grammar

The Core grammar productions for validate are:

[49 (Core)]	`ValidateExpr`	::=	`"validate" ValidationMode? "{" Expr "}"`
[50 (Core)]	`ValidationMode`	::=	`"lax" \| "strict"`

A validate expression validates its argument with respect to the in-scope schema definitions, using the schema validation process described in [Schema Part 1]. The argument to a validate expression must be either an element or a document node. Validation replaces all nodes with new nodes that have their own identity, the type annotation^XQ, and default values created during the validation process.

Normalization

A validate expression with no validation mode is normalized into a validate expression with the validation mode set to strict.

[validate { Expr }]_Expr

validate strict { [Expr]_Expr }

[validate ValidationMode { Expr }]_Expr

validate ValidationMode { [Expr]_Expr }

Static Type Analysis

Static typing of the validate operation is defined by the following rule. Note the use of a subtyping check to ensure that the type of the expression to validate is either an element or a well-formed document node (i.e., with only one root element and no text nodes). The type of the expression to validate may be a union of more than one element type. We apply the with mode judgment to each element type to determine the meaning of that element type with the given validation mode, which yields a new element type. The result type is the union over all new element types.

statEnv |- Expr : Type

statEnv |- Type <: (element | document { ElementType })

statEnv |- prime(Type) = ElementType₁ | ... | ElementType_n

ElementType₁ = element ElementNameOrWildcard₁ OptTypeSpecifier₁

···

ElementType_n = element ElementNameOrWildcard_n OptTypeSpecifier_n

statEnv |- ElementNameOrWildcard₁ with mode ValidationMode resolves to ElementType₁

···

statEnv |- ElementNameOrWildcard_n with mode ValidationMode resolves to ElementType_n

Type₁ = ElementType₁ | ... | ElementType_n

statEnv |- validate ValidationMode { Expr } : Type₁

4.13.1 Validating an Element Node

Dynamic Evaluation

The normative dynamic semantics of validation is specified in Section 3.13 Validate Expressions^XQ. The effect of validation of a data model value is equivalent to:

serialization of the data model, as described in [Data Model Serialization], followed by
validation of the serialized value into a Post-Schema Validated Infoset, as described in [Schema Part 1], followed by
construction of a new data model value, as described in [Data Model].

The above steps are expressed formally by the "erasure" and "annotation" judgments. Formally, validation removes existing type annotations from nodes ("erasure"), and it re-validates the corresponding data model instance, possibly adding new type annotations to nodes ("annotation"). Both erasure and annotation are described formally in [E Auxiliary Judgments for Validation]. Indeed, the conjunction of erasure and annotation provides a formal model for a large part of actual schema validation. The semantics of the validate expression is specified as follows.

In the first premise below, the expression to validate is evaluated. The resulting value must be an element or document node. The second premise constructs a new value in which all existing type annotations have been erased. The third premise determines the element type that corresponds to the element node's name in the given validation mode. The last premise validates erased element node with the type against which it is validated, using the annotate as judgment, yielding the final validated element.

statEnv; dynEnv |- Expr => ElementValue₁

ElementValue₁ erases to ElementValue₂

ElementValue₂ = element ElementName₂ of type TypeName₂ { Value }

statEnv |- ElementName₂ with mode ValidationMode resolves to ElementType₂

statEnv |- annotate as ElementType₂ ( ElementValue₂) => ElementValue₃

dynEnv |- validate ValidationMode { Expr } => ElementValue₃

4.13.2 Validating a Document Node

The rule for validating a document node is similar to that for validating an element node.

Dynamic Evaluation

statEnv; dynEnv |- Expr => document { ElementValue₁ }

document { ElementValue₁ } erases to document { ElementValue₂ }

ElementValue₂ = element ElementName₂ of type TypeName₂ { Value }

statEnv |- ElementName₂ with mode ValidationMode resolves to ElementType₂

statEnv |- annotate as document { ElementType₂ } (document { ElementValue₂ }) => document { ElementValue₃ }

dynEnv |- validate ValidationMode { Expr } => document { ElementValue₃ }

4.14 Extension Expressions

Introduction

An extension expression is an expression whose semantics are implementation-defined. An extension expression consists of one or more pragmas, followed by an expression enclosed in curly braces.

[65 (XQuery)]	`ExtensionExpr^XQ`	::=	`Pragma+ "{" Expr? "}"`
[66 (XQuery)]	`Pragma^XQ`	::=	`"(#" S? QName PragmaContents "#)"`
[67 (XQuery)]	`PragmaContents^XQ`	::=	`(Char* - (Char* '#)' Char*))`

Core Grammar

The Core grammar productions for ExtensionExpr are:

[51 (Core)]	`ExtensionExpr`	::=	`Pragma+ "{" Expr? "}"`
[52 (Core)]	`Pragma`	::=	`"(#" S? QName PragmaContents "#)"`
[53 (Core)]	`PragmaContents`	::=	`(Char* - (Char* '#)' Char*))`

Normalization

Extension expressions are normalized as extension expressions in the [XPath/XQuery] Core.

[Pragma+ { Expr }]_Expr

Pragma+ { [Expr]_Expr }

If the extension expression does not contain any expression, this is normalized into an extension expression with a call to the fn:error function.

[Pragma+ { }]_Expr

Pragma+ { fn:error() }

Static Type Analysis

If at least one of the pragmas is recognized, the static semantics are implementation-defined.

If none of the pragmas is recognized, the static semantics are the same as for the input expression. In both cases, the static typing must be applied on the input expression, possibly raising the corresponding type errors.

statEnv |- Expr : Type₁

statEnv |- A Pragma is recognized, yielding the implementation-defined static type Type₂.

statEnv |- Pragma+ { Expr } : Type₂

statEnv |- Expr : Type₁

statEnv |- No Pragma is recognized.

statEnv |- Pragma+ { Expr } : Type₁

Dynamic Evaluation

The QName of a pragma must resolve to a namespace URI and local name, using the statically known namespaces. If at least one of the pragmas is recognized, the dynamic semantics is implementation-defined.

dynEnv |- Some Pragma are recognized, yielding the implementation-defined value Value.

dynEnv |- Pragma+ { Expr } => Value

If none of the pragmas is recognized the dynamic semantics of an ExtensionExpr are the same as evaluating the given expression.

No Pragma is recognized. dynEnv |- Expr => Value

dynEnv |- Pragma+ { Expr } => Value

5 Modules and Prologs

The organization of this section parallels the organization of Section 4 Modules and Prologs^XQ.

Introduction

XQuery supports modules as defined in Section 4 Modules and Prologs^XQ. A main module^XQ contains a Prolog^XQ followed by a query body^XQ. A query has exactly one main module. In a main module, the query body^XQ can be evaluated, and its value is the result of the query. A library module^XQ contains a module declaration followed by a Prolog^XQ.

The Prolog is a sequence of declarations that affect query processing. The Prolog can be used, for example, to declare namespace prefixes, import types from XML Schemas, and declare functions and variables. Namespace declarations and schema imports always precede function and variable declarations, as specified by the following grammar productions.

Query Module

[1 (XQuery)]	`Module^XQ`	::=	`VersionDecl? (LibraryModule \| MainModule)`
[3 (XQuery)]	`MainModule^XQ`	::=	`Prolog QueryBody`
[4 (XQuery)]	`LibraryModule^XQ`	::=	`ModuleDecl Prolog`
[6 (XQuery)]	`Prolog^XQ`	::=	`((DefaultNamespaceDecl \| Setter \| NamespaceDecl \| Import) Separator)* ((VarDecl \| FunctionDecl \| OptionDecl) Separator)*`
[7 (XQuery)]	`Setter^XQ`	::=	`BoundarySpaceDecl \| DefaultCollationDecl \| BaseURIDecl \| ConstructionDecl \| OrderingModeDecl \| EmptyOrderDecl \| CopyNamespacesDecl`
[8 (XQuery)]	`Import^XQ`	::=	`SchemaImport \| ModuleImport`
[9 (XQuery)]	`Separator^XQ`	::=	`";"`
[30 (XQuery)]	`QueryBody^XQ`	::=	`Expr`

Function declarations are globally scoped, that is, the use of a function name in a function call may precede declaration of the function. Variable declarations are lexically scoped, i.e., variable declarations must precede variable uses.

Core Grammar

The Core grammar productions for the prolog are:

Query Module

[1 (Core)]	`Module`	::=	`VersionDecl? (LibraryModule \| MainModule)`
[3 (Core)]	`MainModule`	::=	`Prolog QueryBody`
[4 (Core)]	`LibraryModule`	::=	`ModuleDecl Prolog`
[6 (Core)]	`Prolog`	::=	`((DefaultNamespaceDecl \| Setter \| NamespaceDecl \| Import) Separator)* ((VarDecl \| FunctionDecl \| OptionDecl) Separator)*`
[7 (Core)]	`Setter`	::=	`DefaultCollationDecl \| BaseURIDecl \| ConstructionDecl \| OrderingModeDecl \| EmptyOrderDecl \| CopyNamespacesDecl`
[8 (Core)]	`Import`	::=	`SchemaImport \| ModuleImport`
[9 (Core)]	`Separator`	::=	`";"`
[29 (Core)]	`QueryBody`	::=	`Expr`

Notation

The XQuery Prolog requires that declarations appear in a particular order. In the Formal Semantics, it is simpler to assume the declarations can appear in any order, as it does not change their semantics -- we simply assume that an XQuery parser has enforced the required order.

The Prolog contains a variety of declarations that specify the initial static and dynamic context of the query. The following formal grammar productions represent any Prolog declaration.

Prolog Declarations

[79 (Formal)]	`PrologDeclList`	::=	`(PrologDecl Separator)*`
[80 (Formal)]	`PrologDecl`	::=	`DefaultCollationDecl \| BaseURIDecl \| ConstructionDecl \| OrderingModeDecl \| EmptyOrderDecl \| CopyNamespacesDecl \| SchemaImport \| ModuleImport \| NamespaceDecl \| DefaultNamespaceDecl \| VarDecl \| FunctionDecl \| OptionDecl`

The function []_PrologDecl takes a prolog declaration and maps it into its equivalent declaration in the Core grammar.

[PrologDecl₁]_PrologDecl

PrologDecl₂

The following auxiliary judgments are applied when statically processing the declarations in the prolog. The effect of the judgment is to process each prolog declaration in order, constructing a new static environment from the static environment constructed from previous prolog declarations.

The judgment:

statEnv₁ |- PrologDeclList =>_stat statEnv₂ with PrologDeclList₁

holds if under the static environment statEnv₁, the sequence of prolog declarations PrologDeclList yields the static environment statEnv₂ and the normalized sequence of prolog declarations in the Core grammar.

The judgment:

statEnv₁ |- PrologDecl =>_stat statEnv₂

holds if under the static environment statEnv₁, the single prolog declaration PrologDecl yields the new static environment statEnv₂.

Static Context Processing

Prolog declarations are processed in the order they are encountered. The normalization of a prolog declaration PrologDecl depends on the static context processing of all previous prolog declarations. In turn, static context processing of PrologDecl depends on the normalization of the PrologDecl. For example, because variables are lexically scoped, the normalization and static context processing of a variable declaration depends on the normalization and static context processing of all previous variable declarations. Therefore, the normalization phase and static context processing are interleaved, with normalization preceding static context processing for each prolog declaration.

The following inference rules express this dependency. The first rule specifies that for an empty sequence of prolog declarations, the initial static environment is the default static context.

statEnv |- () =>_stat statEnv with ()

The next rule interleaves normalization and static context processing. The result of static context processing and normalization is a static context and the normalized prolog declarations.

[PrologDecl]_PrologDecl == PrologDecl₁

statEnv |- PrologDecl₁ =>_stat statEnv₁

statEnv₁ |- PrologDeclList =>_stat statEnv₂ with PrologDeclList₁

statEnv |- PrologDecl ; PrologDeclList =>_stat statEnv₂ with PrologDecl₁ ; PrologDeclList₁

Static Type Analysis

Static typing of a main module follows context processing and normalization. Context processing and normalization of a main module applies the rules above to the prolog, then using the resulting static environment statEnv, the query body is normalized into a Core expression, and the static typing rules are applied to this Core expression.

statEnvDefault |- PrologDeclList =>_stat statEnv with PrologDeclList₁

statEnv |- [QueryBody]_Expr == Expr₂

statEnv |- Expr₂ : Type

PrologDeclList QueryBody : Type

Notation

Similarly, the judgment:

dynEnv₁ |- PrologDeclList =>_dyn dynEnv₂

The judgment:

dynEnv |- PrologDecl =>_dyn dynEnv₁

holds if under the dynamic environment dynEnv, the single prolog declaration PrologDecl yields the new dynamic environment dynEnv₁.

Dynamic Context Processing

The rules for initializing the dynamic context are as follows. The first rule specifies for an empty sequence of prolog declarations, the initial dynamic environment is the default dynamic context.

dynEnv |- () =>_dyn dynEnv

The second rule simply computes the dynamic environment by processing the prolog declarations in order.

dynEnv |- PrologDecl =>_dyn dynEnv₁

dynEnv₁ |- PrologDeclList =>_dyn dynEnv₂

dynEnv |- PrologDecl ; PrologDeclList =>_dyn dynEnv₂

Dynamic Evaluation

Dynamic evaluation of a main module applies the rules for dynamic-context processing to the prolog declarations, then using the resulting dynamic environment dynEnv, the dynamic evaluation rules are applied to the normalized query body.

dynEnvDefault |- PrologDeclList =>_dyn dynEnv

dynEnv |- [QueryBody]_Expr Expr₂

dynEnv |- Expr₂ => Value

PrologDeclList QueryBody => Value

Notation

We define a new judgment that maps a module's URI (or a main module) to the corresponding module's static environment:

(URI | #MAIN) =>_{module_statEnv} statEnv

We also define a new judgment that maps a module's URI (or a main module) to the corresponding module's dynamic environment:

(URI | #MAIN) =>_{module_dynEnv} dynEnv

For a main module, those judgments are defined as follows.

PrologDeclList =>_stat statEnv

#MAIN =>_{module_statEnv} statEnv

PrologDeclList =>_dyn dynEnv

#MAIN =>_{module_dynEnv} dynEnv

For a library module, those judgments are defined in [5.11 Module Import].

5.1 Version Declaration

Introduction

A version declaration specifies the applicable XQuery syntax and semantics for a module. An XQuery implementation must raise a static error when processing a query labeled with a version that the implementation does not support. This document applies toXQuery 1.0 only and does not specify this static error formally.

[2 (XQuery)] VersionDecl^XQ ::= "xquery" "version" StringLiteral ("encoding" StringLiteral)? Separator

Core Grammar

The core grammar production for version declarations is:

[2 (Core)] VersionDecl ::= "xquery" "version" StringLiteral ("encoding" StringLiteral)? Separator

Normalization

Version declaration are left unchanged through normalization.

[VersionDecl]_PrologDecl

VersionDecl

5.2 Module Declaration

Introduction

[5 (XQuery)] ModuleDecl^XQ ::= "module" "namespace" NCName "=" URILiteral Separator

We assume that the static-context processing and dynamic-context processing described in [5 Modules and Prologs] are applied to all library modules before the normalization, static context processing, and dynamic context processing of the main module. That is, at the time an "import module" declaration is processed, we assume that the static and dynamic context of the imported module is already available. This assumption does not require or assume separate compilation of modules. An implementation might process all or some imported modules statically (i.e., before the importing module is identified) or dynamically (i.e., when the importing module is identified and processed).

Core Grammar

The core grammar production for module declarations is:

[5 (Core)] ModuleDecl ::= "module" "namespace" NCName "=" URILiteral Separator

Normalization

Module declarations are left unchanged through normalization.

[ModuleDecl]_PrologDecl

ModuleDecl

Static Context Processing

The effect of a module declaration is to apply the static processing rules defined in [5 Modules and Prologs] to the module's prolog. The resulting static context is then available to any importing module.

The module declaration extends the prolog with a namespace declaration that binds the module's prefix to its URI, then computes the static context for the complete module.

declare namespace NCName = URILiteral ; PrologDeclList =>_stat statEnv

module namespace NCName = URILiteral PrologDeclList

URILiteral =>_{module_statEnv} statEnv

Note that the rule above and the rules for static processing of an "import module" declaration in [5.11 Module Import] are mutually recursive.

Dynamic Context Processing

The dynamic context processing of a module declaration is similar to that of static processing. The module declaration extends the prolog with a namespace declaration that binds the module's prefix to its URI, then computes the dynamic context for the complete module.

(declare namespace NCName = URILiteral PrologDeclList) =>_dyn dynEnv

module namespace NCName = URILiteral PrologDeclList

URILiteral =>_{module_dynEnv} dynEnv

Note that the rule above and the rules for dynamic processing of an "import module" declaration in [5.11 Module Import] are mutually recursive.

5.3 Boundary-space Declaration

[11 (XQuery)] BoundarySpaceDecl^XQ ::= "declare" "boundary-space" ("preserve" | "strip")

The xmlspace declaration is not specified formally as the Formal Semantics is defined on the Core language, which is an abstract, not concrete, syntax and is typically the result of parsing phase described in [3.2.1 Processing model].

5.4 Default Collation Declaration

[19 (XQuery)] DefaultCollationDecl^XQ ::= "declare" "default" "collation" URILiteral

Core Grammar

The core grammar production for default collation declarations is:

[18 (Core)] DefaultCollationDecl ::= "declare" "default" "collation" URILiteral

Normalization

Default collation declarations are left unchanged through normalization.

[DefaultCollationDecl]_PrologDecl

DefaultCollationDecl

Static Context Processing

The default collation declaration updates the collations environment component within the static environment. The collations environment component is used by several functions in [Functions and Operators], but is not used in the Formal Semantics.

statEnv.collations(URILiteral) = Collation

statEnv₁ = statEnv + defaultCollation(Collation)

statEnv |- declare default collation URILiteral =>_stat statEnv₁

Dynamic Context Processing

The default collation declaration does not affect the dynamic context.

dynEnv |- declare default collation URILiteral =>_dyn dynEnv

5.5 Base URI Declaration

[20 (XQuery)] BaseURIDecl^XQ ::= "declare" "base-uri" URILiteral

Core Grammar

The core grammar production for base uri declarations is:

[19 (Core)] BaseURIDecl ::= "declare" "base-uri" URILiteral

Normalization

Base URI declarations are left unchanged through normalization.

[BaseURIDecl]_PrologDecl

BaseURIDecl

Static Context Processing

A base URI declaration specifies the base URI property of the static context, which is used when resolving relative URIs within a module. A static error is raised if more than one base URI declaration is declared in a query prolog.

statEnv₁ = statEnv + baseURI(URILiteral)

statEnv |- declare base-uri URILiteral =>_stat statEnv₁

Dynamic Context Processing

The base URI declaration does not affect the dynamic context.

dynEnv |- declare base-uri URILiteral =>_dyn dynEnv

5.6 Construction Declaration

[25 (XQuery)] ConstructionDecl^XQ ::= "declare" "construction" ("strip" | "preserve")

Core Grammar

The core grammar production for construction declarations is:

[24 (Core)] ConstructionDecl ::= "declare" "construction" ("strip" | "preserve")

Normalization

Construction declarations are left unchanged through normalization.

[ConstructionDecl]_PrologDecl

ConstructionDecl

Static Context Processing

The construction declaration modifies the construction mode in the static context.

statEnv₁ = statEnv + constructionMode( ConstructionMode)

statEnv |- declare construction ConstructionMode =>_stat statEnv₁

Dynamic Context Processing

The construction declaration does not have any affect on the dynamic context.

dynEnv |- declare construction ConstructionMode =>_dyn dynEnv

5.7 Ordering Mode Declaration

[14 (XQuery)] OrderingModeDecl^XQ ::= "declare" "ordering" ("ordered" | "unordered")

Core Grammar

The core grammar production for ordering mode declarations is:

[13 (Core)] OrderingModeDecl ::= "declare" "ordering" ("ordered" | "unordered")

Normalization

Ordering mode declarations are left unchanged through normalization.

[OrderingModeDecl]_PrologDecl

OrderingModeDecl

5.8 Empty Order Declaration

[15 (XQuery)] EmptyOrderDecl^XQ ::= "declare" "default" "order" "empty" ("greatest" | "least")

Core Grammar

The core grammar production for empty order declarations is:

[14 (Core)] EmptyOrderDecl ::= "declare" "default" "order" "empty" ("greatest" | "least")

Normalization

Empty order declarations are left unchanged through normalization.

[EmptyOrderDecl]_PrologDecl

EmptyOrderDecl

5.9 Copy-Namespaces Declaration

[16 (XQuery)]	`CopyNamespacesDecl^XQ`	::=	`"declare" "copy-namespaces" PreserveMode "," InheritMode`
[17 (XQuery)]	`PreserveMode^XQ`	::=	`"preserve" \| "no-preserve"`
[18 (XQuery)]	`InheritMode^XQ`	::=	`"inherit" \| "no-inherit"`

Core Grammar

The core grammar productions for copy-namespaces declarations are:

[15 (Core)]	`CopyNamespacesDecl`	::=	`"declare" "copy-namespaces" PreserveMode "," InheritMode`
[16 (Core)]	`PreserveMode`	::=	`"preserve" \| "no-preserve"`
[17 (Core)]	`InheritMode`	::=	`"inherit" \| "no-inherit"`

Normalization

Copy-namespace declarations are left unchanged through normalization.

[CopyNamespaceDecl]_PrologDecl

CopyNamespaceDecl

5.10 Schema Import

Schema Imports

[21 (XQuery)]	`SchemaImport^XQ`	::=	`"import" "schema" SchemaPrefix? URILiteral ("at" URILiteral ("," URILiteral)*)?`
[22 (XQuery)]	`SchemaPrefix^XQ`	::=	`("namespace" NCName "=") \| ("default" "element" "namespace")`

The semantics of Schema Import is described in terms of the [XPath/XQuery] type system. The process of converting an XML Schema into a sequence of type declarations is described in Section [C Importing Schemas]. This section describes how the resulting sequence of type declarations is added into the static context when the Prolog is processed.

Core Grammar

The core grammar productions for schema imports are:

Schema Imports

[20 (Core)]	`SchemaImport`	::=	`"import" "schema" SchemaPrefix? URILiteral ("at" URILiteral ("," URILiteral)*)?`
[21 (Core)]	`SchemaPrefix`	::=	`("namespace" NCName "=") \| ("default" "element" "namespace")`

Normalization

Schema imports are left unchanged through normalization.

[SchemaImport]_PrologDecl

SchemaImport

Notation

For convenience, we introduce the following auxiliary grammar productions.

Location Hints

[16 (Formal)]	`LocationHints`	::=	`"at" URILiteral ("," URILiteral)*`
[90 (Formal)]	`OptLocationHints`	::=	`LocationHints?`

Notation

The following auxiliary judgments are used when processing schema imports.

The judgment:

statEnv₁ |- Definitions =>_type statEnv₂

holds if under the static environment statEnv₁, the sequence of type definitions Definitions yields the new static environment statEnv₂.

The judgment:

statEnv₁ |- Definition =>_type statEnv₂

holds if under the static environment statEnv₁, the single definition Definition yields the new static environment statEnv₂.

Static Context Processing

A schema imported into a query is first mapped into the [XPath/XQuery] type system, which yields a sequence of XQuery type definitions. The rules for mapping the imported schema begins in [C.2 Schemas as a whole]. Each type definition in an imported schema is then added to the static environment.

Definitions = [schema URILiteral OptLocationHints]_Schema

statEnv |- Definitions =>_type statEnv₁

statEnv |- import schema URILiteral OptLocationHints =>_stat statEnv₁

The schema import declaration may also assign an element/type namespace prefix to the URI of the imported schema, or assign the default element namespace to the URI of the imported schema.

Definitions = [schema URILiteral OptLocationHints]_Schema

statEnv |- Definitions =>_type statEnv₁

statEnv₂ = statEnv₁ + namespace(NCName => (passive, URILiteral))

statEnv |- import schema namespace NCName = URILiteral OptLocationHints =>_stat statEnv₂

Definitions = [schema URILiteral OptLocationHints]_Schema

statEnv |- Definitions =>_type statEnv₁

statEnv₂ = statEnv₁ + default_elem_namespace( URILiteral)

statEnv |- import schema default element namespace URILiteral OptLocationHints =>_stat statEnv₂

An empty sequence of type definitions yields the input environment.

statEnv |- () =>_type statEnv

Each type definition is added into the static environment.

statEnv |- Definitions =>_type statEnv₁

statEnv₁ |- Definition₁ =>_type statEnv₂

statEnv |- Definition₁ Definitions =>_type statEnv₂

Each type, element, or attribute declaration is added respectively to the type, element and attribute declarations components of the static environment.

statEnv |- TypeName of elem/type expands to expanded-QName

statEnv₁ = statEnv + typeDefn(expanded-QName => define type TypeName TypeDerivation )

statEnv |- define type TypeName TypeDerivation =>_type statEnv₁

statEnv |- ElementName of elem/type expands to expanded-QName

statEnv₁ = statEnv + elemDecl(expanded-QName => define element ElementName OptSubstitution OptNillable TypeReference)

statEnv |- define element ElementName OptSubstitution OptNillable TypeReference =>_type statEnv₁

statEnv |- AttributeName of attr expands to expanded-QName

statEnv₁ = statEnv + attrDecl(expanded-QName => define attribute AttributeName TypeReference)

statEnv |- define attribute AttributeName TypeReference =>_type statEnv₁

Note that it is a static error to import two schemas that both define the same name in the same symbol space and in the same scope, that is multiple top-level definitions of the same type, element, or attribute name raises a static error. For instance, a query may not import two schemas that include top-level element declarations for two elements with the same expanded name.

Dynamic Context Processing

The schema import declarations do not affect the dynamic context.

dynEnv |- SchemaImport =>_dyn dynEnv

5.11 Module Import

[23 (XQuery)] ModuleImport^XQ ::= "import" "module" ("namespace" NCName "=")? URILiteral ("at" URILiteral ("," URILiteral)*)?

Introduction

The effect of an "import module" declaration is to extend the importing module's dynamic (and static) context with the global variables (and their types) and the functions (and their signatures) of the imported module. Module import is not transitive, only the global variables and functions declared explicitly in the imported module are available in the importing module. Also, module import does not import schemas, therefore the importing module must explicitly import any schemas on which the imported global variables or functions depend.

Core Grammar

The core grammar production for module imports is:

Module Import

[22 (Core)] ModuleImport ::= "import" "module" ("namespace" NCName "=")? URILiteral ("at" URILiteral ("," URILiteral)*)?

Normalization

Module imports are left unchanged through normalization.

[ModuleImport]_PrologDecl

ModuleImport

Notation

The rules below depend on the following auxiliary functions which are used to import the proper fragment of the static context.

The function fs:local-variables(statEnv, URI) returns all the (expanded-QName, Type) pairs in statEnv.varType such that the URI part of the variable's expanded-QName equals the given URI, that is, the variables that are declared locally in the module with the given namespace URI.

The function fs:local-functions(statEnv, URI) returns all the function signatures in statEnv.funcType such that the URI part of the function's expanded-QName equals the given URI, that is, the function signatures that are declared locally in the module with the given namespace URI.

Notation

The following auxiliary judgments is used to extend a given static environment with the static environment from an imported module.

The judgment

statEnv₁ extended with static environment statEnv₂ yields statEnv₃ for uri URILiteral

holds if extending the environment statEnv₁ with the environment statEnv₂ yields the environment statEnv₃ under the given namespace uri URILiteral.

This judgment is defined as follows.

statEnv₃ = statEnv₁ + varType(fs:local-variables(statEnv₂, URILiteral))

statEnv₄ = statEnv₃ + localFunc(fs:local-functions(statEnv₂, URILiteral))

statEnv₁ extended with static environment statEnv₂ yields statEnv₄ for uri URILiteral

Notation

The rules below depend on the following auxiliary judgments.

This judgment adds each variable explicitly declared in the imported module to the importing module's dynamic variable environment.

dynEnv₂ = dynEnv₁ + varValue(expanded-QName₁ => #IMPORTED(URI))

dynEnv₂ ; URI |- (expanded-QName₂, Type₂), ···, (expanded-QName_n, Type_n) =>_{import_variables} dynEnv₃

dynEnv₁ ; URI |- (expanded-QName₁, Type₁), ···, (expanded-QName_n, Type_n) =>_{import_variables} dynEnv₃

This judgment adds each function explicitly declared in the imported module to the importing module's dynamic function environment.

dynEnv₂ = dynEnv₁ + funcDefn((expanded-QName₁(Type_1,1, ..., Type_1,n)) => #IMPORTED(URI))

dynEnv₂ ; URI |- (expanded-QName₂(Type_2,1, ..., Type_2,n)), ···, (expanded-QName_k(Type_k,1, ..., Type_k,n)) =>_{import_functions} dynEnv₃

dynEnv₁ ; URI |- (expanded-QName₁(Type_1,1, ..., Type_1,n)), ···, (expanded-QName_k(Type_k,1, ..., Type_k,n)) =>_{import_functions} dynEnv₃

Notation

The following auxiliary judgments is used to extend a given dynamic environment with the dynamic environment from an imported module.

The judgment

dynEnv₁ extended with dynamic environment dynEnv₂ yields dynEnv₃ for uri URILiteral

holds if extending the dynamic environment dynEnv₁ with the dynamic environment dynEnv₂ yields the dynamic environment dynEnv₃ under the given namespace uri URILiteral.

This judgment is defined as follows.

dynEnv₁ ; URILiteral |- fs:local-variables(statEnv₂, URILiteral) =>_{import_variables} dynEnv₃

dynEnv₃ ; URILiteral |- fs:local-functions(statEnv₂, URILiteral) =>_{import_functions} dynEnv₄

dynEnv₁ extended with dynamic environment dynEnv₂ yields dynEnv₄ for uri URILiteral

Static Context Processing

The first set of premises below "look up" the static contexts of all the imported modules, as defined in [5.2 Module Declaration]. The second set of premises extend the input static context with the global variables and function signatures declared in the imported static contexts.

URILiteral₁ =>_{module_statEnv} statEnv₁

...

URILiteral₁ =>_{module_statEnv} statEnv_n

statEnv extended with static environment statEnv₁ yields statEnv₁' for uri URILiteral

...

statEnv_n-1 extended with static environment statEnv_n yields statEnv_n' for uri URILiteral

statEnv |- import module URILiteral₁ LocationHints? =>_stat statEnv_n'

URILiteral₁ =>_{module_statEnv} statEnv₁

...

URILiteral₁ =>_{module_statEnv} statEnv_n

statEnv extended with static environment statEnv₁ yields statEnv₁' for uri URILiteral

...

statEnv_n-1 extended with static environment statEnv_n yields statEnv_n' for uri URILiteral

statEnv' = statEnv_n' + namespace(NCName => (passive, URILiteral))

statEnv |- import module namespace NCName = URILiteral₁ LocationHints? =>_stat statEnv_n'

Note that the rules above and the rules for processing a library module in [5.2 Module Declaration] above are mutually recursive. It is possible to define the semantics in that way, since XQuery forbids the use of recursive modules.

Dynamic Context Processing

During dynamic context processing, each variable and function name is mapped to the special value #IMPORTED(URI) to indicate that the variable or function is defined in the imported module with the given URI.

The first set of premises below "look up" the dynamic contexts of all the imported modules, as defined in [5.2 Module Declaration]. The second set of premises extend the input dynamic context with the global variables and functions declared in the imported dynamic contexts.

URILiteral =>_{module_dynEnv} dynEnv₁

...

URILiteral =>_{module_dynEnv} dynEnv_n

dynEnv extended with dynamic environment dynEnv₁ yields dynEnv₁' for uri URILiteral

...

dynEnv_n-1 extended with dynamic environment dynEnv_n yields dynEnv_n' for uri URILiteral

dynEnv₁ |- import module (namespace NCName =)? URILiteral LocationHints? =>_dyn dynEnv_n'

Note that the rule above and the rules for processing a library module in [5.2 Module Declaration] above are mutually recursive. It is possible to define the semantics in that way, since XQuery forbids the use of recursive modules.

5.12 Namespace Declaration

[10 (XQuery)] NamespaceDecl^XQ ::= "declare" "namespace" NCName "=" URILiteral

Core Grammar

The core grammar production for namespace declarations is:

[10 (Core)] NamespaceDecl ::= "declare" "namespace" NCName "=" URILiteral

Normalization

Namespace declarations are left unchanged through normalization.

[NamespaceDecl]_PrologDecl

NamespaceDecl

Static Context Processing

A namespace declaration adds a new (prefix,uri) binding in the namespace component of the static environment. All namespace declarations in the prolog are passive declarations. Namespace declaration attributes of element constructors are active declarations.

statEnv₁ = statEnv + namespace(NCName => (passive, URILiteral))

statEnv |- declare namespace NCName = URILiteral =>_stat statEnv₁

Dynamic Context Processing

The namespace declaration does not affect the dynamic context.

dynEnv |- declare namespace NCName = URILiteral =>_dyn dynEnv

5.13 Default Namespace Declaration

[12 (XQuery)] DefaultNamespaceDecl^XQ ::= "declare" "default" ("element" | "function") "namespace" URILiteral

Core Grammar

The core grammar production for default namespace declarations is:

[11 (Core)] DefaultNamespaceDecl ::= "declare" "default" ("element" | "function") "namespace" URILiteral

Normalization

Default namespace declarations are left unchanged through normalization.

[DefaultNamespaceDecl]_PrologDecl

DefaultNamespaceDecl

Static Context Processing

A default element namespace declaration changes the default element namespace prefix binding in the namespace component of the static environment. If the string literal is the zero-length string, the default element namespace is set to the null namespace.

statEnv₁ = statEnv + default_elem_namespace(#NULL-NAMESPACE)

statEnv |- declare default element namespace "" =>_stat statEnv₁

not(URILiteral = "") statEnv₁ = statEnv + default_elem_namespace( URILiteral)

statEnv |- declare default element namespace URILiteral =>_stat statEnv₁

A default function namespace declaration changes the default function namespace prefix binding in the namespace component of the static environment. If the URI literal is the zero-length string, the default function namespace is set to the null namespace.

statEnv₁ = statEnv + default_function_namespace(#NULL-NAMESPACE)

statEnv |- declare default function namespace "" =>_stat statEnv₁

not(URILiteral = "") statEnv₁ = statEnv + default_function_namespace( URILiteral)

statEnv |- declare default function namespace URILiteral =>_stat statEnv₁

Note that multiple declarations of the same namespace prefix in the Prolog result in a static error. However, a declaration of a namespace in the Prolog can override a prefix that has been predeclared in the static context.

Dynamic Context Processing

Default namespace declarations do not affect the dynamic context.

dynEnv |- DefaultNamespaceDecl =>_dyn dynEnv

5.14 Variable Declaration

[24 (XQuery)] VarDecl^XQ ::= "declare" "variable" "$" QName TypeDeclaration? ((":=" ExprSingle) | "external")

Core Grammar

The core grammar production for variable declarations is:

[23 (Core)] VarDecl ::= "declare" "variable" "$" QName TypeDeclaration? ((":=" ExprSingle) | "external")

Normalization

Normalization of a variable declaration normalizes the variable and its corresponding expression, if it is present.

[ declare variable VarRef as SequenceType := Expr ]_PrologDecl

declare variable VarRef as SequenceType := [Expr]_Expr

If an external variable declaration does not have a type declaration it is treated as if the type declaration was item()*.

[ declare variable VarRef external ]_PrologDecl

declare variable VarRef as item()* external

[ declare variable VarRef as SequenceType external ]_PrologDecl

declare variable VarRef as SequenceType external

Static Context Processing

A variable declaration updates the variable component of the static context by associating the given variable with a static type.

If a variable declaration has an associated expression but does not have a type declaration, the static type of the variable is the static type of the expression.

statEnv |- VarRef of var expands to Variable

statEnv |- Expr : Type

statEnv₁ = statEnv + varType( Variable => Type)

statEnv |- declare variable VarRef := Expr =>_stat statEnv₁

If the variable declaration has an associated expression and has a type declaration, the static type of the variable is the specified type. The type of the expression must be a subtype of the declared type.

statEnv |- VarRef of var expands to Variable

statEnv |- Type = [SequenceType]_sequencetype

statEnv |- Expr : Type₂

statEnv |- Type₂ <: Type

statEnv₁ = statEnv + varType( Variable => Type)

statEnv |- declare variable VarRef as SequenceType := Expr =>_stat statEnv₂

If the variable declaration is external and has a type declaration, the static type of the variable is the specified type.

statEnv |- VarRef of var expands to Variable

statEnv |- Type = [SequenceType]_sequencetype

statEnv₁ = statEnv + varType( Variable => Type)

statEnv |- declare variable VarRef as SequenceType external =>_stat statEnv₂

Dynamic Context Processing

To evaluate a variable declaration, its associated expression is evaluated, and the dynamic context is updated with the variable bound to the resulting value.

dynEnv |- Expr => Value

statEnv |- VarRef of var expands to Variable

dynEnv₁ = dynEnv + varValue( Variable => Value)

dynEnv |- declare variable VarRef as SequenceType (:= Expr | external) =>_dyn dynEnv₁

Dynamic evaluation does not apply to externally defined variables. The dynamic environment must provide the values of external variables in the initial dynamic context (dynEnvDefault).

dynEnv |- declare variable VarRef as SequenceType external =>_dyn dynEnv

5.15 Function Declaration

Introduction

User-defined functions specify the name of the function, the names and types of the parameters, and the type of the result. The function body defines how the result of the function is computed from its parameters.

Function declarations

[26 (XQuery)]	`FunctionDecl^XQ`	::=	`"declare" "function" QName "(" ParamList? ")" ("as" SequenceType)? (EnclosedExpr \| "external")`
[27 (XQuery)]	`ParamList^XQ`	::=	`Param ("," Param)*`
[28 (XQuery)]	`Param^XQ`	::=	`"$" QName TypeDeclaration?`

Core Grammar

The core grammar productions for function declarations are:

Function declarations

[25 (Core)]	`FunctionDecl`	::=	`"declare" "function" QName "(" ParamList? ")" ("as" SequenceType)? (EnclosedExpr \| "external")`
[26 (Core)]	`ParamList`	::=	`Param ("," Param)*`
[27 (Core)]	`Param`	::=	`"$" QName TypeDeclaration?`

Notation

The following auxiliary mapping rule is used for the normalization of parameters in function declarations: []_Param.

Parameters without a declared typed are given the item* sequence type.

[VarRef]_Param

VarRef as item*

[VarRef as SequenceType ]_Param

VarRef as SequenceType

Normalization

The parameter list and body of a user-defined function are all normalized into Core expressions.

[ declare function QName ( ParamList? ) as SequenceType EnclosedExpr ]_PrologDecl

declare function QName ( [ParamList?]_Param ) as SequenceType [EnclosedExpr]_Expr

If the return type of the function is not provided, it is given the item* sequence type.

[declare function QName ( ParamList? ) EnclosedExpr ]_PrologDecl

declare function QName( [ParamList?]_Param ) as item* [EnclosedExpr]_Expr

Externally defined functions are normalized similarly.

[ declare function QName ( ParamList? ) as SequenceType external]_PrologDecl

declare function QName( [ParamList?]_Param ) as SequenceType external

[declare function QName ( ParamList? ) external ]_PrologDecl

declare function [QName] ( [ParamList?]_Param ) as item* external

Static Context Processing

Because functions are mutually referential, all function signatures must be defined in the static environment before static type analysis is applied to the function bodies. This rule also updates the local functions component of the static context to indicate the function is declared within the given module.

statEnv |- QName of func expands to expanded-QName

statEnv₁ = statEnv + funcType(expanded-QName => FunctionDecl)

statEnv₁ |- FunctionDecl : Type_r

statEnv |- FunctionDecl =>_stat statEnv₁

Note that the static context processing is performing type checking of the function, as defined below. Note also that the type checking is done in the new environment in which the function declaration has been added which ensures that recursive calls are type-checked properly.

Static Type Analysis

The static typing rules for function bodies follows normalization and processing of the static context. The typing rules below constructs a new environment in which each variable has the given expected type, then the static type of the function's body is computed under the new environment. The function body's type must be a subtype of the expected return type. If type checking fails, a static type error is raised. Otherwise, static typing of the function has no other effect, as function signatures are already inside the static environment.

statEnv |- VarRef₁ of var expands to Variable₁

...

statEnv |- VarRef_n of var expands to Variable_n

statEnv |- [SequenceType₁]_sequencetype = Type₁

...

statEnv |- [SequenceType_n]_sequencetype = Type_n

statEnv |- [SequenceType_r]_sequencetype = Type_r

statEnv + varType( Variable₁ => Type₁ ;...; Variable_n => Type_n ) |- Expr : Type

statEnv |- Type <: Type_r

statEnv |- declare function QName (VarRef₁ as SequenceType₁, ···, VarRef_n as SequenceType_n) as SequenceType_r { Expr } : Type_r

The bodies of external functions are not available and therefore cannot by type checked. To ensure type soundness, the implementation must guarantee that the value returned by the external function matches the expected return type.

statEnv |- VarRef₁ of var expands to Variable₁

...

statEnv |- VarRef_n of var expands to Variable_n

statEnv |- [SequenceType₁]_sequencetype = Type₁

...

statEnv |- [SequenceType_n]_sequencetype = Type_n

statEnv |- [SequenceType_r]_sequencetype = Type_r

statEnv |- declare function QName ( VarRef₁ as SequenceType₁ , ···, VarRef_n as SequenceType_n ) as SequenceType_r external : Type_r

Dynamic Context Processing

A function declaration updates the dynamic context. The function name with arity N is associated with the given function body. The number of arguments is required, because XQuery permits overloading of function names as long as each function signature has a different number of arguments.

statEnv |- QName of func expands to expanded-QName

statEnv |- VarRef₁ of var expands to Variable₁ ··· statEnv |- VarRef_n of var expands to Variable_n

statEnv |- [SequenceType₁]_sequencetype = Type₁

...

statEnv |- [SequenceType_n]_sequencetype = Type_n

dynEnv₁ = dynEnv + funcDefn(expanded-QName(Type₁,...,Type_n) => ( Expr , Variable₁ , ···, Variable_n))

dynEnv |- declare function QName ( VarRef₁ as SequenceType₁, ···, VarRef_n as SequenceType_n ) as SequenceType_r { Expr } =>_dyn dynEnv₁

An external function declaration does not affect the dynamic environment. The implementation must support the declared external functions.

dynEnv |- declare function QName ( Variable₁ as SequenceType₁, ···, Variable_n as SequenceType_n ) as SequenceType_r external =>_dyn dynEnv

The dynamic semantics of a function body are applied when the function is called and is described in [4.1.5 Function Calls].

5.16 Option Declaration

[13 (XQuery)] OptionDecl^XQ ::= "declare" "option" QName StringLiteral

Core Grammar

The core grammar production for option declarations is:

[12 (Core)] OptionDecl ::= "declare" "option" QName StringLiteral

Normalization

Option declarations are left unchanged through normalization.

[OptionDecl]_PrologDecl

OptionDecl

6 Conformance

The XQuery Formal Semantics is intended primarily as a component that can be used by [XQuery 1.0: An XML Query Language], or a host language of [XML Path Language (XPath) 2.0]. Therefore, the XQuery Formal Semantics relies on specifications that use it (such as [XPath 2.0], [XSLT 2.0], and [XQuery]) to specify conformance criteria in their respective environments. Specifications that set conformance criteria for their use of the formal semantics must not relax the constraints expressed in this specification.

6.1 Static Typing Feature

This specification normatively defines the static typing feature which can be used in [XQuery 1.0: An XML Query Language] or a host language of [XML Path Language (XPath) 2.0]. The static typing feature is specified using the static typing judgment introduced in [3.2.3 Static typing judgment].

6.1.1 Static Typing Extensions

In some cases, the static typing rules are not very precise (see, for example, the type inference rules for the ancestor axes—parent, ancestor, and ancestor-or-self—and for the function fn:root). If an implementation supports a static typing extension, it must always provide a more precise type than the one defined in this specification.

This constraint is formally expressed as follows. A static typing extension Expr :_ext Type must be such that for every expression Expr the following holds.

statEnv |- Expr : Type

statEnv |- Type' <: Type

statEnv |- Expr :_ext Type'

Note:

It is not recommended for a static typing extension to change the static typing behavior of expressions that specify a type explicitly (treat as, cast as, typeswitch, function parameters, and type declarations in variable bindings), since the purpose of those expressions is to impose a specific type.

7 Additional Semantics of Functions

This section defines the auxiliary functions required to define the formal semantics of [XPath/XQuery], and gives special normalization and static typing rules for some functions in [Functions and Operators].

Remember from [4.1.5 Function Calls] that the following rules operate after namespace resolution for the function name, and directly over the input type of the parameters. In the rest of the section, we will use the following shortcuts notations for specific relevant URIs:

FN-URI for functions from the [Functions and Operators] document.
OP-URI for operators from the [Functions and Operators] document.
FS-URI for formal semantics functions.

7.1 Formal Semantics Functions

Introduction

This section gives the definition and semantics of functions that are used in the formal semantics but are not in [Functions and Operators]. Their dynamic semantics are defined in the same informal style as in the [Functions and Operators] document. The static semantics of some formal-semantics functions require custom typing rules.

7.1.1 The fs:`convert-operand` function

fs:convert-operand($actual as item?, $expected as xdt:anyAtomicType) as xdt:anyAtomicType ?

The formal-semantics function fs:convert-operand converts the operands of arithmetic and comparison operators as follows:

If $actual is the empty sequence, returns the empty sequence.
If $actual is of type xdt:untypedAtomic, then
1. if $expected is of type xdt:untypedAtomic, returns $actual cast to xs:string;
2. if $expected is of numeric type, returns $actual cast to xs:double
3. otherwise returns $actual cast to the type of $expected.
Otherwise, $actual is returned unchanged.

Static Type Analysis

No conversion is needed for numeric (or empty) operands.

statEnv |- Type₁ <: (xs:decimal|xs:float|xs:double)?

statEnv |- (FS-URI,"convert-operand")(Type₁, Type₂) : Type₁

Pairs of untyped atomic operands are converted to strings.

statEnv |- Type₁ <: xdt:untypedAtomic ?

statEnv |- Type₂ <: xdt:untypedAtomic

statEnv |- (FS-URI,"convert-operand")(Type₁, Type₂) : xs:string · quantifier (Type₁)

When an untyped operand is paired with a numeric operand, it is converted to xs:double.

statEnv |- Type₁ <: xdt:untypedAtomic ?

statEnv |- Type₂ <: fs:numeric

statEnv |- (FS-URI,"convert-operand")(Type₁, Type₂) : xs:double · quantifier (Type₁)

Finally, an untyped atomic operand not dealt with by the above rules is converted to the type of the other operand.

statEnv |- Type₁ <: xdt:untypedAtomic ?

statEnv |- Type₂ <: xdt:anyAtomicType

statEnv |- not(Type₂ <: (xdt:untypedAtomic|fs:numeric))

statEnv |- (FS-URI,"convert-operand")(Type₁, Type₂) : Type₂ · quantifier(Type₁)

7.1.2 The fs:`convert-simple-operand` function

fs:convert-simple-operand($actual as item *, $expected as xdt:anyAtomicType) as xdt:anyAtomicTypeAtomic *

The formal-semantics function fs:convert-simple-operand is used to convert the value of the $actual argument such that it matches the type of the $expected argument (or matches a sequence of that type).

The dynamic semantics of this function are as follows:

For each item in $actual argument that is of type xdt:untypedAtomic, that item is cast to the type of the $expected argument, and the resulting sequence is returned.

Static Type Analysis

The following static semantics rules correspond to the dynamic semantics rules given above.

statEnv |- Type₂ <: xdt:anyAtomicType

Type₃ = convert_untypedAtomic(prime(Type₁), Type₂)

statEnv |- (FS-URI",convert-simple-operand")(Type₁, Type₂) : Type₃ · quantifier(Type₁)

7.1.3 The fs:`distinct-doc-order` function

fs:distinct-doc-order($nodes as node *) as node *

The fs:distinct-doc-order function sorts its input sequence of nodes by document order and removes duplicates.

Static Type Analysis

The fs:distinct-doc-order function expects a sequence of nodes as input. The resulting type is computed using prime and quantifier, which are defined in [8.4 Judgments for FLWOR and other expressions on sequences].

statEnv |- (FS-URI,"distinct-doc-order") ( Type ) : prime(Type) · quantifier(Type)

7.1.4 The fs:`distinct-doc-order-or-atomic-sequence` function

fs:distinct-doc-order-or-atomic-sequence($item as node *) as item*

The fs:distinct-doc-order-or-atomic-sequence function operates on either an homogeneous sequence of nodes or an homogeneous sequence of atomic values. If the input is a sequence of nodes, is sorts those nodes by document order and removes duplicates, using the fs:distinct-doc-order function. If it is a sequence of atomic values, it returns it unchanged.

Static Type Analysis

The fs:distinct-doc-order function expects either a sequence of nodes as input or a sequence of atomic values. The resulting type is computed using prime and quantifier, which are defined in [8.4 Judgments for FLWOR and other expressions on sequences].

statEnv |- Type <: node*

statEnv |- (FS-URI,"distinct-doc-order-or-atomic-sequence") ( Type ) : prime(Type) · quantifier(Type)

statEnv |- Type <: xdt:anyAtomicType*

statEnv |- (FS-URI,"distinct-doc-order-or-atomic-sequence") ( Type ) : Type

7.1.5 The fs:`item-sequence-to-node-sequence` function

fs:item-sequence-to-node-sequence($items as item *) as node *

The fs:item-sequence-to-node-sequence function converts a sequence of item values to nodes by applying the normative rules in Section 3.7.3.1 Computed Element Constructors^XQ.

Static Type Analysis

statEnv |- (FS-URI,"item-sequence-to-node-sequence") (Type) : attribute*, (element|text|PI|comment)*

7.1.6 The fs:`item-sequence-to-untypedAtomic` function

Introduction

fs:item-sequence-to-untypedAtomic($items as item *) as xdt:untypedAtomic

The fs:item-sequence-to-untypedAtomic function converts a sequence of item values to a string of type xdt:untypedAtomic by applying the normative rules in Section 3.7.3.2 Computed Attribute Constructors^XQ.

Dynamic Evaluation

If the input of the fs:item-sequence-to-untypedAtomic function is an empty sequence, it returns a zero-length string. Otherwise, each atomic value in the input sequence is cast into a string. The individual strings resulting from the previous step are merged into a single string by concatenating them with a single space character between each pair.

Static Type Analysis

There are no special static typing rules for this function.

7.1.7 The fs:`item-sequence-to-untypedAtomic-PI` function

Introduction

fs:item-sequence-to-untypedAtomic-PI($items as item *) as xdt:untypedAtomic

The fs:item-sequence-to-untypedAtomic-PI function converts a sequence of item values to a string of type xdt:untypedAtomic by applying the normative rules in Section 3.7.3.5 Computed Processing Instruction Constructors^XQ.

Dynamic Evaluation

If the input is an empty sequence, the fs:item-sequence-to-untypedAtomic-PI function returns a zero-length string. Otherwise, each atomic value in the input sequence is cast into a string. If any of the resulting strings contains the string "?>", a dynamic error is raised. The individual strings resulting from the previous step are merged into a single string by concatenating them with a single space character between each pair. Leading whitespace is removed from the resulting string.

Static Type Analysis

There are no special static typing rules for this function.

7.1.8 The fs:`item-sequence-to-untypedAtomic-text` function

Introduction

fs:item-sequence-to-untypedAtomic-text($items as item *) as xdt:untypedAtomic?

The fs:item-sequence-to-untypedAtomic-text function converts a sequence of item values to a string of type xdt:untypedAtomic, or empty, by applying the rules in Section 3.7.3.4 Text Node Constructors^XQ.

Dynamic Evaluation

If the input is the empty sequence, the fs:item-sequence-to-untypedAtomic-text function returns the empty sequence. Otherwise, each atomic value in the input sequence is cast into a string. The individual strings resulting from the previous step are merged into a single string by concatenating them with a single space character between each pair.

Static Type Analysis

There are no special static typing rules for this function.

7.1.9 The fs:`item-sequence-to-untypedAtomic-comment` function

Introduction

fs:item-sequence-to-untypedAtomic-comment($items as item *) as xdt:untypedAtomic

The fs:item-sequence-to-untypedAtomic-comment function converts a sequence of item values to a string of type xdt:untypedAtomic by applying the normative rules in Section 3.7.3.6 Computed Comment Constructors^XQ.

Dynamic Evaluation

If the input is the empty sequence, the fs:item-sequence-to-untypedAtomic-comment function returns a zero-length string. Otherwise, each atomic value in the input sequence is cast into a string. The individual strings resulting from the previous step are merged into a single string by concatenating them with a single space character between each pair. It is a dynamic error if the result of the content expression of a computed comment constructor contains two adjacent hyphens or ends with a hyphen.

Static Type Analysis

There are no special static typing rules for this function.

7.1.10 The fs:`apply-ordering-mode` function

fs:apply-ordering-mode($items as item()*) as item()*

Dynamic Evaluation

If the statEnv.orderingMode is set to ordered, the fs:apply-ordering-mode function is the identity function, returning its input sequence in its original order.

statEnv.orderingMode = ordered dynEnv |- Expr => Value

dynEnv |- fs:apply-ordering-mode(Expr) => Value

If the statEnv.orderingMode is set to unordered, the fs:apply-ordering-mode is equivalent to the fn:unordered function, returning the items from its input sequence in arbitrary order.

statEnv.orderingMode = ordered dynEnv |- fn:unordered(Expr) => Value

dynEnv |- fs:apply-ordering-mode(Expr) => Value

Static Type Analysis

If the ordering context is set to ordered, the static type of the input expression of the fs:apply-ordering-mode function is left unchanged.

statEnv.orderingMode = ordered

statEnv |- (FS-URI,"apply-ordering-mode")(Type) : Type

If the ordering context is set to unordered, the static type of the input expression of the fs:apply-ordering-mode function is computed using the prime and quantifier judgments, as for the fn:unordered function.

statEnv.orderingMode = unordered

statEnv |- (FS-URI,"apply-ordering-mode")(Type) : prime(Type) · quantifier(Type)

7.1.11 The fs:`to` function

fs:to($firstval as xs:integer?, $lastval as xs:integer?) as xs:integer*

The formal semantics function fs:to is a wrapper function for the op:to operator, taking the semantics of the range expression over empty sequences into account.

Dynamic Evaluation

If one of the input parameters for fs:to is the empty sequence, the function returns the empty sequence, otherwise it returns the result of calling the op:to operator. This semantics is equivalent to the following function call.

declare function fs:to($firstval as xs:integer?, $lastval as xs:integer?) as xs:integer* {
  if (fn:empty($lastval) or fn:empty($lastval)
  then ()
  else op:to($firstval,$lastval)
};

Static Type Analysis

The static type of fs:to does not require any additional static typing rule, and is typed as a function call based on the above signature.

7.2 Standard functions with specific typing rules

Introduction

This section gives special normalization and static typing rules for functions in [Functions and Operators] for which the standard normalization or typing rules are not appropriate. All functions that are not mentioned behave as described in Section [4.1.5 Function Calls]. When given, the static typing rules in this section always give more precise type information than the generic rule based on the function's signature.

7.2.1 The `fn:last` context function

As explained in [3.1.2 Dynamic Context], the fn:last() context function is modeled using the Formal Semantics variable $fs:last.

Normalization

[fn:last()]_Expr

$fs:last

7.2.2 The `fn:position` context function

As explained in [3.1.2 Dynamic Context], the fn:position() context function is modeled using the Formal Semantics variable $fs:position.

Normalization

[fn:position()]_Expr

$fs:position

7.2.3 The `fn:abs`, `fn:ceiling`, `fn:floor`, `fn:round`, and `fn:round-half-to-even` functions

Static Type Analysis

The typing rules for the fn:abs, fn:ceiling, fn:floor, fn:round, and fn:round-half-to-even functions promote their input type to the (least) base primitive numeric type from which the input type is derived. Parameters of type xdt:untypedAtomic are always promoted to xs:double. Instead of writing a separate judgment for each function, we write one rule with function variable F, which is one of the (FN-URI,"abs"), (FN-URI,"ceiling", (FN-URI,"floor"), (FN-URI,"round"), or (FN-URI,"round-half-to-even") functions.

statEnv |- Type <: xdt:anyAtomicType ?

Type₂ = convert_untypedAtomic(Type, xs:double)

statEnv |- Type₂ can be promoted to Type₁

Type₁ in { xs:integer, xs:decimal, xs:float, xs:double }

statEnv |- F (Type) : Type₁ · quantifier(Type)

7.2.4 The `fn:boolean` function

Static Type Analysis

The fn:boolean function as described in the [Functions and Operators] document takes an empty sequence, a sequence of one or more nodes, or a singleton value of type xs:string, xdt:untypedAtomic or some numeric type. All other values are illegal.

statEnv |- fn:boolean(Type) : xs:boolean

7.2.5 The `fn:collection` and `fn:doc` functions

Introduction

The type inference rules for fn:collection and fn:doc depend on the syntactic form of their input expression. As a result, the corresponding type inference rules must be written directly over the input expression, unlike the other functions in this section.

Static Type Analysis

The fn:collection function as described in the [Functions and Operators] document, takes a string-valued expression, which denotes a URI, and returns a value.

If the fn:collection function has no parameter, the result type is given by the implementation for the default sequence if it exists.

statEnv |- QName of func expands to (FN-URI,"collection")

statEnv |- Implementation-defined default sequence has type Type

statEnv |- QName() : Type

If the argument to fn:collection is a URILiteral expression which is defined in statEnv.collectionType, then the result type is the type corresponding to the URILiteral in statEnv.collectionType.

statEnv |- QName of func expands to (FN-URI,"collection")

statEnv.collectionType(URILiteral) = Type

statEnv |- QName(URILiteral) : Type

Otherwise, if the argument is not a URI literal or is a string but not defined in statEnv.collectionType, then we don't know anything about the URI, and the static type is a collection of nodes:

statEnv |- QName of func expands to (FN-URI,"collection")

statEnv.collectionType(URILiteral) undefined

statEnv |- QName of func expands to (FN-URI,"collection")

statEnv |- Expr is not a URILiteral

The static type of the fn:doc function has similar static rules, but, in addition, requires that the static type of the URI be any document:

statEnv |- QName of func expands to (FN-URI,"doc")

statEnv |- statEnv.docType(URILiteral) = Type

statEnv |- Type <: document

statEnv |- QName(URILiteral) : Type

Otherwise, if the argument is not a URI literal or is not defined in the domain of statEnv.docType, then we don't know anything about the URI, and the static type is document:

statEnv |- QName of func expands to (FN-URI,"doc")

statEnv.docType(URILiteral) undefined

statEnv |- QName(URILiteral) : document

statEnv |- QName of func expands to (FN-URI,"doc")

statEnv |- not(Expr = URILiteral)

statEnv |- QName(Expr) : document

7.2.6 The `fn:data` function

Introduction

The fn:data function converts a sequence of items to a sequence of atomic values.

Notation

Inferring the type for the fn:data function is done by applying the data on auxiliary judgment, using the same approach as for the XPath steps.

statEnv |- data on Type₁ : Type₂

Static Type Analysis

The general rule for fn:data is to apply the filter data on to the prime type of its argument type, then apply the quantifier to the result:

statEnv |- data on prime(Type) : Type₁

statEnv |- (FN-URI,"data")(Type) : Type₁ · quantifier(Type)

When applied to none, data on yields none.

statEnv |- data on none : none

When applied to empty, data on yields empty.

statEnv |- data on empty : empty

When applied to the union of two types, data on is applied to each of the two types. The resulting type is computed using prime and quantifier, which are defined in [8.4 Judgments for FLWOR and other expressions on sequences]. This rule is necessary because data on may return a sequence of atomic types.

statEnv |- data on Type₁ : Type₁'

statEnv |- data on Type₂ : Type₂'

statEnv |- data on (Type₁|Type₂) : prime(Type₁'|Type₂') · quantifier(Type₁'|Type₂')

When applied to an atomic type, data on simply returns the atomic type:

statEnv |- Type <: xdt:anyAtomicType

statEnv |- data on Type : Type

When applied to comment or processing instruction node types, data on returns xs:string

statEnv |- Type <: comment | processing-instruction

statEnv |- data on Type : xs:string

When applied to text, and document node types, data on returns xdt:untypedAtomic

statEnv |- Type <: text | document

statEnv |- data on Type : xdt:untypedAtomic

When applied to element node types with type annotation^XQ xdt:untyped, the data on filter returns xdt:untypedAtomic.

statEnv |- ElementType type lookup of type xdt:untyped

statEnv |- data on ElementType : xdt:untypedAtomic

When applied to an attribute node type, the data on filter returns the attribute's simple type.

statEnv |- AttributeType type lookup of type TypeName

statEnv |- (of type TypeName) expands to Type

statEnv |- data on AttributeType : Type

When applied to an element type whose type annotation^XQ denotes a simple type or a complex type of simple content, data on returns the element's simple type.

statEnv |- ElementType type lookup TypeReference

statEnv |- TypeReference expands to Type

statEnv |- Type <: (attribute *, Type₁) statEnv |- Type₁ <: xdt:anyAtomicType*

statEnv |- data on ElementType : Type₁

When applied to an element type whose type annotation^XQ denotes a complex type of mixed content, the data on filter returns xdt:untypedAtomic.

statEnv |- ElementType type lookup of type TypeName

statEnv |- TypeName of elem/type expands to expanded-QName statEnv.typeDefn(expanded-QName) = define type TypeName Derivation mixed { Type₁ }

statEnv |- data on ElementType : xdt:untypedAtomic

The data on filter is not defined on any element type whose type annotation^XQ denotes a complex type of complex content and therefore apply data on to such a node raises a static error.

Example

Consider the following variables and its corresponding static type.

    $x : (element price { attribute currency { xs:string }, xs:decimal }
         | element price_code { xs:integer })

Applying the fn:data function on that variable results in the following type.

    fn:data($x) : (xs:decimal | xs:integer)

Because the input type is a choice, applying the data on filter results in a choice of simple types for the output of the fn:data function.

7.2.7 The `fn:distinct-values` function

Static Type Analysis

The fn:distinct-values function expects a sequence of atomic values as input and returns a sequence of prime types, which are defined in [8.4 Judgments for FLWOR and other expressions on sequences].

statEnv |- Type <: xdt:anyAtomicType*

statEnv |- (FN-URI,"distinct-values")(Type) : prime(Type) · quantifier(Type)

7.2.8 The `fn:unordered` function

Static Type Analysis

The static semantics for unordered is computed using prime and quantifier, which are defined in [8.4 Judgments for FLWOR and other expressions on sequences]. The type of each argument is determined, and then prime(.) and quantifier(.) are applied to the sequence type (Type₁, Type₂).

statEnv |- (FN-URI,"unordered")(Type₁) : prime(Type₁) · quantifier(Type₁)

7.2.9 The `fn:error` function

Static Type Analysis

The fn:error function always has the none type.

statEnv |- (FN-URI,"error")() : none

statEnv |- Type <: xs:QName

statEnv |- (FN-URI,"error")(Type) : none

statEnv |- Type₁ <: xs:QName? statEnv |- Type₂ <: xs:string

statEnv |- fn:error(Type₁,Type₂) : none

statEnv |- Type₁ <: xs:QName? statEnv |- Type₂ <: xs:string

statEnv |- (FN-URI,"error")(Type₁,Type₂,Type₃) : none

7.2.10 The `fn:min`, `fn:max`, `fn:avg`, and `fn:sum` functions

Introduction

The dynamic evaluation rules for aggregate functions convert any item of type xdt:untypedAtomic in the input sequence to xs:double, then attempt to promote all values in the input sequence to values that are comparable. The static typing rules reflect the dynamic rules.

Normalization

The fn:sum function has two forms. The first form takes two arguments: The first argument is the input sequence and the second argument is the value that should be returned if the input sequence is empty. In case there is no second argument, the value returned for an empty sequence is the xs:integer value 0.

[fn:sum(Expr₁)]_Expr

[fn:sum(Expr₁,0)]_Expr

Notation

The type function convert_untypedAtomic takes a prime type and converts all occurrences of the type xdt:untypedAtomic to a target type. It is defined recursively as follows.

convert_untypedAtomic(`xdt:untypedAtomic`, Type)	=	Type
convert_untypedAtomic(FormalItemType, Type)	=	FormalItemType (FormalItemType is not `xdt:untypedAtomic`)
convert_untypedAtomic(`empty`, Type)	=	`empty`
convert_untypedAtomic(`none`, Type)	=	`none`
convert_untypedAtomic(Type₁ \| Type₂, Type)	=	convert_untypedAtomic(Type₁, Type) \| convert_untypedAtomic(Type₂, Type)

Notation

The function aggregate_quantifier converts the input type quantifier zero-or-more or zero-or-one to the result type quantifier zero-or-one, and converts the input type quantifier one or one-or-more, to the result type quantifier one.

aggregate_quantifier(`?`)	=	`?`
aggregate_quantifier(`*`)	=	`?`
aggregate_quantifier(`1`)	=	`1`
aggregate_quantifier(`+`)	=	`1`

Static Type Analysis

Now we can define the static typing rules for the aggregate functions. First, the input type is converted to a prime type. Second, the type function convert_untypedAtomic is applied to the prime type, yielding a new prime type, in which occurrences of xdt:untypedAtomic are converted to xs:double. Third, the judgment can be promoted to is applied to the new prime type and target type. The result type is combined with the aggregate quantifier of the input type.

For a given aggregate function, instead of writing a separate judgment for each target type, we write one rule using a target type Type₀.

For fn:min and fn:max, the target type Type₀ is either xs:string, xs:integer, xs:decimal, xs:float, xs:double, xs:date, xs:time, xs:dateTime, xdt:yearMonthDuration, or xdt:dayTimeDuration .

Type₁ = prime(Type)

Type₂ = convert_untypedAtomic(Type₁, xs:double)

ItemType₁, ...,ItemType_n = Type₂

Type₀ in { xs:string, xs:integer, xs:decimal, xs:float, xs:double, xs:date, xs:time, xs:dateTime, xdt:yearMonthDuration, xdt:dayTimeDuration }

statEnv |- ItemType_i can be promoted to Type₀ 1 <= i <= n

statEnv |- (FN-URI,"min")(Type) : Type₀ · aggregate_quantifier(quantifier(Type))

Type₁ = prime(Type)

Type₂ = convert_untypedAtomic(Type₁, xs:double)

ItemType₁, ...,ItemType_n = Type₂

Type₀ in { xs:string, xs:integer, xs:decimal, xs:float, xs:double, xs:date, xs:time, xs:dateTime, xdt:yearMonthDuration, xdt:dayTimeDuration }

statEnv |- ItemType_i can be promoted to Type₀ 1 <= i <= n

statEnv |- (FN-URI,"max")(Type) : Type₀ · aggregate_quantifier(quantifier(Type))

For fn:avg, the target type Type is either xs:decimal, xs:float, xs:double, xdt:yearMonthDuration, or xdt:dayTimeDuration .

Type₁ = prime(Type)

Type₂ = convert_untypedAtomic(Type₁, xs:double)

ItemType₁, ...,ItemType_n = Type₂

Type₀ in { xs:decimal, xs:float, xs:double, xdt:yearMonthDuration, xdt:dayTimeDuration }

statEnv |- ItemType_i can be promoted to Type₀ 1 <= i <= n

statEnv |- (FN-URI,"avg")(Type) : Type₀ · aggregate_quantifier(quantifier(Type))

For fn:sum, the target type Type is either xs:integer, xs:decimal, xs:float, xs:double, xdt:yearMonthDuration, or xdt:dayTimeDuration . The second argument in fn:sum is the value that should be returned if the input sequence is empty. The result type is the union of the target type and the type of the second argument. Note that the rule checks that the type for the zero value is consistent with the type of the input sequence.

statEnv |- Type₂ <: xdt:anyAtomicType ?

Type₃ = prime(Type₁)

Type₄ = convert_untypedAtomic(Type₃, xs:double)

ItemType₁, ...,ItemType_n = Type₄

Type₀ in { xs:integer, xs:decimal, xs:float, xs:double, xdt:yearMonthDuration }

statEnv |- ItemType_i can be promoted to Type₀ 1 <= i <= n

statEnv |- Type₂ <: Type₀

statEnv |- (FN-URI,"sum")(Type₁,Type₂) : Type₀ · aggregate_quantifier(quantifier(Type₁))

7.2.11 The `fn:remove` function

Static Type Analysis

The static type for the fn:remove function is computed using prime and quantifier, which are defined in [8.4 Judgments for FLWOR and other expressions on sequences]. Since one item may be removed from the sequence, the resulting type is made optional.

statEnv |- Type₁ <: xs:integer

statEnv |- (FN-URI,"remove")(Type, Type₁) : prime(Type) · quantifier(Type)?

7.2.12 The `fn:reverse` function

Static Type Analysis

The static type for the fn:reverse function is computed using prime and quantifier, which are defined in [8.4 Judgments for FLWOR and other expressions on sequences].

statEnv |- (FN-URI,"reverse")(Type) : prime(Type) · quantifier(Type)

7.2.13 The `fn:subsequence` function

Introduction

The fn:subsequence function has special typing rules when its second argument is the numeric literal value 1 or the built-in variable $fs:last. These rules provide better typing for path expressions such as Expr[1] and Expr[fn:last()].

The type inference rules for fn:subsequence depends on the syntactic form of their input expression. As a result, the corresponding type inference rules must be written directly over the input expression, unlike the other functions in this section.

Static Type Analysis

If the type of the input expression has exactly one or one-or-more items, then the type inferred for fn:subsequence is the prime type of the input type.

statEnv |- QName of func expands to (FN-URI,"subsequence")

statEnv |- Expr : Type quantifier(Type) in { 1, + }

statEnv |- QName(Expr, 1, 1) : prime(Type)

If the type of the input expression has zero or more items, fn:subsequence is applied on a numeric literal, $fs:position, or $fs:last, then the static type is zero-or-one of the prime type of the input type. Those static typing rules are intended to support more precise typing for the cases where fn:subsequence is the result of normalizing an XPath predicate of the form Expr[NumericLiteral] of Expr[last()], see [4.2.1 Steps].

statEnv |- QName of func expands to (FN-URI,"subsequence")

statEnv |- Expr : Type quantifier(Type) in { * }

statEnv |- QName(Expr, NumericLiteral, 1) : prime(Type) ?

The same rule applies when the last item in the input sequence is selected.

statEnv |- QName of func expands to (FN-URI,"subsequence")

statEnv |- Expr : Type quantifier(Type) in { * }

statEnv |- QName(Expr, $fs:last, 1) : prime(Type) ?

The same rule applies when an item is selected based on its position in the input sequence.

statEnv |- QName of func expands to (FN-URI,"subsequence")

statEnv |- Expr : Type quantifier(Type) in { * }

statEnv |- QName(Expr, $fs:position, 1) : prime(Type) ?

The last rule applies to all other applications of the fn:subsequence function.

statEnv |- QName of func expands to (FN-URI,"subsequence")

statEnv |- Expr : Type

statEnv |- Expr₁ : xs:double statEnv |- Expr₂ : xs:double

statEnv |- QName(Expr, Expr₁, Expr₂) : prime(Type) · quantifier(Type) · ?

7.2.14 The `op:union`, `op:intersect`, and `op:except` operators

Static Type Analysis

The static semantics for op:union is computed using prime and quantifier, which are defined in [8.4 Judgments for FLWOR and other expressions on sequences]. The type of each argument is determined, and then prime(.) and quantifier(.) are applied to the sequence type (Type₁, Type₂).

statEnv |- (OP-URI,"union")(Type₁, Type₂) : prime(Type₁ , Type₂) · quantifier(Type₁ , Type₂)

The static semantics of op:intersect is analogous to that for op:union. Because an intersection may be empty, the result type is optional.

statEnv |- (OP-URI,"intersect")(Type₁, Type₂) : prime(Type₁, Type₂) · quantifier(Type₁,Type₂) · ?

The static semantics of op:except follows. The type of the second argument is ignored as it does not contribute to the result type. As with op:intersect, the result of op:except may be the empty sequence.

statEnv |- (OP-URI,"except")(Type₁, Expr₂) : prime(Type₁) · quantifier(Type₁) · ?

7.2.15 The `fn:insert-before` function

Static Type Analysis

The static type for the fn:insert-before function is computed using prime and quantifier, which are defined in [8.4 Judgments for FLWOR and other expressions on sequences].

statEnv |- Type₂ <: xs:integer

Type₄ = (Type₁,Type₃)

statEnv |- (FN-URI,"insert-before")(Type₁,Type₂,Type₃) : prime(Type₄) · quantifier(Type₄)

7.2.16 The `fn:zero-or-one`, `fn:one-or-more`, and `fn:exactly-one` functions

The functions fn:zero-or-one, fn:one-or-more, and fn:exactly-one check that the cardinality of a sequence is in the expected range. They are useful to override the static type inferred for a given query. For example, in the following query, the user may know that all ISBN numbers are unique and therefore that the function always return at most one book element. However, the static typing feature cannot infer a precise enough type and will return a static type error at compile time.

  declare function book_with_isbn($isbn as xs:string) as schema-element(book)? {
    //book[@isbn=$isbn]
  }

In that query, the fn:zero-or-one function can be used to tell the type system that the cardinality is known to be zero or one.

  declare function book_with_isbn($isbn as xs:string) as schema-element(book)? {
    fn:zero-or-one(//book[@isbn=$isbn])
  }

Static Type Analysis

The static typing rules for those functions always infer a type with the cardinality indicated by that function.

statEnv |- (FN-URI,"zero-or-one")(Type) : prime(Type)?

statEnv |- (FN-URI,"one-or-more")(Type) : prime(Type)+

statEnv |- (FN-URI,"exactly-one")(Type) : prime(Type)

8 Auxiliary Judgments

This section defines auxiliary judgments used in defining the formal semantics. Many auxiliary judgments are used in both static and dynamic inference rules. Those auxiliary judgments that are used in only the static or dynamic semantics are labeled as such.

8.1 Judgments for accessing types

Introduction

This section defined several auxiliary judgments to access components of the [XPath/XQuery] type system. The first two judgments (derives from and substitutes for) are used to access the type and element name hierarchies in an XML Schema. The other judgments (name lookup, type lookup, extended by, adjusts to and expands to) are used to lookup the meaning of element or attribute types from the schema. These judgments are used in many expressions, notably in the specification of type matching (See [8.3 Judgments for type matching]), validation (See [E.1 Judgments for the validate expression]), and the static semantics of step expressions (See [8.2 Judgments for step expressions and filtering]).

8.1.1 Derives from

Notation

The judgment

statEnv |- TypeName₁ derives from TypeName₂

holds when TypeName₁ derives from TypeName₂. This judgment formalizes the definition of the derives-from function in Section 2.5.4 SequenceType Matching^XQ.

Example

For example, assuming the extended XML Schema given in section [2.4.5 Example of a complete Schema], then the following judgments hold.

  USAddress            derives from  xs:anyType
  NYCAddress           derives from  USAddress
  NYCAddress           derives from  xs:anyType
  xsd:positiveInteger  derives from  xsd:integer
  xsd:integer          derives from  xs:anySimpleType
  fs:anon3             derives from  xsd:positiveInteger
  fs:anon3             derives from  xsd:integer
  fs:anon3             derives from  xs:anySimpleType
  fs:anon3             derives from  xs:anyType

Note

Derivation is a partial order. It is reflexive and transitive by the definition below.

Semantics

This judgment is specified by the following rules.

Some rules have hypotheses that simply list a type, element, or attribute declaration.

Every type name derives from itself.

statEnv |- TypeName derives from TypeName

Every type name derives from the type it is declared to derive from by restriction or extension.

statEnv |- TypeName of elem/type expands to expanded-QName

statEnv.typeDefn(expanded-QName) = define type TypeName extends BaseTypeName OptMixed { Type }

statEnv |- TypeName derives from BaseTypeName

statEnv |- TypeName of elem/type expands to expanded-QName

statEnv.typeDefn(expanded-QName) = define type TypeName restricts BaseTypeName OptMixed { Type }

statEnv |- TypeName derives from BaseTypeName

The above rules all require that the type names be defined in the static context, but [XPath/XQuery] permits references to "unknown" type names, i.e., type names that are not defined in the static context. An unknown type name might be encountered, if a module in which the given type name occurs does not import the schema in which the given type name is defined. In this case, an implementation is allowed (but is not required) to provide an implementation-dependent mechanism for determining whether the unknown type name is the same as or derived by restriction from the expected type name. The following rule formalizes this implementation dependent mechanism.

"The implementation is able to determine that TypeName₁ is derived by restriction from TypeName₂."

statEnv |- TypeName₁ derives from TypeName₂

The derivation relation is transitive.

statEnv |- TypeName₁ derives from TypeName₂ statEnv |- TypeName₂ derives from TypeName₃

statEnv |- TypeName₁ derives from TypeName₃

8.1.2 Substitutes for

The substitutes judgment is used to know whether an element name is in the substitution group of another element name.

Notation

The judgment

statEnv |- ElementName₁ substitutes for ElementName₂

holds when ElementName₁ substitutes for ElementName₂.

Example

For example, assuming the extended XML Schema given in section [2.4.5 Example of a complete Schema], then the following judgments hold.

  usaddress  substitutes for  address
  nyaddress  substitutes for  usaddress
  nyaddress  substitutes for  address

Note

Substitution is a partial order. It is reflexive and transitive by the definition below. It is asymmetric because no cycles are allowed in substitution groups.

Semantics

The substitutes judgment for element names is specified by the following rules.

Every element name substitutes for itself.

statEnv |- ElementName substitutes for ElementName

Every element name substitutes for the element it is declared to substitute for.

statEnv |- ElementName of elem/type expands to expanded-QName

statEnv.elemDecl(expanded-QName) = define element ElementName substitutes for BaseElementName OptNillable TypeReference

statEnv |- ElementName substitutes for BaseElementName

Substitution is transitive.

statEnv |- ElementName₁ substitutes for ElementName₂ statEnv |- ElementName₁ substitutes for ElementName₃

statEnv |- ElementName₁ substitutes for ElementName₃

8.1.3 Element and attribute name lookup (Dynamic)

The name lookup judgment is used in the definition of the matches judgment, which takes a value and a type and determines whether the value matches, or is an instance of, the given type. Both name lookup and matches are used in the dynamic semantics.

The name lookup judgment takes an element(attribute) name (derived from a node value) and an element(attribute) type and if the element(attribute) name matches the corresponding name in the element(attribute) type, the judgment yields the type's corresponding type reference and for elements, its nillable property.

Notation

The judgment

statEnv |- ElementName name lookup ElementType yields OptNillable TypeReference

holds when the given element name matches the given element type and requires that the element be nillable as indicated and have the given type reference.

Example

For example, assuming the extended XML Schema given in section [2.4.5 Example of a complete Schema], then the following judgments hold.

  comment    name lookup element comment                          yields of type xsd:string
  size       name lookup element size nillable of type xs:integer yields nillable of type xsd:string
  apt        name lookup element apt                              yields of type fs:anon3
  nycaddress name lookup element address                          yields of type NYCAddress

Note that when the element name is in a substitution group, the name lookup returns the type name corresponding to the original element name (here the type NYCAddress for the element nycaddress, instead of Address for the element address).

Semantics

This judgment is specified by the following rules.

If the element type is a reference to a global element, then name lookup yields the type reference in the element declaration for the given element name. The given element name must be in the substitution group of the global element.

statEnv |- ElementName₁ substitutes for ElementName₂

statEnv |- ElementName₁ of elem/type expands to expanded-QName₁

statEnv.elemDecl(expanded-QName₁) = define element ElementName₁ OptSubstitution OptNillable TypeReference

statEnv |- ElementName₁ name lookup element ElementName₂ yields OptNillable TypeReference

If the given element name matches the element name in the element type, and the element type contains a type reference, then name lookup yields that type reference.

statEnv |- ElementName name lookup element ElementName OptNillable TypeReference yields OptNillable TypeReference

If the element type has no element name but contains a type reference, then name lookup yields the type reference.

statEnv |- ElementName name lookup element TypeReference yields TypeReference

If the element type has no element name and no type reference, then name lookup yields xs:anyType.

statEnv |- ElementName name lookup element yields of type xs:anyType

Notation

The judgment

statEnv |- AttributeName name lookup AttributeType yields TypeReference

holds when matching an attribute with the given attribute name against the given attribute type matches the type reference.

Example

For example, assuming the extended XML Schema given in section [2.4.5 Example of a complete Schema], then the following judgments hold.

  orderDate  name lookup  attribute orderDate of type xsd:date  yields  of type xsd:date?
  orderDate  name lookup  attribute of type xsd:date            yields  of type xsd:date?

Semantics

This judgment is specified by the following rules.

If the attribute type is a reference to a global attribute, then name lookup yields the type reference in the attribute declaration for the given attribute name.

statEnv.attrDecl(AttributeName) = define attribute AttributeName TypeReference

statEnv |- AttributeName name lookup attribute AttributeName yields TypeReference

If the given attribute name matches the attribute name in the attribute type, and the attribute type contains a type reference, then name lookup yields that type reference.

statEnv |- AttributeName name lookup attribute AttributeName TypeReference yields TypeReference

If the attribute type has no attribute name but contains a type reference, then name lookup yields the type reference.

statEnv |- AttributeName name lookup attribute TypeReference yields TypeReference

If the attribute type has no attribute name and no type reference, then name lookup yields xs:anySimpleType.

statEnv |- AttributeName name lookup attribute yields of type xs:anySimpleType

8.1.4 Element and attribute type lookup (Static)

The type lookup judgments are used to obtain the appropriate type reference for an attribute or element.

Notation

The judgment

statEnv |- ElementType type lookup OptNillable TypeReference

holds when the element type is optionally nillable and has the given type reference.

Semantics

The element type lookup judgments are specified by the following rules.

A reference to a global element yields the type reference in the global element declaration with the given element name.

statEnv |- ElementName of elem/type expands to expanded-QName

statEnv.elemDecl(expanded-QName) = define element ElementName OptSubstitution OptNillable TypeReference

statEnv |- element ElementName type lookup OptNillable TypeReference

In the case of a local element type, type lookup yields the corresponding type reference.

statEnv |- element ElementName OptNillable TypeReference type lookup OptNillable TypeReference

If the element type has no element name but contains a type reference, then type lookup yields that type reference.

statEnv |- element OptNillable TypeReference type lookup TypeReference

If the element type has no element name and no type reference, then lookup yields xs:anyType.

statEnv |- element type lookup of type xs:anyType

Notation

The judgment

statEnv |- AttributeType type lookup TypeReference

holds when the attribute type has the given type reference.

Semantics

This judgment is specified by the following rules.

A reference to a global attribute yields the type reference in the global attribute declaration with the given attribute name.

statEnv.attrDecl(AttributeName) = define attribute AttributeName TypeReference

statEnv |- attribute AttributeName type lookup TypeReference

If the attribute name is not defined, i.e., it is not declared in the in-scope schema definitions, then the attribute's default type is xdt:untypedAtomic.

statEnv.attrDecl(AttributeName) undefined

statEnv |- attribute AttributeName type lookup of type xdt:untypedAtomic

In the case of a local attribute type, type lookup yields the corresponding type reference.

statEnv |- attribute AttributeName TypeReference type lookup TypeReference

If the attribute type has no attribute name but contains a type reference, then type lookup yields the type reference.

statEnv |- attribute TypeReference type lookup TypeReference

If the attribute type has no attribute name and no type reference, then type lookup yields xs:anySimpleType.

statEnv |- attribute type lookup of type xs:anySimpleType

8.1.5 Extension

Notation

The judgment

statEnv |- Type₁ extended by Type₂ is Type

holds when the result of extending Type₁ by Type₂ is Type. This judgment is used in the definition of type expansion [8.1.9 Type expansion], which expands a type to include the union of all types derived from the given type,

Semantics

This judgment is specified by the following rules.

statEnv |- Type₁ = AttributeAll₁ , ElementContentType₁ statEnv |- Type₂ = AttributeAll₂ , ElementContentType₂

statEnv |- Type₁ extended by Type₂ is (AttributeAll₁ & AttributeAll₂) , ElementContentType₁ , ElementContentType₂

8.1.6 Mixed content

Notation

The judgment

statEnv |- Type₁ mixes to Type₂

holds when the result of creating a mixed content from Type₁ is Type₂.

Semantics

This judgment is specified by the following rule, which interleaves the element content with a sequence of text nodes and adds a union of xdt:anyAtomicType values. The xdt:anyAtomicType sequence is required because it is possible to derive an element containing only atomic values from an element that is mixed.

statEnv |- Type = AttributeAll , ElementContentType

statEnv |- Type mixes to AttributeAll , ( ElementContentType & text* | xdt:anyAtomicType *)

8.1.7 Type adjustment

In the [XPath/XQuery] type system, a complex-type declaration does not include the implicit attributes and nodes that may be included in the type. Type adjustment takes a complex type and adjusts it to include implicit attributes and nodes. In particular, type adjustment:

adds the four (optional) built-in attributes xsi:type, xsi:nil, xsi:schemaLocation, or xsi:noNamespaceSchemaLocation,
interleaves the type with a sequence of comments and processing-instructions, and
if the complex type is mixed, interleaves the type with a sequence of text nodes and xdt:anyAtomicType.

Notation

The judgment

statEnv |- OptMixed Type₁ adjusts to Type₂

holds when the second type is the same as the first after the first has been adjusted as described above.

Semantics

This judgment is specified by the following rules.

If the type is flagged as mixed, then mix the type and extend it by the built-in attributes.

statEnv |- Type₁ mixes to Type₂

statEnv |- Type₂ extended by BuiltInAttributes is Type₃

statEnv |- Type₄ = Type₃ & processing-instruction* & comment*

statEnv |- mixed Type₁ adjusts to Type₄

Otherwise, just extend the type by the built-in attributes.

statEnv |- Type₁ extended by BuiltInAttributes is Type₂

statEnv |- Type₃ = Type₂ & processing-instruction* & comment*

statEnv |- Type₁ adjusts to Type₃

8.1.8 Builtin attributes

Schema defines four built-in attributes that can appear on any element in the document without being explicitly declared in the schema. Those four attributes need to be added inside content models when doing matching. The four built-in attributes of Schema are declared as follows.

  define attribute xsi:type of type xs:QName
  define attribute xsi:nil of type xs:boolean
  define attribute xsi:schemaLocation of type fs:anon
  define type fs:anon1 { xs:anyURI* }
  define attribute xsi:noNamespaceSchemaLocation of type xs:anyURI

For convenience, a type that is an all group of the four built-in XML Schema attributes is defined.

  BuiltInAttributes =
      attribute xsi:type ?
    & attribute xsi:nil ?
    & attribute xsi:schemaLocation ?
    & attribute xsi:noNamespaceSchemaLocation ?

8.1.9 Type expansion

The expands to judgment is one of the most important static judgments. It is used in the static semantics of the child axis [8.2.2.1 Static semantics of axes], which is used in the definition of many other rules that extract element types from an arbitrary content type.

The judgment takes a type name and computes the union of all types derived from the given type. If the type is nillable, it also makes sure the content model allows the empty sequence. If the type is mixed, it also adjusts the type to include the mixed content model. The judgment depends on the extended with union interpretation of judgment to recursively compute all derived types.

Notation

The judgment

statEnv |- OptNillable TypeReference expands to Type

holds when expanding the (nillable) type reference results in the given type.

Semantics

This judgment is specified by the following rules.

If the type is nillable, then its expansion is optional.

statEnv |- TypeReference expands to Type

statEnv |- nillable TypeReference expands to Type?

The type definition for the type reference is contained in its expansion.

statEnv |- TypeName of elem/type expands to expanded-QName

statEnv.typeDefn(expanded-QName) = define type TypeName extends BaseTypeName OptMixed { Type₁ }

statEnv |- Type₂ is Type₁ extended with union interpretation of TypeName

statEnv |- OptMixed Type₂ adjusts to Type₃

statEnv |- of type TypeName expands to Type₃

In case the type is xdt:untyped, the type does not need to be adjusted as is required for other XML Schema types. See the corresponding definition in [3.5.1 Predefined Schema Types].

statEnv.typeDefn(xdt:untyped) = define type xdt:untyped extends xs:anyType { Type₁ }

statEnv |- of type xdt:untyped expands to Type₁

8.1.10 Union interpretation of derived types

Notation

The judgment

statEnv |- Type₂ is Type₁ extended with union interpretation of TypeName

holds when the type Type₂ is the expansion of the type name TypeName with definition Type₁ to include all types derived by extension and restriction from the given type name. This rule is recursive, because each type name itself may have other type names that are derived from it. The recursive rules traverse the entire derivation tree, identifying every type name derived from the original type name.

Semantics

This judgment is specified by the following rules.

statEnv.typeDefn(expanded-QName_R,1) = define type TypeName_R,1 restricts TypeName₀ OptMixed_R,1 { Type_R,1 }

· · ·

statEnv.typeDefn(expanded-QName_R,n) = define type TypeName_R,n restricts TypeName₀ OptMixed_R,n { Type_R,n }

statEnv |- Type_R,1' is Type_R,1 extended with union interpretation of TypeName_R,1

· · ·

statEnv |- Type_R,n' is Type_R,n extended with union interpretation of TypeName_R,n

statEnv.typeDefn(expanded-QName_E,1) = define type TypeName_E,1 extends TypeName₀ OptMixed_E,1 { Type_E,1 }

· · ·

statEnv.typeDefn(expanded-QName_E,m) = define type TypeName_E,m extends TypeName₀ OptMixed_E,m { Type_E,m }

statEnv |- Type_E,1' is Type_E,1 extended with union interpretation of TypeName_E,1

· · ·

statEnv |- Type_E,m' is Type_E,m extended with union interpretation of TypeName_E,m

statEnv |- Type₁ is Type₀ extended with union interpretation of TypeName₀

Examples

Note that this expansion does not enforce the unique particular attribution property specified by XML Schema in the resulting content models. Implementations may want to implement an equivalent alternative expansion that enforces that property. For example, expanding type T1 below yields the following type that is not one-deterministic:

define type T1 { element a }
define type T2 extends T1 { element b }

(element a | element a, element b) is (element a) extended with union interpretation of T1

An implementation might want to infer the equivalent content model that verifies the unique particular attribution property of XML Schema:

(element a, (() | element b)) is (element a) extended with union interpretation of T1

8.2 Judgments for step expressions and filtering

Introduction

Step expressions are one of the elementary operations in [XPath/XQuery]. Steps select nodes reachable from the root of an XML tree. Defining the semantics of step expressions requires a detailed analysis of all the possible cases of axis and node tests.

This section introduces auxiliary judgments used to define the semantics of step expressions. The principal judgment ([8.2.1 Principal Node Kind]) captures the notion of principal node kind in XPath. The Axis judgments ([8.2.2 Auxiliary judgments for axes]) define the static and dynamic semantics of all axes, and the Node Test judgments ([8.2.3 Auxiliary judgments for node tests]) define the static and dynamic semantics of all node tests. The filter judgment accesses the value of an attribute and is used in the definition of validation ([E Auxiliary Judgments for Validation]).

8.2.1 Principal Node Kind

Notation

The following auxiliary grammar production describe principal node types (See [XML Path Language (XPath) 2.0]).

PrincipalNodeKind

[72 (Formal)] PrincipalNodeKind ::= "element" | "attribute" | "namespace"

Notation

The judgment

Axis principal PrincipalNodeKind

holds when PrincipalNodeKind is the principal node kind for Axis.

Example

For example, the following judgments hold.

  child::       principal  element
  descendant::  principal  element
  preceding::   principal  element
  attribute::   principal  attribute
  namespace::   principal  namespace

Semantics

This judgment is specified by the following rules.

The principal node type for the attribute axis is attribute.

attribute:: principal attribute

The principal node type for the namespace axis is namespace.

namespace:: principal namespace

The principal node type for all other axis is element.

Axis != attribute:: Axis != namespace::

Axis principal element

8.2.2 Auxiliary judgments for axes

8.2.2.1 Static semantics of axes

Notation

The following judgment

statEnv |- axis Axis of Type₁ : Type₂

holds when applying the axis Axis on type Type₁ yields the type Type₂.

The following two judgments are used in the definition of axis. The judgment

statEnv |- Type₁ has-node-content Type₂

only applies to a type that is a valid element content type and holds when Type₁ has the content type Type₂. The judgment separates the attribute types from the other node or atomic-valued types of the element content type and yields the non-attribute types.

The judgment

statEnv |- Type₁ has-attribute-content Type₂

only applies to a type that is a valid element content type and holds when Type₁ has attribute types Type₂. The judgment yields the attribute types of the element content type.

Example

For example, the following judgments hold.

  axis child::      of  element of type xs:string   :  text
  axis child::      of  element items of type Items :  element item of type fs:anon1*

  axis child::      of  element purchaseOrder       : 
    element shipTo of type USAddress,
    element billTo of type USAddress,
    element ipo:comment?,
    element items of type Items

  axis attribute::  of  element of type xs:string   :  empty

    attribute partNum of type SKU,
    element item of type fs:anon1*
  has-node-content
    element item of type fs:anon1*

    attribute partNum of type SKU,
    element item of type fs:anon1*
  has-attribute-content
    attribute partNum of type SKU

    (attribute partNum of type SKU,
     element item of type fs:anon1*) |
    (attribute orderDate of type xs:date?,
     element shipTo of type USAddress,
     element billTo of type USAddress,
     element comment?,
     element items of type Items)
  has-node-content
    (element item of type fs:anon1*) |
    (element shipTo of type USAddress,
     element billTo of type USAddress,
     element comment?,
     element items of type Items)

    (attribute partNum of type SKU,
     element item of type fs:anon1*) |
    (attribute orderDate of type xs:date?,
     element shipTo of type USAddress,
     element billTo of type USAddress,
     element comment?,
     element items of type Items)
  has-attribute-content
    (attribute partNum of type SKU) |
    (attribute orderDate of type xs:date?)

8.2.2.1.1 Inference rules for all axis

Semantics

This judgment is specified by the following rules.

The following rules compute the type of the axis expression when applied to each item type in the content model.

statEnv |- axis Axis of Type₁ : Type₂

statEnv |- axis Axis of Type₁ Occurrence : Type₂ Occurrence

statEnv |- axis Axis of Type₁ : Type₃

statEnv |- axis Axis of Type₂ : Type₄

statEnv |- axis Axis of Type₁&Type₂ : Type₃&Type₄

statEnv |- axis Axis of Type₁ : Type₃

statEnv |- axis Axis of Type₂ : Type₄

statEnv |- axis Axis of Type₁,Type₂ : Type₃,Type₄

statEnv |- axis Axis of Type₁ : Type₃

statEnv |- axis Axis of Type₂ : Type₄

statEnv |- axis Axis of Type₁|Type₂ : Type₃|Type₄

statEnv |- axis Axis of none : none

statEnv |- axis Axis of empty : empty

The following rules specifies how to compute the type of each axis applied to an item type.

8.2.2.1.2 Inference rules for the `self` axis

Semantics

Applying the self axis to a node type results in the same node type.

statEnv |- axis self:: of NodeType : NodeType

8.2.2.1.3 Inference rules for the `child` axis

Semantics

In the case of an element type, the static type of the child axis is obtained by type lookup and expansion of the resulting type. Note that the expands to judgment yields the type that corresponds to a given type name. Because the meaning of a type name includes the definitions of all type names derived by extension and restriction from the given type name, expands to yields the union of all the type definitions of all type names derived from the input type name. Each type in the union contains the complete definition of the type name, i.e., it includes built-in attributes and, if necessary, processing-instruction, comment, and text types.

After type expansion, the judgment has-node-content is applied to each type in the union. The resulting type is the union of all non-attribute types in the expanded type.

statEnv |- ElementType type lookup OptNillable TypeReference

statEnv |- OptNillable TypeReference expands to Type₁ | · · · | Type_n

statEnv |- Type₁ has-node-content Type₁'

· · ·

statEnv |- Type_n has-node-content Type_n'

statEnv |- axis child:: of ElementType : Type₁' | ... | Type_n'

If the type is a sequence of attributes, then the content type is empty.

statEnv |- Type <: attribute*

statEnv |- Type has-node-content empty

If the type is attributes followed by a simple type, the content type is zero-or-one text. The resulting type is optional since an expression returning the empty sequence results in no text node being constructed.

Type = Type₁, Type₂

statEnv |- Type₁ <: attribute*

statEnv |- Type₂ <: xdt:anyAtomicType*

statEnv |- Type has-node-content text?

In the case of an element type with complex content type, the content type is simply the non-attribute part of the complex content type.

Type = Type₁, Type₂

statEnv |- Type₁ <: attribute*

statEnv |- Type₂ <: ElementContentType*

statEnv |- Type has-node-content Type₂

In the case of an attribute type, the static type of the child axis is empty.

statEnv |- axis child:: of AttributeType : empty

In the case of a text node type, the static type of the child axis is empty.

statEnv |- axis child:: of text : empty

In the case of a comment node type, the static type of the child axis is empty.

statEnv |- axis child:: of comment : empty

In the case of a processing-instruction node type, the static type of the child axis is empty.

statEnv |- axis child:: of processing-instruction : empty

In case of a document node type, the static type of the child axis is the type of the document node content, interleaved with a sequence of comments and processing-instructions.

statEnv |- axis child:: of document { Type } : Type & processing-instruction* & comment*

8.2.2.1.4 Inference rules for the `attribute` axis

Semantics

The static type for the attribute axis is computed in a similar way as the static type for the child axis. As above, the expands to judgment may yield a union type. After type expansion, the judgment has-attribute-content is applied to each type in the union.

statEnv |- ElementType type lookup OptNillable TypeReference

statEnv |- OptNillable TypeReference expands to Type₁ | · · · | Type_n

statEnv |- Type₁ has-attribute-content Type₁'

· · ·

statEnv |- Type_n has-attribute-content Type_n'

statEnv |- axis attribute:: of ElementType : Type₁' | ... | Type_n'

When applied to an element type, has-attribute-content yields the type of the element's content that are attributes.

Type = (Type₁, Type₂)

statEnv |- Type₁ <: attribute*

statEnv |- Type₂ <: ElementContentType* | xdt:anyAtomicType*

statEnv |- Type has-attribute-content Type₁

In case of an attribute type, the static type of the attribute axis is empty.

statEnv |- axis attribute:: of AttributeType : empty

In case of a text node type, the static type of the attribute axis is empty.

statEnv |- axis attribute:: of text : empty

In case of a comment node type, the static type of the attribute axis is empty.

statEnv |- axis attribute:: of comment : empty

In case of a processing-instruction node type, the static type of the attribute axis is empty.

statEnv |- axis attribute:: of processing-instruction : empty

In case of a document node type, the static type of the attribute axis is the empty.

statEnv |- axis attribute:: of document { Type } : empty

8.2.2.1.5 Inference rules for the `parent` axis

Semantics

The type for the parent of an element type, a text node type, a PI node type, or a comment node type is either an element, a document, or empty.

statEnv |- axis parent:: of element : (element | document)?

statEnv |- axis parent:: of text : (element | document)?

statEnv |- axis parent:: of processing-instruction : (element | document)?

statEnv |- axis parent:: of comment : (element | document)?

The type for the parent of an attribute node is an element or empty.

statEnv |- axis parent:: of AttributeType : element?

The type for the parent of a document node type is always empty.

statEnv |- axis parent:: of DocumentType : empty

8.2.2.1.6 Inference rules for the `namespace` axis

Semantics

The type for the namespace axis is always empty.

statEnv |- axis namespace:: of NodeType : empty

8.2.2.1.7 Inference rules for the `descendant` axis

Semantics

The types for the descendant axis is obtained as the closure of the type of the child axis. This is expressed by the following inference rule.

statEnv |- axis child:: of Type : Type₁

statEnv |- axis child:: of prime(Type₁) : Type₂

...

statEnv |- axis child:: of prime(Type_n) : Type_n+1

statEnv |- prime(Type_n+1) <: prime(Type₁) | ... | prime(Type_n)

statEnv |- axis descendant:: of Type : (prime(Type₁) | ... | prime(Type_n))*

Note

Note that the last premise in the above rule terminates the recursion. The rule computes the n-th type Type_n such that applying the child axis one more time does not add any new item type to the union. This condition is guaranteed to hold at some point, because the number of item types is bounded by all of the item types defined in the in-scope schema definitions.

8.2.2.1.8 Inference rules for the `descendant-or-self` axis

Semantics

The type for the descendant-or-self axis is the union of the type for the self axis and for the descendant axis.

statEnv |- axis descendant:: of Type₁ : Type₂

statEnv |- axis descendant-or-self:: of Type₁ : (prime(Type₁) | prime(Type₂))*

8.2.2.1.9 Inference rules for the `ancestor` axis

Semantics

The type for the ancestor axis is computed similarly as for the descendant axis.

statEnv |- axis ancestor:: of NodeType : (element | document)*

Note that this rule will always result in the type (element | document)* type, but this formulation is preferred for consistency, and in case the static typing for the parent axis gets improved in a future version.

statEnv |- axis parent:: of Type : Type₁

statEnv |- axis parent:: of prime(Type₁) : Type₂

...

statEnv |- axis parent:: of prime(Type_n) : Type_n+1

statEnv |- prime(Type_n+1) <: prime(Type₁) | ... | prime(Type_n)

statEnv |- axis ancestor:: of Type : (prime(Type₁) | ... | prime(Type_n))*

8.2.2.1.10 Inference rules for the `ancestor-or-self` axis

Semantics

The type for the ancestor-or-self axis is the union of the type for the self axis and for the ancestor axis.

statEnv |- axis ancestor:: of Type₁ : Type₂

statEnv |- axis ancestor-or-self:: of Type₁ : (prime(Type₁) | prime(Type₂))*

8.2.2.2 Dynamic semantics of axes

Notation

The following judgment

dynEnv |- axis Axis of Value₁ => Value₂

holds when applying the axis Axis on Value₁ yields Value₂:

Example

For example, the following judgments hold.

  axis child::      of    element sizes { text { "1 2 3" } }  =>  text { "1 2 3" }

  axis attribute::  of
     element weight of type xs:integer {
       attribute xsi:type of type xs:QName {
         "xs:integer" of type xs:QName
       },
       42 of type xs:integer
     }
  => attribute xsi:type of type xs:QName {
       "xs:integer" of type xs:QName
     }

Semantics

This judgment is specified by the following rules.

The first set of rules are used to process the axis judgment on each individual item in the input sequence.

dynEnv |- axis Axis of () => ()

dynEnv |- axis Axis of Value₁ => Value₃

dynEnv |- axis Axis of Value₂ => Value₄

dynEnv |- axis Axis of Value₁,Value₂ => Value₃,Value₄

The following rules specifies how the value filter judgment is applied on each Axis.

The self axis just returns the context node.

dynEnv |- axis self:: of NodeValue => NodeValue

The child, parent, attribute and namespace axis are specified as follows.

dynEnv |- axis child:: of element ElementName { AttributeValue,ElementValue } => ElementValue

dynEnv |- axis attribute:: of ElementName { AttributeValue,ElementValue } => AttributeValue

dynEnv |- axis parent:: of NodeValue => dm:parent(NodeValue)

Editorial note
The use of the `dm:` should be removed. This can be removed when adding the notion of store in the dynamic rules.

The descendant, descendant-or-self, ancestor, and ancestor-or-self axis are implemented through recursive application of the children and parent filters.

dynEnv |- axis child:: of NodeValue => Value₁

dynEnv |- axis descendant:: of Value₁ => Value₂

dynEnv |- axis descendant:: of NodeValue => Value₁, Value₂

dynEnv |- axis self:: of NodeValue => Value₁

dynEnv |- axis descendant:: of Value₁ => Value₂

dynEnv |- axis descendant-or-self:: of NodeValue => Value₁, Value₂

dynEnv |- axis parent:: of NodeValue => Value₁

dynEnv |- axis ancestor:: of Value₁ => Value₂

dynEnv |- axis ancestor:: of NodeValue => Value₁, Value₂

dynEnv |- axis self:: of NodeValue => Value₁

dynEnv |- axis ancestor:: of Value₁ => Value₂

dynEnv |- axis ancestor-or-self:: of NodeValue => Value₁, Value₂

In all the other cases, the axis application results in an empty sequence.

dynEnv |- axis Axis of NodeValue => () otherwise.

8.2.3 Auxiliary judgments for node tests

A node test may be a name test or a kind test. In the static and dynamic semantics, we begin with name tests, followed by kind tests.

8.2.3.1 Static semantics of node tests

Notation

The following judgment

statEnv |- test NodeTest with PrincipalNodeKind of Type₁ : Type₂

holds when applying the node test NodeTest on the type Type₁ in the context of the given principal node kind, yields the type Type₂.

Example

For example, assuming the extended XML Schema given in section [2.4.5 Example of a complete Schema], then the following judgments hold.

  test shipTo with element of
    element shipTo of type USAddress,
    element billTo of type USAddress,
    element ipo:comment?,
    element items of type Items
  : element shipTo of type USAddress

Semantics

This judgment is specified by the following rules.

The first set of rules is similar to that for axes, and are used to process the content each individual item type in the input content model.

statEnv |- test NodeTest with PrincipalNodeKind of Type₁ : Type₂

statEnv |- test NodeTest with PrincipalNodeKind of Type₁ Occurrence : Type₂ Occurrence

statEnv |- test NodeTest with PrincipalNodeKind of Type₁ : Type₃

statEnv |- test NodeTest with PrincipalNodeKind of Type₂ : Type₄

statEnv |- test NodeTest with PrincipalNodeKind of Type₁ & Type₂ : Type₃ & Type₄

statEnv |- test NodeTest with PrincipalNodeKind of Type₁ : Type₃

statEnv |- test NodeTest with PrincipalNodeKind of Type₂ : Type₄

statEnv |- test NodeTest with PrincipalNodeKind of Type₁ , Type₂ : Type₃ , Type₄

statEnv |- test NodeTest with PrincipalNodeKind of Type₁ : Type₃

statEnv |- test NodeTest with PrincipalNodeKind of Type₂ : Type₄

statEnv |- test NodeTest with PrincipalNodeKind of Type₁|Type₂ : Type₃|Type₄

statEnv |- test NodeTest with PrincipalNodeKind of none : none

statEnv |- test NodeTest with PrincipalNodeKind of empty : empty

The following rules specify how the test judgment apply to node tests in the context of a principal node kind. We start with name tests followed by kind tests.

8.2.3.1.1 Name Tests

Name tests on elements and attributes always compute the most specific type possible. For example, if $v is bound to an element with a computed name, the type of $v is element. The static type computed for the expression $v/self::foo is element foo of type xs:anyType, which makes use of foo in the name test to compute a more specific type. Also note that each case of name matching restricts the principal node kind appropriately.

statEnv |- QName₁ of elem/type expands to expanded-QName

statEnv |- QName₂ of elem/type expands to expanded-QName

statEnv |- test QName₂ with element of element QName₁ OptTypeSpecifier : element QName₁ OptTypeSpecifier

statEnv |- QName₂ of elem/type expands to expanded-QName₂

fn:local-name-from-QName(expanded-QName₂) = LocalPart₁

statEnv |- test QName₂ with element of element *:LocalPart₁ OptTypeSpecifier : element QName₂ OptTypeSpecifier

statEnv |- QName₂ of elem/type expands to expanded-QName₂

fn:namespace-uri-from-QName(expanded-QName₂) = statEnv.namespace(Prefix₁)

statEnv |- test QName₂ with element of element Prefix₁:* OptTypeSpecifier : element Prefix₁:LocalPart₂ OptTypeSpecifier

statEnv |- test QName₂ with element of element OptTypeSpecifier : element QName₂ OptTypeSpecifier

statEnv |- QName₁ of elem/type expands to expanded-QName₁

fn:namespace-uri-from-QName( expanded-QName₁ ) = statEnv.namespace(Prefix₁)

LocalPart₂ = fn:local-name-from-QName( expanded-QName₁ )

statEnv |- test *:LocalPart₂ with element of element QName₁ OptTypeSpecifier : element QName₁ OptTypeSpecifier

LocalPart₁ = LocalPart₂

statEnv |- test *:LocalPart₂ with element of element *:LocalPart₁ OptTypeSpecifier : element *:LocalPart₂ OptTypeSpecifier

statEnv |- test *:LocalPart₂ with element of element Prefix₁:* OptTypeSpecifier : element Prefix₁:LocalPart₂ OptTypeSpecifier

statEnv |- test *:LocalPart₂ with element of element OptTypeSpecifier : element *:LocalPart₂ OptTypeSpecifier

statEnv |- QName₁ of elem/type expands to expanded-QName₁

fn:namespace-uri-from-QName( expanded-QName₁) = statEnv.namespace(Prefix₂)

statEnv |- test Prefix₂:* with element of element QName₁ OptTypeSpecifier : element QName₁ OptTypeSpecifier

statEnv |- test Prefix₂:* with element of element *:LocalPart₁ OptTypeSpecifier : element Prefix₂:LocalPart₁ OptTypeSpecifier

statEnv.namespace(Prefix₁) = statEnv.namespace(Prefix₂)

statEnv |- test Prefix₂:* with element of element Prefix₁:* OptTypeSpecifier : element Prefix₁:* OptTypeSpecifier

statEnv |- test Prefix₂:* with element of element OptTypeSpecifier : element Prefix₂:* OptTypeSpecifier

statEnv |- test * with element of element QName OptTypeSpecifier : element QName OptTypeSpecifier

Similar typing rules apply to the attribute name tests:

statEnv |- QName₁ of attr expands to expanded-QName

statEnv |- QName₂ of attr expands to expanded-QName

statEnv |- test QName₂ with attribute of attribute QName₁ OptTypeReference : attribute QName₁ OptTypeReference

statEnv |- QName₂ of attr expands to expanded-QName₂

fn:local-name-from-QName(expanded-QName₂) = LocalPart₁

statEnv |- test QName₂ with attribute of attribute *:LocalPart₁ OptTypeReference : attribute QName₂ OptTypeReference

statEnv |- QName₂ of attr expands to expanded-QName₂

fn:namespace-uri-from-QName(expanded-QName₂) = statEnv.namespace(Prefix₁)

statEnv |- test QName₂ with attribute of attribute Prefix₁:* OptTypeReference : attribute Prefix₁:LocalPart₂OptTypeReference

statEnv |- test QName₂ with attribute of attribute OptTypeReference : attribute QName₂ OptTypeReference

statEnv |- QName₁ of attr expands to expanded-QName₁

fn:local-name-from-QName( expanded-QName₁ ) = LocalPart₂

statEnv |- test *:LocalPart₂ with attribute of attribute QName₁ OptTypeReference : attribute QName₁ OptTypeReference

LocalPart₁ = LocalPart₂

statEnv |- test *:LocalPart₂ with attribute of attribute *:LocalPart₁ OptTypeReference : attribute *:LocalPart₂ OptTypeReference

statEnv |- test *:LocalPart₂ with attribute of attribute Prefix₁:* OptTypeReference : attribute Prefix₁:LocalPart₂OptTypeReference

statEnv |- test *:LocalPart₂ with attribute of attribute OptTypeReference : attribute *:LocalPart₂ OptTypeReference

fn:namespace-uri-from-QName( QName₁) = statEnv.namespace(Prefix₂)

statEnv |- test Prefix₂:* with attribute of attribute QName₁ OptTypeReference : attribute QName₁ OptTypeReference

statEnv |- test Prefix₂:* with attribute of attribute *:LocalPart₁ OptTypeReference : attribute Prefix₂:LocalPart₁ OptTypeReference

statEnv.namespace(Prefix₁) = statEnv.namespace(Prefix₂)

statEnv |- test Prefix₂:* with attribute of attribute Prefix₁:* OptTypeReference : attribute Prefix₁:* OptTypeReference

statEnv |- test Prefix₂:* with attribute of attribute OptTypeReference : attribute Prefix₂:* OptTypeReference

statEnv |- test * with attribute of attribute QName OptTypeReference : attribute QName OptTypeReference

Lastly, if none of the above rules holds, then the type of the input expression is empty.

statEnv |- [NameTest]_sequencetype = ElementNameOrWildcard₁ TypeSpecifier₁

statEnv |- not(ElementNameOrWildcard₁ TypeSpecifier₁ <: ElementNameOrWildcard₂ TypeSpecifier₂)

statEnv |- not(ElementNameOrWildcard₂ TypeSpecifier₂ <: ElementNameOrWildcard₁ TypeSpecifier₁)

statEnv |- TypeSpecifier₁ expands to Type₁

statEnv |- TypeSpecifier₂ expands to Type₂

statEnv |- not(Type₁ <: Type₂)

statEnv |- not(Type₂ <: Type₁)

statEnv |- test NameTest with element of ElementNameOrWildcard₂ TypeSpecifier₂ : empty

8.2.3.1.2 Kind Tests

All the rules for typing the document, element, and attribute kind tests are similar. First, the document, element, or attribute test is normalized to the equivalent document, element, or attribute type by applying the []_sequencetype normalization rule to the kind test.

After normalization of the kind test as an XQuery type, that type is compared to the expression's inferred type. If the latter is a subtype of the former other, then the kind test yields the smaller type.

Document kind test

Semantics

If the type of the expression is a subtype of the document kind test, then we are guaranteed that during evaluation, the expression's value will always match the document kind test, and therefore the type of the entire expression is the type of the input expression.

statEnv |- [DocumentTest]_sequencetype = DocumentType

statEnv |- Type₁ <: DocumentType

statEnv |- test DocumentTest with element of Type₁ : Type₁

Conversely, if the type of the document kind test is a subtype of the expression, then during evaluation, the expression's value may or may not match the document kind test, and therefore the type of the entire expression is zero-or-one of the type of the document kind test.

statEnv |- [DocumentTest]_sequencetype = DocumentType

statEnv |- DocumentType <: Type₁

statEnv |- test DocumentTest with element of Type₁ : DocumentType?

If the types of the expression and document kind test are unrelated, then we apply the kind test rule recursively on the element types, which may yield a non-empty type.

statEnv |- [document-node (ElementTest)]_sequencetype = DocumentType

statEnv |- not(Type₁ <: DocumentType or DocumentType <: Type₁)

statEnv |- test ElementTest with element of Type₁ : Type₂ not(Type₂ <: empty)

statEnv |- test document-node (ElementTest) with element of document { Type₁ } : document { Type₂ }

If there is no non-empty type, then the kind test yields the empty type.

statEnv |- [document-node (ElementTest)]_sequencetype = DocumentType

statEnv |- not(Type₁ <: DocumentType or DocumentType <: Type₁)

statEnv |- test ElementTest with element of Type₁ : Type₂ Type₂ <: empty

statEnv |- test document-node (ElementTest) with element of document { Type₁ } : empty

Element kind test

Semantics

The rules for the element kind test are similar to those for the document kind test.

If the type of the expression is a subtype of the element kind test, then we are guaranteed that during evaluation, the expression's element value will always match the element kind test, and therefore the type of the entire expression is the type of the input expression.

statEnv |- [ElementTest]_sequencetype = ElementType

statEnv |- Type₁ <: ElementType

statEnv |- test ElementTest with element of Type₁ : Type₁

Conversely, if the type of the element kind test is a subtype of the expression, then during evaluation, the expression's element value may or may not match the element kind test, and therefore the type of the entire expression is zero-or-one of the type of the element kind test.

statEnv |- [ElementTest]_sequencetype = ElementType

statEnv |- ElementType <: Type₁

statEnv |- test ElementTest with element of Type₁ : ElementType?

If the types of the expression and element kind test are unrelated (i.e., neither type is a subtype of the other), then we must compare the structure of the type of the element test with the type of the element expression, as an element type or test may contain wildcards.

In the first case, the element kind test contains an element name and a type name and the input expression's type contains only a type name. If the input expression's content type is a subtype of the element kind test's content type, then the type of the entire expression is zero-or-one of an element with the given name and the input expression's content type.

statEnv |- [ElementTest]_sequencetype = element ElementName₁ TypeSpecifier₁ statEnv |- TypeSpecifier₁ expands to Type₁

statEnv |- TypeSpecifier₂ expands to Type₂

statEnv |- Type₂ <: Type₁

statEnv |- test ElementTest with element of element TypeSpecifier₂ : element ElementName₁ TypeSpecifier₂?

In the second case, the structure of the input types is reversed: The input expression's type contains an element name and a type name and the element kind test's type contains only a type name. If the element kind test's content type is a subtype of the input expression's content type, then the type of the entire expression is zero-or-one of an element with the given name and the element kind test's content type.

statEnv |- [ElementTest]_sequencetype = element TypeSpecifier₁ statEnv |- TypeSpecifier₁ expands to Type₁

statEnv |- TypeSpecifier₂ expands to Type₂

statEnv |- Type₁ <: Type₂

statEnv |- test ElementTest with element of element ElementName₂ TypeSpecifier₂ : element ElementName₂ TypeSpecifier₁?

Lastly, if none of the above rules holds, then the type of the input expression is empty.

statEnv |- [ElementTest]_sequencetype = ElementNameOrWildcard₁ TypeSpecifier₁

statEnv |- not(ElementNameOrWildcard₁ TypeSpecifier₁ <: ElementNameOrWildcard₂ TypeSpecifier₂)

statEnv |- not(ElementNameOrWildcard₂ TypeSpecifier₂ <: ElementNameOrWildcard₁ TypeSpecifier₁)

statEnv |- TypeSpecifier₁ expands to Type₁

statEnv |- TypeSpecifier₂ expands to Type₂

statEnv |- not(Type₁ <: Type₂)

statEnv |- not(Type₂ <: Type₁)

statEnv |- test ElementTest with element of ElementNameOrWildcard₂ TypeSpecifier₂ : empty

Attribute kind test

Semantics

The rules for the attribute kind test are isomorphic to those for element kind test.

If the type of the expression is a subtype of the attribute kind test, then we are guaranteed that during evaluation, the expression's attribute value will always match the attribute kind test, and therefore the type of the entire expression is the type of the input expression.

statEnv |- [AttributeTest]_sequencetype = AttributeType

statEnv |- Type₁ <: AttributeType

statEnv |- test AttributeTest with attribute of Type₁ : Type₁

Conversely, if the type of the attribute kind test is a subtype of the expression, then during evaluation, the expression's attribute value may or may not match the attribute kind test, and therefore the type of the entire expression is zero-or-one of the type of the attribute kind test.

statEnv |- [AttributeTest]_sequencetype = AttributeType

statEnv |- AttributeType <: Type₁

statEnv |- test AttributeTest with attribute of Type₁ : AttributeType?

If the types of the expression and attribute kind test are unrelated (i.e., neither type is a subtype of the other), then we must compare the structure of the type of the attribute test with the type of the attribute expression, as an attribute type or test may contain wildcards.

In the first case, the attribute kind test contains an attribute name and a type name and the input expression's type contains only a type name. If the input expression's content type is a subtype of the attribute kind test's content type, then the type of the entire expression is zero-or-one of an attribute with the given name and the input expression's content type.

statEnv |- [AttributeTest]_sequencetype = attribute AttributeName₁ TypeReference₁ statEnv |- TypeReference₁ expands to Type₁

statEnv |- TypeReference₂ expands to Type₂

statEnv |- Type₂ <: Type₁

statEnv |- test AttributeTest with attribute of attribute TypeReference₂ : attribute AttributeName₁ TypeReference₂?

In the second case, the structure of the input types is reversed: The input expression's type contains an attribute name and a type name and the attribute kind test's type contains only a type name. If the attribute kind test's content type is a subtype of the input expression's content type, then the type of the entire expression is zero-or-one of an attribute with the given name and the attribute kind test's content type.

statEnv |- [AttributeTest]_sequencetype = attribute TypeReference₁ statEnv |- TypeReference₁ expands to Type₁

statEnv |- TypeReference₂ expands to Type₂

statEnv |- Type₁ <: Type₂

statEnv |- test AttributeTest with attribute of attribute AttributeName₂ TypeReference₂ : attribute AttributeName₂ TypeReference₁?

Lastly, if none of the above rules holds, then the type of the input expression is empty.

statEnv |- [AttributeTest]_sequencetype = AttributeName₁ TypeReference₁

statEnv |- not(AttributeName₁ TypeReference₁ <: AttributeName₂ TypeReference₂)

statEnv |- not(AttributeName₂ TypeReference₂ <: AttributeName₁ TypeReference₁)

statEnv |- TypeReference₁ expands to Type₁

statEnv |- TypeReference₂ expands to Type₂

statEnv |- not(Type₁ <: Type₂)

statEnv |- not(Type₂ <: Type₁)

statEnv |- test AttributeTest with attribute of AttributeName₂ TypeReference₂ : empty

Processing instruction, comment, and text kind tests.

Semantics

statEnv |- test processing-instruction() with PrincipalNodeKind of processing-instruction : processing-instruction

A processing-instruction node test with a string literal or NCName matches a processing instruction whose target has the given name. Since target matching cannot be checked statically, the static type of the node test is zero-or-one processing instruction.

statEnv |- test processing-instruction(StringLiteral | NCName) with PrincipalNodeKind of processing-instruction : processing-instruction?

statEnv |- test comment() with PrincipalNodeKind of comment : comment

statEnv |- test text() with PrincipalNodeKind of text : text

statEnv |- test node() with PrincipalNodeKind of NodeType : NodeType

If none of the above rules apply, then the node test returns the empty sequence and the following rule applies:

statEnv |- test node() with PrincipalNodeKind of NodeType : empty

8.2.3.2 Dynamic semantics of node tests

Notation

The following judgment

dynEnv |- test NodeTest with PrincipalNodeKind of Value₁ => Value₂

holds when applying the node test NodeTest on Value₁ in the context of the PrincipalNodeKind yields Value₂:

Example

For example, the following judgments hold.

  test node()  with element  of    text { "1 2 3" }  => text { "1 2 3" }
  test size    with element  of    text { "1 2 3" }  => ()

  test foo:*   with element  of
     (element foo:a of type xs:int { 1 },
      element foo:a of type xs:int { 2 },
      element bar:b of type xs:int { 3 },
      element bar:c of type xs:int { 4 },
      element foo:d of type xs:int { 5 })
  => (element foo:a of type xs:int { 1 },
      element foo:a of type xs:int { 2 },
      (),
      (),
      element foo:d of type xs:int { 5 })

Note

The last example illustrates how a test judgment operates on a sequence of nodes, applying the test on each node in the sequence individually, while preserving the structure of the sequence.

Semantics

This judgment is specified by the following rules.

The first set of rules are similar to those for axes, and are used to process the test judgment on each individual item in the input sequence.

dynEnv |- test NodeTest with PrincipalNodeKind of () => ()

dynEnv |- test NodeTest with PrincipalNodeKind of Value₁ => Value₃

dynEnv |- test NodeTest with PrincipalNodeKind of Value₂ => Value₄

dynEnv |- test NodeTest with PrincipalNodeKind of Value₁,Value₂ => Value₃,Value₄

8.2.3.2.1 Name Tests

The following rules specify how the value filter judgment is applied on a name test in the context of a principal node kind.

Semantics

dm:node-kind( NodeValue ) = PrincipalNodeKind

fn:node-name( NodeValue ) = expanded-QName

fn:namespace-uri-from-QName( expanded-QName) = statEnv.namespace(Prefix)

fn:local-name-from-QName( expanded-QName ) = LocalPart

dynEnv |- test Prefix:LocalPart with PrincipalNodeKind of NodeValue => NodeValue

dm:node-kind( NodeValue ) = PrincipalNodeKind

dynEnv |- test * with PrincipalNodeKind of NodeValue => NodeValue

dm:node-kind( NodeValue ) = PrincipalNodeKind

fn:node-name ( NodeValue ) = expanded-QName

fn:namespace-uri-from-QName ( QName ) = statEnv.namespace(Prefix)

dynEnv |- test Prefix:* with PrincipalNodeKind of NodeValue => NodeValue

dm:node-kind( NodeValue ) = PrincipalNodeKind

fn:node-name ( NodeValue ) = expanded-QName

fn:local-name-from-QName ( expanded-QName ) = local

dynEnv |- test *:LocalPart with PrincipalNodeKind of NodeValue => NodeValue

8.2.3.2.2 Kind Tests

All the rules for evaluating the document, element, and attribute kind tests are similar. First, the document, element, or attribute test is normalized to the equivalent document, element, or attribute type by applying the []_sequencetype normalization rule. As explained in [3.5.3 SequenceType Syntax], SequenceTypes are normalized to XQuery types whenever a dynamic or static rule requires the corresponding type. The reason for this deviation from the processing model is that the result of SequenceType normalization is not part of the [XPath/XQuery] core syntax.

After normalization of the SequenceType to an XQuery type, the document, element, or attribute value is simply matched against the XQuery type. If the value matches the type, then the judgment yields the value, otherwise the judgment yields the empty sequence.

Document kind test

Semantics

statEnv |- [DocumentTest]_sequencetype = DocumentType

statEnv |- DocumentValue matches DocumentType

dynEnv |- test DocumentTest with element of DocumentValue => DocumentValue

statEnv |- [DocumentTest]_sequencetype = DocumentType

statEnv |- not(DocumentValue matches DocumentType)

dynEnv |- test DocumentTest with element of DocumentValue => ()

Element kind test

Semantics

statEnv |- [ElementTest]_sequencetype = ElementType

statEnv |- ElementValue matches ElementType

dynEnv |- test ElementTest with element of ElementValue => ElementValue

statEnv |- [ElementTest]_sequencetype = ElementType

statEnv |- not(ElementValue matches ElementType)

dynEnv |- test ElementTest with element of ElementValue => ()

Attribute kind test

Semantics

statEnv |- [AttributeTest]_sequencetype = AttributeType

statEnv |- AttributeValue matches AttributeType

dynEnv |- test AttributeTest with attribute of AttributeValue => AttributeValue

statEnv |- [AttributeTest]_sequencetype = AttributeType

statEnv |- not(AttributeValue matches AttributeType)

dynEnv |- test AttributeTest with attribute of AttributeValue => ()

Processing instruction, comment, and text kind tests.

Semantics

dm:node-kind ( NodeValue ) = "processing-instruction"

dynEnv |- test processing-instruction() with PrincipalNodeKind of NodeValue => NodeValue

dm:node-kind ( NodeValue ) = "processing-instruction"

fn:node-name ( NodeValue ) = expanded-QName

fn:local-name-from-QName ( expanded-QName ) = String

dynEnv |- test processing-instruction( StringLiteral ) with PrincipalNodeKind of NodeValue => NodeValue

not(dm:node-kind ( NodeValue ) = "processing-instruction")

dynEnv |- test processing-instruction() with PrincipalNodeKind of NodeValue => ()

dm:node-kind ( NodeValue ) = "comment"

dynEnv |- test comment() with PrincipalNodeKind of NodeValue => NodeValue

not(dm:node-kind ( NodeValue ) = "comment")

dynEnv |- test comment() with PrincipalNodeKind of NodeValue => ()

dm:node-kind ( NodeValue ) = "text"

dynEnv |- test text() with PrincipalNodeKind of NodeValue => NodeValue

not(dm:node-kind ( NodeValue ) = "text")

dynEnv |- test text() with PrincipalNodeKind of NodeValue => ()

The node() node test is true for all nodes. Therefore, the following rule does not have any precondition (remember that an empty upper part in the rule indicates that the rule is always true).

dynEnv |- test node() with PrincipalNodeKind of NodeValue => NodeValue

If none of the above rules applies then the node test returns the empty sequence, and the following dynamic rule is applied:

dynEnv |- test node() with PrincipalNodeKind of NodeValue => ()

8.3 Judgments for type matching

Introduction

XQuery supports type declarations on variable bindings, and several operations on types (typeswitch, instance of, etc). This section describes judgments used for the specification of the semantics of those operations.

The "match" judgment specifies formally type matching. It takes as input a value and a type and either succeeds or fails. It is used in matching parameters against function signatures, type declarations, and matching values against cases in "typeswitch". An informal description of type matching is given in Section 2.5.4 SequenceType Matching^XQ.
The "subtyping" judgment takes two types and succeeds if all values matching the first type also match the second. It is used to define the static semantics of operations using type matching.

8.3.1 Matches

Notation

The judgment

statEnv |- Value matches Type

holds when the given value matches the given type.

Example

For example, assuming the extended XML Schema given in section [2.4.5 Example of a complete Schema], then the following judgments hold.

  element comment of type xsd:string { "This is not important" }
    matches
  element comment of type xsd:string

  (element apt of type fs:anon3 { 2510 },
   element apt of type fs:anon3 { 2511 })
    matches
  element apt+

  ()
    matches
  element usaddress?

  element usaddress of type USAddress {
    element name of type xsd:string { "The Archive" },
    element street of type xsd:string { "Christopher Street" },
    element city of type xsd:string { "New York" },
    element state of type xsd:string { "NY" },
    element zip of type xsd:decimal { 10210 }
  }
    matches
  element usaddress?

Semantics

We start by giving the inference rules for matching an item value with an item type.

An atomic value matches an atomic type if its type annotation^XQ derives from the atomic type. The value itself is ignored -- this is checked as part of validation.

statEnv |- AtomicTypeName₁ derives from AtomicTypeName₂

statEnv |- AtomicValue of type AtomicTypeName₁ matches AtomicTypeName₂

A text node matches text.

statEnv |- text { String } matches text

A comment node matches comment.

statEnv |- comment { String } matches comment

A processing-instruction node matches processing-instruction.

statEnv |- processing-instruction QName { String } matches processing-instruction

A document node matches a document type if the node's content matches the document type's corresponding content type.

statEnv |- Value matches Type

statEnv |- document { Value } matches document { Type }

The rules for matching an element value with an element type are more complicated. When an element value is not nilled, the element matches an element type if the element name and the element type resolve to some type name, and the element value's type annotation^XQ is derived from the resolved type name. Note that there is no need to check structural constraints on the value since those have been checked during XML Schema validation and the value is assumed to be consistent with its type annotation^XQ.

statEnv |- ElementName name lookup ElementType yields OptNillable of type BaseTypeName

statEnv |- TypeName derives from BaseTypeName

Value filter @xsi:nil => () or false

statEnv |- element ElementName of type TypeName { Value } matches ElementType

Note

Type matching uses the name lookup judgment defined in [8.1.3 Element and attribute name lookup (Dynamic)].

In the case the element has been nilled, that is there exists and xsi:nil attribute set to true in the element value, the following rule checks that the type is nillable.

statEnv |- ElementName name lookup ElementType yields nillable of type BaseTypeName

statEnv |- TypeName derives from BaseTypeName

Value filter @xsi:nil => true

statEnv |- element ElementName of type TypeName { Value } matches ElementType

The rule for attributes is similar, but does not require the check for the xsi:nil attribute.

statEnv |- AttributeName name lookup AttributeType yields of type BaseTypeName

statEnv |- TypeName derives from BaseTypeName

statEnv |- attribute AttributeName of type TypeName { Value } matches AttributeType

A type can also be a sequence of items, in that case the matching rules also need to check whether the constraints described by the type as a regular expression hold. This is specified by the following rules.

The empty sequence matches the empty sequence type.

statEnv |- () matches empty

If two values match two types, then their sequence matches the corresponding sequence type.

statEnv |- Value₁ matches Type₁

statEnv |- Value₂ matches Type₂

statEnv |- Value₁,Value₂ matches Type₁,Type₂

If a value matches a type, then it also matches a choice type where that type is one of the choices.

statEnv |- Value matches Type₁

statEnv |- Value matches Type₁|Type₂

statEnv |- Value matches Type₂

statEnv |- Value matches Type₁|Type₂

If two values match two types, then their interleaving matches the corresponding all group.

statEnv |- Value₁ matches Type₁

statEnv |- Value₂ matches Type₂

statEnv |- Value₁ interleave Value₂ yields Value

statEnv |- Value matches Type₁ & Type₂

An optional type matches a value of that type or the empty sequence.

statEnv |- Value matches (Type | empty)

statEnv |- Value matches Type?

The following rules are used to match a value against a sequence of zero (or one) or more types.

statEnv |- () matches Type*

statEnv |- Value₁ matches Type statEnv |- Value₂ matches Type*

statEnv |- Value₁, Value₂ matches Type*

statEnv |- Value₁ matches Type statEnv |- Value₂ matches Type*

statEnv |- Value₁, Value₂ matches Type+

Note

The above definition of type matching, although complete and precise, does not give a simple means to compute type matching. Notably, some of the above rules can be non-deterministic (e.g., the rule for matching of choice or repetition).

The structural component of the [XPath/XQuery] type system can be modeled by regular expressions. Regular expressions can be implemented by means of finite state automata. Computing type matching then is equivalent to check if a given sequence of items is recognized by its corresponding finite state automata. Finite state automata and their relationships to regular expressions have been extensively studied and documented in computer-science literature. The interested reader can consult the relevant literature, for instance [Languages], or [TATA].

8.3.2 Subtype and Type equality

Introduction

This section defines the semantics of subtyping in [XPath/XQuery]. Subtyping is used during the static type analysis, in typeswitch expressions, treat and assert expressions, and to check the correctness of function applications.

Note that intuitive relationships between types. For instance, that (Type,()) is equivalent to Type can be deduced using the subtyping judgment (and algorithm) described here.

Notation

The judgment

statEnv |- Type₁ <: Type₂

holds if the first type is a subtype of the second.

Semantics

This judgment is true if and only if, for every value Value, if Value matches Type₁ holds, then Value matches Type₂ also holds.

Note

It is easy to see that the subtype relation <: is a partial order, i.e. it is reflexive:

statEnv |- Type <: Type

and it is transitive: if,

statEnv |- Type₁ <: Type₂

and,

statEnv |- Type₂ <: Type₃

then,

statEnv |- Type₁ <: Type₃

Finally, two types are equal if each is a subtype of the other, that is:

statEnv |- Type₁ <: Type₂

and,

statEnv |- Type₂ <: Type₁

then,

statEnv |- Type₁ = Type₂

Note

The above definition although complete and precise, does not give a simple means to compute subtyping. Notably the definition above refers to values which are not available at static type checking time.

Finite state automata and how to compute operations on those automata, such as inclusion, emptiness or intersection have been extensively studied and documented in the literature. The interested reader can consult the relevant literature on tree grammars, for instance [Languages], or [TATA].

8.4 Judgments for FLWOR and other expressions on sequences

Introduction

Some [XPath/XQuery] operations work on sequences of items. For instance, [For/FLWOR] expressions iterate over a sequence of items and the fn:unordered function can return all items in a sequence in any order, etc.

Static typing for those operations need to infer a type acceptable for all the items in the sequence. This sometimes require to approximate the type known for each item individually.

Example

Assume the variable $shipTo is bound to the shipTo element

    <shipTo country="US">
        <name>Alice Smith</name>
        <street>123 Maple Street</street>
        <city>Mill Valley</city>
        <state>CA</state>
        <zip>90952</zip>
    </shipTo>

and has type

   element shipTo of type USAddress

The following query orders all children of the shipTo element by alphabetical order of their content.

   for $x in $shipTo/*
   order by $x/text()
   return $x

resulting in the sequence

    (<street>123 Maple Street</street>,
     <zip>90952</zip>,
     <name>Alice Smith</name>,
     <state>CA</state>,
     <city>Mill Valley</city>)

This operation iterates over the elements in the input sequence returned by the expression $shipTo/*, whose type is the content of a type USAddress.

    (element name of type xsd:string,
     element street of type xsd:string,
     element city of type xsd:string,
     element state of type xsd:string,
     element zip of type xsd:decimal)

During static typing, one must give a type to the variable $x which corresponds to the type of each element in the sequence. Since each item as a of a different type, one must find an item type which is valid for all cases in the sequence. This can be done by using a choice for the variable $x, as follows

    (element name of type xsd:string |
     element street of type xsd:string |
     element city of type xsd:string |
     element state of type xsd:string |
     element zip of type xsd:decimal)

This type indicates that the type of the variable can be of any of the item types in the input sequence.

The static inference also needs to approximate the number of occurrence of items in the sequence. In this example, there is at least one item and more than one, so the closest occurrence indicator is + for one or more items.

The static inference for this example finally results in the following type.

    (element name of type xsd:string |
     element street of type xsd:string |
     element city of type xsd:string |
     element state of type xsd:string |
     element zip of type xsd:decimal)+

This section defines a prime type, which is a choice of item types. It defines two functions on types that compute the prime type of an arbitrary type, and approximate the occurrence of items in an arbitrary type. Those judgments are used the static semantics of many expressions, including "for", "some", and "every" expressions, many functions, including "fn:unordered" and "fn:distinct" functions.

Notation

A choice of item types is called a prime type, as described by the following grammar production.

Prime Types

[47 (Formal)] PrimeType ::= FormalItemType | (PrimeType "|" PrimeType)

Notation

The type function prime(Type) extracts all item types from the type Type, and combines them into a choice.

The function quantifier(Type) approximates the possible number of items in Type with the occurrence indicators supported by the [XPath/XQuery] type system (?, +, *).

For interim results, the auxiliary occurrence indicator 1 denotes exactly one occurrence.

Semantics

The prime function is defined by induction as follows.

prime(FormalItemType)	=	FormalItemType
prime(`empty`)	=	`none`
prime(`none`)	=	`none`
prime(Type₁ , Type₂)	=	prime(Type₁) \| prime(Type₂)
prime(Type₁ & Type₂)	=	prime(Type₁) \| prime(Type₂)
prime(Type₁ \| Type₂)	=	prime(Type₁) \| prime(Type₂)
prime(Type?)	=	prime(Type)
prime(Type*)	=	prime(Type)
prime(Type+)	=	prime(Type)

Semantics

The quantifier function is defined by induction as follows.

quantifier(FormalItemType)	=	1
quantifier(`empty`)	=	?
quantifier(`none`)	=	1
quantifier(Type₁ , Type₂)	=	quantifier(Type₁) , quantifier(Type₂)
quantifier(Type₁ & Type₂)	=	quantifier(Type₁) , quantifier(Type₂)
quantifier(Type₁ \| Type₂)	=	quantifier(Type₁) \| quantifier(Type₂)
quantifier(Type?)	=	quantifier(Type) · ?
quantifier(Type*)	=	quantifier(Type) · *
quantifier(Type+)	=	quantifier(Type) · +

This definition uses the sum (Occurrence₁ , Occurrence₂), the choice (Occurrence₁ | Occurrence₂), and the product (Occurrence₁ · Occurrence₂) of two occurrence indicators Occurrence₁, Occurrence₂, which are defined by the following tables.

,	1	?	+	*
1	+	+	+	+
?	+	*	+	*
+	+	+	+	+
*	+	*	+	*

\|	1	?	+	*
1	1	?	+	*
?	?	?	*	*
+	+	*	+	*
*	*	*	*	*

·	1	?	+	*
1	1	?	+	*
?	?	?	*	*
+	+	*	+	*
*	*	*	*	*

Examples

For example, here are the result of applying prime and quantifier on a few simple types.

  prime(element a+)                         = element a
  prime(element a | empty)                  = element a
  prime(element a?,element b?)              = element a | element b
  prime(element a | element b+, element c*) = element a | element b | element c

  quantifier(element a+)                         = +
  quantifier(element a | empty)                  = ?
  quantifier(element a?,element b?)              = *
  quantifier(element a | element b+, element d*) = +

Note that the last occurrence indicator should be '+', since the regular expression is such that there must be at least one element in the sequence (this element being an 'a' element or a 'b' element).

Note

Note that prime(Type) · quantifier(Type) is always a super type of the original type Type I.e., prime(Type) · quantifier(Type) <: Type always holds. Therefore, it is appropriate to used it as an approximation for the type of an expression. This property is required for the soundness of the static type analysis.

Semantics

Finally, a type Type and an occurrence indicator can be combined back together to yield a new type with the · operation, as follows.

Type · 1	=	Type
Type · ?	=	Type?
Type · +	=	Type+
Type · *	=	Type*

8.5 Judgments for function calls

Introduction

Function calls can perform type promotion between atomic types. This section introduces judgments which describe type promotion for the purpose of the dynamic and static semantics. These promotion rules include promoting xdt:untypedAtomic to any other type.

8.5.1 Type promotion

Notation

The judgment

statEnv |- Type₁ can be promoted to Type₂

holds if type Type₁ can be promoted to type Type₂.

Example

For example, the following judgments hold:

  xs:integer  can be promoted to  xs:integer
  xs:decimal  can be promoted to  xs:float
  xs:integer  can be promoted to  xs:float
  xs:float    can be promoted to  xs:double
  xdt:untypedAtomic     can be promoted to  xs:double

Semantics

This judgment is specified by the following rules.

xs:decimal can be promoted to xs:float:

statEnv |- xs:decimal can be promoted to xs:float

xs:float can be promoted to xs:double:

statEnv |- xs:float can be promoted to xs:double

xdt:untypedAtomic can be promoted to any type:

statEnv |- xdt:untypedAtomic can be promoted to Type

A type can be promoted to itself or to any type of which it is a subtype:

statEnv |- Type can be promoted to Type

statEnv |- Type <: Type₁

statEnv |- Type can be promoted to Type₁

Type promotion is transitive:

statEnv |- Type₁ can be promoted to Type₂ statEnv |- Type₂ can be promoted to Type₃

statEnv |- Type₁ can be promoted to Type₃

Finally, type promotion distributes over occurrence and union constructors.

statEnv |- prime(Type₁) can be promoted to prime(Type₂) quantifier(Type₁) <= quantifier(Type₂)

statEnv |- Type₁ can be promoted to Type₂

statEnv |- prime(Type₁) can be promoted to prime(Type₁') prime(Type₂) can be promoted to prime(Type₂')

statEnv |- (Type₁ | Type₂) can be promoted to (Type₁' | Type₂')

where the "<=" operator for occurrence indicators denotes set inclusion of the subsets of the allowed occurrences.

Notation

The judgment

statEnv |- Value₁ against Type₂ promotes to Value₂

holds if value Value₁ can be promoted to the value Value₂ against the type Type₂.

Example

For example, the following judgments hold

  1     of type xs:integer  against  xs:integer  is promoted to  1     of type xs:integer
  1     of type xs:integer  against  xs:decimal  is promoted to  1     of type xs:integer
  1     of type xs:integer  against  xs:float    is promoted to  1.0e0 of type xs:float
  1.0e0 of type xs:float    against  xs:double   is promoted to  1.0e0 of type xs:double

Note that type promotion changes the value, and only occurs if the input value does not matches the target type.

Semantics

This judgment is specified by the following rules.

If the value matches the target type, then it is promoted to itself

statEnv |- Value matches Type

statEnv |- Value against Type promotes to Value

If the value does not match the target type, but matches a type which can be promoted to the target type, then the value is cast to the target type.

statEnv |- Value₁ matches Type₁

statEnv |- Type₁ can be promoted to Type₂

statEnv |- Type₁ != Type₂

Value₁ cast as Type₂ => Value₂

statEnv |- Value₁ against Type₂ promotes to Value₂

8.6 Judgments for validation modes and contexts

8.6.1 Elements in validation mode

Notation

A validation mode may occur explicitly in a validate expression [4.13 Validate Expressions]. The following with mode judgment resolves an element name within a given validation mode to the type that the element name denotes. The judgment is used in the semantics of the validate expression and in sequence type.

The judgment

statEnv |- ElementNameOrWildcard with mode ValidationMode resolves to Type

holds when the possibly optional element name resolves to the given type in the given validation mode.

Semantics

We start with the rules for the global validation context.

If no element name is present, the global validation context resolves to the union of all element types that are globally declared.

statEnv |- ElementName₁ of elem/type expands to expanded-QName₁

...

statEnv |- ElementName_n of elem/type expands to expanded-QName_n

statEnv.elemDecl(expanded-QName₁) = define ElementType₁

...

statEnv.elemDecl(expanded-QName_n) = define ElementType_n

statEnv |- with mode ValidationMode resolves to (ElementType₁ | ... | ElementType_n)

If the element name is globally declared in the schema, it resolves to the element type of the corresponding global element declaration, independently of the validation mode.

statEnv |- ElementName of elem/type expands to expanded-QName

statEnv.elemDecl(expanded-QName) = define ElementType

statEnv |- ElementName with mode ValidationMode resolves to ElementType

If an element name is not globally defined and the validation mode is lax, then the element name resolves to the element type with the given element name with any content type.

statEnv |- ElementName of elem/type expands to expanded-QName

statEnv.elemDecl(expanded-QName) undefined

statEnv |- ElementName with mode lax resolves to element ElementName of type xs:anyType

A Normalized core grammar

This section contains the grammar of [XPath/XQuery] after it has been normalized, sometimes referred to as the "core" syntax.

A.1 Core BNF

The following grammar uses the same Basic EBNF notation as [XML], except that grammar symbols always have initial capital letters. The EBNF contains the lexemes embedded in the productions.

Named Terminals

[105 (Core)]	`IntegerLiteral`	::=	`Digits`
[106 (Core)]	`DecimalLiteral`	::=	`("." Digits) \| (Digits "." [0-9]*)`
[107 (Core)]	`DoubleLiteral`	::=	`(("." Digits) \| (Digits ("." [0-9]*)?)) [eE] [+-]? Digits`
[108 (Core)]	`StringLiteral`	::=	`('"' (EscapeQuot \| [^"])* '"') \| ("'" (EscapeApos \| [^'])* "'")`
[109 (Core)]	`EscapeQuot`	::=	`'""'`
[110 (Core)]	`EscapeApos`	::=	`"''"`
[111 (Core)]	`ElementContentChar`	::=	`Char - [{}<&]`
[112 (Core)]	`QuotAttrContentChar`	::=	`Char - ["{}<&]`
[113 (Core)]	`AposAttrContentChar`	::=	`Char - ['{}<&]`
[114 (Core)]	`PITarget`	::=	`[http://www.w3.org/TR/REC-xml#NT-PITarget]^XML`
[115 (Core)]	`QName`	::=	`[http://www.w3.org/TR/REC-xml-names/#NT-QName]^Names`
[116 (Core)]	`NCName`	::=	`[http://www.w3.org/TR/REC-xml-names/#NT-NCName]^Names`
[117 (Core)]	`S`	::=	`[http://www.w3.org/TR/REC-xml#NT-S]^XML`
[118 (Core)]	`Char`	::=	`[http://www.w3.org/TR/REC-xml#NT-Char]^XML`

Non-Terminals

[1 (Core)]	`Module`	::=	`VersionDecl? (LibraryModule \| MainModule)`
[2 (Core)]	`VersionDecl`	::=	`"xquery" "version" StringLiteral ("encoding" StringLiteral)? Separator`
[3 (Core)]	`MainModule`	::=	`Prolog QueryBody`
[4 (Core)]	`LibraryModule`	::=	`ModuleDecl Prolog`
[5 (Core)]	`ModuleDecl`	::=	`"module" "namespace" NCName "=" URILiteral Separator`
[6 (Core)]	`Prolog`	::=	`((DefaultNamespaceDecl \| Setter \| NamespaceDecl \| Import) Separator)* ((VarDecl \| FunctionDecl \| OptionDecl) Separator)*`
[7 (Core)]	`Setter`	::=	`DefaultCollationDecl \| BaseURIDecl \| ConstructionDecl \| OrderingModeDecl \| EmptyOrderDecl \| CopyNamespacesDecl`
[8 (Core)]	`Import`	::=	`SchemaImport \| ModuleImport`
[9 (Core)]	`Separator`	::=	`";"`
[10 (Core)]	`NamespaceDecl`	::=	`"declare" "namespace" NCName "=" URILiteral`
[11 (Core)]	`DefaultNamespaceDecl`	::=	`"declare" "default" ("element" \| "function") "namespace" URILiteral`
[12 (Core)]	`OptionDecl`	::=	`"declare" "option" QName StringLiteral`
[13 (Core)]	`OrderingModeDecl`	::=	`"declare" "ordering" ("ordered" \| "unordered")`
[14 (Core)]	`EmptyOrderDecl`	::=	`"declare" "default" "order" "empty" ("greatest" \| "least")`
[15 (Core)]	`CopyNamespacesDecl`	::=	`"declare" "copy-namespaces" PreserveMode "," InheritMode`
[16 (Core)]	`PreserveMode`	::=	`"preserve" \| "no-preserve"`
[17 (Core)]	`InheritMode`	::=	`"inherit" \| "no-inherit"`
[18 (Core)]	`DefaultCollationDecl`	::=	`"declare" "default" "collation" URILiteral`
[19 (Core)]	`BaseURIDecl`	::=	`"declare" "base-uri" URILiteral`
[20 (Core)]	`SchemaImport`	::=	`"import" "schema" SchemaPrefix? URILiteral ("at" URILiteral ("," URILiteral)*)?`
[21 (Core)]	`SchemaPrefix`	::=	`("namespace" NCName "=") \| ("default" "element" "namespace")`
[22 (Core)]	`ModuleImport`	::=	`"import" "module" ("namespace" NCName "=")? URILiteral ("at" URILiteral ("," URILiteral)*)?`
[23 (Core)]	`VarDecl`	::=	`"declare" "variable" "$" QName TypeDeclaration? ((":=" ExprSingle) \| "external")`
[24 (Core)]	`ConstructionDecl`	::=	`"declare" "construction" ("strip" \| "preserve")`
[25 (Core)]	`FunctionDecl`	::=	`"declare" "function" QName "(" ParamList? ")" ("as" SequenceType)? (EnclosedExpr \| "external")`
[26 (Core)]	`ParamList`	::=	`Param ("," Param)*`
[27 (Core)]	`Param`	::=	`"$" QName TypeDeclaration?`
[28 (Core)]	`EnclosedExpr`	::=	`"{" Expr "}"`
[29 (Core)]	`QueryBody`	::=	`Expr`
[30 (Core)]	`Expr`	::=	`ExprSingle ("," ExprSingle)*`
[31 (Core)]	`ExprSingle`	::=	`FLWORExpr \| TypeswitchExpr \| IfExpr \| OrExpr`
[32 (Core)]	`FLWORExpr`	::=	`(ForClause \| LetClause) "return" ExprSingle`
[33 (Core)]	`ForClause`	::=	`"for" "$" VarName TypeDeclaration? PositionalVar? "in" ExprSingle`
[34 (Core)]	`PositionalVar`	::=	`"at" "$" VarName`
[35 (Core)]	`LetClause`	::=	`"let" "$" VarName TypeDeclaration? ":=" ExprSingle`
[36 (Core)]	`OrderByClause`	::=	`(("order" "by") \| ("stable" "order" "by")) OrderSpecList`
[37 (Core)]	`OrderSpecList`	::=	`OrderSpec ("," OrderSpec)*`
[38 (Core)]	`OrderSpec`	::=	`ExprSingle OrderModifier`
[39 (Core)]	`OrderModifier`	::=	`("ascending" \| "descending")? ("empty" ("greatest" \| "least"))? ("collation" URILiteral)?`
[40 (Core)]	`QuantifiedExpr`	::=	`("some" \| "every") "$" VarName TypeDeclaration? "in" ExprSingle ("," "$" VarName TypeDeclaration? "in" ExprSingle)* "satisfies" ExprSingle`
[41 (Core)]	`TypeswitchExpr`	::=	`"typeswitch" "(" Expr ")" CaseClause+ "default" ("$" VarName)? "return" ExprSingle`
[42 (Core)]	`CaseClause`	::=	`"case" ("$" VarName "as")? SequenceType "return" ExprSingle`
[43 (Core)]	`IfExpr`	::=	`"if" "(" Expr ")" "then" ExprSingle "else" ExprSingle`
[44 (Core)]	`OrExpr`	::=	`AndExpr ( "or" AndExpr )*`
[45 (Core)]	`AndExpr`	::=	`CastableExpr ( "and" CastableExpr )*`
[46 (Core)]	`CastableExpr`	::=	`CastExpr ( "castable" "as" SingleType )?`
[47 (Core)]	`CastExpr`	::=	`ValueExpr ( "cast" "as" SingleType )?`
[48 (Core)]	`ValueExpr`	::=	`ValidateExpr \| StepExpr \| ExtensionExpr`
[49 (Core)]	`ValidateExpr`	::=	`"validate" ValidationMode? "{" Expr "}"`
[50 (Core)]	`ValidationMode`	::=	`"lax" \| "strict"`
[51 (Core)]	`ExtensionExpr`	::=	`Pragma+ "{" Expr? "}"`
[52 (Core)]	`Pragma`	::=	`"(#" S? QName PragmaContents "#)"`
[53 (Core)]	`PragmaContents`	::=	`(Char* - (Char* '#)' Char*))`
[54 (Core)]	`StepExpr`	::=	`PrimaryExpr \| AxisStep`
[55 (Core)]	`AxisStep`	::=	`ReverseStep \| ForwardStep`
[56 (Core)]	`ForwardStep`	::=	`ForwardAxis NodeTest`
[57 (Core)]	`ForwardAxis`	::=	`("child" "::") \| ("descendant" "::") \| ("attribute" "::") \| ("self" "::") \| ("descendant-or-self" "::") \| ("namespace" "::")`
[58 (Core)]	`ReverseStep`	::=	`ReverseAxis NodeTest`
[59 (Core)]	`ReverseAxis`	::=	`("parent" "::") \| ("ancestor" "::") \| ("ancestor-or-self" "::")`
[60 (Core)]	`NodeTest`	::=	`KindTest \| NameTest`
[61 (Core)]	`NameTest`	::=	`QName \| Wildcard`
[62 (Core)]	`Wildcard`	::=	`"" \| (NCName ":" "") \| ("*" ":" NCName)`
[63 (Core)]	`PrimaryExpr`	::=	`Literal \| VarRef \| ParenthesizedExpr \| FunctionCall \| Constructor`
[64 (Core)]	`Literal`	::=	`NumericLiteral \| StringLiteral`
[65 (Core)]	`NumericLiteral`	::=	`IntegerLiteral \| DecimalLiteral \| DoubleLiteral`
[66 (Core)]	`VarRef`	::=	`"$" VarName`
[67 (Core)]	`VarName`	::=	`QName`
[68 (Core)]	`ParenthesizedExpr`	::=	`"(" Expr? ")"`
[69 (Core)]	`OrderedExpr`	::=	`"ordered" "{" Expr "}"`
[70 (Core)]	`UnorderedExpr`	::=	`"unordered" "{" Expr "}"`
[71 (Core)]	`FunctionCall`	::=	`QName "(" (ExprSingle ("," ExprSingle)*)? ")"`
[72 (Core)]	`Constructor`	::=	`ComputedConstructor`
[73 (Core)]	`ComputedConstructor`	::=	`CompDocConstructor \| CompElemConstructor \| CompAttrConstructor \| CompTextConstructor \| CompCommentConstructor \| CompPIConstructor`
[74 (Core)]	`CompDocConstructor`	::=	`"document" "{" Expr "}"`
[75 (Core)]	`CompElemConstructor`	::=	`"element" (QName \| ("{" Expr "}")) "{" ContentExpr "}"`
[76 (Core)]	`ContentExpr`	::=	`Expr`
[77 (Core)]	`CompAttrConstructor`	::=	`"attribute" (QName \| ("{" Expr "}")) "{" Expr "}"`
[78 (Core)]	`CompTextConstructor`	::=	`"text" "{" Expr "}"`
[79 (Core)]	`CompCommentConstructor`	::=	`"comment" "{" Expr "}"`
[80 (Core)]	`CompPIConstructor`	::=	`"processing-instruction" (NCName \| ("{" Expr "}")) "{" Expr? "}"`
[81 (Core)]	`SingleType`	::=	`AtomicType "?"?`
[82 (Core)]	`TypeDeclaration`	::=	`"as" SequenceType`
[83 (Core)]	`SequenceType`	::=	`("empty-sequence" "(" ")") \| (ItemType OccurrenceIndicator?)`
[84 (Core)]	`OccurrenceIndicator`	::=	`"?" \| "*" \| "+"`
[85 (Core)]	`ItemType`	::=	`KindTest \| ("item" "(" ")") \| AtomicType`
[86 (Core)]	`AtomicType`	::=	`QName`
[87 (Core)]	`KindTest`	::=	`DocumentTest \| ElementTest \| AttributeTest \| SchemaElementTest \| SchemaAttributeTest \| PITest \| CommentTest \| TextTest \| AnyKindTest`
[88 (Core)]	`AnyKindTest`	::=	`"node" "(" ")"`
[89 (Core)]	`DocumentTest`	::=	`"document-node" "(" (ElementTest \| SchemaElementTest)? ")"`
[90 (Core)]	`TextTest`	::=	`"text" "(" ")"`
[91 (Core)]	`CommentTest`	::=	`"comment" "(" ")"`
[92 (Core)]	`PITest`	::=	`"processing-instruction" "(" (NCName \| StringLiteral)? ")"`
[93 (Core)]	`AttributeTest`	::=	`"attribute" "(" (AttribNameOrWildcard ("," TypeName)?)? ")"`
[94 (Core)]	`AttribNameOrWildcard`	::=	`AttributeName \| "*"`
[95 (Core)]	`SchemaAttributeTest`	::=	`"schema-attribute" "(" AttributeDeclaration ")"`
[96 (Core)]	`AttributeDeclaration`	::=	`AttributeName`
[97 (Core)]	`ElementTest`	::=	`"element" "(" (ElementNameOrWildcard ("," TypeName "?"?)?)? ")"`
[98 (Core)]	`ElementNameOrWildcard`	::=	`ElementName \| "*"`
[99 (Core)]	`SchemaElementTest`	::=	`"schema-element" "(" ElementDeclaration ")"`
[100 (Core)]	`ElementDeclaration`	::=	`ElementName`
[101 (Core)]	`AttributeName`	::=	`QName`
[102 (Core)]	`ElementName`	::=	`QName`
[103 (Core)]	`TypeName`	::=	`QName`
[104 (Core)]	`URILiteral`	::=	`StringLiteral`

B Functions and Operators

B.1 Functions and Operators used in the Formal Semantics

Here is the list of functions from the [Functions and Operators] document that are used in the [XPath/XQuery] Formal Semantics:

B.2 Mapping of Overloaded Internal Functions

This section gives the semantics specific to overloaded internal functions (with prefix fs:) that are used to define overloaded XQuery operators (with prefix op:), such as comparison expressions or arithmetic expressions. Static typing for those functions are defined over unions of (possibly optional) atomic types. The semantics is obtained in three steps. First, a rule is applied to deal with the union of those (possibly optional) atomic types. A second set of rules treat the cases where one of the operands of those functions is the empty type (resp. empty sequence) or optional. Finally, a final rule deals with type promotion and access to an operators mapping table which maps the overloaded internal functions to the appropriate operator functions defined in [Functions and Operators] and give the corresponding type.

Notation

The following auxiliary grammar production describe optional atomic types.

OptAtomicType

[81 (Formal)] OptAtomicType ::= AtomicTypeName | (AtomicTypeName "?") | "empty"

Static Type Analysis

The following static typing rules apply generically to all the fs: special functions. They do not apply to any other function calls, which are treated in [4.1.5 Function Calls].

First, if the static type of one of the expressions passed as argument is a union of atomic types, the function call is type checked once separately for each atomic type in that union. The static type of the entire function call expression is then the union of the types computed in each case.

Type₁ = (OptAtomicType₁_,1|...|OptAtomicType_m_,1)

...

Type_n = (OptAtomicType₁_,n|...|OptAtomicType_m_,n)

statEnv |- expanded-QName(OptAtomicType₁_,1,..., OptAtomicType₁_,n) : OptAtomicType₁'

...

statEnv |- expanded-QName(OptAtomicType_m_,1,..., OptAtomicType_m_,n) : OptAtomicType_r'

statEnv |- expanded-QName(Type₁, ..., Type_n) : (OptAtomicType₁'|...|OptAtomicType_r')

Note

Note that this approach can be used since the type declared for a function parameter is never itself be a union.

The following rules deal with optional arguments. In the case of binary operators, if either one of the types of the operands is empty, the resulting type is empty.

statEnv |- Expr₁ : empty

statEnv |- Expr₁ : Type₂

statEnv |- expanded-QName(Expr₁,Expr₂) : empty

statEnv |- Expr₁ : Type₁

statEnv |- Expr₁ : empty

statEnv |- expanded-QName(Expr₁,Expr₂) : empty

If either one of the types of the operands is optional, the type obtained by propagating the optional occurrence indicator.

statEnv |- Expr₁ : AtomicType₁

statEnv |- Expr₁ : AtomicType₂?

statEnv |- expanded-QName(AtomicType₁,AtomicType₂) : AtomicType₃

statEnv |- expanded-QName(Expr₁,Expr₂) : AtomicType₃?

statEnv |- Expr₁ : AtomicType₁?

statEnv |- Expr₁ : AtomicType₂

statEnv |- expanded-QName(AtomicType₁,AtomicType₂) : AtomicType₃

statEnv |- expanded-QName(Expr₁,Expr₂) : AtomicType₃?

statEnv |- Expr₁ : AtomicType₁?

statEnv |- Expr₁ : AtomicType₂?

statEnv |- expanded-QName(AtomicType₁,AtomicType₂) : AtomicType₃

statEnv |- expanded-QName(Expr₁,Expr₂) : AtomicType₃?

In the case of unary operators, if the type of the operand is empty, the resulting type is empty.

statEnv |- Expr₁ : Type₁

statEnv |- expanded-QName(Expr₁) : empty

Finally, the resulting type is obtained by performing type promotion and accessing the operators mapping table (using the operator type for judgment defined below).

statEnv |- AtomicType₁ can be promoted to AtomicType₁'

statEnv |- AtomicType₂ can be promoted to AtomicType₂'

statEnv |- operator type for AtomicType₁ and AtomicType₂ is AtomicType₃

statEnv |- expanded-QName(AtomicType₁,AtomicType₂) : AtomicType₃

statEnv |- AtomicType₁ can be promoted to AtomicType₁'

statEnv |- operator type for AtomicType₁ is AtomicType₃

statEnv |- expanded-QName(AtomicType₁) : AtomicType₃

Dynamic Evaluation

Each fs: overloaded operator maps to the corresponding equivalent overloaded op: operator, as defined in [Functions and Operators], and deals with the case where one of the operands is the empty sequence.

The dynamic semantics of the fs: operator is similar to using the following user-defined function.

declare function fs:opname($x1 as xdt:anyAtomicType?, $x2 as xdt:anyAtomicType?) as xdt:anyAtomicType? {

if (fn:empty($x1) or fn:empty($x2)) then () else [fs:opname($x1,$x2)]_OverloadedOp

};

Where [fs:opname()]_OverloadedOp maps to the corresponding op: operator in [Functions and Operators], as defined in the table below.

Notation

The operators mapping table is given below. The table is used to define the following auxiliary mapping rules and judgments.

The mapping rule for binary and unary operators

[fs:opname1(Expr₁,Expr₂)]_OverloadedOp == op:opname2(Expr₁,Expr₂)

and

[fs:opname1(Expr₁)]_OverloadedOp == op:opname2(Expr₁)

where the operator depends on the type of each value returned by Expr₁ and Expr₂.

The judgments for binary and unary operators

operator type for AtomicType₁ and AtomicType₂ is AtomicType₃

and

operator type for AtomicType₁ is AtomicType₃

hold when the operator table indicates the output type AtomicType₃ for the input types AtomicType₁ and AtomicType₂.

Note that in the following table, all numeric functions are applied to operands with the same type. Values are promoted to compatible types using the function call semantics given in [4.1.5 Function Calls].

Gregorian refers to the types xs:gYearMonth, xs:gYear, xs:gMonthDay, xs:gDay, and xs:gMonth. For binary operators that accept two Gregorian-type operands, both operands must have the same type (for example, if one operand is of type xs:gDay, the other operand must be of type xs:gDay.)

Binary Operators
Internal Function	AtomicType₁	AtomicType₂	Denotes	AtomicType₃
fs:`plus`(A, B)	`xs:integer`	`xs:integer`	op:numeric-add(A, B)	`xs:integer`
fs:`plus`(A, B)	`xs:decimal`	`xs:decimal`	op:numeric-add(A, B)	`xs:decimal`
fs:`plus`(A, B)	`xs:float`	`xs:float`	op:numeric-add(A, B)	`xs:float`
fs:`plus`(A, B)	`xs:double`	`xs:double`	op:numeric-add(A, B)	`xs:double`
fs:`plus`(A, B)	`xs:date`	`xdt:yearMonthDuration`	op:add-yearMonthDuration-to-date(A, B)	`xs:date`
fs:`plus`(A, B)	`xdt:yearMonthDuration`	`xs:date`	op:add-yearMonthDuration-to-date(B, A)	`xs:date`
fs:`plus`(A, B)	`xs:date`	`xdt:dayTimeDuration`	op:add-dayTimeDuration-to-date(A, B)	`xs:date`
fs:`plus`(A, B)	`xdt:dayTimeDuration`	`xs:date`	op:add-dayTimeDuration-to-date(B, A)	`xs:date`
fs:`plus`(A, B)	`xs:time`	`xdt:dayTimeDuration`	op:add-dayTimeDuration-to-time(A, B)	`xs:time`
fs:`plus`(A, B)	`xdt:dayTimeDuration`	`xs:time`	op:add-dayTimeDuration-to-time(B, A)	`xs:time`
fs:`plus`(A, B)	`xs:dateTime`	`xdt:yearMonthDuration`	op:add-yearMonthDuration-to-dateTime(A, B)	`xs:dateTime`
fs:`plus`(A, B)	`xdt:yearMonthDuration`	`xs:dateTime`	op:add-yearMonthDuration-to-dateTime(B, A)	`xs:dateTime`
fs:`plus`(A, B)	`xs:dateTime`	`xdt:dayTimeDuration`	op:add-dayTimeDuration-to-dateTime(A, B)	`xs:dateTime`
fs:`plus`(A, B)	`xdt:dayTimeDuration`	`xs:dateTime`	op:add-dayTimeDuration-to-dateTime(B, A)	`xs:dateTime`
fs:`plus`(A, B)	`xdt:yearMonthDuration`	`xdt:yearMonthDuration`	op:add-yearMonthDurations(A, B)	`xdt:yearMonthDuration`
fs:`plus`(A, B)	`xdt:dayTimeDuration`	`xdt:dayTimeDuration`	op:add-dayTimeDurations(A, B)	`xdt:dayTimeDuration`
fs:`minus`(A, B)	`xs:integer`	`xs:integer`	op:numeric-subtract(A, B)	`xs:integer`
fs:`minus`(A, B)	`xs:decimal`	`xs:decimal`	op:numeric-subtract(A, B)	`xs:decimal`
fs:`minus`(A, B)	`xs:float`	`xs:float`	op:numeric-subtract(A, B)	`xs:float`
fs:`minus`(A, B)	`xs:double`	`xs:double`	op:numeric-subtract(A, B)	`xs:double`
fs:`minus`(A, B)	`xs:date`	`xs:date`	fn:subtract-dates(A, B)	`xdt:dayTimeDuration`
fs:`minus`(A, B)	`xs:date`	`xdt:yearMonthDuration`	op:subtract-yearMonthDuration-from-date(A, B)	`xs:date`
fs:`minus`(A, B)	`xs:date`	`xdt:dayTimeDuration`	op:subtract-dayTimeDuration-from-date(A, B)	`xs:date`
fs:`minus`(A, B)	`xs:time`	`xs:time`	fn:subtract-times(A, B)	`xdt:dayTimeDuration`
fs:`minus`(A, B)	`xs:time`	`xdt:dayTimeDuration`	op:subtract-dayTimeDuration-from-time(A, B)	`xs:time`
fs:`minus`(A, B)	`xs:dateTime`	`xs:dateTime`	fn:get-dayTimeDuration-from-dateTimes(A, B)	`xdt:dayTimeDuration`
fs:`minus`(A, B)	`xs:dateTime`	`xdt:yearMonthDuration`	op:subtract-yearMonthDuration-from-dateTime(A, B)	`xs:dateTime`
fs:`minus`(A, B)	`xs:dateTime`	`xdt:dayTimeDuration`	op:subtract-dayTimeDuration-from-dateTime(A, B)	`xs:dateTime`
fs:`minus`(A, B)	`xdt:yearMonthDuration`	`xdt:yearMonthDuration`	op:subtract-yearMonthDurations(A, B)	`xdt:yearMonthDuration`
fs:`minus`(A, B)	`xdt:dayTimeDuration`	`xdt:dayTimeDuration`	op:subtract-dayTimeDurations(A, B)	`xdt:dayTimeDuration`
fs:`times`(A, B)	`xs:integer`	`xs:integer`	op:numeric-multiply(A, B)	`xs:integer`
fs:`times`(A, B)	`xs:decimal`	`xs:decimal`	op:numeric-multiply(A, B)	`xs:decimal`
fs:`times`(A, B)	`xs:float`	`xs:float`	op:numeric-multiply(A, B)	`xs:float`
fs:`times`(A, B)	`xs:double`	`xs:double`	op:numeric-multiply(A, B)	`xs:double`
fs:`times`(A, B)	`xdt:yearMonthDuration`	`xs:double`	op:multiply-yearMonthDuration(A, B)	`xdt:yearMonthDuration`
fs:`times`(A, B)	`xs:double`	`xdt:yearMonthDuration`	op:multiply-yearMonthDuration(B, A)	`xdt:yearMonthDuration`
fs:`times`(A, B)	`xdt:dayTimeDuration`	`xs:double`	op:multiply-dayTimeDuration(A, B)	`xdt:dayTimeDuration`
fs:`times`(A, B)	`xs:double`	`xdt:dayTimeDuration`	op:multiply-dayTimeDuration(B, A)	`xdt:dayTimeDuration`
fs:`idiv`(A, B)	`xs:integer`	`xs:integer`	op:integer-div(A, B)	`xs:integer`
fs:`div`(A, B)	`xs:integer`	`xs:integer`	op:numeric-divide(A, B)	`xs:double`
fs:`div`(A, B)	`xs:decimal`	`xs:decimal`	op:numeric-divide(A, B)	`xs:decimal`
fs:`div`(A, B)	`xs:float`	`xs:float`	op:numeric-divide(A, B)	`xs:float`
fs:`div`(A, B)	`xs:double`	`xs:double`	op:numeric-divide(A, B)	`xs:double`
fs:`div`(A, B)	`xdt:yearMonthDuration`	`xs:double`	op:divide-yearMonthDuration(A, B)	`xdt:yearMonthDuration`
fs:`div`(A, B)	`xdt:dayTimeDuration`	`xs:double`	op:divide-dayTimeDuration(A, B)	`xdt:dayTimeDuration`
fs:`div`(A, B)	`xdt:yearMonthDuration`	`xdt:yearMonthDuration`	op:divide-yearMonthDuration-by-yearMonthDuration(A, B)	`xs:decimal`
fs:`div`(A, B)	`xdt:dayTimeDuration`	`xdt:dayTimeDuration`	op:divide-dayTimeDuration-by-dayTimeDuration(A, B)	`xs:decimal`
fs:`mod`(A, B)	`xs:integer`	`xs:integer`	op:numeric-mod(A, B)	`xs:integer`
fs:`mod`(A, B)	`xs:decimal`	`xs:decimal`	op:numeric-mod(A, B)	`xs:decimal`
fs:`mod`(A, B)	`xs:float`	`xs:float`	op:numeric-mod(A, B)	`xs:float`
fs:`mod`(A, B)	`xs:double`	`xs:double`	op:numeric-mod(A, B)	`xs:double`
fs:`eq`(A, B)	`xs:integer`	`xs:integer`	op:numeric-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	`xs:decimal`	`xs:decimal`	op:numeric-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	`xs:float`	`xs:float`	op:numeric-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	`xs:double`	`xs:double`	op:numeric-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	`xs:boolean`	`xs:boolean`	op:boolean-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	`xs:string`	`xs:string`	op:numeric-equal(fn:compare(A, B), 1)	`xs:boolean`
fs:`eq`(A, B)	`xs:date`	`xs:date`	op:date-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	`xs:time`	`xs:time`	op:time-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	`xs:dateTime`	`xs:dateTime`	op:datetime-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	`xdt:yearMonthDuration`	`xdt:yearMonthDuration`	op:yearMonthDuration-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	`xdt:dayTimeDuration`	`xdt:dayTimeDuration`	op:dayTimeDuration-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	Gregorian	Gregorian	op:gYear-equal(A, B) etc.	`xs:boolean`
fs:`eq`(A, B)	`xs:hexBinary`	`xs:hexBinary`	op:hex-binary-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	`xs:base64Binary`	`xs:base64Binary`	op:base64-binary-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	`xs:anyURI`	`xs:anyURI`	op:anyURI-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	`xs:QName`	`xs:QName`	op:QName-equal(A, B)	`xs:boolean`
fs:`eq`(A, B)	`xs:NOTATION`	`xs:NOTATION`	op:NOTATION-equal(A, B)	`xs:boolean`
fs:`ne`(A, B)	`xs:integer`	`xs:integer`	`fn:not`(op:numeric-equal(A, B))	`xs:boolean`
fs:`ne`(A, B)	`xs:decimal`	`xs:decimal`	`fn:not`(op:numeric-equal(A, B))	`xs:boolean`
fs:`ne`(A, B)	`xs:float`	`xs:float`	`fn:not`(op:numeric-equal(A, B))	`xs:boolean`
fs:`ne`(A, B)	`xs:double`	`xs:double`	`fn:not`(op:numeric-equal(A, B))	`xs:boolean`
fs:`ne`(A, B)	`xs:boolean`	`xs:boolean`	`fn:not`(op:boolean-equal(A, B))	`xs:boolean`
fs:`ne`(A, B)	`xs:string`	`xs:string`	`fn:not`(op:numeric-equal(fn:compare(A, B), 1))	`xs:boolean`
fs:`ne`(A, B)	`xs:date`	`xs:date`	`fn:not`(op:date-equal(A, B))	`xs:boolean`
fs:`ne`(A, B)	`xs:time`	`xs:time`	`fn:not`(op:time-equal(A, B))	`xs:boolean`
fs:`ne`(A, B)	`xs:dateTime`	`xs:dateTime`	`fn:not`(op:datetime-equal(A, B))	`xs:boolean`
fs:`ne`(A, B)	`xdt:yearMonthDuration`	`xdt:yearMonthDuration`	`fn:not`(op:yearMonthDuration-equal(A, B))	`xs:boolean`
fs:`ne`(A, B)	`xdt:dayTimeDuration`	`xdt:dayTimeDuration`	`fn:not`(op:dayTimeDuration-equal(A, B)	`xs:boolean`
fs:`ne`(A, B)	Gregorian	Gregorian	`fn:not`(op:gYear-equal(A, B)) etc.	`xs:boolean`
fs:`ne`(A, B)	`xs:hexBinary`	`xs:hexBinary`	`fn:not`(op:hex-binary-equal(A, B))	`xs:boolean`
fs:`ne`(A, B)	`xs:base64Binary`	`xs:base64Binary`	`fn:not`(op:base64-binary-equal(A, B))	`xs:boolean`
fs:`ne`(A, B)	`xs:anyURI`	`xs:anyURI`	`fn:not`(op:anyURI-equal(A, B))	`xs:boolean`
fs:`ne`(A, B)	`xs:QName`	`xs:QName`	`fn:not`(op:QName-equal(A, B))	`xs:boolean`
fs:`ne`(A, B)	`xs:NOTATION`	`xs:NOTATION`	`fn:not`(op:NOTATION-equal(A, B))	`xs:boolean`
fs:`gt`(A, B)	integer	integer	op:numeric-greater-than(A, B)	`xs:boolean`
fs:`gt`(A, B)	decimal	decimal	op:numeric-greater-than(A, B)	`xs:boolean`
fs:`gt`(A, B)	float	float	op:numeric-greater-than(A, B)	`xs:boolean`
fs:`gt`(A, B)	double	double	op:numeric-greater-than(A, B)	`xs:boolean`
fs:`gt`(A, B)	`xs:boolean`	`xs:boolean`	op:boolean-greater-than(A, B)	`xs:boolean`
fs:`gt`(A, B)	`xs:string`	`xs:string`	op:numeric-greater-than(`fn:compare`(A, B), 0)	`xs:boolean`
fs:`gt`(A, B)	`xs:date`	`xs:date`	op:date-greater-than(A, B)	`xs:boolean`
fs:`gt`(A, B)	`xs:time`	`xs:time`	op:time-greater-than(A, B)	`xs:boolean`
fs:`gt`(A, B)	`xs:dateTime`	`xs:dateTime`	op:datetime-greater-than(A, B)	`xs:boolean`
fs:`gt`(A, B)	`xdt:yearMonthDuration`	`xdt:yearMonthDuration`	op:yearMonthDuration-greater-than(A, B)	`xs:boolean`
fs:`gt`(A, B)	`xdt:dayTimeDuration`	`xdt:dayTimeDuration`	op:dayTimeDuration-greater-than(A, B)	`xs:boolean`
fs:`lt`(A, B)	`xs:integer`	`xs:integer`	op:numeric-less-than(A, B)	`xs:boolean`
fs:`lt`(A, B)	`xs:decimal`	`xs:decimal`	op:numeric-less-than(A, B)	`xs:boolean`
fs:`lt`(A, B)	`xs:float`	`xs:float`	op:numeric-less-than(A, B)	`xs:boolean`
fs:`lt`(A, B)	`xs:double`	`xs:double`	op:numeric-less-than(A, B)	`xs:boolean`
fs:`lt`(A, B)	`xs:boolean`	`xs:boolean`	op:boolean-less-than(A, B)	`xs:boolean`
fs:`lt`(A, B)	`xs:string`	`xs:string`	op:numeric-less-than(`fn:compare`(A, B), 0)	`xs:boolean`
fs:`lt`(A, B)	`xs:date`	`xs:date`	op:date-less-than(A, B)	`xs:boolean`
fs:`lt`(A, B)	`xs:time`	`xs:time`	op:time-less-than(A, B)	`xs:boolean`
fs:`lt`(A, B)	`xs:dateTime`	`xs:dateTime`	op:datetime-less-than(A, B)	`xs:boolean`
fs:`lt`(A, B)	`xdt:yearMonthDuration`	`xdt:yearMonthDuration`	op:yearMonthDuration-less-than(A, B)	`xs:boolean`
fs:`lt`(A, B)	`xdt:dayTimeDuration`	`xdt:dayTimeDuration`	op:dayTimeDuration-less-than(A, B)	`xs:boolean`
fs:`ge`(A, B)	`xs:integer`	`xs:integer`	op:numeric-greater-than(A, B) or op:numeric-equal(A,B)	`xs:boolean`
fs:`ge`(A, B)	`xs:decimal`	`xs:decimal`	op:numeric-greater-than(A, B) or op:numeric-equal(A,B)	`xs:boolean`
fs:`ge`(A, B)	`xs:float`	`xs:float`	op:numeric-greater-than(A, B) or op:numeric-equal(A,B)	`xs:boolean`
fs:`ge`(A, B)	`xs:double`	`xs:double`	op:numeric-greater-than(A, B) or op:numeric-equal(A,B)	`xs:boolean`
fs:`ge`(A, B)	`xs:boolean`	`xs:boolean`	op:numeric-greater-than(A, B) or op:numeric-equal(A,B)	`xs:boolean`
fs:`ge`(A, B)	`xs:string`	`xs:string`	op:numeric-greater-than(`fn:compare`(A, B), -1)	`xs:boolean`
fs:`ge`(A, B)	`xs:date`	`xs:date`	op:date-less-than(B, A)	`xs:boolean`
fs:`ge`(A, B)	`xs:time`	`xs:time`	op:time-less-than(B, A)	`xs:boolean`
fs:`ge`(A, B)	`xs:dateTime`	`xs:dateTime`	op:datetime-less-than(B, A)	`xs:boolean`
fs:`ge`(A, B)	`xdt:yearMonthDuration`	`xdt:yearMonthDuration`	op:yearMonthDuration-less-than(B, A)	`xs:boolean`
fs:`ge`(A, B)	`xdt:dayTimeDuration`	`xdt:dayTimeDuration`	op:dayTimeDuration-less-than(B, A)	`xs:boolean`
fs:`le`(A, B)	`xs:integer`	`xs:integer`	op:numeric-less-than(A, B) or op:numeric-equal(A,B)	`xs:boolean`
fs:`le`(A, B)	`xs:decimal`	`xs:decimal`	op:numeric-less-than(A, B) or op:numeric-equal(A,B)	`xs:boolean`
fs:`le`(A, B)	`xs:float`	`xs:float`	op:numeric-less-than(A, B) or op:numeric-equal(A,B)	`xs:boolean`
fs:`le`(A, B)	`xs:double`	`xs:double`	op:numeric-less-than(A, B) or op:numeric-equal(A,B)	`xs:boolean`
fs:`le`(A, B)	`xs:boolean`	`xs:boolean`	op:numeric-less-than(A, B) or op:numeric-equal(A,B)	`xs:boolean`
fs:`le`(A, B)	`xs:string`	`xs:string`	op:numeric-less-than(`fn:compare`(A, B), 1)	`xs:boolean`
fs:`le`(A, B)	`xs:date`	`xs:date`	op:date-greater-than(B, A)	`xs:boolean`
fs:`le`(A, B)	`xs:time`	`xs:time`	op:time-greater-than(B, A)	`xs:boolean`
fs:`le`(A, B)	`xs:dateTime`	`xs:dateTime`	op:datetime-greater-than(B, A)	`xs:boolean`
fs:`le`(A, B)	`xdt:yearMonthDuration`	`xdt:yearMonthDuration`	op:yearMonthDuration-greater-than(B, A)	`xs:boolean`
fs:`le`(A, B)	`xdt:dayTimeDuration`	`xdt:dayTimeDuration`	op:dayTimeDuration-greater-than(B, A)	`xs:boolean`
fs:`is-same-node`(A, B)	node()	node()	`op:is-same-node`	`xs:boolean`
fs:`node-before`(A, B)	node()	node()	`op:node-before`	`xs:boolean`
fs:`node-after`(A, B)	node()	node()	`op:node-after`	`xs:boolean`

Unary Operators
Internal Function	AtomicType₁	Denotes	AtomicType₃
fs:`unary-plus`(A)	`xs:integer`	op:numeric-unary-plus(A)	`xs:integer`
fs:`unary-plus`(A)	`xs:decimal`	op:numeric-unary-plus(A)	`xs:decimal`
fs:`unary-plus`(A)	`xs:float`	op:numeric-unary-plus(A)	`xs:float`
fs:`unary-plus`(A)	`xs:double`	op:numeric-unary-plus(A)	`xs:double`
fs:`unary-minus`(A)	`xs:integer`	op:numeric-unary-minus(A)	`xs:integer`
fs:`unary-minus`(A)	`xs:decimal`	op:numeric-unary-minus(A)	`xs:decimal`
fs:`unary-minus`(A)	`xs:float`	op:numeric-unary-minus(A)	`xs:float`
fs:`unary-minus`(A)	`xs:double`	op:numeric-unary-minus(A)	`xs:double`

C Importing Schemas

This section describes how XML Schema declarations, as specified by XML Schema are imported into the [XPath/XQuery] type system.

C.1 Introduction

At compile time, the [XPath/XQuery] environment imports XML Schema declarations and loads them as declarations in the [XPath/XQuery] type system. The semantics of that loading process is defined by normalization rules that map XML Schema descriptions into the [XPath/XQuery] type system.

C.1.1 Features

Here is summarized the XML Schema features which are covered by the formal semantics, and handled by the import mapping described in this section. For each feature, the following indications are used.

Handled indicates features that are relevant for [XPath/XQuery], are modeled in the [XPath/XQuery] type system, and are supported by the mapping.
Not in v1.0 indicates features that are relevant to [XPath/XQuery], but are not yet modeled in the [XPath/XQuery] type system or are not handled by the mapping in XQuery V1.0. In case the [XPath/XQuery] type system provides appropriate support for those features, but the mapping is incomplete, the additional annotation mapping only is used.
Not handled indicates features that are relevant for [XPath/XQuery], but are not modeled in the [XPath/XQuery] type system, and are not handled by the mapping. Such features are typically only related to validation, for which the formal semantics defines a partial model.
Ignored Indicates features that are not relevant for [XPath/XQuery], are not modeled in the [XPath/XQuery] type system, and are not relevant for the mapping. Such features might have to do with documentation of the schema, or might affect which Schemas are legal, but do not affect which documents match which Schemas.

Here is the exhaustive list of XML Schema features and their status in this document.

Feature:	Supported
Primitive Simple types	Handled
Simple type derivation by restriction	Handled
Derivation by list and union	Handled
Facets on simple types	Not handled
ID and IDREF constraints	Ignored
Attribute Declarations
default,fixed,use	Not in v1.0
Element Declarations
default, fixed (value constraint)	Not in v1.0
nillable	Handled
substitution group affiliation	Handled
substitution group exclusions	Ignored
disallowed substitutions	Ignored
abstract	Not in v1.0
Complex Type Definitions
derivation by restriction	Handled
derivation by extension	Handled
final	Ignored
abstract	Not in v1.0
AttributeUses
required	Not in v1.0, mapping only
default, fixed (value constraint)	Not in v1.0
Attribute Group Definitions	Not in v1.0, mapping only
Model Group Definitions	Not in v1.0, mapping only
Model Groups	Handled
Particles	Handled
Wildcards
process contents strict, skip, lax	Ignored
namespace wild cards.	Ignored
Identity-constraint Definitions	Ignored
Notation Declarations	Ignored
Annotations	Ignored

Note that the schema import feature specified here assumes it is given a legal schema as input. As a result, it is not necessary to check for 'block' or 'abstract' attributes.

C.1.2 Organization

The presentation of the schema mapping is done according to the following organization.

Schema component

First each schema component is summarized using the same notation used in the XML Representation Summary sections in XML Schema. For instance, here is the XML Representation Summary for complex types.

<complexType

[ ignored ] abstract = boolean : false

[ ignored ] block = (#all | List of (extension | restriction))

[ ignored ] final = (#all | List of (extension | restriction))

[ ignored ] id = ID

mixed = boolean : false

name = NCName

[ ignored ] {any schemaAttributes with non-schema namespace ...} >

</complexType>

Attributes indicated as [ ignored ] are not mapped into the [XPath/XQuery] type system.

Attributes indicated as [ not handled ] are not currently handled by the mapping.

Note that in order to simplify the mapping, it is assumed that the default values for all attributes in the XML Representation of Schema are filled in. For instance in the above complex type, if the mixed attribute is not present, it will be treated as being present and having the value "false".

Schema mapping

XML Schema import is specified by means of mapping rules. All mapping rules have the structure below.

[SchemaComponent]_Subscript

TypeComponent

The SchemaComponent above the horizontal rule denotes an XML Schema component before translation and the TypeComponent beneath the horizontal rule denotes an equivalent type component in the [XPath/XQuery] type system.

Notation

Whenever necessary for the mapping rules, specific grammar productions which describe fragments of XML Schema may be introduced. For instance, here are grammar productions used to describes fragments of the XML Representation Summary for the complexType Element Information Item.

Complex type content

[62 (Formal)]	`ComplexTypeContent`	::=	`"annotation"? ("simpleContent" \| "complexContent" \| (ChildrenContent AttributeContent))`
[65 (Formal)]	`AttributeContent`	::=	`("attribute" \| "attributeGroup")* "anyAttribute"?`
[63 (Formal)]	`ChildrenContent`	::=	`("group" \| "all" \| "choice" \| "sequence")?`

As in the rest of this document, some mapping rules may use fragments of the XML Representation corresponding to the syntactic categories defined by those grammar productions. For instance, the following complex type fragment uses the syntactic categories: TypeName, ComplexTypeContent, and AttributeContent, ChildrenContent, and MixedAttribute.

<complexType

name = TypeName

MixedAttribute >

ChildrenContent AttributeContent

</complexType>

C.1.3 Main mapping rules

Notation

The normalization rule

[Schema]_Schema

Definitions

maps a complete schema into a set of Definitions in the [XPath/XQuery] type system.

The normalization rule

[SchemaComponent]_{definition(targetNCName)}

Definition

maps a top level schema component into a Definition in the [XPath/XQuery] type system, given the target namespace targetURI.

The normalization rule

[SchemaComponent]_{content(targetNCName)}

TypeComponent

maps a schema component not directly under the schema element, into a TypeComponent in the [XPath/XQuery] type system, given the target namespace targetURI.

C.1.4 Special attributes

The XML Schema attributes: use, default, fixed, minOccurs, maxOccurs, mixed, nillable, and substitutionGroup, require specific mapping rules.

C.1.4.1 use, default, and fixed

The "use", "default", and "fixed" attributes are used to describe the occurrence and default behavior of a given attribute.

Notation

The following auxiliary grammar productions are used to describe the "use", "default", and "fixed" attributes.

Use, default, and fixed attributes

[67 (Formal)]	`UseAttribute`	::=	`"use" "=" ("optional" \| "prohibited" \| "required")`
[68 (Formal)]	`DefaultAttribute`	::=	`"default" "=" String`
[69 (Formal)]	`FixedAttribute`	::=	`"fixed" "=" String`

The normalization rule

[UseAttribute DefaultAttribute? FixedAttribute? ]_use

Occurrence

maps a combination of a use attribute UseAttribute, along with an optional default or fixed attribute in Schema into the occurrence indicator Occurrence in the [XPath/XQuery] type system.

Schema mapping

Use attributes are mapped to the type system in the following way. In case there is a default or fixed attribute, the attribute is always present in the PSVI and the use attribute is ignored.

UseAttribute DefaultAttribute_use

UseAttribute FixedAttribute_use

use = "optional"_use

use = "required"_use

Editorial note
Issue: how derivation of attribute declaration and the "prohibited" use attributes are mapped in the [XPath/XQuery] type system is still an open issue.

C.1.4.2 minOccurs, maxOccurs, minLength, maxLength, and length

Notation

The following auxiliary grammar productions are used to describe occurrence attributes and the length facets.

Occurrence attributes

[61 (Formal)]	`OccursAttributes`	::=	`maxOccurs \| minOccurs \| maxLength \| minLength \| length`
[59 (Formal)]	`maxOccurs`	::=	`"maxOccurs" "=" ("nonNegativeInteger" \| "unbounded")`
[60 (Formal)]	`minOccurs`	::=	`"minOccurs" "=" "nonNegativeInteger"`
[56 (Formal)]	`maxLength`	::=	`"maxLength" "=" "nonNegativeInteger"`
[57 (Formal)]	`minLength`	::=	`"minLength" "=" "nonNegativeInteger"`
[58 (Formal)]	`length`	::=	`"length" "=" "nonNegativeInteger"`

The normalization rule

[OccursAttributes]_occurs

Occurrence

maps the occurrence attributes and facets OccursAttributes in Schema into the occurrence indicator Occurrence in the [XPath/XQuery] type system.

Schema mapping

Occurrence attributes are mapped to the type system in the following way.

[minOccurs="0" maxOccurs="1"]_occurs

[minOccurs="1" maxOccurs="1"]_occurs

[minOccurs="0" maxOccurs="n"]_occurs

[minOccurs="1" maxOccurs="n"]_occurs

where n > 1.

[minOccurs="n" maxOccurs="m"]_occurs

where m >= n > 1

[minLength="0" maxLength="1"]_occurs

[minLength="1" maxLength="1"]_occurs

[minLength="0" maxLength="n"]_occurs

[minLength="1" maxLength="n"]_occurs

where n > 1.

[minLength="n" maxLength="m"]_occurs

where m >= n > 1

[length="1"]_occurs

[length="n"]_occurs

where n > 1

C.1.4.3 mixed

Notation

The following auxiliary grammar productions are used to describe the "mixed" attribute.

Mixed attribute

[53 (Formal)] MixedAttribute ::= "mixed" "=" Boolean

The normalization rule

[MixedAttribute]_mixed

Mixed

maps the mixed attribute MixedAttribute in Schema into a Mixed notation in the [XPath/XQuery] type system.

Schema mapping

If the mixed attribute is true it is mapped to a mixed notation in the [XPath/XQuery] type system.

[ mixed = "true" ]_mixed

mixed

If the mixed attribute is false it is mapped to empty in the [XPath/XQuery] type system.

[ mixed = "false" ]_mixed

C.1.4.4 nillable

Notation

The following auxiliary grammar productions are used to describe the "nillable" attribute.

Nillable attribute

[54 (Formal)] NillableAttribute ::= "nillable" "=" Boolean

The normalization rule

[NillableAttribute]_nillable

Nillable

maps the nillable attribute NillableAttribute in Schema into a Nillable notation in the [XPath/XQuery] type system.

Schema mapping

If the nillable attribute is true it is mapped to a nillable notation in the [XPath/XQuery] type system.

[ nillable = "true" ]_nillable

nillable

If the nillable attribute is false it is mapped to empty in the [XPath/XQuery] type system.

[ nillable = "false" ]_nillable

C.1.4.5 substitutionGroup

Notation

The substitution group declaration indicates the element that a given element can be substituted for. The following auxiliary grammar productions are used to describe the "substitutionGroup" attribute.

substitutionGroup attribute

[55 (Formal)] substitutionGroupAttribute ::= "substitutionGroup" "=" QName

The normalization rule

[substitutionGroupAttribute]_substitution

Substitution

maps the substitutionGroup attribute substitutionGroupAttribute in Schema into a Substitution notation in the [XPath/XQuery] type system.

Schema mapping

If the substitutionGroup attribute is present, it is mapped to a substitutionGroup notation in the [XPath/XQuery] type system.

[ substitutionGroup = QName ]_substitution

substitutes for QName

Otherwise, it is mapped to empty.

C.1.5 Anonymous type names

Notation

As explained in [2.4 The [XPath/XQuery] Type System], the [XPath/XQuery] type uses system-generated type names for anonymous types. For the purpose of this document those type names are generated at XML Schema import time.

C.2 Schemas as a whole

C.2.1 Schema

Schema component

A schema is represented in XML by the following structure.

<schema

[ not handled ] attributeFormDefault = (qualified | unqualified) : unqualified

[ ignored ] blockDefault = (#all | List of (extension | restriction | substitution)) : ' '

[ not handled ] elementFormDefault = (qualified | unqualified) : unqualified

[ ignored ] finalDefault = (#all | List of (extension | restriction)) : ' '

[ ignored ] id = ID

targetNamespace = anyURI

[ ignored ] version = token

[ ignored ] xml:lang = language

[ ignored ] {any attributes with non-schema namespace ...} >

</schema>

Notation

The following auxiliary grammar productions are used.

XML Schema Pragma and Content

[51 (Formal)]	`SPragma`	::=	`("include" \| "import" \| "redefine" \| "annotation")*`
[52 (Formal)]	`Content`	::=	`(("simpleType" \| "complexType" \| "element" \| "attribute" \| "attributeGroup" \| "group" \| "notation") "annotation")`

The auxiliary normalization rule

[Pragma]_{pragma(targetNCName)}

Definitions

maps the a schema pragma into a set of definitions in the [XPath/XQuery] type system.

Schema mapping

Schemas are imported by the "schema" declaration in the preamble of a query. To import a schema, the document referred to by the given URI is opened and the schema declarations contained in the document are translated into the corresponding in-line type definitions. The mechanism for finding a schema document, possibly using the optional schema location hint, is not specified formally.

[schema StringLiteral (at StringLiteral)?]_Schema

[open-schema-document(StringLiteral (at StringLiteral)?)]_Schema

[

<schema

targetNamespace = targetURI >

Pragma Content

</schema>

]_Schema

[Pragma]_{pragma(targetNCName)} [Content]_{definition(targetNCName)}

C.2.2 Include

Schema component

A schema include is represented in XML by the following structure.

<include

[ ignored ] id = ID

schemaLocation = anyURI

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?)

</include>

Schema mapping

A schema include is not specified here, and is assumed to be handled by the XML Schema processor.

C.2.3 Redefine

Schema component

A schema redefinition is represented in XML by the following structure.

<redefine

[ ignored ] id = ID

schemaLocation = anyURI

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation | (simpleType | complexType | group | attributeGroup))*

</redefine>

Schema mapping

A schema redefine is not specified here, and is assumed to be handled by the XML Schema processor.

C.2.4 Import

Schema component

A schema import is represented in XML by the following structure.

<import

[ ignored ] id = ID

namespace = anyURI

schemaLocation = anyURI

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?)

</import>

Schema mapping

A schema import is not specified here, and is assumed to be handled by the XML Schema processor.

C.3 Attribute Declarations

Schema component

The following structure describes attribute declarations in XML Schema.

<attribute

[ not handled ] default = string

[ not handled ] fixed = string

[ not handled ] form = (qualified | unqualified)

[ ignored ] id = ID

name = NCName

ref = QName

type = QName

use = (optional | prohibited | required) : optional

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?, (simpleType?))

</attribute>

C.3.1 Global attributes declarations

Schema import distinguishes between global attribute declarations and local attribute declarations.

Schema mapping

Global attribute declarations are mapped like local attribute declarations, but are prefixed by a "define" keyword in the [XPath/XQuery] type system.

[AttributeDecl]_{definition(targetNCName)}

define [AttributeDecl]_{content(targetNCName)}

C.3.2 Local attribute declarations

Schema mapping

Local attributes whose type is given by a reference to a global type name are mapped in the type system as follows.

[

<attribute

name = NCName

type = QName

UseAttribute />

]_{content(targetNCName)}

( attribute targetNCName:NCName { of type QName } )[UseAttribute]_use

References to a global attribute are mapped in the type system as follows.

[

<attribute

ref = QName

UseAttribute />

]_{content(targetNCName)}

( attribute QName )[UseAttribute]_use

A local attribute with a local content is mapped to the [XPath/XQuery] type system as follows. Let fs:anon_k be a newly generated anonymous name.

[

<attribute

name = NCName

UseAttribute >

simpleType

</attribute>

]_{content(targetNCName)}

( attribute targetNCName:NCName of type fs:anon_k )[UseAttribute]_use

with

define type fs:anon_k of type xs:anySimpleType { [simpleType]_{content(targetNCName)} }

C.4 Element Declarations

Schema component

The following structure describes attribute declarations in XML Schema.

<element

[ ignored ] abstract = boolean : false

[ ignored ] block = (#all | List of (extension | restriction))

[ not handled ] default = string

[ ignored ] final = (#all | List of (extension | restriction))

[ not handled ] fixed = string

[ not handled ] form = (qualified | unqualified)

[ ignored ] id = ID

maxOccurs = (nonNegativeInteger | unbounded) : 1

minOccurs = nonNegativeInteger : 1

name = NCName

nillable = boolean : false

ref = QName

substitutionGroup = QName

type = QName

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?, ((simpleType | complexType)?, (unique | key | keyref)*))

</element>

C.4.1 Global element declarations

Schema import distinguishes between global element declarations and local element declarations.

Schema mapping

Global element declarations are mapped like local element declarations, but are prefixed by a "define" keyword in the [XPath/XQuery] type system.

[

<element

name = NCName

NillableAttribute

substitutionGroupAttribute

type = QName />

]_{definition(targetNCName)}

define element targetNCName:NCName [substitutionGroupAttribute]_substitution [NillableAttribute]_nillable of type QName

[

<element

name = NCName

NillableAttribute

substitutionGroupAttribute >

ElementContentType

</element>

]_{definition(targetNCName)}

define element targetNCName:NCName [substitutionGroupAttribute]_substitution [NillableAttribute]_nillable [ElementContentType]_{content(targetNCName)}

C.4.2 Local element declarations

Schema mapping

Local element declarations, but mapped into corresponding notations in the [XPath/XQuery] type system. Note that substitution group cannot be declared on local elements.

[

<element

OccursAttributes

name = NCName

NillableAttribute

type = QName />

]_{content(targetNCName)}

( element targetNCName:NCName [NillableAttribute]_nillable of type QName ) [OccursAttributes]_occurs

[

<element

OccursAttributes

ref = QName />

]_{content(targetNCName)}

( element QName ) [OccursAttributes]_occurs

Let fs:anon_k be a newly generated anonymous name.

[

<element

OccursAttributes

name = NCName

NillableAttribute >

ElementContentType

</element>

]_{definition(targetNCName)}

( element targetNCName:NCName [NillableAttribute]_nillable of type fs:anon_k ) [OccursAttributes]_occurs

with

define type fs:anon_k [ElementContentType]_{content(targetNCName)} }

C.5 Complex Type Definitions

Schema component

A complex type definition is represented in XML by the following structure.

<complexType

[ ignored ] abstract = boolean : false

[ ignored ] block = (#all | List of (extension | restriction))

[ ignored ] final = (#all | List of (extension | restriction))

[ ignored ] id = ID

mixed = boolean : false

name = NCName

[ ignored ] {any attributes with non-schema namespace ...} >

</complexType>

Notation

The following auxiliary grammar productions are used to describe the content of a complex type definition.

Complex type content

[62 (Formal)]	`ComplexTypeContent`	::=	`"annotation"? ("simpleContent" \| "complexContent" \| (ChildrenContent AttributeContent))`
[65 (Formal)]	`AttributeContent`	::=	`("attribute" \| "attributeGroup")* "anyAttribute"?`
[63 (Formal)]	`ChildrenContent`	::=	`("group" \| "all" \| "choice" \| "sequence")?`

C.5.1 Global complex type

Schema import distinguishes between global complex types (which are mapped to sort declarations) and local complex types (which are mapped to type definitions).

Schema mapping

In the case of global complex types, the mapping rule which applies is denoted by []_{definition(targetNCName)}.

[

<complexType

MixedAttribute

name = NCName >

ComplexTypeContent

</complexType>

]_{definition(targetNCName)}

define type targetNCName:NCName [MixedAttribute ComplexTypeContent]_{mixed_content(targetNCName)}

Note that the mixed is passed along in the normalization rules, in order to map it later on to the appropriate indication in the [XPath/XQuery] type system.

C.5.2 Local complex type

Schema mapping

In the case of a local complex types, there must not be a name attribute and the mapping rule which applies is denoted by []_{content(targetNCName)}.

[

<complexType

MixedAttribute >

ComplexTypeContent

</complexType>

]_{content(targetNCName)}

[MixedAttribute ComplexTypeContent]_{mixed_content(targetNCName)}

Note that the mixed is passed along in the normalization rules, in order to map it later on to the appropriate indication in the [XPath/XQuery] type system.

C.5.3 Complex type with simple content

Schema component

A complex type can be of simple content. A simple content is represented in XML by the following structure.

<simpleContent

[ ignored ] id = ID

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?, (restriction | extension))

</simpleContent>

Derivation by restriction inside a simple content is represented in XML by the following structure.

<restriction

base = QName

[ ignored ] id = ID

[ ignored ] {any attributes with non-schema namespace ...} >

</restriction>

Derivation by extension inside a simple content is represented in XML by the following structure.

<extension

base = QName

[ ignored ] id = ID

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?, ((attribute | attributeGroup)*, anyAttribute?))

</extension>

Notation

The normalization rule

[MixedAttribute ComplexTypeContent]_{mixed_content(targetNCName)}

TypeDerivation

maps a pair of mixed attribute and complex type content to a type derivation.

Schema mapping

A complex types with simple content must not have a mixed attribute set to "true".

If the simple content is derived by restriction, it is mapped into a simple type restriction in the [XPath/XQuery] type system. Only the name of the base atomic type and attributes are mapped, while the actual simple type restriction is ignored. (Remember that facets are not captured in the [XPath/XQuery] type system.)

[

mixed = "false"

<restriction

base = QName >

simpleContentRestriction AttributeContent

</restriction>

</simpleContent>

]_{mixed_content(targetNCName)}

restricts QName { [AttributeContent]_{content(targetNCName)} QName }

If the simple type is derived by extension, it is mapped into an extended type specifier into the [XPath/XQuery] type system.

[

mixed = "false"

<extension

base = QName >

AttributeContent

</extension>

</simpleContent>

]_{mixed_content(targetNCName)}

extends QName { [AttributeContent]_{content(targetNCName)} }

C.5.4 Complex type with complex content

Schema component

A complex type can be of complex content. A complex content is represented in XML by the following structure.

<complexContent

[ ignored ] id = ID

mixed = boolean : false

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?, (restriction | extension))

</complexContent>

Derivation by restriction inside a complex content is represented in XML by the following structure.

<restriction

base = QName

[ ignored ] id = ID

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?, (group | all | choice | sequence)?, ((attribute | attributeGroup)*, anyAttribute?))

</restriction>

Derivation by extension inside a complex content is represented in XML by the following structure.

<extension

base = QName

[ ignored ] id = ID

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?, ((group | all | choice | sequence)?, ((attribute | attributeGroup)*, anyAttribute?)))

</extension>

Schema mapping

If the complex content is derived by restriction, it is mapped into a type restriction in the [XPath/XQuery] type system, and the

[

MixedAttribute

<restriction

base = QName >

annotation? ChildrenContent AttributeContent

</restriction>

</complexContent>

]_{mixed_content(targetNCName)}

restricts QName [MixedAttribute]_mixed { [AttributeContent]_{content(targetNCName)} [ChildrenContent]_{content(targetNCName)} }

If the complex content is derived by extension, it is mapped into an extended type specifier into the [XPath/XQuery] type system.

[

MixedAttribute

<extension

base = QName >

annotation? ChildrenContent AttributeContent

</extension>

</complexContent>

]_{mixed_content(targetNCName)}

extends QName [MixedAttribute]_mixed { [AttributeContent]_{content(targetNCName)} [ChildrenContent]_{content(targetNCName)} }

C.6 Attribute Uses

Mapping for attribute uses is given in [C.1.4 Special attributes].

C.7 Attribute Group Definitions

C.7.1 Attribute group definitions

Schema component

Model group definitions are represented in XML by the following structure.

<attributeGroup

[ ignored ] id = ID

name = NCame

ref = QName

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?, ((attribute | attributeGroup)*, anyAttribute?))

</attributeGroup>

Schema mapping

Attribute group definitions are not currently handled by the mapping. See Issue 501 (FS-Issue-0158).

C.7.2 Attribute group reference

Schema mapping

Attribute group references are not currently handled by the mapping. See Issue 501 (FS-Issue-0158).

C.8 Model Group Definitions

Schema component

Model group definitions are represented in XML by the following structure.

<group

name = NCame >

Content: (annotation?, (all | choice | sequence))

</group>

Schema mapping

Model group definitions are not currently handled by the mapping. See Issue 501 (FS-Issue-0158).

C.9 Model Groups

Model groups are either "all", "sequence" or "choice". One can also refer to a model group definition.

C.9.1 All groups

Schema component

All groups are represented in XML by the following structure.

<all

[ ignored ] id = ID

maxOccurs = 1 : 1

minOccurs = (0 | 1) : 1

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?, element*)

</all>

Schema mapping

All groups are mapped into the "&" operation in the [XPath/XQuery] type system.

[

<all

OccursAttributes >

Element₁ ... Element_n

</all>

]_{content(targetNCName)}

([Element₁]_{content(targetNCName)} & ... & [Element_n]_{content(targetNCName)}) [OccursAttributes]_occurs

C.9.2 Choice groups

Schema component

Choice groups are represented in XML by the following structure.

<choice

[ ignored ] id = ID

maxOccurs = (nonNegativeInteger | unbounded) : 1

minOccurs = nonNegativeInteger : 1

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?, (element | group | choice | sequence | any)*)

</choice>

Notation

The following auxiliary grammar productions are used to describe group components.

Group Component

[64 (Formal)] GroupComponent ::= "element" | "group" | "choice" | "sequence" | "any"

Schema mapping

Choice groups are mapped into the "|" operation in the [XPath/XQuery] type system.

[

<choice

OccursAttributes >

GroupComponent₁ ... GroupComponent_n

</choice>

]_{content(targetNCName)}

([GroupComponent₁]_{content(targetNCName)} | ... | [GroupComponent_n]_{content(targetNCName)}) [OccursAttributes]_occurs

C.9.3 Sequence groups

Schema component

Sequence groups are represented in XML by the following structure.

<sequence

[ ignored ] id = ID

maxOccurs = (nonNegativeInteger | unbounded) : 1

minOccurs = nonNegativeInteger : 1

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?, (element | group | choice | sequence | any)*)

</sequence>

Schema mapping

Sequence groups are mapped into the "," operation in the [XPath/XQuery] type system.

[

<sequence

OccursAttributes >

GroupComponent₁ ... GroupComponent_n

</sequence>

]_{content(targetNCName)}

([GroupComponent₁]_{content(targetNCName)} , ... , [GroupComponent_n]_{content(targetNCName)}) [OccursAttributes]_occurs

C.10 Particles

Particles contribute to the definition of content models.

A particle can be either an element reference, a group reference or a wildcard.

C.10.1 Element reference

Schema component

Element reference particles are represented in XML by the following structure.

<element

ref = QName

maxOccurs = (nonNegativeInteger | unbounded) : 1

minOccurs = nonNegativeInteger : 1

[ ignored ] {any attributes with non-schema namespace ...} >

Schema mapping

Element references are mapped into element references in the [XPath/XQuery] type system.

[

<element

ref = QName

OccursAttributes />

]_{content(targetNCName)}

element QName [OccursAttributes]_occurs

C.10.2 Group reference

Schema component

Group reference particles are represented in XML by the following structure.

<group

ref = QName

maxOccurs = (nonNegativeInteger | unbounded) : 1

minOccurs = nonNegativeInteger : 1

[ ignored ] {any attributes with non-schema namespace ...} >

Schema mapping

Model group references are not currently handled by the mapping.

C.11 Wildcards

C.11.1 Attribute wildcards

Schema component

Attribute wildcards are represented in XML by the following structure.

<anyAttribute

[ ignored ] id = ID

[ not handled ] namespace = ((##any | ##other) | List of (anyURI | (##targetNamespace | ##local)) ) : ##any

processContents = (lax | skip | strict) : strict

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?)

</anyAttribute>

Schema mapping

An attribute wildcard with a "skip" process content is mapped as an attribute wildcard in the [XPath/XQuery] type system.

[

<anyAttribute

processContents = "skip" >

annotation?

</anyAttribute>

]_{content(targetNCName)}

(attribute (*, xdt:untypedAtomic))*

[

<anyAttribute

processContents = "lax" >

annotation?

</anyAttribute>

]_{content(targetNCName)}

attribute *

[

<anyAttribute

processContents = "strict" >

annotation?

</anyAttribute>

]_{content(targetNCName)}

attribute *

Editorial note
Namespace wildcards are not handled by the mapping.

C.11.2 Element wildcards

Schema component

Element wildcards are represented in XML by the following structure.

<any

[ ignored ] id = ID

maxOccurs = (nonNegativeInteger | unbounded) : 1

minOccurs = nonNegativeInteger : 1

[ not handled ] namespace = ((##any | ##other) | List of (anyURI | (##targetNamespace | ##local)) ) : ##any

processContents = (lax | skip | strict) : strict

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?)

</any>

Schema mapping

An element wildcard with a "skip" process content is mapped as an element wildcard in the [XPath/XQuery] type system.

[

<any

OccursAttributes

processContents = "skip" >

annotation?

</any>

]_{content(targetNCName)}

( element (*, xdt:untyped) )[OccursAttributes]_occurs

[

<any

OccursAttributes

processContents = "lax" >

annotation?

</any>

]_{content(targetNCName)}

( element (*, xs:anyType) )[OccursAttributes]_occurs

Editorial note
Element wildcards with a "lax" or "strict" process content are not handled by the mapping.

Editorial note
Namespace wildcards are not handled by the mapping.

C.12 Identity-constraint Definitions

All identity-constraints definitions are ignored when mapping into the [XPath/XQuery] type system.

C.13 Notation Declarations

All notation declarations are ignored when mapping into the [XPath/XQuery] type system.

C.14 Annotation

All annotation are ignored when mapping into the [XPath/XQuery] type system.

C.15 Simple Type Definitions

Schema component

A simple type is represented in XML by the following structure.

<simpleType

[ ignored ] final = (#all | (list | union | restriction))

[ ignored ] id = ID

name = NCName

[ ignored ] {any attributes with non-schema namespace ...} >

name = NCName

</simpleType>

Derivation by restriction inside a simple type is represented in XML by the following structure.

<restriction

base = QName

[ ignored ] id = ID

[ ignored ] {any attributes with non-schema namespace ...} >

</restriction>

Derivation by list inside a simple type is represented in XML by the following structure.

<list

[ ignored ] id = ID

itemType = QName

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?, (simpleType?))

</list>

Derivation by union inside a simple type is represented in XML by the following structure.

<union

[ ignored ] id = ID

memberTypes = List of QName

[ ignored ] {any attributes with non-schema namespace ...} >

Content: (annotation?, (simpleType*))

</union>

C.15.1 Global simple type definition

Schema import distinguishes between global simple types (which are mapped to sort declarations) and local simple types (which are mapped to type definitions).

Schema mapping

In the case of global simple types, the mapping rule which applies is denoted by []_{definition(targetNCName)}.

[

<simpleType

name = NCName >

SimpleTypeContent

</simpleType>

]_{definition(targetNCName)}

define type targetNCName:NCName [SimpleTypeContent]_{simple_content(targetNCName)}

C.15.2 Local simple type definition

Schema mapping

In the case of global simple types, the mapping rule which applies is denoted by []_{content(targetNCName)}.

[

SimpleTypeContent

</simpleType>

]_{content(targetNCName)}

[SimpleTypeContent]_{simple_content(targetNCName)}

C.15.3 Simple type content

Notation

The normalization rule []_{simple_content(targetNCName)} maps a simple type content to a type specifier and an optional occurrence indicator.

Schema mapping

If the simple type is derived by restriction, it is mapped into a simple type restriction in the [XPath/XQuery] type system. The name of the base atomic type and attributes are mapped. Only the minLength, maxLength, and length facets in the simple type restriction are handled. All other properties of the simple-type restriction are ignored.

[

<restriction

base = QName >

simpleContentRestriction

</restriction>

]_{simple_content(targetNCName)}

restricts QName { QName } [simpleContentRestriction]_occurs

If the simple type is derived by list, and its content type does not constrain the length of the list, it is mapped into a zero-or-more repetition type into the [XPath/XQuery] type system.

[

<list>

SimpleType

</list>

]_{simple_content(targetNCName)} Type = [SimpleType]_{content(targetNCName)}

{ Type * }

If the simple type is derived by list, and its content type does constrain the length of the list, then it is mapped into a zero-or-more repetition type into the [XPath/XQuery] type system.

[

<list>

SimpleType

</list>

]_{simple_content(targetNCName)} Type · Occurrence = [SimpleType]_{content(targetNCName)}

{ Type · Occurrence }

[

<list

itemType = QName />

]_{simple_content(targetNCName)}

{ QName* }

If the simple type is derived by union, it is mapped into a union type into the [XPath/XQuery] type system.

[

<union>

SimpleType₁ ... SimpleType_n

</union>

]_{simple_content(targetNCName)}

{ ([SimpleType]_{content(targetNCName)} | ... | [SimpleType_n]_{content(targetNCName)}) }

[

<union

memberTypes = QName₁ ... QName_n />

]_{simple_content(targetNCName)}

{ QName₁ | ... | QName_n }

D References

D.1 Normative References

XML: Extensible Markup Language (XML) 1.0 (Third Edition), C. M. Sperberg-McQueen, Eve Maler, Tim Bray, et. al., Editors. World Wide Web Consortium, 04 Feb 2004. This version is http://www.w3.org/TR/2004/REC-xml-20040204. The latest version is available at http://www.w3.org/TR/REC-xml.
XML Names 1.1: World Wide Web Consortium. Namespaces in XML 1.1. W3C Recommendation. See http://www.w3.org/TR/xml-names11/
Schema Part 1: XML Schema Part 1: Structures Second Edition, David Beech, Noah Mendelsohn, Murray Maloney, and Henry S. Thompson, Editors. World Wide Web Consortium, 28 Oct 2004. This version is http://www.w3.org/TR/2004/REC-xmlschema-1-20041028/. The latest version is available at http://www.w3.org/TR/xmlschema-1/.
Schema Part 2: XML Schema Part 2: Datatypes Second Edition, Paul V. Biron and Ashok Malhotra, Editors. World Wide Web Consortium, 28 Oct 2004. This version is http://www.w3.org/TR/2004/REC-xmlschema-2-20041028/. The latest version is available at http://www.w3.org/TR/xmlschema-2/.
Data Model: XQuery 1.0 and XPath 2.0 Data Model (XDM), Norman Walsh, Mary Fernández, Ashok Malhotra, et. al., Editors. World Wide Web Consortium, 3 Nov 2005. This version is http://www.w3.org/TR/2005/CR-xpath-datamodel-20051103/. The latest version is available at http://www.w3.org/TR/xpath-datamodel/.
Data Model Serialization: XSLT 2.0 and XQuery 1.0 Serialization, Joanne Tong, Michael Kay, Norman Walsh, et. al., Editors. World Wide Web Consortium, 3 Nov 2005. This version is http://www.w3.org/TR/2005/CR-xslt-xquery-serialization-20051103/. The latest version is available at http://www.w3.org/TR/xslt-xquery-serialization/.
XQuery 1.0: An XML Query Language: XQuery 1.0: An XML Query Language, Don Chamberlin , Anders Berglund, Scott Boag, et. al., Editors. World Wide Web Consortium, 3 Nov 2005. This version is http://www.w3.org/TR/2005/CR-xquery-20051103/. The latest version is available at http://www.w3.org/TR/xquery/.
XML Path Language (XPath) 2.0: XML Path Language (XPath) 2.0, Don Chamberlin , Anders Berglund, Scott Boag, et. al., Editors. World Wide Web Consortium, 3 Nov 2005. This version is http://www.w3.org/TR/2005/CR-xpath20-20051103/. The latest version is available at http://www.w3.org/TR/xpath20/.
Functions and Operators: XQuery 1.0 and XPath 2.0 Functions and Operators, Ashok Malhotra, Jim Melton, and Norman Walsh, Editors. World Wide Web Consortium, 3 Nov 2005. This version is http://www.w3.org/TR/2005/CR-xpath-functions-20051103/. The latest version is available at http://www.w3.org/TR/xpath-functions/.

D.2 Non-normative References

XML Schema Part 0: XML Schema Part 0: Primer Second Edition, David C. Fallside and Priscilla Walmsley, Editors. World Wide Web Consortium, 28 Oct 2004. This version is http://www.w3.org/TR/2004/REC-xmlschema-0-20041028/. The latest version is available at http://www.w3.org/TR/xmlschema-0/.
XML Query 1.0 Requirements: XML Query (XQuery) Requirements, Don Chamberlin, Peter Fankhauser, Massimo Marchiori, and Jonathan Robie, Editors. World Wide Web Consortium, 3 Jun 2005. This version is http://www.w3.org/TR/2005/WD-xquery-requirements-20050603/. The latest version is available at http://www.w3.org/TR/xquery-requirements/.

D.3 Background References

Languages: Handbook of Formal Languages. G. Rozenberg and A. Salomaa, editors. Springer-Verlag. 1997.
TATA: Tree Automata Techniques and Applications. H. Comon and M. Dauchet and R. Gilleron and F. Jacquemard and D. Lugiez and S. Tison and M. Tommasi. See http://www.grappa.univ-lille3.fr/tata/. 1997.

E Auxiliary Judgments for Validation (Non-Normative)

E.1 Judgments for the validate expression

XQuery supports XML Schema validation using the validate expression. This section gives a non-normative formal semantics of XML Schema validation, solely for the purpose of specifying its usage in XQuery.

Specifying XML Schema validation requires a fairly large number of auxiliary judgments. There are two main judgments used to describe the semantics of validation.

The "erase" judgment takes a value and removes all type information from it. This operation is necessary since, in XQuery, validation can occur both on well-formed or already validated documents.
The "annotate" operation takes an untyped value and a type and either fails or succeeds by returning a new -validated- value.

Before defining those three judgments, we first introduce auxiliary judgments used to describe specific parts of the XML Schema's semantics.

E.1.1 Type resolution

Notation

The judgment

statEnv |- (TypeReference | TypeDerivation) resolves to TypeName { Type }

holds when a type reference or a type derivation resolves to the given type name and type content.

Semantics

This judgment is specified by the following rules.

If the type is omitted, it is resolved as the empty sequence type.

statEnv |- Derivation? Mixed? { empty } resolves to TypeName { Type }

statEnv |- Derivation? Mixed? { } resolves to TypeName { Type }

In case of a type reference, then the type name is the name of that type, and the type is taken by resolving the type declaration of the global type.

statEnv |- TypeName of elem/type expands to expanded-QName

statEnv.typeDefn(expanded-QName) => define type TypeName TypeDerivation

statEnv |- TypeDerivation resolves to BaseTypeName { Type }

statEnv |- of type TypeName resolves to TypeName { Type }

In the above inference rule, note that BaseTypeName is the base type of the type referred to. So this is indeed the original type name, TypeName, which must be returned, and eventually used to annotated the corresponding element or attribute. However, the type needs to be obtained through a second application of the resolves to judgment.

If the type derivation is a restriction, then the type name is the name of the base type, and the type is taken from the type derivation.

statEnv |- Mixed? Type adjusts to AdjustedType

statEnv |- restricts TypeName Mixed? { Type } resolves to TypeName { AdjustedType }

If the type derivation is an extension, then the type name is the name of the base type, and the type is the base type extended by the type in the type derivation.

statEnv |- TypeName of elem/type expands to expanded-QName

statEnv.typeDefn(expanded-QName) => define type TypeName Derivation? BaseMixed? { BaseType? }

statEnv |- BaseType? extended by Type is ExtendedType

statEnv |- Mixed? ExtendedType adjusts to AdjustedType

statEnv |- extends TypeName Mixed? { Type } resolves to TypeName { AdjustedType }

E.1.2 Interleaving

Notation

The judgment

statEnv |- Value₁ interleave Value₂ yields Value₃

holds if some interleaving of Value₁ and Value₂ yields Value₃. Interleaving is non-deterministic; it is used for processing all groups.

Semantics

This judgment is specified by the following rules.

Interleaving two empty sequences yields the empty sequence.

statEnv |- () interleave () yields ()

Otherwise, pick an item from the head of one of the sequences, and recursively interleave the remainder.

statEnv |- Value₁ interleave Value₂ yields Value₃

statEnv |- Item,Value₁ interleave Value₂ yields Item,Value₃

statEnv |- Value₁ interleave Value₂ yields Value₃

statEnv |- Value₁ interleave Item,Value₂ yields Item,Value₃

E.1.3 Attribute filtering

Introduction

Finally, we introduce an auxiliary judgment which extracts the value of a given attribute if it exists. This judgment is not used in the semantics of step expressions, but in [8.3 Judgments for type matching], and is based on the other filter judgments.

Notation

The judgment

Value filter @QName => ()

holds if there are no occurrences of the attribute QName in Value. The judgment

Value filter @QName => SimpleValue

holds if there is one occurrence of the attribute QName in Value, and the value of that attribute is SimpleValue. The judgment

Value filter @QName => () or SimpleValue

holds if either of the previous two judgments hold.

Semantics

The filter judgments are defined as follows.

dynEnv |- Value₁ of attribute:: => Value₂

dynEnv |- Value₂ of attribute, QName => ()

Value₁ filter @QName => ()

dynEnv |- Value₁ of attribute:: => Value₂

dynEnv |- Value₂ of attribute,QName => Value₃

Value₃ = attribute QName { SimpleValue }

Value₁ filter @QName => SimpleValue

E.1.4 Erasure

E.1.4.1 Simply erases

Notation

To define erasure, an auxiliary judgment is needed. The judgment

statEnv |- SimpleValue simply erases to String

holds when SimpleValue erases to the string String.

Semantics

This judgment is specified by the following rules.

The empty sequence erases to the empty string.

statEnv |- () simply erases to ""

The concatenation of two non-empty sequences of values erases to the concatenation of their erasures with a separating space.

statEnv |- SimpleValue₁ simply erases to String₁ SimpleValue₁ != ()

statEnv |- SimpleValue₂ simply erases to String₂ SimpleValue₂ != ()

statEnv |- SimpleValue₁,SimpleValue₂ simply erases to fn:concat(String₁," ",String₂)

An atomic value erases to its string representation as an instance of xdt:untypedAtomic.

statEnv |- AtomicValue of type AtomicTypeName simply erases to dm:string-value(AtomicValue) of type xdt:untypedAtomic

E.1.4.2 Erases

Notation

The erases to judgment is used in the definition of the dynamic semantics of validation. The normative dynamic semantics of validation is specified in Section 3.13 Validate Expressions^XQ. The effect of the validate expression is equivalent to:

serialization of the data model, as described in [Data Model Serialization], followed by
validation of the serialized value into a Post-Schema Validated Infoset, as described in [Schema Part 1], followed by
construction of a new data model value, as described in [Data Model].

Erasure is the formal equivalent of serialization followed by construction a new data model value in which all element nodes are labeled with xdt:untyped and all attribute nodes with xdt:untypedAtomic.

The judgment

statEnv |- Value₁ erases to Value₂

holds when the erasure of Value₁ is Value₂.

Semantics

This judgment is specified by the following rules.

The empty sequence erases to itself.

statEnv |- () erases to ()

The erasure of the concatenation of two values is the concatenation of their erasure, so long as neither of the two original values is simple.

statEnv |- Value₁ erases to Value₁' statEnv |- Value₁ not a simple value

statEnv |- Value₂ erases to Value₂' statEnv |- Value₂ not a simple value

statEnv |- Value₁,Value₂ erases to Value₁',Value₂'

The erasure of an element is an element that has the same name and the type xdt:untyped and the erasure of the original content.

statEnv |- Value₁ erases to Value₂

statEnv |- element ElementName of type TypeName { Value₁ } erases to element ElementName of type xdt:untyped { Value₂ }

The erasure of an attribute is an attribute that has the same name and the type xdt:untypedAtomic and the simple erasure of the original content labeled with xdt:untypedAtomic.

statEnv |- Value simply erases to String

statEnv |- attribute AttributeName of type TypeName { Value } erases to attribute AttributeName of type xdt:untypedAtomic { String of type xdt:untypedAtomic }

The erasure of a document is a document with the erasure of the original content.

statEnv |- Value₁ erases to Value₂

statEnv |- document { Value₁ } erases to document { Value₂ }

The erasure of a text or comment or processing-instruction node is itself.

statEnv |- text { String } erases to text { String }

statEnv |- comment { String } erases to comment { String }

statEnv |- processing-instruction QName { String } erases to processing-instruction QName { String }

The erasure of a simple value is the corresponding text node.

statEnv |- SimpleValue simply erases to String

statEnv |- SimpleValue erases to text { String }

E.1.5 Annotate

The annotate as judgment is used in the definition of the dynamic semantics of validation. The normative dynamic semantics of validation is specified in Section 3.13 Validate Expressions^XQ. The effect of the validate expression is equivalent to:

serialization of the data model, as described in [Data Model Serialization], followed by
parsing of the serialized value into the Infoset
validation of the Infoset into a Post-Schema Validated Infoset, as described in [Schema Part 1], followed by
construction of a new data model value, as described in [Data Model].

Annotation is the formal equivalent of schema validation of an Infoset value value into the PSVI followed by construction of a new data model value. Because the Formal Semantics is defined on data model values, not the Infoset, annotation is applied to a data model values in which all element nodes are labeled with xdt:untyped and all attribute nodes with xdt:untypedAtomic -- that is, the result of erasure.

E.1.5.1 Simply annotate

Notation

The judgment

statEnv |- simply annotate as SimpleType ( SimpleValue ) => SimpleValue₂

holds if the result of casting the SimpleValue₁ to SimpleType is SimpleValue₂.

Semantics

This judgment is specified by the following rules.

Simply annotating a simple value to a union type yields the result of simply annotating the simple value to either the first or second type in the union. Note that simply annotating to the second type is attempted only if simply annotating to the first type fails.

statEnv |- simply annotate as SimpleType₁ (SimpleValue₁) => SimpleValue₂

statEnv |- simply annotate as SimpleType₁|SimpleType₂ (SimpleValue) => SimpleValue₂

statEnv |- (simply annotate as SimpleType₁ (SimpleValue₁) => SimpleValue₂) fails

statEnv |- simply annotate as SimpleType₂ (SimpleValue₁) => SimpleValue₂

statEnv |- simply annotate as SimpleType₁|SimpleType₂ (SimpleValue) => SimpleValue₂

The simple annotation rules for ?, +, * are similar.

statEnv |- simply annotate as SimpleType? ( () ) => ()

statEnv |- simply annotate as SimpleType (SimpleValue₁) => SimpleValue₂

statEnv |- simply annotate as SimpleType? (SimpleValue₁) => SimpleValue₂

statEnv |- simply annotate as SimpleType* ( () ) => ()

statEnv |- simply annotate as SimpleType (SimpleValue₁) => SimpleValue₁'

statEnv |- simply annotate as SimpleType* (SimpleValue₂) => SimpleValue₂'

statEnv |- simply annotate as SimpleType* (SimpleValue₁,SimpleValue₂) => SimpleValue₁',SimpleValue₂'

statEnv |- simply annotate as SimpleType (SimpleValue₁) => SimpleValue₁'

statEnv |- simply annotate as SimpleType* (SimpleValue₂) => SimpleValue₂'

statEnv |- simply annotate as SimpleType+ (SimpleValue₁,SimpleValue₂) => SimpleValue₁',SimpleValue₂'

Simply annotating an atomic value to xs:string yields its string representation.

statEnv |- simply annotate as xs:string (AtomicValue) => dm:string-value(AtomicValue)

Simply annotating an atomic value to xs:decimal yields the decimal that results from parsing its string representation.

statEnv |- simply annotate as xs:decimal (AtomicValue) => xs:decimal(dm:string-value(AtomicValue))

Similar rules are assumed for the rest of the 19 XML Schema primitive types.

E.1.5.2 Nil-annotate

Notation

The judgment

statEnv |- nil-annotate as Nillable? Type ( Value₁ ) => Value₂

holds if it is possible to annotate value Value₁ as if it had the nillable type Type and Value₂ is the corresponding annotated value.

Semantics

This judgment is specified by the following rules.

If the type is not nillable, then the xsi:nil attribute must not appear in the value, and it must be possible to annotate value Value as if it had the type Type.

Value₁ filter @xsi:nil => ()

statEnv |- annotate as Type ( Value ) => Value₂

statEnv |- nil-annotate as Type ( Value₁ ) => Value₂

If the type is nillable, and the xsi:nil attribute does not appear or is false, then it must be possible to annotate value Value₁ as if it had the type Type.

Value₁ filter @xsi:nil => () or false

statEnv |- annotate as Type ( Value₁ ) => Value₂

statEnv |- nil-annotate as nillable Type ( Value₁ ) => Value₂

If the type is nillable, and the xsi:nil attribute is true, then it must be possible to annotate value Value₁ as if it had a type where the attributes in the type are kept and the element content of the type is ignored.

Value₁ filter @xsi:nil => true

statEnv |- annotate as AttributeAll ( Value₁ ) => Value₂

statEnv |- nil-annotate as nillable (AttributeAll, ElementContentType) ( Value₁ ) => Value₂

E.1.5.3 Annotate

serialization of the data model, as described in [Data Model Serialization], followed by
parsing of the serialized value into the Infoset
validation of the Infoset into a Post-Schema Validated Infoset, as described in [Schema Part 1], followed by
construction of a new data model value, as described in [Data Model].

Notation

The judgment

statEnv |- annotate as Type ( Value₁ ) => Value₂

holds if it is possible to annotate value Value₁ as if it had type Type and Value₂ is the corresponding annotated value.

Note

Assume an XML Infoset instance X1 is validated against an XML Schema S, yielding PSVI instance X2. Then if X1 corresponds to Value₁ and S corresponds to Type and X2 corresponds to Value₂, the following should hold: annotate as Type ( Value₁ ) => Value₂.

Semantics

This judgment is specified by the following rules.

Annotating the empty sequence as the empty type yields the empty sequence.

statEnv |- annotate as () (()) => ()

Annotating a concatenation of values as a concatenation of types yields the concatenation of the annotated values.

statEnv |- annotate as Type₁ (Value₁) => Value₁'

statEnv |- annotate as Type₂ (Value₂) => Value₂'

statEnv |- annotate as Type₁,Type₂ (Value₁,Value₂) => Value₁',Value₂'

Annotating a value as a choice type yields the result of annotating the value as either the first or second type in the choice.

statEnv |- annotate as Type₁ (Value₁) => Value₂

statEnv |- annotate as Type₁|Type₂ (Value₁) => Value₂

statEnv |- annotate as Type₂ (Value₁) => Value₂

statEnv |- annotate as Type₁|Type₂ (Value₁) => Value₂

Annotating a value as an all group uses interleaving to decompose the original value and recompose the annotated value.

Editorial note
Jerome and Phil: Note that this may reorder the original sequence. Perhaps we should disallow such reordering. Specifying that formally is not as easy as we would like.

statEnv |- annotate as Type₁ ( Value₁ ) => Value₁'

statEnv |- annotate as Type₂ ( Value₂ ) => Value₂'

statEnv |- Value₁ interleave Value₂ yields Value

statEnv |- Value₁' interleave Value₂' yields Value'

statEnv |- annotate as Type₁ & Type₂ ( Value ) => Value'

The annotation rules for ?, +, * are similar.

statEnv |- annotate as (Type | empty)(Value₁) => Value₂

statEnv |- annotate as Type? (Value₁) => Value₂

statEnv |- annotate as Type (Value₁) => Value₁' statEnv |- annotate as Type* (Value₂) => Value₂'

statEnv |- annotate as Type+ (Value₁,Value₂) => (Value₁',Value₂')

statEnv |- annotate as Type* ( () ) => ()

statEnv |- annotate as Type (Value₁) => Value₁' statEnv |- annotate as Type* (Value₂) => Value₂'

statEnv |- annotate as Type* (Value₁,Value₂) => (Value₁',Value₂')

To annotate an element with no xsi:type attribute, first look up the element type, next resolve the resulting type reference, then annotate the value against the resolved type, and finally return a new element with the name of the original element, the resolved type name, and the annotated value.

Value filter @xsi:type => ()

statEnv |- ElementName name lookup ElementType yields Nillable? TypeReference

statEnv |- TypeReference resolves to TypeName { Type }

statEnv |- nil-annotate as Type Nillable? (Value) => Value'

statEnv |- annotate as ElementType ( element ElementName of type xs:anyType { Value } ) => element ElementName of type TypeName { Value' }

To annotate an element with an xsi:type attribute, define a type reference corresponding to the xsi:type. Look up the element type, yielding a type reference, and check that the xsi:type reference derives from this type reference. Resolve the xsi:type reference, then annotate the value against the resolved type, and finally return a new element with the name of the original element, the resolved type name, and the annotated value.

Value filter @xsi:type => TypeName

statEnv |- XsiTypeReference = of type TypeName

statEnv |- ElementName name lookup ElementType yields Nillable? of type BaseTypeName

statEnv |- TypeName derives from BaseTypeName

statEnv |- XsiTypeReference resolves to TypeName { Type }

statEnv |- nil-annotate as Type Nillable? (Value) => Value'

statEnv |- annotate as ElementType ( element ElementName of type xs:anyType { Value } ) => element ElementName of type TypeName { Value' }

The rule for attributes is similar to the first rule for elements.

statEnv |- AttributeName name lookup AttributeType yields TypeReference

statEnv |- TypeReference resolves to TypeName { Type }

statEnv |- nil-annotate as Type Nillable? (Value) => Value'

statEnv |- annotate as AttributeType ( attribute AttributeName of type xs:anySimpleType { Value } ) => attribute AttributeName of type TypeName { Value' }

Annotating a document node yields a document with the annotation of its contents.

statEnv |- annotate as Type (Value) => Value'

statEnv |- annotate as document { Type } ( document { Value } ) => document { Value' }

Annotating a text node as text yields itself.

statEnv |- annotate as text (text { String }) => text { String }

Annotating a text nodes as a simple type is identical to casting.

statEnv |- simply annotate as SimpleType ( String as xs:anySimpleType ) => SimpleValue'

statEnv |- annotate as SimpleType ( text { String } ) => SimpleValue'

Annotating a simple value as a simple type is identical to casting.

statEnv |- simply annotate as SimpleType ( SimpleValue ) => SimpleValue'

statEnv |- annotate as SimpleType ( SimpleValue ) => SimpleValue'

F Revision Log (Non-Normative)

This log records the changes that have been made to this document since the Working Draft of September 3 June 2005 Last Call Working Draft.

F.1 15 September 2005

Completely removed the formal specification of error propagation, and which kind of dynamic errors are raised.
Fixed static typing rules for fn:subsequence
Numerous fixes to static typing rules for function calls, including overloaded functions in Appendix B.2.
Fixed bugs in auxiliary functions dealing with type promotion and atomization in the semantics of function calls.
Fixed handling of namespace "unbinding" under the namespaces rules for XML 1.1.
Fixed dynamic evaluation rules for literals.
Fixed static typing rules for document constructors.
Fixed a bug in the rule implementing the 'union interpretation' for derivation by extension.
Fixed bugs in the rules for module import, now dealing with multiple modules with the same namespace properly.
Fixed terminology for some aspects of the type system, and added clarification pointers in a number of places.
Fixed numerous bugs and typos, as a result of processing last call comments.
A few minor fixes to the core grammar (e.g., Constructor production was missing).

F.2 03 November 2005 (CR Draft)

Numerous improvements and clarifications in the preliminary section which introduces the formal semantics notations.
Complete refactoring of the formal semantics of function calls.
Always provide the normalization rule even for the trivial cases.
Default values in formal notations are only used in examples, not in inference rules anymore.
Removed the use of the confusing notation NonTerminal? in inference rules.
Fixes to the formal semantics of constructors.
Fixes to the semantics of global variables and function declarations.
Fixed numerous bugs and typos, as a result of processing last call comments.
Grammar productions now have a marker indicating when they correspond to the XQuery or the XPath grammar.

`fn:data`(Expr)		If [SequenceType]_sequencetype <: `xdt:anyAtomicType`*
Expr		Otherwise

fs:`convert-simple-operand`(Expr,PrototypicalValue)	If [SequenceType]_sequencetype <: `xdt:anyAtomicType`*
Expr	Otherwise

XQuery 1.0 and XPath 2.0 Formal Semantics

W3C Candidate Recommendation 3 November 2005

Abstract

Status of this Document

Table of Contents

Appendices

1 Introduction

1.1 Normative and Informative Sections

2 Preliminaries

2.1 Introduction to the Formal Semantics

2.1.1 Notations from grammar productions

[For/FLWOR] Expressions

[For/FLWOR] Expressions

Core FLWOR Expressions

Type Definitions

2.1.2 Notations for judgments

2.1.3 Notations for environments

2.1.4 Notations for inference rules

2.1.5 Putting it together

2.2 URIs, Namespaces, and Prefixes

2.3 XML Values

2.3.1 Formal values

Values

2.3.2 Examples of values

2.4 The [XPath/XQuery] Type System

2.4.1 XML Schema and the [XPath/XQuery] Type System

2.4.2 Item types

Item Types

2.4.3 Content models

Types

2.4.4 Top level definitions

Type Definitions

2.4.5 Example of a complete Schema

2.5 Functions and operators

3 Basics

3.1 Expression Context

3.1.1 Static Context

3.1.1.1 Resolving QNames to Expanded QNames

3.1.2 Dynamic Context

3.2 Processing Model

3.2.1 Processing model

3.2.2 Normalization judgment

3.2.3 Static typing judgment

3.2.4 Dynamic evaluation judgment

3.3 Error Handling

3.4 Concepts

3.4.1 Document Order

3.4.2 Atomization

3.4.3 Effective Boolean Value

3.4.4 Input Sources

3.4.5 URI Literals

3.5 Types

3.5.1 Predefined Schema Types

3.5.2 Typed Value and String Value

3.5.3 SequenceType Syntax

SequenceType

3.5.4 SequenceType Matching

3.6 Comments

3.7 XML-defined Terminals

4 Expressions

4.1 Primary Expressions

Primary Expressions

Primary Expressions

4.1.1 Literals

Literals

Literals

4.1.2 Variable References

Variable References

Primary Expressions

4.1.3 Parenthesized Expressions

4.1.4 Context Item Expression

4.1.5 Function Calls

Function Calls

Function Calls

4.2 Path Expressions

Path Expressions

4.2.1 Steps

Steps

Steps

4.2.1.1 Axes