3 Semantics, structure, and APIs of HTML documents

Every XML and HTML document in an HTML UA is represented by a Document object. [DOM]

The document's address is the URL associated with a Document (as defined in the DOM standard). It is initially set when the Document is created, but that can change during the lifetime of the Document; for example, it changes when the user navigates to a fragment identifier on the page and when the pushState() method is called with a new URL. [DOM]

Interactive user agents typically expose the document's address in their user interface. This is the primary mechanism by which a user can tell if a site is attempting to impersonate another.

The document's referrer is an absolute URL that can be set when the Document is created. If it is not explicitly set, then its value is the empty string.

Each Document object has a reload override flag that is originally unset. The flag is set by the document.open() and document.write() methods in certain situations. When the flag is set, the Document also has a reload override buffer which is a Unicode string that is used as the source of the document when it is reloaded.

When the user agent is to perform an overridden reload, given a source browsing context, it must act as follows:

3.1.1 The Document object

The DOM specification defines a Document interface, which this specification extends significantly:

3.1.2 Resource metadata management

In the case of HTTP, the referrer IDL attribute will match the Referer (sic) header that was sent when fetching the current page.

Typically user agents are configured to not report referrers in the case where the referrer uses an encrypted protocol and the current page does not (e.g. when navigating from an https: page to an http: page).

3.1.3 DOM tree accessors

The html element of a document is the document's root element, if there is one and it's an html element, or null otherwise.

The head element of a document is the first head element that is a child of the html element, if there is one, or null otherwise.

The title element of a document is the first title element in the document (in tree order), if there is one, or null otherwise.

The body element of a document is the first child of the html element that is either a body element or a frameset element. If there is no such element, it is null.

The Document interface supports named properties. The supported property names at any moment consist of the values of the name content attributes of all the applet, exposed embed, form, iframe, img, and exposed object elements in the Document that have non-empty name content attributes, and the values of the id content attributes of all the applet and exposed object elements in the Document that have non-empty id content attributes, and the values of the id content attributes of all the img elements in the Document that have both non-empty name content attributes and non-empty id content attributes. The supported property names must be in tree order, ignoring later duplicates, with values from id attributes coming before values from name attributes when the same element contributes both.

To determine the value of a named property name when the Document object is indexed for property retrieval, the user agent must return the value obtained using the following steps:

Let elements be the list of named elements with the name name in the Document.
There will be at least one such element, by definition.
If elements has only one element, and that element is an iframe element, then return the WindowProxy object of the nested browsing context represented by that iframe element, and abort these steps.
Otherwise, if elements has only one element, return that element and abort these steps.
Otherwise return an HTMLCollection rooted at the Document node, whose filter matches only named elements with the name name.

Named elements with the name name, for the purposes of the above algorithm, are those that are either:

applet, exposed embed, form, iframe, img, or exposed object elements that have a name content attribute whose value is name, or
applet or exposed object elements that have an id content attribute whose value is name, or
img elements that have an id content attribute whose value is name, and that have a non-empty name content attribute present also.

An embed or object element is said to be exposed if it has no exposed object ancestor, and, for object elements, is additionally either not showing its fallback content or has no object or embed descendants.

The dir attribute on the Document interface is defined along with the dir content attribute.

3.2 Elements

3.2.1 Semantics

Elements, attributes, and attribute values in HTML are defined (by this specification) to have certain meanings (semantics). For example, the ol element represents an ordered list, and the lang attribute represents the language of the content.

These definitions allow HTML processors, such as Web browsers or search engines, to present and use documents and applications in a wide variety of contexts that the author might not have considered.

Authors must not use elements, attributes, or attribute values for purposes other than their appropriate intended semantic purpose, as doing so prevents software from correctly processing the page.

Authors must not use elements, attributes, or attribute values that are not permitted by this specification or other applicable specifications, as doing so makes it significantly harder for the language to be extended in the future.

Through scripting and using other mechanisms, the values of attributes, text, and indeed the entire structure of the document may change dynamically while a user agent is processing it. The semantics of a document at an instant in time are those represented by the state of the document at that instant in time, and the semantics of a document can therefore change over time. User agents must update their presentation of the document as this occurs.

HTML has a progress element that describes a progress bar. If its "value" attribute is dynamically updated by a script, the UA would update the rendering to show the progress changing.

3.2.2 Elements in the DOM

The nodes representing HTML elements in the DOM must implement, and expose to scripts, the interfaces listed for them in the relevant sections of this specification. This includes HTML elements in XML documents, even when those documents are in another context (e.g. inside an XSLT transform).

Elements in the DOM represent things; that is, they have intrinsic meaning, also known as semantics.

The basic interface, from which all the HTML elements' interfaces inherit, and which must be used by elements that have no additional requirements, is the HTMLElement interface.

The HTMLElement interface holds methods and attributes related to a number of disparate features, and the members of this interface are therefore described in various different sections of this specification.

3.2.3 Element definitions

Each element in this specification has a definition that includes the following information:

This is then followed by a description of what the element represents, along with any additional normative conformance criteria that may apply to authors and implementations. Examples are sometimes also included.

3.2.3.1 Attributes

Except where otherwise specified, attributes on HTML elements may have any string value, including the empty string. Except where explicitly stated, there is no restriction on what text can be specified in such attributes.

3.2.4 Content models

Each element defined in this specification has a content model: a description of the element's expected contents. An HTML element must have contents that match the requirements described in the element's content model. The contents of an element are its children in the DOM, except for template elements, where the children are those in the template contents (a separate DocumentFragment assigned to the element when the element is created).

The space characters are always allowed between elements. User agents represent these characters between elements in the source markup as Text nodes in the DOM. Empty Text nodes and Text nodes consisting of just sequences of those characters are considered inter-element whitespace.

Inter-element whitespace, comment nodes, and processing instruction nodes must be ignored when establishing whether an element's contents match the element's content model or not, and must be ignored when following algorithms that define document and element semantics.

Thus, an element A is said to be preceded or followed by a second element B if A and B have the same parent node and there are no other element nodes or Text nodes (other than inter-element whitespace) between them. Similarly, a node is the only child of an element if that element contains no other nodes other than inter-element whitespace, comment nodes, and processing instruction nodes.

Authors must not use HTML elements anywhere except where they are explicitly allowed, as defined for each element, or as explicitly required by other specifications. For XML compound documents, these contexts could be inside elements from other namespaces, if those elements are defined as providing the relevant contexts.

3.2.4.1 Kinds of content

Each element in HTML falls into zero or more categories that group elements with similar characteristics together. The following broad categories are used in this specification:

Some elements also fall into other categories, which are defined in other parts of this specification.

Sectioning content, heading content, phrasing content, embedded content, and interactive content are all types of flow content. Metadata is sometimes flow content. Metadata and interactive content are sometimes phrasing content. Embedded content is also a type of phrasing content, and sometimes is interactive content.

Other categories are also used for specific purposes, e.g. form controls are specified using a number of categories to define common requirements. Some elements have unique requirements and do not fit into any particular category.

3.2.4.1.1 Metadata content

Metadata content is content that sets up the presentation or behavior of the rest of the content, or that sets up the relationship of the document with other documents, or that conveys other "out of band" information.

Elements from other namespaces whose semantics are primarily metadata-related (e.g. RDF) are also metadata content.

3.2.4.1.2 Flow content

Most elements that are used in the body of documents and applications are categorized as flow content.

3.2.4.1.3 Sectioning content

3.2.4.1.4 Heading content

Heading content defines the header of a section (whether explicitly marked up using sectioning content elements, or implied by the heading content itself).

3.2.4.1.5 Phrasing content

Phrasing content is the text of the document, as well as elements that mark up that text at the intra-paragraph level. Runs of phrasing content form paragraphs.

Most elements that are categorized as phrasing content can only contain elements that are themselves categorized as phrasing content, not any flow content.

Text nodes and attribute values must consist of Unicode characters, must not contain U+0000 characters, must not contain permanently undefined Unicode characters (noncharacters), and must not contain control characters other than space characters. This specification includes extra constraints on the exact value of Text nodes and attribute values depending on their precise context.

3.2.4.1.6 Embedded content

Embedded content is content that imports another resource into the document, or content from another vocabulary that is inserted into the document.

Elements that are from namespaces other than the HTML namespace and that convey content but not metadata, are embedded content for the purposes of the content models defined in this specification. (For example, MathML, or SVG.)

Some embedded content elements can have fallback content: content that is to be used when the external resource cannot be used (e.g. because it is of an unsupported format). The element definitions state what the fallback is, if any.

3.2.4.1.7 Interactive content

Interactive content is content that is specifically intended for user interaction.

Certain elements in HTML have an activation behavior, which means that the user can activate them. This triggers a sequence of events dependent on the activation mechanism, and normally culminating in a click event, as described below.

3.2.4.1.8 Palpable content

As a general rule, elements whose content model allows any flow content or phrasing content should have at least one node in its contents that is palpable content and that does not have the hidden attribute specified.

This requirement is not a hard requirement, however, as there are many cases where an element can be empty legitimately, for example when it is used as a placeholder which will later be filled in by a script, or when the element is part of a template and would on most pages be filled in but on some pages is not relevant.

Conformance checkers are encouraged to provide a mechanism for authors to find elements that fail to fulfill this requirement, as an authoring aid.

3.2.4.1.9 Script-supporting elements

Script-supporting elements are those that do not represent anything themselves (i.e. they are not rendered), but are used to support scripts, e.g. to provide functionality for the user.

3.2.4.2 Transparent content models

Some elements are described as transparent; they have "transparent" in the description of their content model. The content model of a transparent element is derived from the content model of its parent element: the elements required in the part of the content model that is "transparent" are the same elements as required in the part of the content model of the parent of the transparent element in which the transparent element finds itself.

In some cases, where transparent elements are nested in each other, the process has to be applied iteratively.

When a transparent element has no parent, then the part of its content model that is "transparent" must instead be treated as accepting any flow content.

3.2.4.3 Paragraphs

The term paragraph as defined in this section is used for more than just the definition of the p element. The paragraph concept defined here is used to describe how to interpret documents. The p element is merely one of several ways of marking up a paragraph.

A paragraph is typically a run of phrasing content that forms a block of text with one or more sentences that discuss a particular topic, as in typography, but can also be used for more general thematic grouping. For instance, an address is also a paragraph, as is a part of a form, a byline, or a stanza in a poem.

Paragraphs in flow content are defined relative to what the document looks like without the a, ins, del, and map elements complicating matters, since those elements, with their hybrid content models, can straddle paragraph boundaries, as shown in the first two examples below.

Generally, having elements straddle paragraph boundaries is best avoided. Maintaining such markup can be difficult.

The p element can be used to wrap individual paragraphs when there would otherwise not be any content other than phrasing content to separate the paragraphs from each other.

3.2.5 Global attributes

The following attributes are common to and may be specified on all HTML elements (even those not defined in this specification):

To enable assistive technology products to expose a more fine-grained interface than is otherwise possible with HTML elements and attributes, a set of annotations for assistive technology products can be specified (the ARIA role and aria-* attributes). [ARIA]

The attributes marked with an asterisk have a different meaning when specified on body elements as those elements expose event handlers of the Window object with the same names.

While these attributes apply to all elements, they are not useful on all elements. For example, only media elements will ever receive a volumechange event fired by the user agent.

Custom data attributes (e.g. data-foldername or data-msgid) can be specified on any HTML element, to store custom data specific to the page.

In HTML documents, elements in the HTML namespace may have an xmlns attribute specified, if, and only if, it has the exact value "http://www.w3.org/1999/xhtml". This does not apply to XML documents.

In HTML, the xmlns attribute has absolutely no effect. It is basically a talisman. It is allowed merely to make migration to and from XHTML mildly easier. When parsed by an HTML parser, the attribute ends up in no namespace, not the "http://www.w3.org/2000/xmlns/" namespace like namespace declaration attributes in XML do.

In XML, an xmlns attribute is part of the namespace declaration mechanism, and an element cannot actually have an xmlns attribute in no namespace specified.

The XML specification also allows the use of the xml:space attribute in the XML namespace on any element in an XML document. This attribute has no effect on HTML elements, as the default behavior in HTML is to preserve whitespace. [XML]

There is no way to serialize the xml:space attribute on HTML elements in the text/html syntax.

3.2.5.1 The id attribute

The value must be unique amongst all the IDs in the element's home subtree and must contain at least one character. The value must not contain any space characters.

There are no other restrictions on what form an ID can take; in particular, IDs can consist of just digits, start with a digit, start with an underscore, consist of just punctuation, etc.

An element's unique identifier can be used for a variety of purposes, most notably as a way to link to specific parts of a document using fragment identifiers, as a way to target an element when scripting, and as a way to style a specific element from CSS.

3.2.5.2 The title attribute

The title attribute represents advisory information for the element, such as would be appropriate for a tooltip. On a link, this could be the title or a description of the target resource; on an image, it could be the image credit or a description of the image; on a paragraph, it could be a footnote or commentary on the text; on a citation, it could be further information about the source; on interactive content, it could be a label for, or instructions for, use of the element; and so forth. The value is text.

Relying on the title attribute is currently discouraged as many user agents do not expose the attribute in an accessible manner as required by this specification (e.g. requiring a pointing device such as a mouse to cause a tooltip to appear, which excludes keyboard-only users and touch-only users, such as anyone with a modern phone or tablet).

If this attribute is omitted from an element, then it implies that the title attribute of the nearest ancestor HTML element with a title attribute set is also relevant to this element. Setting the attribute overrides this, explicitly stating that the advisory information of any ancestors is not relevant to this element. Setting the attribute to the empty string indicates that the element has no advisory information.

If the title attribute's value contains "LF" (U+000A) characters, the content is split into multiple lines. Each "LF" (U+000A) character represents a line break.

Some elements, such as link, abbr, and input, define additional semantics for the title attribute beyond the semantics described above.

3.2.5.3 The lang and xml:lang attributes

The lang attribute (in no namespace) specifies the primary language for the element's contents and for any of the element's attributes that contain text. Its value must be a valid BCP 47 language tag, or the empty string. Setting the attribute to the empty string indicates that the primary language is unknown. [BCP47]

If these attributes are omitted from an element, then the language of this element is the same as the language of its parent element, if any.

Authors must not use the lang attribute in the XML namespace on HTML elements in HTML documents. To ease migration to and from XHTML, authors may specify an attribute in no namespace with no prefix and with the literal localname "xml:lang" on HTML elements in HTML documents, but such attributes must only be specified if a lang attribute in no namespace is also specified, and both attributes must have the same value when compared in an ASCII case-insensitive manner.

The attribute in no namespace with no prefix and with the literal localname "xml:lang" has no effect on language processing.

3.2.5.4 The translate attribute

The translate attribute is an enumerated attribute that is used to specify whether an element's attribute values and the values of its Text node children are to be translated when the page is localized, or whether to leave them unchanged.

The attribute's keywords are the empty string, yes, and no. The empty string and the yes keyword map to the yes state. The no keyword maps to the no state. In addition, there is a third state, the inherit state, which is the missing value default (and the invalid value default).

When an element is in the translate-enabled state, the element's translatable attributes and the values of its Text node children are to be translated when the page is localized. Attributes of the element that are not listed as translatable attributes should not be translated.

When an element is in the no-translate state, the element's attribute values (including the values of translatable attributes) and the values of its Text node children are to be left as-is when the page is localized, e.g. because the element contains a person's name or a the name of a computer program.

3.2.5.5 The xml:base attribute (XML only)

3.2.5.6 The dir attribute

The dir attribute specifies the element's text directionality. The attribute is an enumerated attribute with the following keywords and states:

The directionality of an element (any element, not just an HTML element) is either 'ltr' or 'rtl', and is determined as per the first appropriate set of steps from the following list:

Since the dir attribute is only defined for HTML elements, it cannot be present on elements from other namespaces. Thus, elements from other namespaces always just inherit their directionality from their parent element, or, if they don't have one, default to 'ltr'.

The directionality of an attribute of an HTML element, which is used when the text of that attribute is to be included in the rendering in some manner, is determined as per the first appropriate set of steps from the following list:

Authors are strongly encouraged to use the dir attribute to indicate text direction rather than using CSS, since that way their documents will continue to render correctly even in the absence of CSS (e.g. as interpreted by search engines).

3.2.5.7 The class attribute

The attribute, if specified, must have a value that is a set of space-separated tokens representing the various classes that the element belongs to.

Assigning classes to an element affects class matching in selectors in CSS, the getElementsByClassName() method in the DOM, and other such features.

There are no additional restrictions on the tokens authors can use in the class attribute, but authors are encouraged to use values that describe the nature of the content, rather than values that describe the desired presentation of the content.

3.2.5.8 The style attribute

Documents that use style attributes on any of their elements must still be comprehensible and usable if those attributes were removed.

In particular, using the style attribute to hide and show content, or to convey meaning that is otherwise not included in the document, is non-conforming. (To hide and show content, use the hidden attribute.)

3.2.5.9 Embedding custom non-visible data with the data-* attributes

A custom data attribute is an attribute in no namespace whose name starts with the string "data-", has at least one character after the hyphen, is XML-compatible, and contains no uppercase ASCII letters.

All attribute names on HTML elements in HTML documents get ASCII-lowercased automatically, so the restriction on ASCII uppercase letters doesn't affect such documents.

Custom data attributes are intended to store custom data private to the page or application, for which there are no more appropriate attributes or elements.

These attributes are not intended for use by software that is independent of the site that uses the attributes.

Authors should carefully design such extensions so that when the attributes are ignored and any associated CSS dropped, the page is still usable.

JavaScript libraries may use the custom data attributes, as they are considered to be part of the page on which they are used. Authors of libraries that are reused by many authors are encouraged to include their name in the attribute names, to reduce the risk of clashes. Where it makes sense, library authors are also encouraged to make the exact name used in the attribute names customizable, so that libraries whose authors unknowingly picked the same name can be used on the same page, and so that multiple versions of a particular library can be used on the same page even when those versions are not mutually compatible.

3.2.6 Requirements relating to the bidirectional algorithm

3.2.6.1 Authoring conformance criteria for bidirectional-algorithm formatting characters

Text content in HTML elements with Text nodes in their contents, and text in attributes of HTML elements that allow free-form text, may contain characters in the ranges U+202A to U+202E and U+2066 to U+2069 (the bidirectional-algorithm formatting characters). However, the use of these characters is restricted so that any embedding or overrides generated by these characters do not start and end with different parent elements, and so that all such embeddings and overrides are explicitly terminated by a U+202C POP DIRECTIONAL FORMATTING character. This helps reduce incidences of text being reused in a manner that has unforeseen effects on the bidirectional algorithm. [BIDI]

Any strings that, as described above, are bidirectional-algorithm formatting character ranges must match the string production in the following ABNF, the character set for which is Unicode. [ABNF]

While the U+2069 POP DIRECTIONAL ISOLATE character implicitly also ends open embeddings and overrides, text that relies on this implicit scope closure is not conforming to this specification. All strings of embeddings, overrides, and isolations need to be explicitly terminated to conform to this section's requirements.

Authors are encouraged to use the dir attribute, the bdo element, and the bdi element, rather than maintaining the bidirectional-algorithm formatting characters manually. The bidirectional-algorithm formatting characters interact poorly with CSS.

3.2.7 WAI-ARIA

Authors may use the ARIA role and aria-* attributes on HTML elements, in accordance with the requirements described in the ARIA specifications, except where these conflict with the strong native semantics described below. These exceptions are intended to prevent authors from making assistive technology products report nonsensical states that do not represent the actual state of the document. [ARIA]

3.2.7.1 ARIA Role Attribute

The attribute, if specified, must have a value that is a set of space-separated tokens representing the various WAI-ARIA roles that the element belongs to.

3.2.7.2 State and Property Attributes

These attributes, if specified, must have a value that is the ARIA value type in the "Value" field of the definition for the state or property, mapped to the appropriate HTML value type according to [ARIA] Section 10.2 Mapping WAI-ARIA Value types to languages using the HTML 5 mapping.

ARIA State and Property attributes can be used on any element. They are not always meaningful, however, and in such cases user agents might not perform any processing aside from including them in the DOM. State and property attributes are processed according to the requirements of the sections Strong Native Semantics and Implicit ARIA semantics, as well as [ARIA] and [ARIAIMPL].

3.2.7.3 Strong Native Semantics

The following table defines the strong native semantics and corresponding default implicit ARIA semantics that apply to HTML elements. Each language feature (element or attribute) in a cell in the first column implies the ARIA semantics (any role, states, and properties) given in the cell in the second column of the same row. When multiple rows apply to an element, the role from the last row to define a role must be applied, and the states and properties from all the rows must be combined.

Documents must not use any role values with elements in the following table other than the corresponding role value (if any) as listed for that element in the second column, or the role value "presentation", if the second column indicates that element's semantics can be removed by using the "presentation" role value.

In the majority of cases setting an ARIA role and/or aria-* attribute that matches the default implicit ARIA semantics is unnecessary and not recommended as these properties are already set by the browser.

Language feature	Strong native semantics and default implicit ARIA semantics
`area` element that creates a hyperlink	`link` role
`base` element	No role
`datalist` element	`listbox` role, with the `aria-multiselectable` property set to "false"
`details` element	`aria-expanded` state set to "true" if the element's `open` attribute is present, and set to "false" otherwise
`dialog` element without an `open` attribute	The `aria-hidden` state set to "true"
`fieldset` element	`group` role (semantics may be removed by using the `presentation` role)
`footer` element that is not a descendant of an `article` or `section` element.	`contentinfo` role (semantics may be removed by using the `presentation` role)
`head` element	No role
`header` element that is not a descendant of an `article` or `section` element.	`banner` role (semantics may be removed by using the `presentation` role)
`hr` element	`separator` role (semantics may be removed by using the `presentation` role)
`html` element	No role
`img` element whose `alt` attribute's value is empty	`presentation` role
`img` element whose `alt` attribute's value is empty and whose `usemap` attribute has a valid hash-name reference to a `map` element.	`img` role
`input` element with a `type` attribute in the Checkbox state	`aria-checked` state set to "mixed" if the element's `indeterminate` IDL attribute is true, or "true" if the element's checkedness is true, or "false" otherwise
`input` element with a `type` attribute in the Color state	No role
`input` element with a `type` attribute in the Date state	No role, with the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`input` element with a `type` attribute in the Date and Time state	No role, with the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`input` element with a `type` attribute in the Local Date and Time state	No role, with the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`input` element with a `type` attribute in the E-mail state with no suggestions source element	`textbox` role, with the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`input` element with a `type` attribute in the File Upload state	No role
`input` element with a `type` attribute in the Hidden state	No role
`input` element with a `type` attribute in the Month state	No role, with the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`input` element with a `type` attribute in the Number state	`spinbutton` role, with the `aria-readonly` property set to "true" if the element has a `readonly` attribute, the `aria-valuemax` property set to the element's maximum, the `aria-valuemin` property set to the element's minimum, and, if the result of applying the rules for parsing floating-point number values to the element's value is a number, with the `aria-valuenow` property set to that number
`input` element with a `type` attribute in the Password state	`textbox` role, with the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`input` element with a `type` attribute in the Radio Button state	`aria-checked` state set to "true" if the element's checkedness is true, or "false" otherwise
`input` element with a `type` attribute in the Range state	`slider` role, with the `aria-valuemax` property set to the element's maximum, the `aria-valuemin` property set to the element's minimum, and the `aria-valuenow` property set to the result of applying the rules for parsing floating-point number values to the element's value, if that results in a number, or the default value otherwise
`input` element with a `type` attribute in the Reset Button state	`button` role
`input` element with a `type` attribute in the Search state with no suggestions source element	`textbox` role, with the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`input` element with a `type` attribute in the Submit Button state	`button` role
`input` element with a `type` attribute in the Telephone state with no suggestions source element	`textbox` role, with the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`input` element with a `type` attribute in the Text state with no suggestions source element	`textbox` role, with the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`input` element with a `type` attribute in the Text, Search, Telephone, URL, or E-mail states with a suggestions source element	`combobox` role, with the `aria-owns` property set to the same value as the `list` attribute, and the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`input` element with a `type` attribute in the Time state	No role, with the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`input` element with a `type` attribute in the URL state with no suggestions source element	`textbox` role, with the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`input` element with a `type` attribute in the Week state	No role, with the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`keygen` element	No role
`label` element	No role
`link` element that creates a hyperlink	`link` role
`main` element	`main` role (semantics may be removed by using the `presentation` role)
`map` element	No role
`menu` element with a `type` attribute in the popup menu state	No role
`meta` element	No role
`meter` element	No role
`nav` element	`navigation` role (semantics may be removed by using the `presentation` role)
`noscript` element	No role
`optgroup` element	No role
`option` element that is in a list of options	`aria-selected` and `aria-checked` states set to "true" if the element's selectedness is true, and "false" otherwise
`option` element that represents a suggestion in a `datalist` element or that is in a list of options of a `select` element with a `multiple` attribute or a display size greater than 1	`option` role
`param` element	No role
`progress` element	`progressbar` role, with, if the progress bar is determinate, the `aria-valuemax` property set to the maximum value of the progress bar, the `aria-valuemin` property set to zero, and the `aria-valuenow` property set to the current value of the progress bar
`script` element	No role
`select` element with a `multiple` attribute	`listbox` role, with the `aria-multiselectable` property set to "true"
`select` element with no `multiple` attribute and with a display size equal to 1	`aria-multiselectable` property set to "false"
`select` element with no `multiple` attribute and with a display size greater than 1	`listbox` role, with the `aria-multiselectable` property set to "false"
`select` element with a `required` attribute	The `aria-required` state set to "true"
`source` element	No role
`style` element	No role
`summary` element	No role
`template` element	No role
`textarea` element	`textbox` role, with the `aria-multiline` property set to "true", and the `aria-readonly` property set to "true" if the element has a `readonly` attribute
`title` element	No role
`track` element	No role
Element that is disabled	The `aria-disabled` state set to "true"
Element that is inert	The `aria-disabled` state set to "true"
Element that is a candidate for constraint validation but that does not satisfy its constraints	The `aria-invalid` state set to "true"

3.2.7.4 Implicit ARIA Semantics

Some HTML elements have native semantics that can be overridden. The following table lists these elements and their default implicit ARIA semantics, along with the restrictions that apply to those elements. Each language feature (element or attribute) in a cell in the first column implies, unless otherwise overridden, the ARIA semantic (role, state, or property) given in the cell in the second column of the same row, but this semantic may be overridden under the conditions listed in the cell in the third column of that row.

Language feature	Default implicit ARIA semantic	Restrictions
`a` element that creates a hyperlink	`link` role	If specified, role must be one of the following: `link`, `button`, `checkbox`, `menuitem`, `menuitemcheckbox`, `menuitemradio`, `tab`, or `treeitem`
`address` element	No role	If specified, role must be `contentinfo`
`article` element	`article` role	If specified, role must be one of the following: `article`, `document`, `application`, or `main`
`aside` element	`complementary` role	If specified, role must be one of the following: `complementary`, `note`, `search` or `presentation`
`audio` element	No role	If specified, role must be `application`
`body` element	`document` role	If specified, role must be either `document` or `application`
`button` element	`button` role	If specified, role must be one of the following: `button`, `link`, `menuitem`, `menuitemcheckbox`, `menuitemradio` or `radio`
`details` element	`group` role	If specified, role must be a role that supports `aria-expanded`
`dialog` element	`dialog` role	If specified, role must be one of the following: `alert`, `alertdialog`, `application`, `contentinfo`, `dialog`, `document`, `log`, `main`, `marquee`, `region`, `search`, or `status`
`embed` element	No role	If specified, role must be one of the following: `application`, `document`, `img` or `presentation`
`h1` element	`heading` role, with the `aria-level` property set to the element's outline depth	If specified, role must be one of the following: `heading`, `tab` or `presentation`
`h2` element	`heading` role, with the `aria-level` property set to the element's outline depth	If specified, role must be one of the following: `heading`, `tab` or `presentation`
`h3` element	`heading` role, with the `aria-level` property set to the element's outline depth	If specified, role must be one of the following: `heading`, `tab` or `presentation`
`h4` element	`heading` role, with the `aria-level` property set to the element's outline depth	If specified, role must be one of the following: `heading`, `tab` or `presentation`
`h5` element	`heading` role, with the `aria-level` property set to the element's outline depth	If specified, role must be one of the following: `heading`, `tab` or `presentation`
`h6` element	`heading` role, with the `aria-level` property set to the element's outline depth	If specified, role must be one of the following: `heading`, `tab` or `presentation`
`iframe` element	No role	If specified, role must be one of the following: `application`, `document`, `img`, or `presentation`
`img` element whose `alt` attribute's value is absent	`img` role	No restrictions
`img` element whose `alt` attribute's value is present and not empty	`img` role	No restrictions
`input` element with a `type` attribute in the Button state	`button` role	If specified, role must be one of the following: `button`, `link`, `menuitem`, `menuitemcheckbox`, `menuitemradio` or `radio`
`input` element with a `type` attribute in the Checkbox state	`checkbox` role	If specified, role must be either `checkbox` or `menuitemcheckbox`
`input` element with a `type` attribute in the Image Button state	`button` role	If specified, role must be one of the following: `button`, `link`, `menuitem`, `menuitemcheckbox`, `menuitemradio` or `radio`
`input` element with a `type` attribute in the Radio Button state	`radio` role	If specified, role must be either `radio` or `menuitemradio`
`input`, `select` or `textarea` element with a `required` attribute	The `aria-required` state set to "true"	If specified, the `aria-required` state must be set to "true"
`input`, `select` or `textarea` element without a `required` attribute	`aria-required` set to "false"	If specified, the `aria-required` state set to "true" or "false"
`li` element whose parent is an `ol` or `ul` element	`listitem` role	If specified, role must be one of the following: `listitem`, `menuitem`, `menuitemcheckbox`, `menuitemradio`, `option`, `tab`, `treeitem`, or `presentation`
`menu` element with a `type` attribute in the toolbar state	`toolbar` role	If specified, role must be one of the following: `directory`, `list`, `listbox`, `menu`, `menubar`, `tablist`, `toolbar`, or `tree` or `presentation`
`object` element	No role	If specified, role must be one of the following: `application`, `document`, `img`, or `presentation`
`ol` element	`list` role	If specified, role must be one of the following: `directory`, `group`, `list`, `listbox`, `menu`, `menubar`, `tablist`, `toolbar`, `tree`, or `presentation`
`option` element that is in a list of options of a `select` element with no `multiple` attribute and with a display size equal to 1	`option` role	If specified, role must be one of the following: `option`, `menuitem`, `menuitemradio`, or `separator`
`output` element	`status` role	No restrictions
`section` element	`region` role Note:It is strongly recommended that user agents such as screen readers only convey the presence of, and provide navigation for `section` elements, when the `section` element has an accessible name.	If specified, role must be one of the following: `alert`, `alertdialog`, `application`, `contentinfo`, `dialog`, `document`, `log`, `main`, `marquee`, `region`, `search`, `status` or `presentation`
`select` element with no `multiple` attribute and with a display size equal to 1	`listbox` role	Role must be either `listbox` or `menu`
`ul` element	`list` role	If specified, role must be one of the following: `directory`, `group`, `list`, `listbox`, `menu`, `menubar`, `tablist`, `toolbar`, `tree`, or `presentation`
`video` element	No role	If specified, role must be `application`
Element with a `hidden` attribute	The `aria-hidden` state set to "true"	If specified, the `aria-hidden` state set to "true" or "false"
Element without a `hidden` attribute	The `aria-hidden` state set to "false"	If specified, the `aria-hidden` state set to "true" or "false"

The entry "no role", when used as a strong native semantic, means that no role can be used and that the user agent has no default mapping to ARIA roles. (However, it could have its own mappings to the accessibility layer.) When used as a default implicit ARIA semantic, it means the user agent has no default mapping to ARIA roles. (However, it could have its own mappings to the accessibility layer.)

3.2.7.5 Allowed ARIA roles, states and properties

This section is non-normative.

The following table provides an informative reference to the ARIA roles, states and properties permitted for use in HTML. All ARIA roles, states and properties are normatively defined in the [ARIA] specification. Links to ARIA roles, states and properties in the table reference the normative [ARIA] definitions.

ARIA Roles, States and Properties
Role	Description	Required Properties	Supported Properties
any	ARIA global states and properties can be used on any HTML element.	none	`aria-atomic` `aria-busy (state)` `aria-controls` `aria-describedby` `aria-disabled (state)` `aria-dropeffect` `aria-flowto` `aria-grabbed (state)` `aria-haspopup` `aria-hidden (state)` `aria-invalid (state)` `aria-label` `aria-labelledby` `aria-live` `aria-owns` `aria-relevant`
`alert`	A message with important, and usually time-sensitive, information. See related `alertdialog` and `status`.	none	`aria-expanded (state)`
`alertdialog`	A type of dialog that contains an alert message, where initial focus goes to an element within the dialog. See related `alert` and `dialog`.	none	`aria-expanded (state)`
`application`	A region declared as a web application, as opposed to a web document.	none	`aria-expanded (state)`
`article`	A section of a page that consists of a composition that forms an independent part of a document, page, or site.	none	`aria-expanded (state)`
`banner`	A region that contains mostly site-oriented content, rather than page-specific content.	none	`aria-expanded (state)`
`button`	An input that allows for user-triggered actions when clicked or pressed. See related `link`.	none	`aria-expanded (state)` `aria-pressed (state)`
`checkbox`	A checkable input that has three possible values: true, false, or mixed.	`aria-checked (state)`
`columnheader`	A cell containing header information for a column.	none	`aria-sort` `aria-readonly` `aria-required` `aria-selected (state)` `aria-expanded (state)`
`combobox`	A presentation of a select; usually similar to a textbox where users can type ahead to select an option, or type to enter arbitrary text as a new item in the list. See related `listbox`.	`aria-expanded (state)`	`aria-autocomplete` `aria-required` `aria-activedescendant`
`complementary`	A supporting section of the document, designed to be complementary to the main content at a similar level in the DOM hierarchy, but remains meaningful when separated from the main content.	none	`aria-expanded (state)`
`contentinfo`	A large perceivable region that contains information about the parent document.	none	`aria-expanded (state)`
`definition`	A definition of a term or concept.	none	`aria-expanded (state)`
`dialog`	A dialog is an application window that is designed to interrupt the current processing of an application in order to prompt the user to enter information or require a response. See related `alertdialog`.	none	`aria-expanded (state)`
`directory`	A list of references to members of a group, such as a static table of contents.	none	`aria-expanded (state)`
`document`	A region containing related information that is declared as document content, as opposed to a web application.	none	`aria-expanded (state)`
`form`	A landmark region that contains a collection of items and objects that, as a whole, combine to create a form. See related `search`.	none	`aria-expanded (state)`
`grid`	A grid is an interactive control which contains cells of tabular data arranged in rows and columns, like a table.	none	`aria-level` `aria-multiselectable` `aria-readonly` `aria-activedescendant` `aria-expanded (state)`
`gridcell`	A cell in a grid or treegrid.	none	`aria-readonly` `aria-required` `aria-selected (state)` `aria-expanded (state)`
`group`	A set of user interface objects which are not intended to be included in a page summary or table of contents by assistive technologies.	none	`aria-activedescendant` `aria-expanded (state)`
`heading`	A heading for a section of the page.	none	`aria-level` `aria-expanded (state)`
`img`	A container for a collection of elements that form an image.	none	`aria-expanded (state)`
`link`	An interactive reference to an internal or external resource that, when activated, causes the user agent to navigate to that resource. See related `button`.	none	`aria-expanded (state)`
`list`	A group of non-interactive list items. See related `listbox`.	none	`aria-expanded (state)`
`listbox`	A widget that allows the user to select one or more items from a list of choices. See related `combobox` and `list`.	none	`aria-multiselectable` `aria-required` `aria-expanded (state)` `aria-activedescendant` `aria-expanded (state)`
`listitem`	A single item in a `list` or `directory`.	none	`aria-level` `aria-posinset` `aria-setsize` `aria-expanded (state)`
`log`	A type of live region where new information is added in meaningful order and old information may disappear. See related `marquee`.	none	`aria-expanded (state)`
`main`	The main content of a document.	none	`aria-expanded (state)`
`marquee`	A type of live region where non-essential information changes frequently. See related `log`.	none	`aria-expanded (state)`
`math`	Content that represents a mathematical expression.	none	`aria-expanded (state)`
`menu`	A type of widget that offers a list of choices to the user.	none	`aria-expanded (state)` `aria-activedescendant` `aria-expanded (state)`
`menubar`	A presentation of menu that usually remains visible and is usually presented horizontally.	none	`aria-expanded (state)` `aria-activedescendant` `aria-expanded (state)`
`menuitem`	An option in a group of choices contained by a `menu` or `menubar`.	none
`menuitemcheckbox`	A checkable menuitem that has three possible values: true, false, or mixed.	`aria-checked (state)`
`menuitemradio`	A checkable menuitem in a group of `menuitemradio` roles, only one of which can be checked at a time.	`aria-checked (state)`	`aria-posinset` `aria-selected (state)` `aria-setsize`
`navigation`	A collection of navigational elements (usually links) for navigating the document or related documents.	none	`aria-expanded (state)`
`note`	A section whose content is parenthetic or ancillary to the main content of the resource.	none	`aria-expanded (state)`
`option`	A selectable item in a select list.	none	`aria-checked (state)` `aria-posinset` `aria-selected (state)` `aria-setsize`
`presentation`	An element whose implicit native role semantics will not be mapped to the accessibility API.	none
`progressbar`	An element that displays the progress status for tasks that take a long time.	none	`aria-valuemax` `aria-valuemin` `aria-valuenow` `aria-valuetext`
`radio`	A checkable input in a group of radio roles, only one of which can be checked at a time.	`aria-checked (state)`	`aria-posinset` `aria-selected (state)` `aria-setsize`
`radiogroup`	A group of radio buttons.	none	`aria-required` `aria-activedescendant` `aria-expanded (state)`
`region`	A large perceivable section of a web page or document, that the author feels is important enough to be included in a page summary or table of contents, for example, an area of the page containing live sporting event statistics.	none	`aria-expanded (state)`
`row`	A row of cells in a grid.	none	`aria-level` `aria-selected (state)` `aria-activedescendant` `aria-expanded (state)`
`rowgroup`	A group containing one or more row elements in a grid.	none	`aria-activedescendant` `aria-expanded (state)`
`rowheader`	A cell containing header information for a row in a grid.	none	`aria-sort` `aria-readonly` `aria-required` `aria-selected (state)` `aria-expanded (state)`
`scrollbar`	A graphical object that controls the scrolling of content within a viewing area, regardless of whether the content is fully displayed within the viewing area.	`aria-controls` `aria-orientation` `aria-valuemax` `aria-valuemin` `aria-valuenow`	`aria-expanded (state)`
`search`	A landmark region that contains a collection of items and objects that, as a whole, combine to create a search facility. See related `form`.	none	`aria-expanded (state)` `aria-orientation`
`separator`	A divider that separates and distinguishes sections of content or groups of menuitems.	none	`aria-valuetext`
`slider`	A user input where the user selects a value from within a given range.	`aria-valuemax` `aria-valuemin` `aria-valuenow`	`aria-orientation` `aria-valuetext`
`spinbutton`	A form of range that expects the user to select from among discrete choices.	`aria-valuemax` `aria-valuemin` `aria-valuenow`	`aria-required` `aria-valuetext`
`status`	A container whose content is advisory information for the user but is not important enough to justify an alert, often but not necessarily presented as a status bar. See related `alert`.	none	`aria-expanded (state)`
`tab`	A grouping label providing a mechanism for selecting the tab content that is to be rendered to the user.	none	`aria-selected (state)` `aria-expanded (state)`
`tablist`	A list of tab elements, which are references to tabpanel elements.	none	`aria-level` `aria-activedescendant` `aria-expanded (state)`
`tabpanel`	A container for the resources associated with a `tab`, where each `tab` is contained in a `tablist`.	none	`aria-expanded (state)`
`textbox`	Input that allows free-form text as its value.	none	`aria-activedescendant` `aria-autocomplete` `aria-multiline` `aria-readonly` `aria-required`
`timer`	A type of live region containing a numerical counter which indicates an amount of elapsed time from a start point, or the time remaining until an end point.	none	`aria-expanded (state)`
`toolbar`	A collection of commonly used function buttons represented in compact visual form.	none	`aria-activedescendant` `aria-expanded (state)`
`tooltip`	A contextual popup that displays a description for an element.	none	`aria-expanded (state)`
`tree`	A type of list that may contain sub-level nested groups that can be collapsed and expanded.	none	`aria-multiselectable` `aria-required` `aria-activedescendant` `aria-expanded (state)`
`treegrid`	A grid whose rows can be expanded and collapsed in the same manner as for a tree.	none	`aria-level` `aria-multiselectable` `aria-readonly` `aria-activedescendant` `aria-expanded (state)` `aria-required`
`treeitem`	An option item of a tree. This is an element within a tree that may be expanded or collapsed if it contains a sub-level group of treeitems.	none	`aria-level` `aria-posinset` `aria-setsize` `aria-expanded (state)` `aria-checked (state)` `aria-selected (state)`