Copyright © 2015-2024 World Wide Web Consortium. W3C® liability, trademark and permissive document license rules apply.
Some user agents have music devices, such as synthesizers, keyboard and other controllers, and drum machines connected to their host computer or device. The widely adopted Musical Instrument Digital Interface (MIDI) protocol enables electronic musical instruments, controllers and computers to communicate and synchronize with each other. MIDI does not transmit audio signals: instead, it sends event messages about musical notes, controller signals for parameters such as volume, vibrato and panning, cues and clock signals to set the tempo, and system-specific MIDI communications (e.g. to remotely store synthesizer-specific patch data). This same protocol has become a standard for non-musical uses, such as show control, lighting and special effects control.
This specification defines an API supporting the MIDI protocol, enabling web applications to enumerate and select MIDI input and output devices on the client system and send and receive MIDI messages. It is intended to enable non-music MIDI applications as well as music ones, by providing low-level access to the MIDI devices available on the users' systems. The Web MIDI API is not intended to describe music or controller inputs semantically; it is designed to expose the mechanics of MIDI input and output interfaces, and the practical aspects of sending and receiving MIDI messages, without identifying what those actions might mean semantically (e.g., in terms of "modulate the vibrato by 20Hz" or "play a G#7 chord", other than in terms of changing a controller value or sending a set of note-on messages that happen to represent a G#7 chord).
To some users, "MIDI" has become synonymous with Standard MIDI Files
and General MIDI. That is not the intent of this API; the use case of
simply playing back a .SMF file is not within the purview of this
specification (it could be considered a different format to be
supported by the HTML audio
element, for example). The Web MIDI API
is intended to enable direct access to devices that respond to MIDI -
controllers, external synthesizers or lighting systems, for example.
The Web MIDI API is also explicitly designed to enable a new class of
applications on the web that can respond to MIDI controller inputs -
using external hardware controllers with physical buttons, knobs and
sliders (as well as musical controllers like keyboard, guitar or wind
instrument controllers) to control web applications.
The Web MIDI API is also expected to be used in conjunction with other APIs and elements of the web platform, notably the Web Audio API. This API is also intended to be familiar to users of MIDI APIs on other systems, such as Apple's CoreMIDI and Microsoft's Windows MIDI API.
This section describes the status of this document at the time of its publication. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at https://www.w3.org/TR/.
This document was published by the Audio Working Group as a Working Draft using the Recommendation track.
Publication as a Working Draft does not imply endorsement by W3C and its Members.
This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.
This document was produced by a group operating under the W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.
This document is governed by the 03 November 2023 W3C Process Document.
This section is non-normative.
The Web MIDI API specification defines a means for web developers to enumerate, manipulate and access MIDI devices - for example, interfaces that may provide hardware MIDI ports with other devices plugged in to them and USB devices that support the USB-MIDI specification. Having a Web API for MIDI enables web applications that use existing software and hardware synthesizers, hardware music controllers and light systems and other mechanical apparatus controlled by MIDI. This API has been defined with this wide variety of use cases in mind.
The approaches taken by this API are similar to those taken in Apple's CoreMIDI API and Microsoft's Windows MIDI API; that is, the API is designed to represent the low-level software protocol of MIDI, in order to enable developers to build powerful MIDI software on top. The API enables the developer to enumerate input and output interfaces, and send and receive MIDI messages, but (similar to the aforementioned APIs) it does not attempt to semantically define or interpret MIDI messages beyond what is necessary to robustly support current devices.
The Web MIDI API is not intended to directly implement high-level concepts such as sequencing; it does not directly support Standard MIDI Files, for example, although a Standard MIDI File player can be built on top of the Web MIDI API. It is also not intended to semantically capture patches or controller assignments, as General MIDI does; such interpretation is outside the scope of the Web MIDI API (though again, General MIDI can easily be utilized through the Web MIDI API).
As well as sections marked as non-normative, all authoring guidelines, diagrams, examples, and notes in this specification are non-normative. Everything else in this specification is normative.
The key words MUST and SHOULD in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.
This specification defines conformance criteria that apply to a single product: the user agent that implements the interfaces that it contains.
Implementations that use ECMAScript to implement the APIs defined in this specification MUST implement them in a manner consistent with the ECMAScript Bindings defined in the Web IDL specification [WEBIDL], as this specification uses that specification and terminology.
The Web Audio API and its associated interfaces and concepts are defined in [webaudio].
The terms MIDI, MIDI device, MIDI input port, MIDI output port, MIDI interface, MIDI message, System Real Time and System Exclusive are defined in [MIDI].
The Web Midi API is a powerful feature that is identified by the name "midi". It integrates with Permissions by defining the following permission-related flags:
WebIDLdictionary MidiPermissionDescriptor
: PermissionDescriptor {
boolean sysex
= false;
};
{name: "midi", sysex: true}
is stronger than {name:
"midi", sysex: false}
.
The Web Midi API defines a policy-controlled feature named
"midi" which has a
default allowlist of 'self'
.
WebIDL[SecureContext, Exposed=(Window,Worker)] interface MIDIInputMap
{
readonly maplike <DOMString, MIDIInput
>;
};
The MIDIInputMap
is a maplike interface whose value is a
MIDIInput
instance and key is its ID.
This type is used to represent all the currently available MIDI input ports. This enables:
// to tell how many entries there are:
var numberOfMIDIInputs = inputs.size;
// add each of the ports to a <select> box
inputs.forEach( function( port, key ) {
var opt = document.createElement("option");
opt.text = port.name;
document.getElementById("inputportselector").add(opt);
});
// or you could express in ECMAScript 6 as:
for (let input of inputs.values()) {
var opt = document.createElement("option");
opt.text = input.name;
document.getElementById("inputportselector").add(opt);
}
WebIDL[SecureContext, Exposed=(Window,Worker)] interface MIDIOutputMap
{
readonly maplike <DOMString, MIDIOutput
>;
};
The MIDIOutputMap
is a maplike interface whose value is a
MIDIOutput
instance and key is its ID.
This type is used to represent all the currently available MIDI output ports. This enables:
// to tell how many entries there are:
var numberOfMIDIOutputs = outputs.size;
// add each of the ports to a <select> box
outputs.forEach( function( port, key ) {
var opt = document.createElement("option");
opt.text = port.name;
document.getElementById("outputportselector").add(opt);
});
// or you could express in ECMAScript 6 as:
for (let output of outputs.values()) {
var opt = document.createElement("option");
opt.text = output.name;
document.getElementById("outputportselector").add(opt);
}
This interface provides the methods to list MIDI input and output devices, and obtain access to an individual device.
WebIDL[SecureContext, Exposed=(Window,Worker), Transferable] interface MIDIAccess
: EventTarget {
readonly attribute MIDIInputMap
inputs
;
readonly attribute MIDIOutputMap
outputs
;
attribute EventHandler onstatechange
;
readonly attribute boolean sysexEnabled
;
};
inputs
outputs
onstatechange
The handler called when a new port is connected or an existing port changes the state attribute.
This event handler, of type MIDIConnectionEvent
, MUST be
supported by all objects implementing the MIDIAccess
interface.
It is important to understand that leaving an EventHandler
attached to this object will prevent it from being
garbage-collected; when finished using the MIDIAccess
, you
should remove any onstatechange
listeners.
Whenever a previously unavailable MIDI port becomes available for use, or an existing port changes the state attribute, the user agent SHOULD run the following steps:
MIDIPort
corresponding to the
newly-available, or the existing port.
MIDIAccess
,
using MIDIConnectionEvent
with port
set to port.
sysexEnabled
This interface represents a MIDI input or output port.
WebIDL[SecureContext, Exposed=(Window,Worker)] interface MIDIPort
: EventTarget {
readonly attribute DOMString id
;
readonly attribute DOMString? manufacturer
;
readonly attribute DOMString? name
;
readonly attribute MIDIPortType
type
;
readonly attribute DOMString? version
;
readonly attribute MIDIPortDeviceState
state
;
readonly attribute MIDIPortConnectionState
connection
;
attribute EventHandler onstatechange
;
Promise <MIDIPort
> open
();
Promise <MIDIPort
> close
();
};
id
A unique ID of the port. This can be used by developers to
remember ports the user has chosen for their application. The
User Agent MUST ensure that the id
is unique to only
that port. The User Agent SHOULD ensure that the id is maintained
across instances of the application - e.g., when the system is
rebooted - and when a device is removed from the system.
Applications may want to cache these ids locally to re-create a
MIDI setup. Some systems may not support completely unique
persistent identifiers; in such cases, it will be more
challenging to maintain identifiers when another interface is
added or removed from the system. (This might throw off the index
of the requested port.) It is expected that the system will do
the best it can to match a port across instances of the MIDI API:
for example, an implementation may opaquely use some form of hash
of the port interface manufacturer, name and index as the id, so
that a reference to that port id is likely to match the port when
plugged in. Applications may use the comparison of id of
MIDIPorts to test for equality.
manufacturer
The manufacturer of the port.
name
The system name of the port.
type
A descriptor property to distinguish whether the port is an input
or an output port. For MIDIOutput
, this MUST be
"output"
. For MIDIInput
, this MUST be
"input"
.
version
The version of the port.
state
connection
onstatechange
The handler called when an existing port changes its state or connection attributes.
This event handler, of type "statechange", MUST be
supported by all objects implementing MIDIPort
interface.
It is important to understand that leaving an EventHandler
attached to this object will prevent it from being
garbage-collected; when finished using the MIDIPort
, you
should remove any onstatechange
listeners.
open
Makes the MIDI device corresponding to the MIDIPort
explicitly available. Note that this call is NOT required in order
to use the MIDIPort
- calling send()
on a
MIDIOutput
or attaching a MIDIMessageEvent handler on a
MIDIInput
will cause an implicit open(). The underlying
implementation may not need to do anything in response to this
call. However, some underlying implementations may not be able to
support shared access to MIDI devices, so using explicit
open() and close() calls will enable MIDI applications to
predictably control this exclusive access to devices.
When invoked, this method returns a Promise object representing a request for access to the given MIDI port on the user's system.
If the port device has a state of "connected", when access to the port has been obtained (and the port is ready for input or output), the vended Promise is resolved.
If access to a connected port is not available (for example, the port is already in use in an exclusive-access-only platform), the Promise is rejected (if any) is invoked.
If open()
is called on a port that is "disconnected", the port's
.connection will transition
to "pending",
until the port becomes "connected" or all references
to it are dropped.
When this method is called, the user agent MUST run the algorithm to open a MIDIPort:
Let promise be a new Promise object and resolver be its associated resolver.
Return promise and run the following steps asynchronously.
Let port be the given MIDIPort
object.
If the device's connection is already "open" (e.g. open() has
already been called on this MIDIPort
, or the port has been
implicitly opened), jump to the step labeled success
below.
If the device's connection is "pending" (i.e. the connection had been opened and the device was subsequently disconnected), jump to the step labeled success below.
If the device's state is "disconnected", change
the connection
attribute of
the MIDIPort
to "pending", and enqueue
a new MIDIConnectionEvent
to the
statechange handler
of the MIDIAccess
and to the statechange handler of the
MIDIPort
and jump to the step labeled success
below.
Attempt to obtain access to the given MIDI device in the system. If the device is unavailable (e.g. is already in use by another process and cannot be opened, or is disconnected), jump to the step labeled failure below. If the device is available and access is obtained, continue the following steps.
Change the connection
attribute of the MIDIPort
to "open"
, and enqueue a new
MIDIConnectionEvent
to the statechange handler of the
MIDIAccess
and to the statechange handler of the
MIDIPort
.
If this port is an output port and has any pending data that is waiting to be sent, asynchronously begin sending that data.
success: Call resolver's
accept(value)
method with port as
value argument.
Terminate these steps.
failure: Let error be a new
DOMException
. This exception's .name should be
"InvalidAccessError"
if the port is unavailable.
Call resolver's reject(value)
method
with error as value argument.
close
Makes the MIDI device corresponding to the MIDIPort
explicitly unavailable (subsequently changing the state from
"open" to "closed"). Note that successful invocation of this
method will result in MIDI messages no longer being delivered
to MIDIMessageEvent handlers on a MIDIInput
(although setting
a new handler will cause an implicit open()).
The underlying implementation may not need to do anything in response to this call. However, some underlying implementations may not be able to support shared access to MIDI devices, and the explicit close() call enables MIDI applications to ensure other applications can gain access to devices.
When invoked, this method returns a Promise object representing a request for access to the given MIDI port on the user's system. When the port has been closed (and therefore, in exclusive access systems, the port is available to other applications), the vended Promise is resolved. If the port is disconnected, the Promise is rejected.
When the close()
method is called, the user agent
MUST run the algorithm to close a MIDIPort:
Let promise be a new Promise object and resolver be its associated resolver.
Return promise and run the following steps asynchronously.
Let port be the given MIDIPort
object.
If the port is already closed (its .connection is "closed" - e.g. the port
has not yet been implicitly or explicitly opened, or
close()
has already been
called on this MIDIPort
), jump to the step labeled
closed below.
If the port is an input port, skip to the next step. If the output port's .state is not "connected", clear all pending send data and skip to the next step. Clear any pending send data in the system with timestamps in the future, then finish sending any send messages with no timestamp or with a timestamp in the past or present, prior to proceeding to the next step.
Close access to the port in the underlying system if open, and release any blocking resources in the underlying system.
Change the connection
attribute of the MIDIPort
to "closed"
, and enqueue a new
MIDIConnectionEvent
to the statechange handler of the
MIDIAccess
and to the statechange handler of the
MIDIPort
.
closed: Call resolver's
accept(value)
method with port as
value argument.
Terminate these steps.
Whenever the MIDI port corresponding to the MIDIPort
changes the
state attribute, the user agent SHOULD run the following steps:
Let port be the MIDIPort
.
Fire an event named statechange at the MIDIPort
, and
statechange at the
MIDIAccess
, using MIDIConnectionEvent
with the port
attribute set
to port.
WebIDL[SecureContext, Exposed=(Window,Worker)] interface MIDIInput
: MIDIPort
{
attribute EventHandler onmidimessage
;
};
onmidimessage
This event handler, of type "midimessage", MUST be
supported by all objects implementing MIDIInput
interface.
If the handler is set and the state attribute is not
"opened"
, underlying implementation tries to make
the port available, and change the state attribute to
"opened"
. If succeeded, MIDIConnectionEvent
is
delivered to the corresponding MIDIPort
and
MIDIAccess
.
Whenever the MIDI port corresponding to the MIDIInput
finishes
receiving one or more MIDI messages, the user agent MUST run the
following steps:
Let port be the MIDIInput
.
If the MIDIAccess
did not enable System Exclusive
access, and the message is a System Exclusive message, abort
this process.
Fire an event named "midimessage" at port, using
MIDIMessageEvent
with the timeStamp
attribute set
to the time the message was received by the system, and with the
data
attribute set to a Uint8Array
of
MIDI data bytes representing a single MIDI message.
It is specifically noted that MIDI System Real Time messages may actually occur in the middle of other messages in the input stream; in this case, the System Real Time messages will be dispatched as they occur, while the normal messages will be buffered until they are complete (and then dispatched).
WebIDL[SecureContext, Exposed=(Window,Worker)] interface MIDIOutput
: MIDIPort
{
undefined send
(sequence<octet> data, optional DOMHighResTimeStamp timestamp = 0);
undefined clear
();
};
send
Enqueues the message to be sent to the corresponding MIDI port.
The underlying implementation will (if necessary) coerce each
member of the sequence to an unsigned 8-bit integer. The use of
sequence rather than a Uint8Array enables developers to use the
convenience of output.send( [ 0x90, 0x45, 0x7f ]
);
rather than having to create a Uint8Array, e.g.
output.send( new Uint8Array( [ 0x90, 0x45, 0x7f ] )
);
The data contains one or more valid, complete MIDI messages. Running status is not allowed in the data, as underlying systems may not support it.
If data is not a valid sequence or does not contain
a valid MIDI message, throw a TypeError
exception.
If data is a System Exclusive message, and the
MIDIAccess
did not enable System Exclusive access, throw
an InvalidAccessError
exception.
If the port is "disconnected", throw an
InvalidStateError
exception.
If the port is "connected" but the connection is "closed", asynchronously try to open the port.
DOMHighResTimeStamp
- a number of milliseconds measured
relative to the navigation start of the document). If
timestamp
is set to zero (or another time in the
past), the data is to be sent as soon as possible.
clear
Clears any pending send data that has not yet been sent from
the MIDIOutput
's queue. The implementation will
need to ensure the MIDI stream is left in a good state, so if
the output port is in the middle of a sysex message, a sysex
termination byte (0xf7) should be sent.
WebIDLenum MIDIPortType
{
"input
",
"output
",
};
input
MIDIPort
is an input port, the type member MUST be this
value.
output
MIDIPort
is an output port, the type member MUST be this
value.
WebIDLenum MIDIPortDeviceState
{
"disconnected
",
"connected
",
};
disconnected
MIDIPort
represents is disconnected from the
system. When a device is disconnected from the system, it should
not appear in the relevant map of input and output ports.
connected
MIDIPort
represents is connected, and should
appear in the map of input and output ports.
WebIDLenum MIDIPortConnectionState
{
"open
",
"closed
",
"pending
",
};
open
MIDIPort
represents has been opened (either
implicitly or explicitly) and is
available for use.
closed
MIDIPort
represents has not been opened, or
has been explicitly closed. Until a MIDIPort
has been opened
either explicitly (through MIDIPort
.open
()
) or implicitly (by
adding a midimessage
event handler on an input port, or calling MIDIOutput
.send
()
on an output port, this should be the default state of the device.
pending
MIDIPort
represents has been opened (either
implicitly or
explicitly), but the device has subsequently been disconnected
and is unavailable for use. If the device is reconnected, prior to
sending a statechange
event, the system should attempt to reopen the device (following
the algorithm to open a MIDIPort); this will result in either
the connection state transitioning to "open" or to "closed".
An event object implementing this interface is passed to a
MIDIInput
's onmidimessage handler when MIDI messages are
received. Note that the DOM Event
timeStamp
attribute is defined as a DOMHighResTimeStamp
, and represents the
high-resolution time of when the event was received or is to be sent.
WebIDL[SecureContext, Exposed=(Window,Worker)]
interface MIDIMessageEvent
: Event {
constructor
(DOMString type, optional MIDIMessageEventInit
eventInitDict = {});
readonly attribute Uint8Array? data
;
};
data
A Uint8Array containing the MIDI data bytes of a single MIDI message.
WebIDLdictionary MIDIMessageEventInit
: EventInit {
Uint8Array data
;
};
data
A Uint8Array containing the MIDI data bytes of a single MIDI message.
An event object implementing this interface is passed to a
MIDIAccess
' onstatechange
handler when a new port
becomes available (for example, when a MIDI device is first
plugged in to the computer), when a previously-available port becomes
unavailable, or becomes available again (for example, when a MIDI interface is disconnected, then reconnected) and (if present) is
also passed to the onstatechange
handlers for any
MIDIPort
s referencing the port.
When a MIDIPort
is in the "pending" state and the device
is reconnected to the host system, prior to firing a statechange event the algorithm to open a MIDIPort is run on it to attempt to reopen the port. If this
transition fails (e.g. the Port is reserved by something else in the
underlying system, and therefore unavailable for use), the connection
state moves to "closed", else it transitions back to "open". This is
done prior to the statechange event for the device state
change so that the event will reflect the final connection state as
well as the device state.
Some underlying systems may not provide notification events for device connection status; such systems may have long time delays as they poll for new devices infrequently. As such, it is suggested that heavy reliance on connection events not be used.
WebIDL[SecureContext, Exposed=(Window,Worker)]
interface MIDIConnectionEvent
: Event {
constructor
(DOMString type, optional MIDIConnectionEventInit
eventInitDict = {});
readonly attribute MIDIPort
? port
;
};
port
The port that has been connected or disconnected.
WebIDLdictionary MIDIConnectionEventInit
: EventInit {
MIDIPort
port
;
};
port
The port that has been connected or disconnected.
Allowing the enumeration of the user's MIDI interfaces is a potential target for fingerprinting (that is, uniquely identifying a user by the specific MIDI interfaces they have connected). Note that in this context, what can be enumerated is the MIDI interfaces - not, for example, an individual sampler or synthesizer plugged into a MIDI interface, as these would not be enumerated, unless those devices are connected to the host computer with USB (USB-MIDI devices typically have their own MIDI interface, and would be enumerated). The interfaces that could be fingerprinted are equivalent to MIDI "ports", and for each device the API will expose the name of the device, manufacturer, and opaque identifier of the MIDI interface (but not any attached devices).
Few systems will have significant numbers of MIDI devices attached; those systems that do will typically use hardware MIDI interfaces, not fanning out a dozen USB-MIDI connections through USB hubs. In this case, of course, enumerating the MIDI “devices” will only see the hardware MIDI interface(s), not the synthesizers, samplers, etc. plugged into it on the other side. Given the few number of devices plugged in, the amount of information exposed here is fairly symmetric with the fingerprinting concern exposed by other APIs such as the Gamepad API. The vast majority of systems have relatively few MIDI interfaces attached.
In short: the additional fingerprinting exposure of enumerating MIDI devices is directly analogous to the Gamepad API’s additional fingerprinting exposure through gamepad enumeration; typical users will only have at most a few devices connected, their configuration may change, and the information exposed is about the interface itself (i.e., no user-configured data).
Separate from the fingerprinting concerns of identifying the available ports are concerns around sending and receiving MIDI messages. Those issues are explored in more depth below.
In brief, the general categories of things you can do with MIDI ports are:
The impact of each of these is:
It's also useful to examine what scenarios are enabled by MIDI, mapped against these features:
The additional security concern for receiving short messages is also small - it’s analogous to listening to keyboard, mouse, mobile/laptop accelerometer, touch input or gamepad events; there is no additional information exposed, and all messages other than clock signals must be initiated by the user.
The additional concerns about sending short messages are analogous to any audio output - you cannot overwrite user information or expose use information, but you can make sounds happen, change patches, or (in rare configurations) toggle lights - but non-destructively, and not persistently.
System Exclusive, on the other hand, has a much less bounded potential, and it seems that distinguishing requests for SysEx separately in the API is a good idea, in order to more carefully provide user security hooks. The suggested security model explicitly allows user agents to require the user's approval before giving access to MIDI devices, although it is not currently required to prompt the user for this approval - but it also detailed that System Exclusive support must be requested as part of that request.
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in: