Instructions¶

WebAssembly code consists of sequences of instructions. Its computational model is based on a stack machine in that instructions manipulate values on an implicit operand stack, consuming (popping) argument values and producing or returning (pushing) result values.

In addition to dynamic operands from the stack, some instructions also have static immediate arguments, typically indices or type annotations, which are part of the instruction itself.

Some instructions are structured in that they bracket nested sequences of instructions.

The following sections group instructions into a number of different categories.

Numeric Instructions¶

Numeric instructions provide basic operations over numeric values of specific type. These operations closely match respective operations available in hardware.

\begin{array}{r} \begin{array}{llcl} nn, mm & ::= & 32 | 64 \\ sx & ::= & u | s \\ instr & ::= & i nn . const u nn | f nn . const f nn \\ | & i nn . iunop | f nn . funop \\ | & i nn . ibinop | f nn . fbinop \\ | & i nn . itestop \\ | & i nn . irelop | f nn . frelop \\ | & i nn . extend 8_s | i nn . extend 16_s | i 64. extend 32_s \\ | & i 32. wrap_i 64 | i 64. extend_i 32_sx | i nn . trunc_f mm_sx \\ | & i nn . trunc_sat_f mm_sx \\ | & f 32. demote_f 64 | f 64. promote_f 32 | f nn . convert_i mm_sx \\ | & i nn . reinterpret_f nn | f nn . reinterpret_i nn \\ | & \dots \\ iunop & ::= & clz | ctz | popcnt \\ ibinop & ::= & add | sub | mul | div_sx | rem_sx \\ | & and | or | xor | shl | shr_sx | rotl | rotr \\ funop & ::= & abs | neg | sqrt | ceil | floor | trunc | nearest \\ fbinop & ::= & add | sub | mul | div | \min | \max | copysign \\ itestop & ::= & eqz \\ irelop & ::= & eq | ne | lt_sx | gt_sx | le_sx | ge_sx \\ frelop & ::= & eq | ne | lt | gt | le | ge \end{array} \end{array}

Numeric instructions are divided by number type. For each type, several subcategories can be distinguished:

Constants: return a static constant.
Unary Operations: consume one operand and produce one result of the respective type.
Binary Operations: consume two operands and produce one result of the respective type.
Tests: consume one operand of the respective type and produce a Boolean integer result.
Comparisons: consume two operands of the respective type and produce a Boolean integer result.
Conversions: consume a value of one type and produce a result of another (the source type of the conversion is the one after the “ $_$ ”).

Some integer instructions come in two flavors, where a signedness annotation $sx$ distinguishes whether the operands are to be interpreted as unsigned or signed integers. For the other integer instructions, the use of two’s complement for the signed interpretation means that they behave the same regardless of signedness.

Conventions¶

Occasionally, it is convenient to group operators together according to the following grammar shorthands:

\begin{array}{r} \begin{array}{llll} unop & ::= & iunop | funop | extend N_s \\ binop & ::= & ibinop | fbinop \\ testop & ::= & itestop \\ relop & ::= & irelop | frelop \\ cvtop & ::= & wrap | extend | trunc | trunc_sat | convert | demote | promote | reinterpret \end{array} \end{array}

Vector Instructions¶

Vector instructions (also known as SIMD instructions, single instruction multiple data) provide basic operations over values of vector type.

\begin{array}{r} \begin{array}{llcl} ishape & ::= & i 8 x 16 | i 16 x 8 | i 32 x 4 | i 64 x 2 \\ fshape & ::= & f 32 x 4 | f 64 x 2 \\ shape & ::= & ishape | fshape \\ half & ::= & low | high \\ laneidx & ::= & u 8 \end{array} \end{array}

\begin{array}{r} \begin{array}{llcl} instr & ::= & \dots \\ | & v 128. const i 128 \\ | & v 128. vvunop \\ | & v 128. vvbinop \\ | & v 128. vvternop \\ | & v 128. vvtestop \\ | & i 8 x 16. shuffle {laneidx}^{16} \\ | & i 8 x 16. swizzle \\ | & shape . splat \\ | & i 8 x 16. extract_lane_sx laneidx | i 16 x 8. extract_lane_sx laneidx \\ | & i 32 x 4. extract_lane laneidx | i 64 x 2. extract_lane laneidx \\ | & fshape . extract_lane laneidx \\ | & shape . replace_lane laneidx \\ | & i 8 x 16 . virelop | i 16 x 8 . virelop | i 32 x 4 . virelop \\ | & i 64 x 2. eq | i 64 x 2. ne | i 64 x 2. lt_s | i 64 x 2. gt_s | i 64 x 2. le_s | i 64 x 2. ge_s \\ | & fshape . vfrelop \\ | & ishape . viunop | i 8 x 16. popcnt \\ | & i 16 x 8. q 15 mulr_sat_s \\ | & i 32 x 4. dot_i 16 x 8_s \\ | & fshape . vfunop \\ | & ishape . vitestop \\ | & ishape . bitmask \\ | & i 8 x 16. narrow_i 16 x 8_sx | i 16 x 8. narrow_i 32 x 4_sx \\ | & i 16 x 8. extend_half_i 8 x 16_sx | i 32 x 4. extend_half_i 16 x 8_sx \\ | & i 64 x 2. extend_half_i 32 x 4_sx \\ | & ishape . vishiftop \\ | & ishape . vibinop \\ | & i 8 x 16. viminmaxop | i 16 x 8. viminmaxop | i 32 x 4. viminmaxop \\ | & i 8 x 16. visatbinop | i 16 x 8. visatbinop \\ | & i 16 x 8. mul | i 32 x 4. mul | i 64 x 2. mul \\ | & i 8 x 16. avgr_u | i 16 x 8. avgr_u \\ | & i 16 x 8. extmul_half_i 8 x 16_sx | i 32 x 4. extmul_half_i 16 x 8_sx | i 64 x 2. extmul_half_i 32 x 4_sx \\ | & i 16 x 8. extadd_pairwise_i 8 x 16_sx | i 32 x 4. extadd_pairwise_i 16 x 8_sx \\ | & fshape . vfbinop \\ | & i 32 x 4. trunc_sat_f 32 x 4_sx | i 32 x 4. trunc_sat_f 64 x 2_sx_zero \\ | & f 32 x 4. convert_i 32 x 4_sx | f 32 x 4. demote_f 64 x 2_zero \\ | & f 64 x 2. convert_low_i 32 x 4_sx | f 64 x 2. promote_low_f 32 x 4 \\ | & \dots \end{array} \end{array}

\begin{array}{r} \begin{array}{llcl} vvunop & ::= & not \\ vvbinop & ::= & and | andnot | or | xor \\ vvternop & ::= & bitselect \\ vvtestop & ::= & any_true \\ vitestop & ::= & all_true \\ virelop & ::= & eq | ne | lt_sx | gt_sx | le_sx | ge_sx \\ vfrelop & ::= & eq | ne | lt | gt | le | ge \\ viunop & ::= & abs | neg \\ vibinop & ::= & add | sub \\ viminmaxop & ::= & \min_sx | \max_sx \\ visatbinop & ::= & add_sat_sx | sub_sat_sx \\ vishiftop & ::= & shl | shr_sx \\ vfunop & ::= & abs | neg | sqrt | ceil | floor | trunc | nearest \\ vfbinop & ::= & add | sub | mul | div | \min | \max | pmin | pmax \end{array} \end{array}

Vector instructions have a naming convention involving a prefix that determines how their operands will be interpreted. This prefix describes the shape of the operand, written $t x N$ , and consisting of a packed numeric type $t$ and the number of lanes $N$ of that type. Operations are performed point-wise on the values of each lane.

Note

For example, the shape $i 32 x 4$ interprets the operand as four $i 32$ values, packed into an $i 128$ . The bitwidth of the numeric type $t$ times $N$ always is 128.

Instructions prefixed with $v 128$ do not involve a specific interpretation, and treat the $v 128$ as an $i 128$ value or a vector of 128 individual bits.

Vector instructions can be grouped into several subcategories:

Constants: return a static constant.
Unary Operations: consume one $v 128$ operand and produce one $v 128$ result.
Binary Operations: consume two $v 128$ operands and produce one $v 128$ result.
Ternary Operations: consume three $v 128$ operands and produce one $v 128$ result.
Tests: consume one $v 128$ operand and produce a Boolean integer result.
Shifts: consume a $v 128$ operand and a $i 32$ operand, producing one $v 128$ result.
Splats: consume a value of numeric type and produce a $v 128$ result of a specified shape.
Extract lanes: consume a $v 128$ operand and return the numeric value in a given lane.
Replace lanes: consume a $v 128$ operand and a numeric value for a given lane, and produce a $v 128$ result.

Some vector instructions have a signedness annotation $sx$ which distinguishes whether the elements in the operands are to be interpreted as unsigned or signed integers. For the other vector instructions, the use of two’s complement for the signed interpretation means that they behave the same regardless of signedness.

Conventions¶

Occasionally, it is convenient to group operators together according to the following grammar shorthands:

\begin{array}{r} \begin{array}{llll} vunop & ::= & viunop | vfunop | popcnt \\ vbinop & ::= & vibinop | vfbinop \\ | & viminmaxop | visatbinop \\ | & mul | avgr_u | q 15 mulr_sat_s \\ vtestop & ::= & vitestop \\ vrelop & ::= & virelop | vfrelop \\ vcvtop & ::= & extend | trunc_sat | convert | demote | promote \end{array} \end{array}

Reference Instructions¶

Instructions in this group are concerned with accessing references.

\begin{array}{r} \begin{array}{llcl} instr & ::= & \dots \\ | & ref . null reftype \\ | & ref . is_null \\ | & ref . func funcidx \end{array} \end{array}

These instructions produce a null value, check for a null value, or produce a reference to a given function, respectively.

Parametric Instructions¶

Instructions in this group can operate on operands of any value type.

\begin{array}{r} \begin{array}{llcl} instr & ::= & \dots \\ | & drop \\ | & select ({valtype}^{*})^{?} \end{array} \end{array}

The $drop$ instruction simply throws away a single operand.

The $select$ instruction selects one of its first two operands based on whether its third operand is zero or not. It may include a value type determining the type of these operands. If missing, the operands must be of numeric type.

Note

In future versions of WebAssembly, the type annotation on $select$ may allow for more than a single value being selected at the same time.

Variable Instructions¶

Variable instructions are concerned with access to local or global variables.

\begin{array}{r} \begin{array}{llcl} instr & ::= & \dots \\ | & local . get localidx \\ | & local . set localidx \\ | & local . tee localidx \\ | & global . get globalidx \\ | & global . set globalidx \end{array} \end{array}

These instructions get or set the values of variables, respectively. The $local . tee$ instruction is like $local . set$ but also returns its argument.

Table Instructions¶

Instructions in this group are concerned with tables table.

\begin{array}{r} \begin{array}{llcl} instr & ::= & \dots \\ | & table . get tableidx \\ | & table . set tableidx \\ | & table . size tableidx \\ | & table . grow tableidx \\ | & table . fill tableidx \\ | & table . copy tableidx tableidx \\ | & table . init tableidx elemidx \\ | & elem . drop elemidx \end{array} \end{array}

The $table . get$ and $table . set$ instructions load or store an element in a table, respectively.

The $table . size$ instruction returns the current size of a table. The $table . grow$ instruction grows table by a given delta and returns the previous size, or $- 1$ if enough space cannot be allocated. It also takes an initialization value for the newly allocated entries.

The $table . fill$ instruction sets all entries in a range to a given value.

The $table . copy$ instruction copies elements from a source table region to a possibly overlapping destination region; the first index denotes the destination. The $table . init$ instruction copies elements from a passive element segment into a table. The $elem . drop$ instruction prevents further use of a passive element segment. This instruction is intended to be used as an optimization hint. After an element segment is dropped its elements can no longer be retrieved, so the memory used by this segment may be freed.

An additional instruction that accesses a table is the control instruction $call_indirect$ .

Memory Instructions¶

Instructions in this group are concerned with linear memory.

\begin{array}{r} \begin{array}{llcl} memarg & ::= & {offset u 32, align u 32} \\ ww & ::= & 8 | 16 | 32 | 64 \\ instr & ::= & \dots \\ | & i nn . load memarg | f nn . load memarg | v 128. load memarg \\ | & i nn . store memarg | f nn . store memarg | v 128. store memarg \\ | & i nn . load 8_sx memarg | i nn . load 16_sx memarg | i 64. load 32_sx memarg \\ | & i nn . store 8 memarg | i nn . store 16 memarg | i 64. store 32 memarg \\ | & v 128. load 8 x 8_sx memarg | v 128. load 16 x 4_sx memarg | v 128. load 32 x 2_sx memarg \\ | & v 128. load 32_zero memarg | v 128. load 64_zero memarg \\ | & v 128. load ww_splat memarg \\ | & v 128. load ww_lane memarg laneidx | v 128. store ww_lane memarg laneidx \\ | & memory . size \\ | & memory . grow \\ | & memory . fill \\ | & memory . copy \\ | & memory . init dataidx \\ | & data . drop dataidx \end{array} \end{array}

Memory is accessed with $load$ and $store$ instructions for the different number types. They all take a memory immediate $memarg$ that contains an address offset and the expected alignment (expressed as the exponent of a power of 2). Integer loads and stores can optionally specify a storage size that is smaller than the bit width of the respective value type. In the case of loads, a sign extension mode $sx$ is then required to select appropriate behavior.

Vector loads can specify a shape that is half the bit width of $v 128$ . Each lane is half its usual size, and the sign extension mode $sx$ then specifies how the smaller lane is extended to the larger lane. Alternatively, vector loads can perform a splat, such that only a single lane of the specified storage size is loaded, and the result is duplicated to all lanes.

The static address offset is added to the dynamic address operand, yielding a 33 bit effective address that is the zero-based index at which the memory is accessed. All values are read and written in little endian byte order. A trap results if any of the accessed memory bytes lies outside the address range implied by the memory’s current size.

Note

Future versions of WebAssembly might provide memory instructions with 64 bit address ranges.

The $memory . size$ instruction returns the current size of a memory. The $memory . grow$ instruction grows memory by a given delta and returns the previous size, or $- 1$ if enough memory cannot be allocated. Both instructions operate in units of page size.

The $memory . fill$ instruction sets all values in a region to a given byte. The $memory . copy$ instruction copies data from a source memory region to a possibly overlapping destination region. The $memory . init$ instruction copies data from a passive data segment into a memory. The $data . drop$ instruction prevents further use of a passive data segment. This instruction is intended to be used as an optimization hint. After a data segment is dropped its data can no longer be retrieved, so the memory used by this segment may be freed.

Note

In the current version of WebAssembly, all memory instructions implicitly operate on memory index $0$ . This restriction may be lifted in future versions.

Control Instructions¶

Instructions in this group affect the flow of control.

\begin{array}{r} \begin{array}{llcl} blocktype & ::= & typeidx | {valtype}^{?} \\ instr & ::= & \dots \\ | & nop \\ | & unreachable \\ | & block blocktype {instr}^{*} end \\ | & loop blocktype {instr}^{*} end \\ | & if blocktype {instr}^{*} else {instr}^{*} end \\ | & br labelidx \\ | & br_if labelidx \\ | & br_table vec (labelidx) labelidx \\ | & return \\ | & call funcidx \\ | & call_indirect tableidx typeidx \end{array} \end{array}

The $nop$ instruction does nothing.

The $unreachable$ instruction causes an unconditional trap.

The $block$ , $loop$ and $if$ instructions are structured instructions. They bracket nested sequences of instructions, called blocks, terminated with, or separated by, $end$ or $else$ pseudo-instructions. As the grammar prescribes, they must be well-nested.

A structured instruction can consume input and produce output on the operand stack according to its annotated block type. It is given either as a type index that refers to a suitable function type, or as an optional value type inline, which is a shorthand for the function type $[] \to [{valtype}^{?}]$ .

Each structured control instruction introduces an implicit label. Labels are targets for branch instructions that reference them with label indices. Unlike with other index spaces, indexing of labels is relative by nesting depth, that is, label $0$ refers to the innermost structured control instruction enclosing the referring branch instruction, while increasing indices refer to those farther out. Consequently, labels can only be referenced from within the associated structured control instruction. This also implies that branches can only be directed outwards, “breaking” from the block of the control construct they target. The exact effect depends on that control construct. In case of $block$ or $if$ it is a forward jump, resuming execution after the matching $end$ . In case of $loop$ it is a backward jump to the beginning of the loop.

Note

This enforces structured control flow. Intuitively, a branch targeting a $block$ or $if$ behaves like a $break$ statement in most C-like languages, while a branch targeting a $loop$ behaves like a $continue$ statement.

Branch instructions come in several flavors: $br$ performs an unconditional branch, $br_if$ performs a conditional branch, and $br_table$ performs an indirect branch through an operand indexing into the label vector that is an immediate to the instruction, or to a default target if the operand is out of bounds. The $return$ instruction is a shortcut for an unconditional branch to the outermost block, which implicitly is the body of the current function. Taking a branch unwinds the operand stack up to the height where the targeted structured control instruction was entered. However, branches may additionally consume operands themselves, which they push back on the operand stack after unwinding. Forward branches require operands according to the output of the targeted block’s type, i.e., represent the values produced by the terminated block. Backward branches require operands according to the input of the targeted block’s type, i.e., represent the values consumed by the restarted block.

The $call$ instruction invokes another function, consuming the necessary arguments from the stack and returning the result values of the call. The $call_indirect$ instruction calls a function indirectly through an operand indexing into a table that is denoted by a table index and must have type $funcref$ . Since it may contain functions of heterogeneous type, the callee is dynamically checked against the function type indexed by the instruction’s second immediate, and the call is aborted with a trap if it does not match.

Expressions¶

Function bodies, initialization values for globals, elements and offsets of element segments, and offsets of data segments are given as expressions, which are sequences of instructions terminated by an $end$ marker.

\begin{array}{r} \begin{array}{llll} expr & ::= & {instr}^{*} end \end{array} \end{array}

In some places, validation restricts expressions to be constant, which limits the set of allowable instructions.