Values¶
WebAssembly programs operate on primitive numeric values. Moreover, in the definition of programs, immutable sequences of values occur to represent more complex data, such as text strings or other vectors.
Bytes¶
The simplest form of value are raw uninterpreted bytes. In the abstract syntax they are represented as hexadecimal literals.
Conventions¶
- The meta variable \(b\) ranges over bytes.
- Bytes are sometimes interpreted as natural numbers \(n < 256\).
Integers¶
Different classes of integers with different value ranges are distinguished by their bit width \(N\) and by whether they are unsigned or signed.
The latter class defines uninterpreted integers, whose signedness interpretation can vary depending on context. In the abstract syntax, they are represented as unsigned values. However, some operations convert them to signed based on a two’s complement interpretation.
Note
The main integer types occurring in this specification are \(\href{../syntax/values.html#syntax-int}{\mathit{u32}}\), \(\href{../syntax/values.html#syntax-int}{\mathit{u64}}\), \(\href{../syntax/values.html#syntax-int}{\mathit{s32}}\), \(\href{../syntax/values.html#syntax-int}{\mathit{s64}}\), \(\href{../syntax/values.html#syntax-int}{\mathit{i8}}\), \(\href{../syntax/values.html#syntax-int}{\mathit{i16}}\), \(\href{../syntax/values.html#syntax-int}{\mathit{i32}}\), \(\href{../syntax/values.html#syntax-int}{\mathit{i64}}\). However, other sizes occur as auxiliary constructions, e.g., in the definition of floating-point numbers.
Conventions¶
- The meta variables \(m, n, i\) range over integers.
- Numbers may be denoted by simple arithmetics, as in the grammar above. In order to distinguish arithmetics like \(2^N\) from sequences like \((1)^N\), the latter is distinguished with parentheses.
Floating-Point¶
Floating-point data represents 32 or 64 bit values that correspond to the respective binary formats of the IEEE 754-2019 standard (Section 3.3).
Every value has a sign and a magnitude. Magnitudes can either be expressed as normal numbers of the form \(m_0.m_1m_2\dots m_M \cdot2^e\), where \(e\) is the exponent and \(m\) is the significand whose most signifcant bit \(m_0\) is \(1\), or as a subnormal number where the exponent is fixed to the smallest possible value and \(m_0\) is \(0\); among the subnormals are positive and negative zero values. Since the significands are binary values, normals are represented in the form \((1 + m\cdot 2^{-M}) \cdot 2^e\), where \(M\) is the bit width of \(m\); similarly for subnormals.
Possible magnitudes also include the special values \(\infty\) (infinity) and \(\href{../syntax/values.html#syntax-float}{\mathsf{nan}}\) (NaN, not a number). NaN values have a payload that describes the mantissa bits in the underlying binary representation. No distinction is made between signalling and quiet NaNs.
where \(M = \href{../syntax/values.html#aux-significand}{\mathrm{signif}}(N)\) and \(E = \href{../syntax/values.html#aux-exponent}{\mathrm{expon}}(N)\) with
A canonical NaN is a floating-point value \(\pm\href{../syntax/values.html#syntax-float}{\mathsf{nan}}(\href{../syntax/values.html#aux-canon}{\mathrm{canon}}_N)\) where \(\href{../syntax/values.html#aux-canon}{\mathrm{canon}}_N\) is a payload whose most significant bit is \(1\) while all others are \(0\):
An arithmetic NaN is a floating-point value \(\pm\href{../syntax/values.html#syntax-float}{\mathsf{nan}}(n)\) with \(n \geq \href{../syntax/values.html#aux-canon}{\mathrm{canon}}_N\), such that the most significant bit is \(1\) while all others are arbitrary.
Note
In the abstract syntax, subnormals are distinguished by the leading 0 of the significand. The exponent of subnormals has the same value as the smallest possible exponent of a normal number. Only in the binary representation the exponent of a subnormal is encoded differently than the exponent of any normal number.
Conventions¶
- The meta variable \(z\) ranges over floating-point values where clear from context.
Names¶
Names are sequences of characters, which are scalar values as defined by Unicode (Section 2.4).
Due to the limitations of the binary format, the length of a name is bounded by the length of its UTF-8 encoding.
Convention¶
- Characters (Unicode scalar values) are sometimes used interchangeably with natural numbers \(n < 1114112\).