Add doc pages for SIMD/small-numbers (#3936)

TheNumbat · web-flow · commit d95e9c9650b2 · 2025-04-30T14:51:48.000Z
diff --git a/jane/doc/extensions/_08-miscellaneous-extensions/simd.md b/jane/doc/extensions/_08-miscellaneous-extensions/simd.md
@@ -0,0 +1,116 @@
+---
+layout: documentation-page
+collectionName: Miscellaneous extensions
+title: SIMD
+---
+
+# SIMD
+
+The OxCaml compiler provides built-in 128-bit SIMD vector types, as well as
+intrinsics for amd64 SIMD instructions up to and including SSE4.2.
+
+<!-- CR mslater: link to simd libraries -->
+To get started with SIMD, add the `ocaml_simd_sse` library to your dependencies.
+You may also want to use `ppx_simd`, which provides convenient syntax for
+defining constants like blend and shuffle masks.
+
+## Types
+
+When SIMD is enabled, the following 128-bit SIMD vector types are available:
+
+```
+int8x16
+int8x16#
+int16x8
+int16x8#
+int32x4
+int32x4#
+int64x2
+int64x2#
+float32x4
+float32x4#
+float64x2
+float64x2#
+```
+
+The types ending with `#` are unboxed: they are passed between functions in XMM
+registers, stored in structures as flat data, and may be stored in flat arrays.
+The operations provided by `Ocaml_simd_sse` operate on unboxed vectors.  For
+more detail on unboxed types, see the [docs](../unboxed-types/intro).
+
+The types without `#` are boxed: when passed to a non-inlined function, they
+will be copied to a heap allocated (abstract) block.  Boxed vectors are not
+necessarily 16-byte aligned, so will generate unaligned load/store instructions.
+
+Within a function, all SIMD vectors live in floating-point registers or 16-byte
+aligned stack slots.
+
+## Intrinsics
+
+SIMD vectors are opaque: no operations on them are built into the
+language. Instead, the compiler translates certain "builtin" externals directly
+to SIMD instructions.  Your code should use the `ocaml_simd_sse` library, which
+exposes an OxCaml API for these intrinsics.
+
+```ocaml
+module Float32x4 = Ocaml_simd_sse.Float32x4
+
+let v = Float32x4.set 1.0 2.0 3.0 4.0
+let v = Float32x4.sqrt v
+let x, y, z, w = Float32x4.splat v
+```
+
+SIMD vectors may be loaded from / stored to strings, bytes, bigstrings, and
+arrays of the corresponding unboxed type. Load and store operations are also
+provided by `ocaml_simd_sse`, rather than Base or Core.
+
+```ocaml
+module Int8x16 = Ocaml_simd_sse.Int8x16
+
+let text = "abcdefghijklmnopqrstuvwxyz"
+let floats = [| 1.0; 2.0 |]
+let ints = [| 1; 2 |]
+
+let _ = Int8x16.String.get text ~byte:0
+let _ = Float64x2.Float_array.get floats ~idx:0 (* Float array optimization required *)
+let _ = Int64x2.Immediate_array.get_tagged ints ~idx:0
+```
+
+Some operations require the user to choose a specific behavior at compile
+time. To do so, you must provide a compile time constant generated by
+`ppx_simd`.  Refer to `ppx_simd` for more details.
+
+```ocaml
+module Int32x4 = Ocaml_simd_sse.Int32x4
+
+let x = Int32x4.set 0 2 4 6
+let y = Int32x4.set 1 3 5 7
+let z = Int32x4.blend [%blend 0, 1, 0, 1] x y
+```
+
+## C ABI
+
+Like floats, both boxed and unboxed SIMD vectors may be passed to C stubs.  The
+OxCaml runtime provides several helper functions for working with SIMD vectors.
+
+```ocaml
+external simd_stub : (int8x16[@unboxed]) -> (int8x16[@unboxed]) =
+  "unboxed_integer_simd_stub" "boxed_integer_simd_stub"
+
+(* ... *)
+```
+```c
+#include <caml/simd.h>
+
+__m128i unboxed_integer_simd_stub(__m128i v) {
+  return v;
+}
+
+value boxed_integer_simd_stub(value v) {
+  return caml_copy_vec128i(unboxed_integer_simd_stub(Vec128_vali(v)));
+}
+```
+
+## Future Work
+
+Support for wider vectors and NEON/AVX2/AVX512 intrinsics is coming soon.
diff --git a/jane/doc/extensions/_08-miscellaneous-extensions/small-numbers.md b/jane/doc/extensions/_08-miscellaneous-extensions/small-numbers.md
@@ -0,0 +1,87 @@
+---
+layout: documentation-page
+collectionName: Miscellaneous extensions
+title: Small Numbers
+---
+
+# Small Numbers
+
+The small numbers extension adds `float32`, `int16`, and `int8` types to OxCaml.
+Currently, only `float32` (single-precision IEEE float) is implemented.
+
+## Float32
+
+When small numbers are enabled, the following float32 types are available:
+
+```
+float32
+float32#
+float32 array
+float32# array
+```
+
+Literals use the `s` suffix:
+
+```
+1.0s  : float32
+#1.0s : float32#
+```
+
+Pattern matching on `float32`s is not supported.
+
+### Operations
+
+Operations on 32-bit floats are available via the `Stdlib_stable.Float32` and
+`Stdlib_stable.Float32_u` libraries, which provide `Base`-like APIs.
+
+### Representation
+
+The boxed `float32` type is encoded as a custom block with similar semantics to
+`int32`.  Similarly, `float32 array` is a typical OxCaml array containing boxed
+elements.
+
+The `float32#` type is unboxed:
+
+- Function arguments and returns of type `float32#` are passed using
+  floating-point registers.
+
+- Record fields of type `float32#` are not boxed, but each take up one word of
+  space.  Using float32 records requires the mixed blocks extension, which is
+  also enabled by default.
+
+- Arrays of type `float32# array` contain tightly packed unboxed float32
+  elements.  The array itself is a custom block with similar semantics to
+  `int32# array`.
+
+Like floats, compiler optimizations allow boxed float32s to remain unboxed while
+being manipulated within the scope of a function.
+
+### C ABI
+
+Both boxed and unboxed float32s may be passed to C stubs.  The OxCaml runtime
+provides helper functions for working with float32s.
+
+```ocaml
+external float32_stub : (float32[@unboxed]) -> (float32[@unboxed]) =
+  "boxed_float32_stub" "unboxed_float32_stub"
+
+external float32_hash_stub : float32# -> float32# =
+  "boxed_float32_stub" "unboxed_float32_stub"
+
+(* ... *)
+```
+```c
+#include <caml/float32.h>
+
+float unboxed_float32_stub(float v) {
+  return v;
+}
+
+value boxed_float32_stub(value v) {
+  return caml_copy_float32(unboxed_float32_stub(Float32_val(v)));
+}
+```
+
+## Int8 / Int16
+
+Coming soon.