These results were collected on the Lanexio Parser self-hosted Linux runner (dev-parser). All numbers are from benchmarks/results/benchmark-results.json in the repository.
Each benchmark runs 100 warm iterations after 10 warm-up iterations. “Cold start” is the first parse after module load. Parse throughput is measured in MB/s of input bytes. Serialization throughput is measured in MB/s of output bytes.
The warm/hot distinction matters: cold-start numbers include V8 Turbofan JIT compilation. For github.html in particular, the Lanexio Parser cold-start time reflects the JIT cost of the adoption-agency and phantom-synthesis branches, which are exercised heavily by GitHub’s HTML output. Hot throughput is the steady-state number.
The Zig/WASM HTML tokenizer is included in v1.0 as an opt-in acceleration path. It provides a measurable end-to-end speedup on HTML documents that contain no character references.
Input
Size
TS parse (ms)
WASM parse (ms)
Speedup
nodejs-docs.html
107 KB
3.68
3.37
1.09×
wikipedia.html
195 KB
17.15
16.19
1.00× *
github.html
306 KB
22.41
22.39
1.00× *
* Wikipedia and GitHub HTML contain character references (&, <, ©). The Zig tokenizer currently stubs character references; a pre-scan detects & followed by a letter or # and gracefully falls back to the TypeScript tokenizer. Full character reference resolution in Zig is deferred to v1.1.