From 77521e7e714bdd388208626a7ef221a073c06e35 Mon Sep 17 00:00:00 2001 From: Alexander Whitestone Date: Thu, 16 Apr 2026 01:27:11 +0000 Subject: [PATCH] feat: WASM demo page (#104) --- wasm/index.html | 144 ++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 144 insertions(+) create mode 100644 wasm/index.html diff --git a/wasm/index.html b/wasm/index.html new file mode 100644 index 00000000..5e2e63c7 --- /dev/null +++ b/wasm/index.html @@ -0,0 +1,144 @@ + + + + +TurboQuant WASM Inference + + + +

TurboQuant WASM Inference

+

Run quantized models directly in the browser via WebAssembly. No server. No data leaves your machine.

+ +
+

1. Initialize

+ +not started +
+ +
+

2. Load Model

+ + +waiting +
+ +
+

3. Inference

+ +
+ + + + +
+
+
+ +
+

Metrics

+
--
Load (ms)
--
Latency (tok/s)
--
Memory (MB)
+
+ +
+

Viability Assessment

+
+Waiting for benchmark results...
+
+Target models:
+  Falcon-H1-Tiny-90M: ~45MB quantized (Q2_K)
+  Bonsai-1.7B: ~400MB quantized (Q4_0)
+
+Browser limits:
+  Chrome: 4GB WASM memory
+  Firefox: 2GB WASM memory
+  Safari: 1GB WASM memory (may fail on 1.7B)
+
+
+ + + + \ No newline at end of file