Activity - turboquant - Hermes Gitea

Timmy_Foundation/turboquant

3 Active Pull Requests

29 Active Issues

3
Merged Pull Requests 0
Proposed Pull Requests 19
Closed Issues 29
New Issues

Excluding merges, 3 authors have pushed 5 commits to main and 12 commits to all branches. On main, 6 files have changed and there have been 948 additions and 2 deletions.

1 Release published by 1 user

Published GoldenRockachopa 2026-04-04 17:40:59 +00:00

3 Pull requests merged by 1 user

Merged #18 PolarQuant Implementation & Phase 2 Integration Plan 2026-03-30 23:49:52 +00:00

Merged #19 Benchmarking Suite: Objective Quality and Performance Testing 2026-03-30 23:41:38 +00:00

Merged #20 feat: Sovereign Evolution Redistribution — turboquant 2026-03-30 23:41:12 +00:00

19 Issues closed from 1 user

Closed #24 [P2-4] Run full quality comparison: turbo4 vs f16 on 10 test prompts 2026-04-05 21:58:15 +00:00

Closed #1 TurboQuant — KV Cache Compression for Local Inference on M4 Max 2026-04-05 14:05:51 +00:00

Closed #23 [P2-3] Fix Ollama install and build custom Ollama with TurboQuant fork 2026-04-05 14:05:50 +00:00

Closed #25 [P2-5] Download qwen3.5:27b and benchmark turbo4 at 64K/128K context 2026-04-05 14:05:50 +00:00

Closed #27 [TQ-2] Build TheTom/llama-cpp-turboquant for M3 Max Metal 2026-04-05 14:05:49 +00:00

Closed #26 [P2-6] Production cutover: swap Timmy's llama-server to TurboQuant 2026-04-05 14:05:49 +00:00

Closed #31 [TQ-1] Download Gemma 4 via Ollama on Mac 2026-04-05 14:05:48 +00:00

Closed #30 [EPIC] TurboQuant + Gemma 4 Local Mac Deployment 2026-04-05 14:05:48 +00:00

Closed #14 [P3] QJL residual correction — Metal port 2026-03-30 21:04:08 +00:00

Closed #13 [P2.5] Per-layer quantization profiles 2026-03-30 21:04:06 +00:00

Closed #10 [P2] Custom Ollama build + MacBook deployment 2026-03-30 21:04:03 +00:00

Closed #9 [P2-S0] Ollama CGo API compatibility check 2026-03-30 21:04:01 +00:00

Closed #6 [P1-S2] Baseline benchmarks — FP16 KV cache (no TurboQuant) 2026-03-30 20:11:07 +00:00

Closed #8 [P1-S2] Peak memory profiling at each context length 2026-03-30 20:11:01 +00:00

Closed #7 [P1-S2] PolarQuant benchmarks — turbo4 KV cache + asymmetric test 2026-03-30 20:10:58 +00:00

Closed #5 [P1-S1] PolarQuant verification checklist 2026-03-30 20:09:53 +00:00

Closed #4 [P1-S1] Build llama.cpp fork with Metal backend on M4 Max 2026-03-30 20:09:51 +00:00

Closed #3 [P1-S0] Fork assessment — age, conflicts, build path estimate 2026-03-30 19:40:56 +00:00

Closed #2 [P1-GATE] Metal kernel check — determines llama.cpp vs MLX path 2026-03-30 19:40:54 +00:00

29 Issues created by 2 users

Opened #1 TurboQuant — KV Cache Compression for Local Inference on M4 Max 2026-03-30 17:11:01 +00:00

Opened #2 [P1-GATE] Metal kernel check — determines llama.cpp vs MLX path 2026-03-30 17:11:02 +00:00

Opened #3 [P1-S0] Fork assessment — age, conflicts, build path estimate 2026-03-30 17:11:04 +00:00

Opened #4 [P1-S1] Build llama.cpp fork with Metal backend on M4 Max 2026-03-30 17:11:05 +00:00

Opened #5 [P1-S1] PolarQuant verification checklist 2026-03-30 17:11:07 +00:00

Opened #6 [P1-S2] Baseline benchmarks — FP16 KV cache (no TurboQuant) 2026-03-30 17:11:08 +00:00

Opened #7 [P1-S2] PolarQuant benchmarks — turbo4 KV cache + asymmetric test 2026-03-30 17:11:09 +00:00

Opened #8 [P1-S2] Peak memory profiling at each context length 2026-03-30 17:11:10 +00:00

Opened #9 [P2-S0] Ollama CGo API compatibility check 2026-03-30 17:11:11 +00:00

Opened #10 [P2] Custom Ollama build + MacBook deployment 2026-03-30 17:11:13 +00:00

Opened #11 [P2] Full test matrix — 10 prompts + quality + performance 2026-03-30 17:11:14 +00:00

Opened #12 [P2] Long-session quality test — 50-turn conversation 2026-03-30 17:11:16 +00:00

Opened #13 [P2.5] Per-layer quantization profiles 2026-03-30 17:11:17 +00:00

Opened #14 [P3] QJL residual correction — Metal port 2026-03-30 17:11:18 +00:00

Opened #15 [P4] Upstream llama.cpp / Ollama TurboQuant watch 2026-03-30 17:11:20 +00:00

Opened #16 [P1-PREP] Write 10 predefined test prompts 2026-03-30 17:11:21 +00:00

Opened #17 TurboQuant Initiative Review & Contributor Feedback 2026-03-30 20:57:42 +00:00

Opened #21 [P2-1] Download wikitext-2-raw and run perplexity quality gate 2026-03-31 04:34:05 +00:00

Opened #22 [P2-2] Write 10 test prompts for quality comparison 2026-03-31 04:34:05 +00:00

Opened #25 [P2-5] Download qwen3.5:27b and benchmark turbo4 at 64K/128K context 2026-03-31 04:34:06 +00:00

Opened #24 [P2-4] Run full quality comparison: turbo4 vs f16 on 10 test prompts 2026-03-31 04:34:06 +00:00

Opened #23 [P2-3] Fix Ollama install and build custom Ollama with TurboQuant fork 2026-03-31 04:34:06 +00:00

Opened #26 [P2-6] Production cutover: swap Timmy's llama-server to TurboQuant 2026-03-31 04:34:07 +00:00

Opened #27 [TQ-2] Build TheTom/llama-cpp-turboquant for M3 Max Metal 2026-04-03 22:41:26 +00:00

Opened #28 [TQ-4] Create Hermes profile for local Gemma 4 + TurboQuant 2026-04-03 22:41:27 +00:00

Opened #29 [TQ-5] Benchmark: latency, memory, quality comparison 2026-04-03 22:41:27 +00:00

Opened #30 [EPIC] TurboQuant + Gemma 4 Local Mac Deployment 2026-04-03 22:41:42 +00:00

Opened #31 [TQ-1] Download Gemma 4 via Ollama on Mac 2026-04-03 22:41:42 +00:00

Opened #32 [TQ-3] Perplexity quality gate: turbo4 vs f16 2026-04-03 22:41:43 +00:00