Compare commits
1 Commits
burn/fix-d
...
fix/67-rea
| Author | SHA1 | Date | |
|---|---|---|---|
| c8d1a9c730 |
12
README.md
12
README.md
@@ -16,11 +16,11 @@ A 27B model at 128K context with TurboQuant beats a 72B at Q2 with 8K context.
|
||||
See [issues](https://forge.alexanderwhitestone.com/Timmy_Foundation/turboquant/issues) for current progress.
|
||||
|
||||
## Roles
|
||||
- **Strago:** Build spec author
|
||||
- **Cid:** Implementation, benchmarks, deployment
|
||||
- **Locke:** Research support, upstream watch
|
||||
- **John:** Quality review
|
||||
- **Frankie:** Coordination
|
||||
- **Build Spec:** Architecture and specification
|
||||
- **Implementation:** Code, benchmarks, deployment
|
||||
- **Research:** Upstream tracking, literature review
|
||||
- **Quality:** Testing and review
|
||||
- **Coordination:** Project management
|
||||
|
||||
## Source Repos
|
||||
- [TheTom/llama-cpp-turboquant](https://github.com/TheTom/llama-cpp-turboquant) — llama.cpp fork with Metal
|
||||
@@ -29,4 +29,4 @@ See [issues](https://forge.alexanderwhitestone.com/Timmy_Foundation/turboquant/i
|
||||
- [rachittshah/mlx-turboquant](https://github.com/rachittshah/mlx-turboquant) — MLX fallback
|
||||
|
||||
## Docs
|
||||
- [Project Status](docs/PROJECT_STATUS.md) — Full project status and build specification
|
||||
- [BUILD-SPEC.md](BUILD-SPEC.md) — Full build specification (Strago, v2.2)
|
||||
|
||||
@@ -135,5 +135,7 @@ llama-server -m model.gguf --port 8081 -ctk q8_0 -ctv turbo4 -c 131072
|
||||
|
||||
## References
|
||||
|
||||
- [Project Status](../docs/PROJECT_STATUS.md)
|
||||
- [TurboQuant Build Spec](../BUILD-SPEC.md)
|
||||
- [Phase 1 Report](../PHASE1-REPORT.md)
|
||||
- [Full Knowledge Transfer](../FULL-REPORT.md)
|
||||
- [llama.cpp TurboQuant Fork](https://github.com/TheTom/llama-cpp-turboquant)
|
||||
|
||||
Reference in New Issue
Block a user