the-nexus/docs at ac2ec406577e19b1361fa23d2d4d33434d658a8a - the-nexus - Hermes Gitea

Timmy_Foundation/the-nexus

Files

History

Timmy (WHIP) ac2ec40657

CI / test (pull_request) Failing after 51s

Details

Review Approval Gate / verify-review (pull_request) Failing after 6s

Details

CI / validate (pull_request) Failing after 40s

Details

feat: standardize llama.cpp backend for sovereign local inference

Closes #1123. Implements all three phases of the local LLM standardization:

PHASE 1 — Deployment:
- docs/local-llm.md: full deployment guide (build, model download, health check,
  model path convention /opt/models/llama/, hardware recommendations)
- systemd/llama-server.service: hardened unit with resource limits and auto-restart
- Health check: /health endpoint + model loaded verification

PHASE 2 — Hermes Integration:
- bin/llama_client.py: OpenAI-compatible Python client wrapping llama.cpp HTTP API
  (chat completions, streaming, raw completions, health check, model listing,
  benchmarking, full CLI interface)
- nexus/llama_provider.py: Hermes inference router provider adapter
  - Activates when external APIs fail, LOCAL_ONLY=true, or explicit local request
  - Response format normalized to OpenAI-compatible chat completions
  - Token usage estimated and logged
  - Health caching with TTL for efficiency

PHASE 3 — Optimization & Ops:
- Benchmarking: client.benchmark() + CLI benchmark command
- Quantization guide: Q4_K_M recommended for fleet, Q6_K for high-RAM, Q3_K for low
- Model recommendations for VPS Beta (3B), VPS Alpha (7B), Mac (7B Q6_K)
- Night watch integration: health probe script with auto-restart

Fleet standard model: Qwen2.5-7B-Instruct-Q4_K_M.gguf
Default endpoint: http://localhost:11435

22 tests pass.

2026-04-13 21:16:31 -04:00

..

bezalel/evennia

feat(bezalel): MemPalace ecosystem — validation, audit, sync, auto-revert, Evennia integration

2026-04-07 14:47:12 +00:00

Add TTS integration proof for Deep Dive (#830 )

2026-04-05 08:31:33 +00:00

fix: [MEDIA] Veo/Flow flythrough prototypes for The Nexus and Timmy

2026-04-11 00:20:14 +00:00

[claude] MemPalace follow-up: CmdAsk, metadata fix, taxonomy CI (#1075 ) (#1091 )

2026-04-07 14:23:07 +00:00

A2A_PROTOCOL.md

feat: implement A2A protocol for fleet-wizard delegation (#1122 )

2026-04-13 18:31:05 -04:00

agent-review-log.md

[AUTOGENESIS][Phase I] Hermes v2.0 architecture spec + successor fork spec (#859 )

2026-04-06 02:57:57 +00:00

ai-tools-org-assessment.md

docs: add AI tools org assessment tracker (#1119 )

2026-04-12 19:24:38 -04:00

BANNERLORD_HARNESS_PROOF.md

feat: Complete Bannerlord MCP Harness implementation (Issue #722 )

2026-03-31 04:53:29 +00:00

BANNERLORD_RUNTIME.md

WIP: issue #720 (mimo swarm)

2026-04-12 11:55:51 -04:00

branch_protection_policy.md

[groq] [QA][POLICY] Branch Protection + Mandatory Review Policy for All Repos (#918 ) (#978 )

2026-04-07 08:38:28 +00:00

branch_protection.md

[groq] [QA][POLICY] Branch Protection + Mandatory Review Policy for All Repos (#918 ) (#935 )

2026-04-07 06:47:03 +00:00

burn-mode-fleet-manual.md

docs: review pass on Burn Mode Operations Manual

2026-04-05 20:59:33 +00:00

CANONICAL_INDEX_DEEPDIVE.md

docs: update canonical index with fleet context module (#830 )

2026-04-05 17:33:00 +00:00

computer-use.md

feat: add desktop automation primitives to Hermes (#1125 )

2026-04-10 05:45:27 -04:00

deep-dive-architecture.md

[EZRA BURN-MODE] Deep Dive architecture decomposition (the-nexus#830)

2026-04-05 08:58:25 +00:00

DEEPSDIVE_ARCHITECTURE.md

[ezra] Deep Dive scaffold #830 : DEEPSDIVE_ARCHITECTURE.md

2026-04-05 01:51:01 +00:00

DEEPSDIVE_EXECUTION.md

[ezra] Add execution runbook for Deep Dive pipeline #830

2026-04-05 03:45:08 +00:00

DEEPSDIVE_QUICKSTART.md

[ezra] Deep Dive quick start guide (#830 )

2026-04-05 05:19:04 +00:00

FLEET_VOCABULARY.md

Rewrite Fleet Vocabulary — replace Robing pattern with Hermes-native comms

2026-04-12 12:09:18 +00:00

GHOST_WIZARD_AUDIT.md

[claude] Archive ghost wizard accounts and clear dead assignments (#827 ) (#891 )

2026-04-06 18:16:58 +00:00

GOOGLE_AI_ULTRA_INTEGRATION.md

feat: add Google AI Ultra integration plan

2026-03-29 21:58:16 -04:00

hermes-v2.0-architecture.md

[AUTOGENESIS][Phase I] Hermes v2.0 architecture spec + successor fork spec (#859 )

2026-04-06 02:57:57 +00:00

local-llm.md

feat: standardize llama.cpp backend for sovereign local inference

2026-04-13 21:16:31 -04:00

mempalace_taxonomy.yaml

docs(memory): add fleet-wide MemPalace taxonomy standard

2026-04-07 14:26:25 +00:00

offload-826-audit.md

[claude] Offload 27 issues from Timmy to Ezra/Bezalel (#826 ) (#890 )

2026-04-06 18:15:20 +00:00

pr-reviewer-policy.md

[claude] PR hygiene: reviewer policy + org-wide cleanup (#916 ) (#923 )

2026-04-07 06:27:56 +00:00

QUARANTINE_PROCESS.md

[claude] Poka-yoke: make test skips/flakes impossible to ignore (#1094 ) (#1104 )

2026-04-07 14:38:49 +00:00

sovereign-ordinal-archive.json

feat: add metadata for ordinal archive

2026-04-11 23:10:03 +00:00

sovereign-ordinal-archive.md

feat: Sovereign Ordinal Archive - block 944648

2026-04-11 23:10:02 +00:00

successor-fork-spec.md

[AUTOGENESIS][Phase I] Hermes v2.0 architecture spec + successor fork spec (#859 )

2026-04-06 02:57:57 +00:00

voice-output.md

feat: add edge-tts as zero-cost voice output provider

2026-04-08 06:29:26 -04:00