Atlas Inference Engine — 3x Faster Than vLLM, Pure Rust + CUDA #674

New Issue

Rockachopa · 2026-04-14T20:28:57Z

Rockachopa commented

2026-04-14 20:28:57 +00:00

Triage: Atlas Inference Engine

Source: https://atlasinference.io/
Date: 2026-04-14
Sponsor: Alexander Whitestone

Verdict: HIGHLY RELEVANT — Potential vLLM Replacement

Atlas is an LLM inference engine written from scratch in Rust and CUDA. No PyTorch. No Python. Just a ~2.5 GB image that runs 3x faster than the status quo.

What Atlas Is

An inference engine built by Avarok that compiles from HTTP to kernel dispatch. Pure Rust + CUDA. Custom CUDA kernels for Blackwell SM120/121. MTP speculative decoding.

Key Differentiators

Metric	Atlas	vLLM
Image size	~2.5 GB	20+ GB
Cold start	< 2 min	~10 min
Runtime	Rust + CUDA	Python + PyTorch
Dependencies	None	200+ packages
Throughput	3.1x faster	Baseline

Benchmark Numbers (DGX Spark, single GPU)

Model	Parameters	Quantization	Throughput
Qwen3.5-35B-A3B MTP	35B (3B active)	NVFP4/FP8	~130 tok/s
Qwen3.5-122B-A10B MTP EP=2	122B (10B active)	NVFP4	~50-54 tok/s
Qwen3-Next-80B-A3B	80B (3B active)	NVFP4	~82 tok/s
Qwen3-Coder-Next	80B (3B active)	FP8	~58 tok/s
Qwen3-VL-30B	30B (3B active)	NVFP4	~100 tok/s
Gemma 4 26B	26B (3.8B active)	NVFP4	~35 tok/s
Nemotron-3 Nano 30B	30B (3.5B active)	NVFP4/FP8	~100 tok/s
Mistral Small 4 119B	119B (6.5B active)	NVFP4	~26 tok/s

Why This Matters for Hermes

3x faster than vLLM — Our SOTA research found vLLM delivers 24x over HF. Atlas is 3x on top of that.
2.5 GB vs 20+ GB — Dramatically smaller deployment footprint
< 2 min cold start — vs ~10 min for vLLM
OpenAI-compatible API — Drop-in replacement for any OpenAI client
MTP speculative decoding — Multiple tokens per forward pass
NVFP4 quantization — Native tensor core support
MoE support — Handles Mixture-of-Experts models efficiently

Comparison to Our Current SOTA Research

Capability	vLLM (SOTA)	Atlas
Throughput	24x HF	3.1x vLLM (74x HF)
Image size	20+ GB	2.5 GB
Cold start	~10 min	< 2 min
Dependencies	200+	None
MoE support	Yes	Yes (custom kernels)
MTP decoding	Limited	Native
Blackwell support	Yes	Yes (SM120/121)

OpenAI Compatibility

Atlas exposes OpenAI-compatible API at http://localhost:8888/v1. Works with:

Claude Code
Cline
OpenCode
Open WebUI
Any OpenAI-compatible client

This means Hermes integration is trivial — just point to Atlas instead of vLLM.

Quick Start

docker pull avarok/atlas-gb10:alpha-2.8
docker run -d --gpus all --ipc=host -p 8888:8888 \
  -v ~/.cache/huggingface:/root/.cache/huggingface \
  avarok/atlas-gb10:alpha-2.8 serve \
  Sehyo/Qwen3.5-35B-A3B-NVFP4 \
  --speculative --scheduling-policy slai \
  --max-seq-len 131072 --max-batch-size 1 \
  --max-prefill-tokens 0

Limitations

DGX Spark focus — Benchmarks are on DGX Spark (GB10). May not work on consumer GPUs.
Alpha stage — Version alpha-2.8, not production-ready
Limited model support — Only 11 models currently supported (hand-tuned kernels)
No CPU support — CUDA-only, no CPU inference
No GGUF support — Only NVFP4/FP8 quantization

Recommendation

HIGH PRIORITY for evaluation. Atlas could replace vLLM as our inference backend. Benefits:

3x faster throughput
8x smaller image
5x faster cold start
Zero dependencies
OpenAI-compatible (drop-in replacement)

Action: Test Atlas on our hardware. Compare to vLLM benchmarks. If it works, switch.

Source

Website: https://atlasinference.io/
Discord: https://discord.gg/DwF3brBMpw
Reddit: https://www.reddit.com/r/LocalLLaMA/comments/1rkefjw/
Company: Avarok
Status: Alpha (v2.8)

## Triage: Atlas Inference Engine **Source:** https://atlasinference.io/ **Date:** 2026-04-14 **Sponsor:** Alexander Whitestone --- ## Verdict: HIGHLY RELEVANT — Potential vLLM Replacement Atlas is an LLM inference engine written from scratch in Rust and CUDA. No PyTorch. No Python. Just a ~2.5 GB image that runs 3x faster than the status quo. --- ## What Atlas Is An inference engine built by Avarok that compiles from HTTP to kernel dispatch. Pure Rust + CUDA. Custom CUDA kernels for Blackwell SM120/121. MTP speculative decoding. ## Key Differentiators | Metric | Atlas | vLLM | |--------|-------|------| | Image size | ~2.5 GB | 20+ GB | | Cold start | < 2 min | ~10 min | | Runtime | Rust + CUDA | Python + PyTorch | | Dependencies | None | 200+ packages | | Throughput | 3.1x faster | Baseline | ## Benchmark Numbers (DGX Spark, single GPU) | Model | Parameters | Quantization | Throughput | |-------|-----------|-------------|-----------| | Qwen3.5-35B-A3B MTP | 35B (3B active) | NVFP4/FP8 | ~130 tok/s | | Qwen3.5-122B-A10B MTP EP=2 | 122B (10B active) | NVFP4 | ~50-54 tok/s | | Qwen3-Next-80B-A3B | 80B (3B active) | NVFP4 | ~82 tok/s | | Qwen3-Coder-Next | 80B (3B active) | FP8 | ~58 tok/s | | Qwen3-VL-30B | 30B (3B active) | NVFP4 | ~100 tok/s | | Gemma 4 26B | 26B (3.8B active) | NVFP4 | ~35 tok/s | | Nemotron-3 Nano 30B | 30B (3.5B active) | NVFP4/FP8 | ~100 tok/s | | Mistral Small 4 119B | 119B (6.5B active) | NVFP4 | ~26 tok/s | ## Why This Matters for Hermes 1. **3x faster than vLLM** — Our SOTA research found vLLM delivers 24x over HF. Atlas is 3x on top of that. 2. **2.5 GB vs 20+ GB** — Dramatically smaller deployment footprint 3. **< 2 min cold start** — vs ~10 min for vLLM 4. **OpenAI-compatible API** — Drop-in replacement for any OpenAI client 5. **MTP speculative decoding** — Multiple tokens per forward pass 6. **NVFP4 quantization** — Native tensor core support 7. **MoE support** — Handles Mixture-of-Experts models efficiently ## Comparison to Our Current SOTA Research | Capability | vLLM (SOTA) | Atlas | |-----------|-------------|-------| | Throughput | 24x HF | 3.1x vLLM (74x HF) | | Image size | 20+ GB | 2.5 GB | | Cold start | ~10 min | < 2 min | | Dependencies | 200+ | None | | MoE support | Yes | Yes (custom kernels) | | MTP decoding | Limited | Native | | Blackwell support | Yes | Yes (SM120/121) | ## OpenAI Compatibility Atlas exposes OpenAI-compatible API at http://localhost:8888/v1. Works with: - Claude Code - Cline - OpenCode - Open WebUI - Any OpenAI-compatible client This means Hermes integration is trivial — just point to Atlas instead of vLLM. ## Quick Start ```bash docker pull avarok/atlas-gb10:alpha-2.8 docker run -d --gpus all --ipc=host -p 8888:8888 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ avarok/atlas-gb10:alpha-2.8 serve \ Sehyo/Qwen3.5-35B-A3B-NVFP4 \ --speculative --scheduling-policy slai \ --max-seq-len 131072 --max-batch-size 1 \ --max-prefill-tokens 0 ``` ## Limitations 1. **DGX Spark focus** — Benchmarks are on DGX Spark (GB10). May not work on consumer GPUs. 2. **Alpha stage** — Version alpha-2.8, not production-ready 3. **Limited model support** — Only 11 models currently supported (hand-tuned kernels) 4. **No CPU support** — CUDA-only, no CPU inference 5. **No GGUF support** — Only NVFP4/FP8 quantization ## Recommendation **HIGH PRIORITY for evaluation.** Atlas could replace vLLM as our inference backend. Benefits: - 3x faster throughput - 8x smaller image - 5x faster cold start - Zero dependencies - OpenAI-compatible (drop-in replacement) **Action:** Test Atlas on our hardware. Compare to vLLM benchmarks. If it works, switch. ## Source - Website: https://atlasinference.io/ - Discord: https://discord.gg/DwF3brBMpw - Reddit: https://www.reddit.com/r/LocalLLaMA/comments/1rkefjw/ - Company: Avarok - Status: Alpha (v2.8)

Rockachopa added the p0-critical p1-important labels 2026-04-14 20:28:57 +00:00

Timmy was assigned by Rockachopa

2026-04-14 20:28:57 +00:00

Rockachopa commented

2026-04-14 22:06:39 +00:00

Research Complete: Atlas vs vLLM Benchmarking

Status: PARTIAL (research agent hit max iterations, but key findings captured)

Key Findings

Atlas claims:

3x faster than vLLM
2.5 GB image (vs vLLM's 20+ GB)
< 2 min cold start (vs vLLM's ~10 min)
Pure Rust + CUDA (no Python/PyTorch)
MTP speculative decoding
OpenAI-compatible API

Supported Models (11 total):

Qwen3.5-35B-A3B: ~130 tok/s
Qwen3.5-122B-A10B: ~50-54 tok/s
Qwen3-Next-80B-A3B: ~82 tok/s
Gemma 4 26B: ~35 tok/s

Limitations:

Alpha stage (v2.8)
DGX Spark focus (may not work on consumer GPUs)
CUDA-only (no CPU)
Limited model support (hand-tuned kernels)
No GGUF support

Recommendation

Test Atlas on our RunPod L40S 48GB. If it works with Qwen3.5-35B, we get:

~130 tok/s (vs Ollama's ~30-50 tok/s)
2.5 GB image (vs Ollama's multi-GB)
< 2 min cold start

Next step: Pull Atlas image and test on L40S.

## Research Complete: Atlas vs vLLM Benchmarking **Status:** PARTIAL (research agent hit max iterations, but key findings captured) ### Key Findings **Atlas claims:** - 3x faster than vLLM - 2.5 GB image (vs vLLM's 20+ GB) - < 2 min cold start (vs vLLM's ~10 min) - Pure Rust + CUDA (no Python/PyTorch) - MTP speculative decoding - OpenAI-compatible API **Supported Models (11 total):** - Qwen3.5-35B-A3B: ~130 tok/s - Qwen3.5-122B-A10B: ~50-54 tok/s - Qwen3-Next-80B-A3B: ~82 tok/s - Gemma 4 26B: ~35 tok/s **Limitations:** - Alpha stage (v2.8) - DGX Spark focus (may not work on consumer GPUs) - CUDA-only (no CPU) - Limited model support (hand-tuned kernels) - No GGUF support ### Recommendation **Test Atlas on our RunPod L40S 48GB.** If it works with Qwen3.5-35B, we get: - ~130 tok/s (vs Ollama's ~30-50 tok/s) - 2.5 GB image (vs Ollama's multi-GB) - < 2 min cold start **Next step:** Pull Atlas image and test on L40S.

Rockachopa referenced a pull request that will close this issue

2026-04-14 23:08:17 +00:00

feat: add Atlas Inference Engine provider detection (#674) #700

Rockachopa commented

2026-04-14 23:08:43 +00:00

PR #700: #700

Added Atlas to provider registry (localhost:8888 -> atlas) and local server detection (/health probe). Drop-in replacement for vLLM — just point config.yaml at http://localhost:8888/v1.

PR #700: https://forge.alexanderwhitestone.com/Timmy_Foundation/hermes-agent/pulls/700 Added Atlas to provider registry (`localhost:8888` -> `atlas`) and local server detection (`/health` probe). Drop-in replacement for vLLM — just point config.yaml at `http://localhost:8888/v1`.

Rockachopa referenced a pull request that will close this issue

2026-04-14 23:09:12 +00:00

feat: Atlas Inference Engine provider integration (#674) #704

Rockachopa referenced this issue

2026-04-14 23:09:27 +00:00

[EVAL] Test Atlas Inference Engine on RunPod L40S #708

Rockachopa referenced this issue

2026-04-15 01:14:14 +00:00

feat: Atlas RunPod L40S evaluation harness (#708) #722

Rockachopa commented

2026-04-16 04:37:21 +00:00

Evaluation: Atlas for Hermes Fleet

Verdict: EVALUATE FURTHER — promising but premature for production

Fit Analysis

Criterion	Score	Notes
Performance	YES	3x vLLM throughput on supported models
Image size	YES	2.5GB vs 20GB — major for VPS fleet
Cold start	YES	Under 2min vs 10min
Dependencies	YES	Zero deps vs 200+ packages
Model support	WARN	MoE only (Qwen3, Gemma4). No dense model support
CUDA version	WARN	Requires Blackwell SM120/121 — not on our L40S or M4 Max
API compatibility	YES	OpenAI-compatible
Maturity	NO	Very new, limited docs, startup

Recommendation

Do NOT migrate now. Atlas requires Blackwell GPUs (SM120/121). Our fleet runs L40S (SM89), M4 Max, RTX 4090 (SM89) — none supported.

Revisit when: (1) Atlas adds Ampere/Hopper support, OR (2) we get Blackwell hardware.

Worth Stealing Now

Rust-based inference loop patterns
MTP speculative decoding (implementable on our hardware)
Custom CUDA kernel approach for L40S optimization

Bottom line: Future of inference, but needs GPUs we do not have. Re-evaluate Q3 2026.

## Evaluation: Atlas for Hermes Fleet ### Verdict: EVALUATE FURTHER — promising but premature for production ### Fit Analysis | Criterion | Score | Notes | |-----------|-------|-------| | Performance | YES | 3x vLLM throughput on supported models | | Image size | YES | 2.5GB vs 20GB — major for VPS fleet | | Cold start | YES | Under 2min vs 10min | | Dependencies | YES | Zero deps vs 200+ packages | | Model support | WARN | MoE only (Qwen3, Gemma4). No dense model support | | CUDA version | WARN | Requires Blackwell SM120/121 — not on our L40S or M4 Max | | API compatibility | YES | OpenAI-compatible | | Maturity | NO | Very new, limited docs, startup | ### Recommendation **Do NOT migrate now.** Atlas requires Blackwell GPUs (SM120/121). Our fleet runs L40S (SM89), M4 Max, RTX 4090 (SM89) — none supported. Revisit when: (1) Atlas adds Ampere/Hopper support, OR (2) we get Blackwell hardware. ### Worth Stealing Now - Rust-based inference loop patterns - MTP speculative decoding (implementable on our hardware) - Custom CUDA kernel approach for L40S optimization **Bottom line:** Future of inference, but needs GPUs we do not have. Re-evaluate Q3 2026.

Sign in to join this conversation.

Branches Tags

main

allegro/research-deliverables-20260416

fix/816

claude/issue-834

fix/813

fix/803

fix/701

fix/read_file-not-found-hint

fix/796

fix/797

fix/806

fix/673

fix/681

fix/679

burn/838-1776304432

fix/799

fix/865

fix/agent-card-discovery-1776311100051

fix/868

burn/842-1776304432

burn/817-1776304433

fix/707-crisis-integration-test

claude/issue-809

burn/835-1776304433

burn/809-1776304433

burn/838-context-budget-tracker

fix/748-session-compaction

burn/252-1776117800

fix/749-batch-tool-execution

burn/836-1776304433

fix/840

burn/798-1776304433

burn/834-1776304433

fix/839

fix/795

fix/749-v2

fix/834-1776303915

fix/834

fix/819

feat/822-a2a-health

feat/vitalik-secure-llm-1776303263

fix/711-crisis-hook-log-level

burn/280-1776117796

fix/712

claude/issue-705

fix/141-crisis-tool

fix/706

fix/702-multilingual-crisis-detection

burn/804-1776264500

fix/713

fix/796-tool-call-benchmark

fix/708

feat/a2a-health-check-805

burn/796-tool-call-benchmark

feat/802-a2a-agent-card

feat/crisis-protocol-1776270957872

burn/800-gemma4-multimodal

feat/806-a2a-mtls

fix/resolve-397-conflict-1776303120

feat/robust-tool-orchestration-1776268138150

fix/658

fix/798

burn/781-1776263880

fix/779

fix/781-json-repair

fix/659

fix/660

fix/662

fix/664

fix/665

feat/43-context-rag-decision-framework

fix/667

fix/672

fix/742

claude/issue-695

fix/666

fix/744-gateway-cron-notification-drop

fix/663

fix/670

fix/740

fix/741

fix/752

fix/753

fix/755

fix/754

fix/222

fix/746

fix/cron-delivery-retry-744

fix/743

fix/756

fix/747

fix/744

fix/705

fix/745

fix/668-api

fix/748

fix/749

fix/695

fix/725

fix/669

fix/issue-734

fix/734

burn/714-1776218235

burn/713-1776218256

burn/714-1776218190

fix/693-crisis-notification-integration

fix/issue-642-1

fix/issue-702-7

fix/issue-701-8

fix/issue-692-3

fix/issue-706-6

fix/issue-694-1

fix/issue-645-6

fix/issue-707-5

fix/issue-708-4

fix/issue-643-8

fix/issue-711-3

fix/issue-714-1

fix/issue-713-2

fix/issue-644-7

fix/cron-schedule-parse-error

burn-681-1776207280

burn-677-1776207278

burn-679-1776207283

feat/670-approval-tiers

feat/679-crisis-wiring

feat/674-atlas-inference-engine

burn-677-1776207287

feat/atlas-provider

feat/671-hybrid-search

fix/677-crisis-hook

feat/673-988-crisis-escalation

feat/672-soul-crisis-protocol

burn-679-1776207276

fix/673-crisis-hook-integration

burn-681-1776207273

fix/677-crisis-hook-integration

fix/672-crisis-protocol

feat/681-path-aware-risk

feat/671-hybrid-search-router

feat/667-context-faithful-prompting

fix/670-approval-tiers

feat/672-crisis-protocol

feat/673-988-lifeline

burn/validate-action-pokayoke

fix/693-test-branch

perf/lazy-session-creation

fix/624-error-context

fix/626-validate-feedback

fix/614-multilingual-shield

claude/issue-628

claude/issue-613

dispatch/350-1776180746

dispatch/295-1776180746

dispatch/329-1776180746

dispatch/372-1776180746

dispatch/327-1776180746

dispatch/326-1776180746

dispatch/296-1776180746

dispatch/375-1776180746

dispatch/321-1776180746

dispatch/324-1776180746

claude/issue-592

fix/582-shield-tool-args

claude/issue-579

am/350-1776166469

am/326-1776166469

am/329-1776166469

am/295-1776166469

am/322-1776166469

am/327-1776166469

am/378-1776166469

am/372-1776166469

am/288-1776166469

am/296-1776166469

am/321-1776166469

am/324-1776166469

am/375-1776166469

claude/issue-565

dawn/295-1776130053

triage/295-1776129677

claude/issue-556

q/295-1776129480

dawn/326-1776130053

fix/538-context-pressure-threshold

fix/561-ssh-dispatch

dawn/322-1776130053

dawn/372-1776130053

triage/326-1776129677

dawn/350-1776130053

dawn/378-1776130053

dawn/329-1776130053

triage/322-1776129677

triage/372-1776129677

dawn/327-1776130053

feat/505-session-templates

triage/378-1776129677

triage/350-1776129677

q/372-1776129480

triage/329-1776129677

triage/327-1776129677

q/322-1776129480

q/378-1776129480

dawn/296-1776130053

fix/500-cloud-context-warning

queue/372-1776129201

dawn/324-1776130053

q/327-1776129480

q/329-1776129480

queue/378-1776129201

dawn/288-1776130053

q/350-1776129480

triage/296-1776129677

triage/324-1776129677

fix/499-hardcoded-paths

q/324-1776129480

q/296-1776129480

queue/322-1776129201

queue/327-1776129201

queue/324-1776129201

q/326-1776129480

queue/296-1776129201

queue/350-1776129201

fix/478-tilde-expand

fix/478-tilde-expansion

queue/329-1776129201

fix/478-hermes-home-tilde-expand

fix/468-cron-cloud-context

fix/479-optional-skills-hardcoded-paths

fix/479-hardcoded-paths

q/316-1776129677

q/288-1776129480

feat/334-profile-scoped-cron

whip/326-1776128804

dawn/375-1776130053

fix/375-deploy-crons-model-provider-comparison

whip/324-1776128804

burn/456-1776129600

triage/375-1776129677

fix/468-1776128804

q/375-1776129480

whip/372-1776128804

queue/375-1776129201

queue/288-1776129201

fix/457-ssh-dispatch-validation

whip/350-1776128804

whip/251-1776128804

fix/456-cloud-context-warning

queue/321-1776129201

whip/378-1776128804

whip/322-1776128804

whip/327-1776128804

whip/329-1776128804

whip/288-1776128804

whip/327-1776127281

whip/375-1776128804

whip/321-1776128804

dispatch/288-1776180746

triage/288-1776129677

whip/296-1776128804

whip/293-1776127532

whip/350-1776127532

whip/372-1776127532

whip/378-1776127532

whip/329-1776127532

whip/322-1776127532

whip/316-1776127532

whip/321-1776127532

whip/326-1776127532

whip/327-1776127532

whip/314-1776127532

whip/288-1776127532

whip/375-1776127532

burn/274-1776126523

burn/275-1776126523

burn/293-1776126523

burn/273-1776126523

burn/299-1776126523

burn/272-1776126523

burn/350-1776125702

burn/372-1776125702

burn/329-1776125702

burn/328-1776125702

burn/379-1776125702

burn/327-1776125702

burn/317-1776125702

burn/373-1776125702

burn/349-1776125702

burn/326-1776125702

burn/322-1776125702

queue/326-1776129201

burn/350-1776120221

burn/372-1776120221

burn/324-1776120221

burn/328-1776120221

burn/373-1776120221

burn/329-1776120221

burn/349-1776120221

burn/323-1776120221

burn/322-1776120221

burn/326-1776120221

burn/327-1776120221

burn/317-1776120221

burn/320-1776120221

burn/251-1776117799

burn/378-1776117791

burn/375-1776117778

burn/379-1776117790

burn/376-1776117777

burn/349-1776117786

burn/254-1776117794

burn/350-1776117787

burn/262-1776117798

burn/372-1776117789

burn/282-1776117784

burn/284-1776117781

burn/373-1776117779

burn/285-1776117782

burn/377-1776117775

burn/acp-272-1776117838

burn/255-1776117795

burn/286-1776117783

burn/web-console-325

burn/253-1776117793

burn/321-1776120221

burn/321-1776125702

burn/profile-cron-334

burn/prompt-injection-324

burn/skill-revert-295

burn/context-overflow-296

burn/honcho-eval-322

burn/privacy-filter-283

burn/model-benchmark-287

burn/20260413-1705-fix-token-tracking

feat/marathon-session-limits-326

fix/poka-yoke-hardcoded-paths

feat/315-session-gc

feature/time-aware-model-routing-317

fix/gateway-config-debt-328

feat/lazy-session-creation

burn/378-1776120221

fix/weak-credential-guard

fix/tool-return-type-validation

fix/memory-no-match-not-error

feat/temporal-decay-holographic-memory

fix/syntax-preflight-execute-code

fix/cron-script-failure-detection

fix/empty-model-preflight

fix/cron-sync-guard-v2

fix/cron-interpreter-shutdown-352

feat/error-circuit-breaker

fix/circuit-breaker-error-cascade

feat/cron-run-now

ci/fix-mempalace-syntax

claude/issue-351

fix/cron-tick-backlog

feat/deploy-sync-guard

feat/20260413-cron-agent-kwargs

feat/profile-scoped-cron

fix/cron-ticker-startup

fix/empirical-audit-hardening

feat/skills-index-workflow

fix/credential-guard

feat/research-paper-scaffolder

feat/cron-tool-choice-propagation

perplexity/provider-allowlist

fix/json-repair-for-tool-calls

feat/context-rag-decision-framework

census/feature-inventory

fix/ci-stability

burn/20260410-1649-277-memory-remove-bridge

keymaxx/mimoomni/243

burn/20260410-0744-matrix-wire

burn/20260410-0707-browser-integration

feature/improve-sovereignty-justification

burn/20260409-2111-memory-budget

burn/20260409-2105-memory-sovereignty

burn/20260409-2051-263-memory-architecture-guide

burn/20260409-1242-memory-docs

claude/issue-1135

feat/mempalace-portal-1775695506634

feat/ci-no-duplicate-models

feat/mempalace-tool-1775642243437

bezalel/ci-provider-duplicate-check

bezalel/self-awareness-epic-203

fix/kimi-fallback-model

bezalel/pr-215-rescue

perplexity/mempalace-tests

upstream-sync

bezalel/fix-gitea-ci-runner-host-mode

claude/issue-192

claude/issue-190

bezalel/fix-indentation-error

bezalel/gitea-workflow-skill

rescue/ollama-provider

rescue/v011-obfuscation-fix

claw-code/issue-151

claw-code/issue-126

groq/issue-168

timmy/issue-169-ollama-provider

gemini/issue-24

bezalel/syntax-guard-ci

claude/issue-128

claude/issue-142

claude/issue-133

claude/issue-143

claude/issue-146

claude/issue-155

claude/issue-147

claude/issue-148

bezalel/notebook-workflow-demo

claude/issue-149

bezalel/forge-health-check

epic-999-phase-ii-forge

allegro/m1-stop-protocol

timmy/issue-123-process-resilience

timmy/issue-116-config-validation

epic-999-phase-i

security/v-011-skills-guard-bypass

gemini/security-hardening

gemini/sovereign-gitea-client

timmy-custom

security/fix-oauth-session-fixation

security/fix-skills-path-traversal

security/fix-file-toctou

security/fix-error-disclosure

security/add-rate-limiting

security/fix-browser-cdp

security/fix-docker-privilege

security/fix-auth-bypass

fix/sqlite-contention

tests/security-coverage

security/fix-race-condition

security/fix-ssrf

security/fix-secret-leakage

feat/gen-ai-evolution-phases-19-21

feat/gen-ai-evolution-phases-16-18

feat/gen-ai-evolution-phases-13-15

security/fix-path-traversal

security/fix-command-injection

feat/gen-ai-evolution-phases-10-12

feat/gen-ai-evolution-phases-7-9

feat/gen-ai-evolution-phases-4-6

feat/gen-ai-evolution-phases-1-3

feat/sovereign-evolution-redistribution

feat/apparatus-verification

feat/sovereign-intersymbolic-ai

feat/sovereign-learning-system

feat/sovereign-reasoning-engine

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: Timmy_Foundation/hermes-agent#674