hermes-agent

Author	SHA1	Message	Date
teknium1	39ee3512cb	Merge PR #614 : fix: resolve systemd restart loop with --replace flag Authored by voidborne-d. Fixes #576. Adds --replace flag to 'hermes gateway run' that terminates any existing gateway instance (SIGTERM with SIGKILL fallback) before starting. Updated systemd unit template with --replace, ExecStop, KillMode, and TimeoutStopSec for robust service management.	2026-03-07 16:33:27 -08:00
teknium1	42673556af	Merge PR #575 : fix(setup): prevent OpenRouter model list fallback for Nous provider Authored by PercyDikec. Fixes #574. # Conflicts: # hermes_cli/setup.py	2026-03-07 16:22:13 -08:00
teknium1	faab73ad58	Merge PR #573 : fix(doctor): detect OpenAI custom endpoint env settings Authored by stablegenius49. Fixes #572.	2026-03-07 16:16:08 -08:00
d 🔹	ee5daba061	fix: resolve systemd restart loop with --replace flag (#576 ) When running under systemd, the gateway could enter restart loops in two scenarios: 1. The previous gateway process hasn't fully exited when systemd starts a new one, causing 'Gateway already running (PID ...)' → exit 1 → restart → same error → infinite loop. 2. The interactive CLI exits immediately in non-TTY mode, and systemd keeps restarting it. Changes: - Add --replace flag to 'hermes gateway run' that gracefully kills any existing gateway instance (SIGTERM → wait 10s → SIGKILL) before starting, preventing the PID-lock deadlock. - Update the generated systemd unit template to use --replace by default, add ExecStop for clean shutdown, set KillMode=mixed and TimeoutStopSec=15 for proper process management. - Existing behavior (without --replace) is unchanged: still prints the error message and exits, now also mentioning the --replace option. Fixes #576	2026-03-07 18:08:12 +00:00
teknium1	48e0dc8791	feat: implement Z.AI endpoint detection for API key validation Added functionality to detect the appropriate Z.AI endpoint based on the provided API key, accommodating different billing plans and regions. The setup process now probes available endpoints and updates the configuration accordingly, enhancing user experience and reducing potential billing errors. Updated the setup model provider function to integrate this new detection logic.	2026-03-07 09:43:37 -08:00
teknium1	064c009deb	feat: show update-available notice in CLI banner Check how many commits behind origin/main the local repo is and display a warning in the welcome banner: ⚠ 12 commits behind — run hermes update to update - git fetch cached for 6 hours (avoids repeated network calls) - Falls back gracefully if offline or not a git repo - Never breaks the banner — all errors silently caught	2026-03-07 07:35:36 -08:00
teknium1	caab1cf453	fix: update setup/config UI for local browser mode - tools_config.py: Add 'Local Browser' as first provider option (no API keys needed, same npm install for agent-browser) - setup.py: Show 'Browser Automation (local)' when agent-browser CLI is found but no Browserbase key is set - config.py: Mark BROWSERBASE_* descriptions as optional - status.py: Note that local browser works without Browserbase	2026-03-07 01:23:27 -08:00
teknium1	348936752a	fix: simplify timezone migration to use os.getenv directly The previous 'get_env_value' in dir() check always evaluated to False (dir() returns local scope, not module scope), making the left branch dead code. Simplified to just os.getenv() which was the fallback anyway.	2026-03-07 00:05:05 -08:00
teknium1	69a36a3361	Merge PR #309 : fix(timezone): timezone-aware now() for prompt, cron, and execute_code Authored by areu01or00. Adds timezone support via hermes_time.now() helper with IANA timezone resolution (HERMES_TIMEZONE env → config.yaml → server-local). Updates system prompt timestamp, cron scheduling, and execute_code sandbox TZ injection. Includes config migration (v4→v5) and comprehensive test coverage.	2026-03-07 00:04:41 -08:00
teknium1	f55f625277	chore: reorder terminal backends in setup wizard Local, Docker, Modal, SSH, Daytona, Singularity (Linux-only, last).	2026-03-06 22:21:57 -08:00
teknium1	9dac85b069	fix: uv pip install fails outside venv in setup wizard uv pip install requires a virtual environment by default. When hermes is installed system-wide or via pipx, the setup wizard's SDK installs (daytona, swe-rex[modal], tinker-atropos) fail with 'No virtual environment found'. Fix by passing --python sys.executable to uv, which targets the correct Python regardless of venv state. Also show the actual error message on install failure so users can debug.	2026-03-06 21:55:33 -08:00
teknium1	a62a137a4f	fix: handle dict-format model config in setup wizard display config['model'] can be a dict (old format: {default, base_url, provider}) or a string (new format). The setup wizard was showing the raw dict in 'Keep current' and 'Model set to' messages. Now extracts the model name from either format.	2026-03-06 21:11:40 -08:00
teknium1	82b18e8ac2	feat: unify hermes tools and hermes setup tools into single flow Both 'hermes tools' and 'hermes setup tools' now use the same unified flow in tools_config.py: 1. Select platform (CLI, Telegram, Discord, etc.) 2. Toggle all 18 toolsets on/off in checklist 3. Newly enabled tools that need API keys → provider-aware config (e.g., TTS shows Edge/OpenAI/ElevenLabs picker) 4. Already-configured tools that stay enabled → silent, no prompts 5. Menu option: 'Reconfigure an existing tool' for updating providers or API keys on tools that are already set up Key changes: - Move TOOL_CATEGORIES, provider config, and post-setup hooks from setup.py to tools_config.py - Replace flat _check_and_prompt_requirements() with provider-aware _configure_toolset() that uses TOOL_CATEGORIES - Add _reconfigure_tool() flow for updating existing configs - setup.py's setup_tools() now delegates to tools_command() - tools_command() menu adds 'Reconfigure' option alongside platforms - Only prompt for API keys on tools that are NEWLY toggled on AND don't already have keys configured No breaking changes. All 2013 tests pass.	2026-03-06 21:02:00 -08:00
teknium1	0111c9848d	fix: remove ANSI codes and em dashes from menu labels simple_term_menu miscalculates string widths when labels contain ANSI escape codes (from color()) or em dashes, causing duplicated and garbled lines on arrow key navigation. Replace color() status indicators with plain text [configured]/[active] and em dashes with regular dashes in all prompt_choice/prompt_checklist labels.	2026-03-06 21:02:00 -08:00
teknium1	ab9cadfeee	feat: modular setup wizard with section subcommands and tool-first UX Restructure the monolithic hermes setup wizard into independently-runnable sections with a category-first tool configuration experience. Changes: - Break setup into 5 sections: model, terminal, gateway, tools, agent - Each section is a standalone function, runnable individually via 'hermes setup model', 'hermes setup terminal', etc. - Returning users get a menu: Quick Setup / Full Setup / individual sections - First-time users get a guided walkthrough of all sections Tool Configuration UX overhaul: - Replace flat API key checklist with category-first approach - Show tool types (TTS, Web Search, Image Gen, etc.) as top-level items - Within each category, let users pick a provider: - TTS: Microsoft Edge (Free), OpenAI, ElevenLabs - Web: Firecrawl Cloud, Firecrawl Self-Hosted - Image Gen: FAL.ai - Browser: Browserbase - Smart Home: Home Assistant - RL Training: Tinker/Atropos - GitHub: Personal Access Token - Shows configured status on each tool and provider - Only prompts for API keys after provider selection Also: - Add section argument to setup argparse parser in main.py - Update summary to show new section commands - Add self-hosted Firecrawl and Home Assistant to tool setup - All 2013 tests pass	2026-03-06 21:02:00 -08:00
PercyDikec	8bf28e1441	fix(setup): prevent OpenRouter model list fallback for Nous provider When `fetch_nous_models()` fails silently during setup, the model selection falls through to the OpenRouter static list. Users then pick models in OpenRouter format (e.g. `anthropic/claude-opus-4.6`) which the Nous inference API rejects with a 400 "missing model" error. Add an explicit `elif selected_provider == "nous"` branch that prompts for manual model entry instead of falling through to the generic OpenRouter fallback.	2026-03-07 07:16:22 +03:00
stablegenius49	5609117882	fix(doctor): recognize OPENAI_API_KEY custom endpoint config	2026-03-06 19:47:09 -08:00
teknium1	388dd4789c	feat: add z.ai/GLM, Kimi/Moonshot, MiniMax as first-class providers Adds 4 new direct API-key providers (zai, kimi-coding, minimax, minimax-cn) to the inference provider system. All use standard OpenAI-compatible chat/completions endpoints with Bearer token auth. Core changes: - auth.py: Extended ProviderConfig with api_key_env_vars and base_url_env_var fields. Added providers to PROVIDER_REGISTRY. Added provider aliases (glm, z-ai, zhipu, kimi, moonshot). Added auto-detection of API-key providers in resolve_provider(). Added resolve_api_key_provider_credentials() and get_api_key_provider_status() helpers. - runtime_provider.py: Added generic API-key provider branch in resolve_runtime_provider() — any provider with auth_type='api_key' is automatically handled. - main.py: Added providers to hermes model menu with generic _model_flow_api_key_provider() flow. Updated _has_any_provider_configured() to check all provider env vars. Updated argparse --provider choices. - setup.py: Added providers to setup wizard with API key prompts and curated model lists. - config.py: Added env vars (GLM_API_KEY, KIMI_API_KEY, MINIMAX_API_KEY, etc.) to OPTIONAL_ENV_VARS. - status.py: Added API key display and provider status section. - doctor.py: Added connectivity checks for each provider endpoint. - cli.py: Updated provider docstrings. Docs: Updated README.md, .env.example, cli-config.yaml.example, cli-commands.md, environment-variables.md, configuration.md. Tests: 50 new tests covering registry, aliases, resolution, auto-detection, credential resolution, and runtime provider dispatch. Inspired by PR #33 (numman-ali) which proposed a provider registry approach. Credit to tars90percent (PR #473) and manuelschipper (PR #420) for related provider improvements merged earlier in this changeset.	2026-03-06 18:55:18 -08:00
teknium1	3c6c11b7c9	Merge PR #420 : fix: respect OPENAI_BASE_URL when resolving API key priority Authored by manuelschipper. Adds GLM-4.7 and GLM-5 context lengths (202752) to model_metadata.py. The key priority fix (prefer OPENAI_API_KEY for non-OpenRouter endpoints) was already applied in PR #295; merged the Z.ai mention into the comment.	2026-03-06 18:43:13 -08:00
teknium1	94053d75a6	fix: custom endpoint no longer leaks OPENROUTER_API_KEY (#560 ) API key selection is now base_url-aware: when the resolved base_url targets OpenRouter, OPENROUTER_API_KEY takes priority (preserving the #289 fix). When hitting any other endpoint (Z.ai, vLLM, custom, etc.), OPENAI_API_KEY takes priority so the OpenRouter key doesn't leak. Applied in both the runtime provider resolver (the real code path) and the CLI initial default (for consistency). Fixes #560.	2026-03-06 17:16:14 -08:00
teknium1	f2fdde5ba4	fix: show user-modified skills count in hermes update output	2026-03-06 16:14:43 -08:00
Teknium	6d3804770c	Merge pull request #552 from NousResearch/feat/insights feat: /insights command — usage analytics, cost estimation & activity patterns	2026-03-06 16:00:28 -08:00
teknium1	ab0f4126cf	fix: restore all removed bundled skills + fix skills sync system - Restored 21 skills removed in commits `757d012` and `740dd92`: accelerate, audiocraft, code-review, faiss, flash-attention, gguf, grpo-rl-training, guidance, llava, nemo-curator, obliteratus, peft, pytorch-fsdp, pytorch-lightning, simpo, slime, stable-diffusion, tensorrt-llm, torchtitan, trl-fine-tuning, whisper - Rewrote sync_skills() with proper update semantics: * New skills (not in manifest): copied to user dir * Existing skills (in manifest + on disk): updated via hash comparison * User-deleted skills (in manifest, not on disk): respected, not re-added * Stale manifest entries (removed from bundled): cleaned from manifest - Added sync_skills() to CLI startup (cmd_chat) and gateway startup (start_gateway) — previously only ran during 'hermes update' - Updated cmd_update output to show new/updated/cleaned counts - Rewrote tests: 20 tests covering manifest CRUD, dir hashing, fresh install, user deletion respect, update detection, stale cleanup, and name collision handling 75 bundled skills total. 2002 tests pass.	2026-03-06 15:57:30 -08:00
teknium1	b52b37ae64	feat: add /insights command with usage analytics and cost estimation Inspired by Claude Code's /insights, adapted for Hermes Agent's multi-platform architecture. Analyzes session history from state.db to produce comprehensive usage insights. Features: - Overview stats: sessions, messages, tokens, estimated cost, active time - Model breakdown: per-model sessions, tokens, and cost estimation - Platform breakdown: CLI vs Telegram vs Discord etc. (unique to Hermes) - Tool usage ranking: most-used tools with percentages - Activity patterns: day-of-week chart, peak hours, streaks - Notable sessions: longest, most messages, most tokens, most tool calls - Cost estimation: real pricing data for 25+ models (OpenAI, Anthropic, DeepSeek, Google, Meta) with fuzzy model name matching - Configurable time window: --days flag (default 30) - Source filtering: --source flag to filter by platform Three entry points: - /insights slash command in CLI (supports --days and --source flags) - /insights slash command in gateway (compact markdown format) - hermes insights CLI subcommand (standalone) Includes 56 tests covering pricing helpers, format helpers, empty DB, populated DB with multi-platform data, filtering, formatting, and edge cases.	2026-03-06 14:04:59 -08:00
teknium1	b4873a5de7	fix(setup): Escape skips instead of exiting, add control hints to all prompts Previously pressing Escape in any setup wizard menu called sys.exit(1), killing the entire wizard with no way to recover. Now: - prompt_choice: Escape keeps the current default and moves on (prints 'Skipped (keeping current)'). Shows '↑/↓ Navigate Enter Select Esc Skip Ctrl+C Exit' hint. - prompt_checklist: Escape returns pre-selected items instead of empty list. Shows 'SPACE Toggle ENTER Confirm ESC Skip Ctrl+C Exit'. - prompt_yes_no: now catches KeyboardInterrupt/EOFError properly. - Fallback number prompts also show control hints. Ctrl+C still exits the wizard cleanly.	2026-03-06 05:27:11 -08:00
teknium1	ecb8148a9f	Merge PR #448 : fix(cli): use correct dict key for codex auth file path in status output Authored by PercyDikec. Fixes #447. The status display used codex_status.get('auth_file') but get_codex_auth_status() in auth.py returns the path under 'auth_store' (line 1220). This one-char key mismatch silently dropped the auth file path from 'hermes status'.	2026-03-06 04:34:46 -08:00
teknium1	145f719d30	Merge PR #469 : fix(config): route API keys and tokens to .env instead of config.yaml Authored by ygd58. Fixes #465. Adds missing keys to allowlist and catch-all patterns (_API_KEY, _TOKEN suffixes) for future-proofing.	2026-03-06 04:23:49 -08:00
teknium1	8481fdcf08	docs: complete Daytona backend documentation coverage Update all remaining files that enumerate terminal backends to include Daytona. Covers security docs (bypass info, backend comparison table), environment variables reference (DAYTONA_API_KEY, TERMINAL_DAYTONA_IMAGE, container resources header), AGENTS.md (architecture tree, config keys), environments/README.md, hermes_base_env.py field description, and various module docstrings. Follow-up to PR #451 merge.	2026-03-06 03:37:05 -08:00
teknium1	39299e2de4	Merge PR #451 : feat: Add Daytona environment backend Authored by rovle. Adds Daytona as the sixth terminal execution backend with cloud sandboxes, persistent workspaces, and full CLI/gateway integration. Includes 24 unit tests and 8 integration tests.	2026-03-06 03:32:40 -08:00
teknium1	f6f3d1de9b	fix: review fixes — path traversal guard, trust_style consistency, edge cases Address code review findings: Security (Medium): - Path traversal guard in OptionalSkillSource.fetch() — resolve() and validate that the path stays within optional-skills/ before reading Bug fixes (Medium): - Add 'builtin' to trust_style dicts in do_inspect() and _resolve_short_name() — official skills now show bright_cyan 'official' label consistently across all display functions (5/5 dicts fixed) Edge cases (Low): - Clamp page_size to [1, 100] in do_browse() to prevent ZeroDivisionError - Update SkillMeta.source docstring to include 'official' - Add browse command to optional-skills/DESCRIPTION.md	2026-03-06 01:40:01 -08:00
teknium1	ec0fe3242a	feat: 'hermes skills browse' — paginated browsing of all hub skills Add a browse command that shows all available skills across all registries, paginated and sorted with official skills first. Usage: hermes skills browse # all sources, page 1 hermes skills browse --source official # only official optional skills hermes skills browse --page 2 # page 2 hermes skills browse --size 30 # 30 per page /skills browse # slash command in chat Features: - Official optional skills always appear first (★ marker, cyan styling) - Per-source limits prevent overloading (100 official/github, 50 others) - Deduplication by name preferring higher trust - Sorted: official > trusted > community, then alphabetical - Page navigation hints at bottom - Source counts summary - Works in both CLI and /skills chat interface - Added 'official' as source filter option for search command too	2026-03-06 01:29:45 -08:00
teknium1	f2e24faaca	feat: optional skills — official skills shipped but not activated by default Add 'optional-skills/' directory for official skills that ship with the repo but are not copied to ~/.hermes/skills/ during setup. They are: - NOT shown to the model in the system prompt - NOT copied during hermes setup/update - Discoverable via 'hermes skills search' labeled as 'official' - Installable via 'hermes skills install' with builtin trust (no third-party warning) - Auto-categorized on install based on directory structure Implementation: - OptionalSkillSource adapter in tools/skills_hub.py (search/fetch/inspect) - Added to create_source_router() as first source (highest priority) - Trust level 'builtin' for official skills in skills_guard.py - Friendly install message for official skills (no third-party warning) - 'official' label in cyan in search results and skill list First optional skill: Blackbox CLI (autonomous-ai-agents/blackbox) - Multi-model coding agent with built-in judge/Chairman pattern - Delegates to Claude, Codex, Gemini, and Blackbox models - Open-source CLI (GPL-3.0, TypeScript, forked from Gemini CLI) - Requires paid Blackbox AI API key Refs: #475	2026-03-06 01:24:11 -08:00
teknium1	8c80b96318	chore: update OpenRouter model list - Remove opus-4.5 and gpt-5.2 - Reorder GPT: 5.4-pro, 5.4, 5.3-codex - Add qwen/qwen3.5-plus-02-15 and qwen/qwen3.5-35b-a3b - Update z-ai/glm-4.7 → glm-5 - Update minimax/minimax-m2.1 → minimax-m2.5	2026-03-06 00:52:45 -08:00
teknium1	2387465dcc	chore: add openai/gpt-5.4-pro and stepfun/step-3.5-flash to OpenRouter models	2026-03-06 00:49:25 -08:00
ygd58	6055adbe1b	fix(config): route API keys and tokens to .env instead of config.yaml	2026-03-06 08:55:36 +01:00
teknium1	2317d115cd	fix: clipboard image paste on WSL2, Wayland, and VSCode terminal The original implementation only supported xclip (X11), which silently fails on WSL2 (can't access Windows clipboard for images), Wayland desktops (xclip is X11-only), and VSCode terminal on WSL2. Clipboard backend changes (hermes_cli/clipboard.py): - WSL2: detect via /proc/version, use powershell.exe with .NET System.Windows.Forms.Clipboard to extract images as base64 PNG - Wayland: use wl-paste with MIME type detection, auto-convert BMP to PNG for WSLg environments (via Pillow or ImageMagick) - Dispatch order: WSL → Wayland → X11 (xclip), with fallthrough - New has_clipboard_image() for lightweight clipboard checks - Cache WSL detection result per-process CLI changes (cli.py): - /paste command: explicit clipboard image check for terminals where BracketedPaste doesn't fire (image-only clipboard in VSCode/WinTerm) - Ctrl+V keybinding: fallback for Linux terminals where Ctrl+V sends raw byte instead of triggering bracketed paste Tests: 80 tests (up from 37) covering WSL, Wayland, X11 dispatch, BMP conversion, has_clipboard_image, and /paste command.	2026-03-05 20:22:44 -08:00
teknium1	399562a7d1	feat: clipboard image paste in CLI (Cmd+V / Ctrl+V) Copy an image to clipboard (screenshot, browser, etc.) and paste into the Hermes CLI. The image is saved to ~/.hermes/images/, shown as a badge above the input ([📎 Image #1]), and sent to the model as a base64-encoded OpenAI vision multimodal content block. Implementation: - hermes_cli/clipboard.py: clean module with platform-specific extraction - macOS: pngpaste (if installed) → osascript fallback (always available) - Linux: xclip (apt install xclip) - cli.py: BracketedPaste key handler checks clipboard on every paste, image bar widget shows attached images, chat() converts to multimodal content format, Ctrl+C clears attachments Inspired by @m0at's fork (https://github.com/m0at/hermes-agent) which implemented image paste support for local vision models. Reimplemented cleanly as a separate module with tests.	2026-03-05 17:55:41 -08:00
teknium1	a41ba57a7a	Merge PR #460 : feat(tools): add support for self-hosted firecrawl Authored by caentzminger. Adds optional FIRECRAWL_API_URL env var to point the Firecrawl client at a self-hosted instance instead of the cloud API.	2026-03-05 16:41:30 -08:00
teknium1	884c8ea70a	chore: add openai/gpt-5.4 to OpenRouter preferred models list	2026-03-05 16:13:45 -08:00
teknium1	c886333d32	feat: smart context length probing with persistent caching + banner display Replaces the unsafe 128K fallback for unknown models with a descending probe strategy (2M → 1M → 512K → 200K → 128K → 64K → 32K). When a context-length error occurs, the agent steps down tiers and retries. The discovered limit is cached per model+provider combo in ~/.hermes/context_length_cache.yaml so subsequent sessions skip probing. Also parses API error messages to extract the actual context limit (e.g. 'maximum context length is 32768 tokens') for instant resolution. The CLI banner now displays the context window size next to the model name (e.g. 'claude-opus-4 · 200K context · Nous Research'). Changes: - agent/model_metadata.py: CONTEXT_PROBE_TIERS, persistent cache (save/load/get), parse_context_limit_from_error(), get_next_probe_tier() - agent/context_compressor.py: accepts base_url, passes to metadata - run_agent.py: step-down logic in context error handler, caches on success - cli.py + hermes_cli/banner.py: context length in welcome banner - tests: 22 new tests for probing, parsing, and caching Addresses #132. PR #319's approach (8K default) rejected — too conservative.	2026-03-05 16:09:57 -08:00
caentzminger	d7d10b14cd	feat(tools): add support for self-hosted firecrawl Adds optional FIRECRAWL_API_URL environment variable to support self-hosted Firecrawl deployments alongside the cloud service. - Add FIRECRAWL_API_URL to optional env vars in hermes_cli/config.py - Update _get_firecrawl_client() in tools/web_tools.py to accept custom API URL - Add tests for client initialization with/without URL - Document new env var in installation and config guides	2026-03-05 16:16:18 -06:00
rovle	5279540bb4	fix(daytona): add missing config mappings in gateway, CLI defaults, and config display Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 11:12:50 -08:00
PercyDikec	ad57bf1e4b	fix(cli): use correct dict key for codex auth file path in status output	2026-03-05 21:27:12 +03:00
rovle	df61054a84	feat(cli): add Daytona to setup wizard, doctor, and status display Add Daytona as a backend choice in the interactive setup wizard with SDK installation and API key prompts. Show Daytona image in status output and validate API key + SDK in doctor checks. Add OPTION 6 example in cli-config.yaml.example. Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 10:02:22 -08:00
rovle	690b8bb563	feat(cli): add Daytona config mapping and env var sync Wire TERMINAL_DAYTONA_IMAGE through cli.py env_mappings and hermes_cli/config.py so `hermes config set` propagates correctly.	2026-03-05 10:02:21 -08:00
teknium1	fe15a2c65c	Merge PR #274 : fix(setup): handle TerminalMenu init failures with safe fallback Authored by jdblackstar. Catches runtime exceptions from TerminalMenu init (e.g. CalledProcessError from tput with unknown TERM like xterm-ghostty over SSH) and falls through to the text-based menu.	2026-03-05 01:26:58 -08:00
Dev User	3221818b6e	fix: respect OPENAI_BASE_URL when resolving API key priority When base_url points to a non-OpenRouter endpoint (e.g. Z.ai), OPENROUTER_API_KEY incorrectly takes priority over OPENAI_API_KEY, sending the wrong credentials. This causes 401 errors on the main inference path and forces users to comment out OPENROUTER_API_KEY, which then breaks auxiliary clients (compression, vision). Fix: check whether base_url contains "openrouter" and swap the key priority accordingly. Also adds GLM-4.7 and GLM-5 context lengths to DEFAULT_CONTEXT_LENGTHS.	2026-03-05 08:25:16 +00:00
teknium1	db58cfb13d	Merge PR #269 : Fix nous refresh token rotation failure on key mint failure Fixes a bug where the refresh token was not persisted when the API key mint failed (e.g., 402 insufficient credits, timeout). The rotated refresh token was lost, causing subsequent auth attempts to fail with a stale token. Changes: - Persist auth state immediately after each successful token refresh, before attempting the mint - Use latest in-memory refresh token on mint-retry paths (was using the stale original) - Atomic durable writes for auth.json (temp file + fsync + replace) - Opt-in OAuth trace logging (HERMES_OAUTH_TRACE=1, fingerprint-only) - 3 regression tests covering refresh+402, refresh+timeout, and invalid-token retry behavior Author: Robin Fernandes <rewbs>	2026-03-04 17:52:10 -08:00
teknium1	4c72329412	feat: add backend validation for required binaries in setup wizard Implemented checks to ensure that necessary binaries (Docker, Singularity, SSH) are installed for the selected backend in the setup wizard. If a required binary is missing, the user is prompted to proceed with a fallback to the local backend. This enhances user experience by preventing potential runtime errors due to missing dependencies.	2026-03-04 14:49:23 -08:00
teknium1	7f9777a0b0	feat: add container resource configuration prompts in setup wizard Introduced interactive prompts for configuring container resource settings (CPU, memory, disk, persistence) during the setup wizard. Updated the default configuration to include these settings and improved user guidance on their implications for Docker, Singularity, and Modal backends. This enhancement aims to streamline the setup process and provide users with clearer options for resource management.	2026-03-04 03:29:05 -08:00

1 2 3 4

166 Commits