hermes-agent

Author	SHA1	Message	Date
kshitijk4poor	38bcaa1e86	chore: remove langfuse doc, smoketest script, and installed-plugin test Made-with: Cursor	2026-04-05 23:31:29 -07:00
kshitijk4poor	f530ef1835	feat(plugins): pre_api_request/post_api_request with narrow payloads - Rename per-LLM-call hooks from pre_llm_request/post_llm_request for clarity vs pre_llm_call - Emit summary kwargs only (counts, usage dict from normalize_usage); keep env_var_enabled for HERMES_DUMP_REQUESTS - Add is_truthy_value/env_var_enabled to utils; wire hermes_cli.plugins._env_enabled through it - Update Langfuse local setup doc; add scripts/langfuse_smoketest.py and optional ~/.hermes plugin tests Made-with: Cursor	2026-04-05 23:31:29 -07:00
kshitijk4poor	9e820dda37	Add request-scoped plugin lifecycle hooks	2026-04-05 23:31:29 -07:00
Teknium	dce5f51c7c	feat: config structure validation — detect malformed YAML at startup (#5426 ) Add validate_config_structure() that catches common config.yaml mistakes: - custom_providers as dict instead of list (missing '-' in YAML) - fallback_model accidentally nested inside another section - custom_providers entries missing required fields (name, base_url) - Missing model section when custom_providers is configured - Root-level keys that look like misplaced custom_providers fields Surface these diagnostics at three levels: 1. Startup: print_config_warnings() runs at CLI and gateway module load, so users see issues before hitting cryptic errors 2. Error time: 'Unknown provider' errors in auth.py and model_switch.py now include config diagnostics with fix suggestions 3. Doctor: 'hermes doctor' shows a Config Structure section with all issues and fix hints Also adds a warning log in runtime_provider.py when custom_providers is a dict (previously returned None silently). Motivated by a Discord user who had malformed custom_providers YAML and got only 'Unknown Provider' with no guidance on what was wrong. 17 new tests covering all validation paths.	2026-04-05 23:31:20 -07:00
Teknium	9ca954a274	fix: mem0 API v2 compat, prefetch context fencing, secret redaction (#5423 ) Consolidated salvage from PRs #5301 (qaqcvc), #5339 (lance0), #5058 and #5098 (maymuneth). Mem0 API v2 compatibility (#5301): - All reads use filters={user_id: ...} instead of bare user_id= kwarg - All writes use filters with user_id + agent_id for attribution - Response unwrapping for v2 dict format {results: [...]} - Split _read_filters() vs _write_filters() — reads are user-scoped only for cross-session recall, writes include agent_id - Preserved 'hermes-user' default (no breaking change for existing users) - Omitted run_id scoping from #5301 — cross-session memory is Mem0's core value, session-scoping reads would defeat that purpose Memory prefetch context fencing (#5339): - Wraps prefetched memory in <memory-context> fenced blocks with system note marking content as recalled context, NOT user input - Sanitizes provider output to strip fence-escape sequences, preventing injection where memory content breaks out of the fence - API-call-time only — never persisted to session history Secret redaction (#5058, #5098): - Added prefix patterns for Groq (gsk_), Matrix (syt_), RetainDB (retaindb_), Hindsight (hsk-), Mem0 (mem0_), ByteRover (brv_)	2026-04-05 22:43:33 -07:00
Teknium	786970925e	fix(cli): add missing subprocess.run() timeouts in gateway CLI (#5424 ) All 35 subprocess.run() calls in hermes_cli/gateway.py lacked timeout parameters. If systemctl, launchctl, loginctl, wmic, or ps blocks, hermes gateway start/stop/restart/status/install/uninstall hangs indefinitely with no feedback. Timeouts tiered by operation type: - 10s: instant queries (is-active, status, list, ps, tail, journalctl) - 30s: fast lifecycle (daemon-reload, enable, start, bootstrap, kickstart) - 90s: graceful shutdown (stop, restart, bootout, kickstart -k) — exceeds our TimeoutStopSec=60 to avoid premature timeout during shutdown Special handling: _is_service_running() and launchd_status() catch TimeoutExpired and treat it as not-running/not-loaded, consistent with how non-zero return codes are already handled. Inspired by PR #3732 (dlkakbs) and issue #4057 (SHL0MS). Reimplemented on current main which has significantly changed launchctl handling (bootout/bootstrap/kickstart vs legacy load/unload/start/stop).	2026-04-05 22:41:42 -07:00
Teknium	ab086a320b	chore: remove qwen-3.6 free from nous portal model list	2026-04-05 22:40:34 -07:00
Teknium	aa56df090f	fix: allow env var overrides for Nous portal/inference URLs (#5419 ) The _login_nous() call site was pre-filling portal_base_url, inference_base_url, client_id, and scope with pconfig defaults before passing them to _nous_device_code_login(). Since pconfig defaults are always truthy, the env var checks inside the function (HERMES_PORTAL_BASE_URL, NOUS_PORTAL_BASE_URL, NOUS_INFERENCE_BASE_URL) could never take effect. Fix: pass None from the call site when no CLI flag is provided, letting the function's own priority chain handle defaults correctly: explicit CLI flag > env var > pconfig default. Addresses the issue reported in PR #5397 by jquesnelle.	2026-04-05 22:33:24 -07:00
SHL0MS	033e971140	Merge pull request #5421 from NousResearch/fix/research-paper-writing-gaps feat(research-paper-writing): fill coverage gaps, integrate AI-Scientist & GPT-Researcher patterns	2026-04-06 01:13:49 -04:00
SHL0MS	95a044a2e0	feat(research-paper-writing): fill coverage gaps and integrate patterns from AI-Scientist, GPT-Researcher Fix duplicate step numbers (5.3, 7.3) and missing 7.5. Add coverage for human evaluation, theory/survey/benchmark/position papers, ethics/broader impact, arXiv strategy, code packaging, negative results, workshop papers, multi-author coordination, compute budgeting, and post-acceptance deliverables. Integrate ensemble reviewing with meta-reviewer and negative bias, pre-compilation validation pipeline, experiment journal with tree structure, breadth/depth literature search, context management for large projects, two-pass refinement, VLM visual review, and claim verification. New references: human-evaluation.md, paper-types.md.	2026-04-06 01:12:32 -04:00
Teknium	38d8446011	feat: implement MCP OAuth 2.1 PKCE client support (#5420 ) Implement tools/mcp_oauth.py — the OAuth adapter that mcp_tool.py's existing auth: oauth hook has been waiting for. Components: - HermesTokenStorage: persists tokens + client registration to HERMES_HOME/mcp-tokens/<server>.json with 0o600 permissions - Callback handler factory: per-flow isolated HTTP handlers (safe for concurrent OAuth flows across multiple MCP servers) - OAuthClientProvider integration: wraps the MCP SDK's httpx.Auth subclass which handles discovery, DCR, PKCE, token exchange, refresh, and step-up auth (403 insufficient_scope) automatically - Non-interactive detection: warns when gateway/cron environments try to OAuth without cached tokens - Pre-registered client support: injects client_id/secret from config for servers that don't support Dynamic Client Registration (e.g. Slack) - Path traversal protection on server names - remove_oauth_tokens() for cleanup Config format: mcp_servers: sentry: url: 'https://mcp.sentry.dev/mcp' auth: oauth oauth: # all optional client_id: '...' # skip DCR client_secret: '...' # confidential client scope: 'read write' # server-provided by default Also passes oauth config dict through from mcp_tool.py (was passing only server_name and url before). E2E verified: full OAuth flow (401 → discovery → DCR → authorize → token exchange → authenticated request → tokens persisted) against local test servers. 23 unit tests + 186 MCP suite tests pass.	2026-04-05 22:08:00 -07:00
emozilla	3962bc84b7	show cache pricing as well (if supported)	2026-04-05 22:02:21 -07:00
emozilla	0365f6202c	feat: show model pricing for OpenRouter and Nous Portal providers Display live per-million-token pricing from /v1/models when listing models for OpenRouter or Nous Portal. Prices are shown in a column-aligned table with decimal points vertically aligned for easy comparison. Pricing appears in three places: - /provider slash command (table with In/Out headers) - hermes model picker (aligned columns in both TerminalMenu and numbered fallback) Implementation: - Add fetch_models_with_pricing() in models.py with per-base_url module-level cache (one network call per endpoint per session) - Add _format_price_per_mtok() with fixed 2-decimal formatting - Add format_model_pricing_table() for terminal table display - Add get_pricing_for_provider() convenience wrapper - Update _prompt_model_selection() to accept optional pricing dict - Wire pricing through _model_flow_openrouter/nous in main.py - Update test mocks for new pricing parameter	2026-04-05 22:02:21 -07:00
Teknium	0efe7dace7	feat: add GPT/Codex execution discipline guidance for tool persistence (#5414 ) Adds OPENAI_MODEL_EXECUTION_GUIDANCE — XML-tagged behavioral guidance injected for GPT and Codex models alongside the existing tool-use enforcement. Targets four specific failure modes: - <tool_persistence>: retry on empty/partial results instead of giving up - <prerequisite_checks>: do discovery/lookup before jumping to final action - <verification>: check correctness/grounding/formatting before finalizing - <missing_context>: use lookup tools instead of hallucinating Follows the same injection pattern as GOOGLE_MODEL_OPERATIONAL_GUIDANCE for Gemini/Gemma models. Inspired by OpenClaw PR #38953 and OpenAI's GPT-5.4 prompting guide patterns.	2026-04-05 21:51:07 -07:00
SHL0MS	4e196a5428	Merge pull request #5411 from SHL0MS/fix/manim-monospace-fonts fix(manim-video): recommend monospace fonts — proportional fonts have broken kerning	2026-04-06 00:36:19 -04:00
SHL0MS	b26e7fd43a	fix(manim-video): recommend monospace fonts — proportional fonts have broken kerning in Pango Manim's Pango text renderer produces broken kerning with proportional fonts (Helvetica, Inter, SF Pro, Arial) at all sizes and resolutions. Characters overlap and spacing is inconsistent. This is a fundamental Pango limitation. Changes: - Recommend Menlo (monospace) as the default font for ALL text - Proportional fonts only acceptable for large titles (>=48, short strings) - Set minimum font_size=18 for readability - Update all code examples to use MONO='Menlo' pattern - Remove Inter/Helvetica/SF Pro from recommendations	2026-04-06 00:35:43 -04:00
SHL0MS	084cd1f840	Merge pull request #5408 from SHL0MS/feat/manim-skill-improvements docs(manim-video): expand references with Manim CE API coverage and 3b1b production patterns	2026-04-06 00:09:25 -04:00
SHL0MS	447ec076a4	docs(manim-video): expand references with comprehensive Manim CE and 3b1b patterns Adds 601 lines across 6 reference files, sourced from deep review of: - Manim CE v0.20.1 full reference manual - 3b1b/manim example_scenes.py and source modules - 3b1b/videos production CLAUDE.md and workflow patterns - Manim CE thematic guides (voiceover, text, configuration) animations.md: always_redraw, TracedPath, FadeTransform, TransformFromCopy, ApplyMatrix, squish_rate_func, ShowIncreasingSubsets, ShowPassingFlash, expanded rate functions mobjects.md: SVGMobject, ImageMobject, Variable, BulletedList, DashedLine, Angle/RightAngle, boolean ops, LabeledArrow, t2c/t2f/t2s/t2w per-substring styling, backstroke for readability, apply_complex_function with prepare_for_nonlinear_transform equations.md: substrings_to_isolate, multi-line equations, TransformMatchingTex with matched_keys and key_map, set_color_by_tex graphs-and-data.md: Graph/DiGraph with layout algorithms, ArrowVectorField/StreamLines, ComplexPlane/PolarPlane camera-and-3d.md: ZoomedScene with inset zoom, LinearTransformationScene for 3b1b-style linear algebra rendering.md: manim.cfg project config, self.next_section() chapter markers, manim-voiceover plugin with ElevenLabs/GTTS integration and bookmark-based audio sync	2026-04-06 00:08:17 -04:00
Teknium	89c812d1d2	feat: shared thread sessions by default — multi-user thread support (#5391 ) Threads (Telegram forum topics, Discord threads, Slack threads) now default to shared sessions where all participants see the same conversation. This is the expected UX for threaded conversations where multiple users @mention the bot and interact collaboratively. Changes: - build_session_key(): when thread_id is present, user_id is no longer appended to the session key (threads are shared by default) - New config: thread_sessions_per_user (default: false) — opt-in to restore per-user isolation in threads if needed - Sender attribution: messages in shared threads are prefixed with [sender name] so the agent can tell participants apart - System prompt: shared threads show 'Multi-user thread' note instead of a per-turn User line (avoids busting prompt cache) - Wired through all callers: gateway/run.py, base.py, telegram.py, feishu.py - Regular group messages (no thread) remain per-user isolated (unchanged) - DM threads are unaffected (they have their own keying logic) Closes community request from demontut_ re: thread-based shared sessions.	2026-04-05 19:46:58 -07:00
Teknium	43d468cea8	docs: comprehensive documentation audit — fix stale info, expand thin pages, add depth (#5393 ) Major changes across 20 documentation pages: Staleness fixes: - Fix FAQ: wrong import path (hermes.agent → run_agent) - Fix FAQ: stale Gemini 2.0 model → Gemini 3 Flash - Fix integrations/index: missing MiniMax TTS provider - Fix integrations/index: web_crawl is not a registered tool - Fix sessions: add all 19 session sources (was only 5) - Fix cron: add all 18 delivery targets (was only telegram/discord) - Fix webhooks: add all delivery targets - Fix overview: add missing MCP, memory providers, credential pools - Fix all line-number references → use function name searches instead - Update file size estimates (run_agent ~9200, gateway ~7200, cli ~8500) Expanded thin pages (< 150 lines → substantial depth): - honcho.md: 43 → 108 lines — added feature comparison, tools, config, CLI - overview.md: 49 → 55 lines — added MCP, memory providers, credential pools - toolsets-reference.md: 57 → 175 lines — added explanations, config examples, custom toolsets, wildcards, platform differences table - optional-skills-catalog.md: 74 → 153 lines — added 25+ missing skills across communication, devops, mlops (18!), productivity, research categories - integrations/index.md: 82 → 115 lines — added messaging, HA, plugins sections - cron-internals.md: 90 → 195 lines — added job JSON example, lifecycle states, tick cycle, delivery targets, script-backed jobs, CLI interface - gateway-internals.md: 111 → 250 lines — added architecture diagram, message flow, two-level guard, platform adapters, token locks, process management - agent-loop.md: 112 → 235 lines — added entry points, API mode resolution, turn lifecycle detail, message alternation rules, tool execution flow, callback table, budget tracking, compression details - architecture.md: 152 → 295 lines — added system overview diagram, data flow diagrams, design principles table, dependency chain Other depth additions: - context-references.md: added platform availability, compression interaction, common patterns sections - slash-commands.md: added quick commands config example, alias resolution - image-generation.md: added platform delivery table - tools-reference.md: added tool counts, MCP tools note - index.md: updated platform count (5 → 14+), tool count (40+ → 47)	2026-04-05 19:45:50 -07:00
Teknium	fec58ad99e	fix(gateway): replace wall-clock agent timeout with inactivity-based timeout (#5389 ) The gateway previously used a hard wall-clock asyncio.wait_for timeout that killed agents after a fixed duration regardless of activity. This punished legitimate long-running tasks (subagent delegation, reasoning models, multi-step research). Now uses an inactivity-based polling loop that checks the agent's built-in activity tracker (get_activity_summary) every 5 seconds. The agent can run indefinitely as long as it's actively calling tools or receiving API responses. Only fires when the agent has been completely idle for the configured duration. Changes: - Replace asyncio.wait_for with asyncio.wait poll loop checking agent idle time via get_activity_summary() - Add agent.gateway_timeout config.yaml key (default 1800s, 0=unlimited) - Update stale session eviction to use agent idle time instead of pure wall-clock (prevents evicting active long-running tasks) - Preserve all existing diagnostic logging and user-facing context Inspired by PR #4864 (Mibayy) and issue #4815 (BongSuCHOI). Reimplemented on current main using existing _touch_activity() infrastructure rather than a parallel tracker.	2026-04-05 19:38:21 -07:00
Teknium	8972eb05fd	docs: add comprehensive Discord configuration reference (#5386 ) Add full Configuration Reference section to Discord docs covering all env vars (10 total) and config.yaml options with types, defaults, and detailed explanations. Previously undocumented: DISCORD_AUTO_THREAD, DISCORD_ALLOW_BOTS, DISCORD_REACTIONS, discord.auto_thread, discord.reactions, display.tool_progress, display.tool_progress_command. Cleaned up manual setup flow to show only required vars.	2026-04-05 19:17:24 -07:00
Teknium	fc15f56fc4	feat: warn users when loading non-agentic Hermes LLM models (#5378 ) Nous Research Hermes 3 & 4 models lack tool-calling capabilities and are not suitable for agent workflows. Add a warning that fires in two places: - /model switch (CLI + gateway) via model_switch.py warning_message - CLI session startup banner when the configured model contains 'hermes' Both paths suggest switching to an agentic model (Claude, GPT, Gemini, DeepSeek, etc.).	2026-04-05 18:41:03 -07:00
Dusk1e	e9ddfee4fd	fix(plugins): reject plugin names that resolve to the plugins root Reject "." as a plugin name — it resolves to the plugins directory itself, which in force-install flows causes shutil.rmtree to wipe the entire plugins tree. - reject "." early with a clear error message - explicit check for target == plugins_resolved (raise instead of allow) - switch boundary check from string-prefix to Path.relative_to() - add regression tests for sanitizer + install flow Co-authored-by: Dusk1e <yusufalweshdemir@gmail.com>	2026-04-05 18:40:45 -07:00
Teknium	2563493466	fix: improve timeout debug logging and user-facing diagnostics (#5370 ) Agent activity tracking: - Add _last_activity_ts, _last_activity_desc, _current_tool to AIAgent - Touch activity on: API call start/complete, tool start/complete, first stream chunk, streaming request start - Public get_activity_summary() method for external consumers Gateway timeout diagnostics: - Timeout message now includes what the agent was doing when killed: actively working vs stuck on a tool vs waiting on API response - Includes iteration count, last activity description, seconds since last activity — users can distinguish legitimate long tasks from genuine hangs - 'Still working' notifications now show iteration count and current tool instead of just elapsed time - Stale lock eviction logs include agent activity state for debugging Stream stale timeout: - _emit_status when stale stream is detected (was log-only) — gateway users now see 'No response from provider for Ns' with model and context size - Improved logger.warning with model name and estimated context size Error path notifications (gateway-visible via _emit_status): - Context compression attempts now use _emit_status (was _vprint only) - Non-retryable client errors emit summary before aborting - Max retry exhaustion emits error summary (was _vprint only) - Rate limit exhaustion emits specific rate-limit message These were all CLI-visible but silent to gateway users, which is why people on Telegram/Discord saw generic 'request failed' messages without explanation.	2026-04-05 18:33:33 -07:00
SHL0MS	1572956fdc	Merge pull request #4930 from SHL0MS/feat/manim-video-skill-v2 feat(skills): add manim-video skill for mathematical and technical animations	2026-04-05 16:10:30 -07:00
SHL0MS	9d885b266c	feat(skills): add manim-video skill for mathematical and technical animations Production pipeline for creating 3Blue1Brown-style animated videos using Manim Community Edition. The agent handles the full workflow: creative planning, Python code generation, rendering, scene stitching, audio muxing, and iterative refinement. Modes: concept explainers, equation derivations, algorithm visualizations, data stories, architecture diagrams, paper explainers, 3D visualizations. 9 reference files, setup verification script, README. All API references verified against ManimCommunity/manim source.	2026-04-05 19:09:37 -04:00
donrhmexe	7409715947	fix: link subagent sessions to parent and hide from session list Subagent sessions spawned by delegate_task were created with parent_session_id=NULL and source=cli, making them indistinguishable from user sessions in hermes sessions list and /resume. Changes: - delegate_tool.py: pass parent_agent.session_id to child agent - run_agent.py: accept parent_session_id param, pass to create_session - hermes_state.py list_sessions_rich: filter parent_session_id IS NULL by default (opt-in include_children=True for callers that need them) - hermes_state.py delete_session: delete child sessions first (FK) - hermes_state.py prune_sessions: delete children before parents (FK) session_search already handles parent_session_id correctly — child sessions are filtered from recent list and resolved to parent root in full-text search results. Fixes #5122	2026-04-05 12:48:50 -07:00
Teknium	efa03fc07d	docs: update honcho CLI reference + document plugin CLI registration (#5308 ) Post PR #5295 docs audit — 4 fixes: 1. cli-commands.md: Update hermes honcho subcommand table with 4 missing commands (peers, enable, disable, sync), --target-profile flag, --all on status, correct mode values (hybrid/context/tools not hybrid/honcho/local), and note that setup redirects to hermes memory setup. 2. build-a-hermes-plugin.md: Replace 'ctx.register_command() — planned but not yet implemented' with the actual implemented ctx.register_cli_command() API. Add full Register CLI commands section with code example. 3. memory-provider-plugin.md: Add 'Adding CLI Commands' section documenting the register_cli(subparser) convention for memory provider plugins, active-provider gating, and directory structure. 4. plugins.md: Add CLI command registration to the capabilities table.	2026-04-05 12:48:20 -07:00
Teknium	4494fba140	feat: OSV malware check for MCP extension packages (#5305 ) Before launching an MCP server via npx/uvx, queries the OSV (Open Source Vulnerabilities) API to check if the package has known malware advisories (MAL-* IDs). Regular CVEs are ignored — only confirmed malware is blocked. - Free, public API (Google-maintained), ~300ms per query - Runs once per MCP server launch, inside _run_stdio() before subprocess spawn - Parallel with other MCP servers (asyncio.gather already in place) - Fail-open: network errors, timeouts, unrecognized commands → allow - Parses npm (scoped @scope/pkg@version) and PyPI (name[extras]==version) Inspired by Block/goose extension malware check.	2026-04-05 12:46:07 -07:00
Teknium	b63fb03f3f	feat(browser): add JS evaluation via browser_console expression parameter (#5303 ) Add optional 'expression' parameter to browser_console that evaluates JavaScript in the page context (like DevTools console). Returns structured results with auto-JSON parsing. No new tool — extends the existing browser_console schema with ~20 tokens of overhead instead of adding a 12th browser tool. Both backends supported: - Browserbase: uses agent-browser 'eval' command via CDP - Camofox: uses /tabs/{tab_id}/eval endpoint with graceful degradation E2E verified: string eval, number eval, structured JSON, DOM manipulation, error handling, and original console-output mode all working.	2026-04-05 12:42:52 -07:00
Teknium	8d5226753f	fix: add missing ButtonStyle.grey to discord mock for test compatibility	2026-04-05 12:42:47 -07:00
Abhey	66d0fa1778	fix: avoid unnecessary Discord members intent on startup Only request the privileged members intent when DISCORD_ALLOWED_USERS includes non-numeric entries that need username resolution. Also release the Discord token lock when startup fails so retries and restarts are not blocked by a stale lock.\n\nAdds regression tests for conditional intents and startup lock cleanup.	2026-04-05 12:42:47 -07:00
Teknium	583d9f9597	fix(honcho): migration guard for observation mode default change Existing honcho.json configs without an explicit observationMode now default to 'unified' (the old default) instead of being silently switched to 'directional'. New installations get 'directional' as the new default. Detection: _explicitly_configured (host block exists or enabled=true) signals an existing config. When true and no observationMode is set anywhere in the config chain, falls back to 'unified'. When false (fresh install), uses 'directional'. Users who explicitly set observationMode or granular observation booleans are unaffected — explicit config always wins. 5 new tests covering all migration paths.	2026-04-05 12:34:11 -07:00
Teknium	0f813c422c	fix(plugins): only register CLI commands for the active memory provider discover_plugin_cli_commands() now reads memory.provider from config.yaml and only loads CLI registration for the active provider. If no memory provider is set, no plugin CLI commands appear in the CLI. Only one memory provider can be active at a time — at most one set of plugin CLI commands is registered. Users who haven't configured honcho (or any memory provider) won't see 'hermes honcho' in their help output. Adds test for inactive provider returning empty results.	2026-04-05 12:34:11 -07:00
Teknium	b074b0b13a	test: add plugin CLI registration tests 11 tests covering: - PluginContext.register_cli_command() storage and overwrite - get_plugin_cli_commands() return semantics - Memory plugin discover_plugin_cli_commands() with register_cli convention - Skipping plugins without register_cli or cli.py - Honcho register_cli() subcommand tree structure - Mode choices updated to recall modes (hybrid/context/tools) - _ProviderCollector.register_cli_command no-op safety	2026-04-05 12:34:11 -07:00
Teknium	dd8a42bf7d	feat(plugins): plugin CLI registration system — decouple plugin commands from core Add ctx.register_cli_command() to PluginContext for general plugins and discover_plugin_cli_commands() to memory plugin system. Plugins that provide a register_cli(subparser) function in their cli.py are automatically discovered during argparse setup and wired into the CLI. - Remove 95-line hardcoded honcho argparse block from main.py - Move honcho subcommand tree into plugins/memory/honcho/cli.py via register_cli() convention - hermes honcho setup now redirects to hermes memory setup (unified path) - hermes honcho (no subcommand) shows status instead of running setup - Future plugins can register CLI commands without touching core files - PluginManager stores CLI registrations in _cli_commands dict - Memory plugin discovery scans cli.py for register_cli at argparse time main.py: -102 lines of hardcoded plugin routing	2026-04-05 12:34:11 -07:00
erosika	c02c3dc723	fix(honcho): plugin drift overhaul -- observation config, chunking, setup wizard, docs, dead code cleanup Salvaged from PR #5045 by erosika. - Replace memoryMode/peer_memory_modes with granular per-peer observation config - Add message chunking for Honcho API limits (25k chars default) - Add dialectic input guard (10k chars default) - Add dialecticDynamic toggle for reasoning level auto-bump - Rewrite setup wizard with cloud/local deployment picker - Switch peer card/profile/search from session.context() to direct peer APIs - Add server-side observation sync via get_peer_configuration() - Fix base_url/baseUrl config mismatch for self-hosted setups - Fix local auth leak (cloud API keys no longer sent to local instances) - Remove dead code: memoryMode, peer_memory_modes, linkedHosts, suppress flags, SOUL.md aiPeer sync - Add post_setup hook to memory_setup.py for provider-specific setup wizards - Comprehensive README rewrite with full config reference - New optional skill: autonomous-ai-agents/honcho - Expanded memory-providers.md with multi-profile docs - 9 new tests (chunking, dialectic guard, peer lookups), 14 dead tests removed - Fix 2 pre-existing TestResolveConfigPath filesystem isolation failures	2026-04-05 12:34:11 -07:00
Teknium	12724e6295	feat: progressive subdirectory hint discovery (#5291 ) As the agent navigates into subdirectories via tool calls (read_file, terminal, search_files, etc.), automatically discover and load project context files (AGENTS.md, CLAUDE.md, .cursorrules) from those directories. Previously, context files were only loaded from the CWD at session start. If the agent moved into backend/, frontend/, or any subdirectory with its own AGENTS.md, those instructions were never seen. Now, SubdirectoryHintTracker watches tool call arguments for file paths and shell commands, resolves directories, and loads hint files on first access. Discovered hints are appended to the tool result so the model gets relevant context at the moment it starts working in a new area — without modifying the system prompt (preserving prompt caching). Features: - Extracts paths from tool args (path, workdir) and shell commands - Loads AGENTS.md, CLAUDE.md, .cursorrules (first match per directory) - Deduplicates — each directory loaded at most once per session - Ignores paths outside the working directory - Truncates large hint files at 8K chars - Works on both sequential and concurrent tool execution paths Inspired by Block/goose SubdirectoryHintTracker.	2026-04-05 12:33:47 -07:00
Teknium	567bc79948	fix: clean up cron platform allowlist — add homeassistant, fix import, improve placement Follow-up for cherry-picked #5118 commits: - Remove duplicate 'import subprocess' - Move _KNOWN_DELIVERY_PLATFORMS to module-level (after imports) - Add 'homeassistant' to allowlist (existing platform missing from original PR) - Remove trailing whitespace	2026-04-05 12:31:27 -07:00
Maymun	71a4582bf8	fix(security): hoist platform allowlist to module scope as frozenset	2026-04-05 12:31:27 -07:00
Maymun	1ebc932417	fix(security): validate cron deliver platform name to prevent env var enumeration	2026-04-05 12:31:27 -07:00
Xowiek	ef3bd3b276	security(approval): fix privilege escalation in gateway once-approval logic	2026-04-05 12:31:27 -07:00
MichaelWDanko	c6793d6fc3	fix(gateway): wrap cron helpers with staticmethod to prevent self-binding Plain functions imported as class attributes in APIServerAdapter get auto-bound as methods via Python's descriptor protocol. Every self._cron_() call injected self as the first positional argument, causing TypeError on all 8 cron API endpoints at runtime. Wrap each import with staticmethod() so self._cron_() calls dispatch correctly without modifying any call sites. Co-authored-by: teknium <teknium@nousresearch.com>	2026-04-05 12:31:10 -07:00
Mibayy	cc2b56b26a	feat(api): structured run events via /v1/runs SSE endpoint Add POST /v1/runs to start async agent runs and GET /v1/runs/{run_id}/events for SSE streaming of typed lifecycle events (tool.started, tool.completed, message.delta, reasoning.available, run.completed, run.failed). Changes the internal tool_progress_callback signature from positional (tool_name, preview, args) to event-type-first (event_type, tool_name, preview, args, **kwargs). Existing consumers filter on event_type and remain backward-compatible. Adds concurrency limit (_MAX_CONCURRENT_RUNS=10) and orphaned run sweep. Fixes logic inversion in cli.py _on_tool_progress where the original PR would have displayed internal tools instead of non-internal ones. Co-authored-by: Mibayy <mibayy@users.noreply.github.com>	2026-04-05 12:05:13 -07:00
Mibayy	e167ad8f61	feat(delegate): add acp_command/acp_args override to delegate_task Allow delegate_task to specify custom ACP transport per-task, so a parent running via CLI/Discord/Telegram can spawn child agents over ACP (e.g. claude --acp --stdio). Follows the existing override_provider pattern. Supports per-task granularity in batch mode. Co-authored-by: Mibayy <mibayy@users.noreply.github.com>	2026-04-05 12:05:13 -07:00
NexVeridian	c71b1d197f	fix(acp): advertise slash commands via ACP protocol Send AvailableCommandsUpdate on session create/load/resume/fork so ACP clients (Zed, etc.) can discover /help, /model, /tools, /compact, etc. Also rewrites /compact to use agent._compress_context() properly with token estimation and session DB isolation. Co-authored-by: NexVeridian <NexVeridian@users.noreply.github.com>	2026-04-05 12:05:13 -07:00
Git-on-my-level	fcdd5447e2	fix: keep ACP stdout protocol-clean Route AIAgent print output to stderr via _print_fn for ACP stdio sessions. Gate quiet-mode spinner startup on _should_start_quiet_spinner() so JSON-RPC on stdout isn't corrupted. Child agents inherit the redirect. Co-authored-by: Git-on-my-level <Git-on-my-level@users.noreply.github.com>	2026-04-05 12:05:13 -07:00
Teknium	914a7db448	fix(acp): rename AuthMethod to AuthMethodAgent for agent-client-protocol 0.9.0 Straight rename to match the 0.9.0 API where AuthMethod was split into AuthMethodAgent, AuthMethodEnvVar, AuthMethodTerminal. Bump pin to >=0.9.0,<1.0. Co-authored-by: Mibayy <mibayy@users.noreply.github.com>	2026-04-05 12:05:13 -07:00
Teknium	6ee90a7cf6	fix: hermes auth remove now clears env-seeded credentials permanently (#5285 ) Removing an env-seeded credential (e.g. from OPENROUTER_API_KEY) via 'hermes auth' previously had no lasting effect -- the entry was deleted from auth.json but load_pool() re-created it on the next call because the env var was still set. Now auth_remove_command detects env-sourced entries (source starts with 'env:') and calls the new remove_env_value() to strip the var from both .env and os.environ, preventing re-seeding. Changes: - hermes_cli/config.py: add remove_env_value() -- atomically removes a line from .env and pops from os.environ - hermes_cli/auth_commands.py: auth_remove_command clears env var when removing an env-seeded pool entry - 8 new tests covering remove_env_value and the full zombie-credential lifecycle (remove -> reload -> stays gone)	2026-04-05 12:00:53 -07:00

1 2 3 4 5 ...

3310 Commits