hermes-agent

Author	SHA1	Message	Date
jackx707	15561ec425	feat: add WebResearchEnv RL environment for multi-step web research	2026-03-05 14:34:36 +00:00
teknium1	ada3713e77	feat: add documentation website (Docusaurus) - 25 documentation pages covering Getting Started, User Guide, Developer Guide, and Reference - Docusaurus with custom amber/gold theme matching the landing page branding - GitHub Actions workflow to deploy landing page + docs to GitHub Pages - Landing page at root, docs at /docs/ on hermes-agent.nousresearch.com - Content extracted and restructured from existing repo docs (README, AGENTS.md, CONTRIBUTING.md, docs/) - Auto-deploy on push to main when website/ or landingpage/ changes	2026-03-05 05:24:55 -08:00
teknium1	1708dcd2b2	feat: implement edit_message() for Telegram/Discord/Slack and fix fallback regression Building on PR #288's edit_message() abstraction: - Telegram: edit_message_text() with MarkdownV2 + plain text fallback - Discord: channel.fetch_message() + msg.edit() with length capping - Slack: chat_update() via slack_bolt client Also fixes the fallback regression in send_progress_messages() where platforms that don't support editing would receive duplicated accumulated tool lines. Now uses a can_edit flag — after the first failed edit, falls back to sending individual lines (matching pre-PR behavior).	2026-03-05 03:47:51 -08:00
teknium1	5702eba93b	Merge PR #288 : feat(whatsapp): stream tool progress as a single live-updating message Authored by satelerd. Adds edit_message() to BasePlatformAdapter and implements it for WhatsApp via Baileys native editing. Progress messages accumulate into a single live-updating message instead of N separate ones. Cherry-picked from stale branch.	2026-03-05 03:44:13 -08:00
Daniel Sateler	a1767fd69c	feat(whatsapp): consolidate tool progress into single editable message Instead of sending a separate WhatsApp message for each tool call during agent execution (N+1 messages), the first tool sends a new message and subsequent tools edit it to append their line. Result: 1 growing progress message + 1 final response = 2 messages instead of N+1. Changes: - bridge.js: Add POST /edit endpoint using Baileys message editing - base.py: Add optional edit_message() to BasePlatformAdapter (no-op default, so platforms without editing support work unchanged) - whatsapp.py: Implement edit_message() calling bridge /edit - run.py: Rewrite send_progress_messages() to accumulate tool lines and edit the progress message. Falls back to sending a new message if edit fails (graceful degradation). Before (5 tools = 6 messages): ⚕ Hermes Agent ─── 🔍 web_search... "query" ⚕ Hermes Agent ─── 📄 web_extract... "url" ⚕ Hermes Agent ─── 💻 terminal... "pip install" ⚕ Hermes Agent ─── ✍️ write_file... "app.py" ⚕ Hermes Agent ─── 💻 terminal... "python app.py" ⚕ Hermes Agent ─── Done! The server is running... After (5 tools = 2 messages): ⚕ Hermes Agent ─── 🔍 web_search... "query" 📄 web_extract... "url" 💻 terminal... "pip install" ✍️ write_file... "app.py" 💻 terminal... "python app.py" ⚕ Hermes Agent ─── Done! The server is running... Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-05 03:44:08 -08:00
teknium1	b4b426c69d	test: add coverage for tee, process substitution, and full-path rm patterns Tests for the three new dangerous command patterns added in PR #280: - TestProcessSubstitutionPattern: 7 tests (bash/sh/zsh/ksh + safe commands) - TestTeePattern: 7 tests (sensitive paths + safe destinations) - TestFindExecFullPathRm: 4 tests (/bin/rm, /usr/bin/rm, bare rm, safe find)	2026-03-05 01:58:33 -08:00
teknium1	2465674fda	Merge PR #280 : fix: add missing dangerous command patterns (tee, process substitution, full-path rm) Authored by dogiladeveloper. Adds detection for tee writes to sensitive files, process substitution with curl/wget, and find -exec with full-path rm.	2026-03-05 01:56:44 -08:00
teknium1	2eca0d4af1	Merge PR #275 : fix(batch_runner): preserve traceback when batch worker fails Authored by batuhankocyigit. Adds explicit traceback logging for batch worker failures and improves tool dispatch error logging in registry.	2026-03-05 01:44:05 -08:00
teknium1	11a7c6b112	fix: update mock agent signature to accept task_id after PR #419 The _Codex401ThenSuccessAgent mock overrides run_conversation() but was missing the task_id parameter, causing a TypeError in the gateway test.	2026-03-05 01:41:50 -08:00
teknium1	50ea8adf46	Merge PR #419 : fix: pass stable task_id in CLI and gateway to preserve sandbox state across turns Authored by rovle. Passes session_id as task_id to run_conversation() in both CLI and gateway, so container backends (Docker/Modal/Singularity) reuse the same sandbox across turns. Also passes task_id through to _create_environment() in file_tools.py. Cherry-picked from original PR branch (which had unrelated divergent commits from the contributor's fork).	2026-03-05 01:40:13 -08:00
rovle	ca33372595	fix: pass task_id to _create_environment as well, to prevent cross-session state mixing Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 01:40:04 -08:00
rovle	7d47e3b776	fix: pass stable task_id in CLI and gateway to preserve sandbox state across turns Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 01:40:04 -08:00
teknium1	fe15a2c65c	Merge PR #274 : fix(setup): handle TerminalMenu init failures with safe fallback Authored by jdblackstar. Catches runtime exceptions from TerminalMenu init (e.g. CalledProcessError from tput with unknown TERM like xterm-ghostty over SSH) and falls through to the text-based menu.	2026-03-05 01:26:58 -08:00
teknium1	d400fb8b23	feat: add /update slash command for gateway platforms Adds a /update command to Telegram, Discord, and other gateway platforms that runs `hermes update` to pull the latest code, update dependencies, sync skills, and restart the gateway. Implementation: - Spawns `hermes update` in a separate systemd scope (systemd-run --user --scope) so the process survives the gateway restart that hermes update triggers at the end. Falls back to nohup if systemd-run is unavailable. - Writes a marker file (.update_pending.json) with the originating platform and chat_id before spawning the update. - On gateway startup, _send_update_notification() checks for the marker, reads the captured update output, sends the results back to the user, and cleans up. Also: - Registers /update as a Discord slash command - Updates README.md, docs/messaging.md, docs/slash-commands.md - Adds 18 tests covering handler, notification, and edge cases	2026-03-05 01:20:58 -08:00
teknium1	2af2f148ab	refactor: rewrite duckduckgo-search skill for accuracy and usability Follow-up to PR #267 merge: - Fix CLI syntax: -k is keywords, -m is max results (was reversed) - Add clear trigger condition: use only when web_search tool unavailable - Remove misleading curl fallback (DuckDuckGo Instant Answer API is not a web search endpoint) - Fix package name: ddgs (renamed from duckduckgo-search) - Add workflow section for search → web_extract pipeline - Add pitfalls and limitations sections - Fix author attribution to actual contributor - Rewrite shell script as simple ddgs wrapper with availability check	2026-03-04 22:11:09 -08:00
teknium1	d19109742e	Merge PR #267 : feat(skills): add DuckDuckGo search skill as Firecrawl fallback Authored by gamedevCloudy. Adds a free web search skill for users without FIRECRAWL_API_KEY, using the ddgs library or curl.	2026-03-04 22:09:07 -08:00
teknium1	078e2e4b19	fix(cli): Ctrl+C clears input buffer before exiting Previously, pressing Ctrl+C while text was typed in the input prompt would immediately exit Hermes. Now follows standard shell behavior: - Text in buffer → Ctrl+C clears the line (like bash) - Empty buffer → Ctrl+C exits This means accidentally hitting Ctrl+C while composing a message just clears the input instead of killing the session. A second Ctrl+C on the empty prompt still exits as expected.	2026-03-04 22:01:13 -08:00
teknium1	9aa2999388	Merge PR #393 : fix(whatsapp): initialize data variable and close log handle on error paths Authored by FarukEst. Fixes #392. 1. Initialize data={} before health-check loop to prevent NameError when resp.json() raises after http_ready is set to True. 2. Extract _close_bridge_log() helper and call on all return False paths to prevent file descriptor leaks on failed connection attempts. Refactors disconnect() to reuse the same helper.	2026-03-04 21:49:53 -08:00
teknium1	d0d9897e81	refactor: clean up transcription_tools after PR #262 merge - Fix incorrect error message (only VOICE_TOOLS_OPENAI_KEY is checked, not OPENAI_API_KEY) - Remove redundant FileNotFoundError catch (exists() check above already handles this) - Consolidate openai imports to single line - Sort SUPPORTED_FORMATS in error message for deterministic output	2026-03-04 21:35:04 -08:00
teknium1	9306a1e06a	Merge PR #262 : improve error handling and validation in transcription_tools Authored by aydnOktay. Adds file format and size validation before API calls, specific exception handling, and improved logging.	2026-03-04 21:33:03 -08:00
teknium1	141b12bd39	refactor: clean up type hints and docstrings in session_search_tool Follow-up to PR #261 merge: - Fix Optional[Any] → Union[int, float, str, None] (actually meaningful) - Fix _resolve_to_parent return type to str (never returns None in practice) - Trim verbose docstrings on internal helpers to single-line style - Correct docstring that claimed 'unknown' on failure (returns str(ts))	2026-03-04 21:25:54 -08:00
teknium1	ae3deff8d4	Merge PR #261 : improve error handling and type hints in session_search_tool Authored by aydnOktay. Adds TimeoutError handling for session summarization, better exception specificity in _format_timestamp, defensive try/except in _resolve_to_parent, and type hints.	2026-03-04 21:23:56 -08:00
teknium1	41adca4e77	fix: strip internal fields from API messages in _handle_max_iterations The flush_memories() and run_conversation() code paths already stripped finish_reason and reasoning from API messages (added in `7a0b377` via PR #253), but _handle_max_iterations() was missed. It was sending raw messages.copy() which could include finish_reason, causing 422 errors on strict APIs like Mistral when the agent hit max iterations. Now strips the same internal fields consistently across all three API call sites.	2026-03-04 21:08:20 -08:00
teknium1	8e901b31c1	Merge PR #214 : fix: align _apply_delete comment with actual behavior Authored by VolodymyrBg.	2026-03-04 20:47:47 -08:00
teknium1	11a5a64729	feat: add emojicombos.com as primary ASCII art search source emojicombos.com has a huge curated collection of ASCII art, dot art, kaomoji, and emoji combos searchable via web_extract with a simple URL pattern: https://emojicombos.com/{term}-ascii-art No API key needed. Returns modern/meme art, pop culture references, and kaomoji alongside classic ASCII art. Added as Source A (recommended first) before asciiart.eu (Source B, classic archive). Also added GitHub Octocat API as a fun easter egg and kaomoji search to the decision flow.	2026-03-04 20:23:36 -08:00
teknium1	0dba3027c1	feat: expand ascii-art skill with cowsay, boxes, toilet, image-to-ascii Adds 5 additional tools from the awesome-ascii-art ecosystem: - cowsay: 50+ characters with speech/thought bubbles - boxes: 70+ decorative border designs, composable with pyfiglet - toilet: colored text art with rainbow/metal/border filters - ascii-image-converter: modern image-to-ASCII (PNG/JPEG/GIF/WEBP) - jp2a: lightweight JPEG-to-ASCII fallback Also adds fun extras (Star Wars telnet), resource links, and an expanded decision flow covering all 7 modes. Ref: github.com/moul/awesome-ascii-art	2026-03-04 20:16:38 -08:00
teknium1	405c7e08be	feat: enhance ascii-art skill with pyfiglet and asciiart.eu search Adds two primary modes on top of the original LLM-generation approach: - Mode 1: pyfiglet (571 fonts, pip install, no API key) for text banners - Mode 2: asciiart.eu search (11,000+ pieces) via web_extract for pre-made art - Mode 3: LLM-generated art using Unicode palette (original PR, now fallback) Includes decision flow, font recommendations, and category reference.	2026-03-04 20:01:08 -08:00
teknium1	cb36930f1d	Merge PR #209 : add ascii-art skill for creative text banners and art Authored by 0xbyt4. Initial skill with Unicode character palette and style guide for LLM-generated ASCII art.	2026-03-04 19:59:13 -08:00
teknium1	90e6fa2612	Merge PR #204 : fix Telegram italic regex newline bug Authored by 0xbyt4. The italic regex [^]+ matched across newlines, corrupting bullet lists using markers (e.g. '* Item one\n* Item two' became italic garbage). Fixed by adding \n to the negated character class: [^*\n]+.	2026-03-04 19:52:03 -08:00
teknium1	fd22ae5fcb	Merge PR #203 : add unit tests for trajectory_compressor Authored by 0xbyt4. 25 tests covering CompressionConfig, TrajectoryMetrics, AggregateMetrics, protected indices, content extraction, and token counting.	2026-03-04 19:48:19 -08:00
teknium1	e1baab90f7	Merge PR #201 : fix skills hub dedup to prefer higher trust levels Authored by 0xbyt4. The dedup logic in GitHubSource.search() and unified_search() used 'r.trust_level == "trusted"' which let trusted results overwrite builtin ones. Now uses ranked comparison: builtin (2) > trusted (1) > community (0).	2026-03-04 19:40:41 -08:00
teknium1	4fcfa329ba	Merge PR #200 : fix extract_images and truncate_message bugs in platform base Authored by 0xbyt4. Two fixes: - extract_images(): only remove extracted image tags, not all markdown image tags. Previously ![doc](report.pdf) was silently dropped when real images were also present. - truncate_message(): walk chunk_body not full_chunk when tracking code block state, so the reopened fence prefix doesn't toggle in_code off and leave continuation chunks with unclosed code blocks.	2026-03-04 19:37:58 -08:00
teknium1	b336980229	Merge PR #193 : add unit tests for 5 security/logic-critical modules (batch 4) Authored by 0xbyt4. 144 new tests covering gateway/pairing.py, tools/skill_manager_tool.py, tools/skills_tool.py, honcho_integration/session.py, and agent/auxiliary_client.py.	2026-03-04 19:35:01 -08:00
teknium1	7128f95621	Merge PR #390 : fix hidden directory filter broken on Windows Authored by Farukest. Fixes #389. Replaces hardcoded forward-slash string checks ('/.git/', '/.hub/') with Path.parts membership test in _find_all_skills() and scan_skill_commands(). On Windows, str(Path) uses backslashes so the old filter never matched, causing quarantined skills to appear as installed.	2026-03-04 19:22:43 -08:00
teknium1	ffc6d767ec	Merge PR #388 : fix --force bypassing dangerous verdict in should_allow_install Authored by Farukest. Fixes #387. Removes 'and not force' from the dangerous verdict check so --force can never install skills with critical security findings (reverse shells, data exfiltration, etc). The docstring already documented this behavior but the code didn't enforce it.	2026-03-04 19:19:57 -08:00
teknium1	44a2d0c01f	Merge PR #386 : fix symlink boundary check prefix confusion in skills_guard Authored by Farukest. Fixes #385. Replaces startswith() with Path.is_relative_to() in _check_structure() symlink escape check — same fix pattern as skill_view() (PR #352). Prevents symlinks escaping to sibling directories with shared name prefixes.	2026-03-04 19:13:21 -08:00
teknium1	3e2ed18ad0	fix: fallback to main model endpoint when auxiliary summary client fails When the auxiliary client (used for context compression summaries) fails — e.g. due to a stale OpenRouter API key after switching to a local LLM — fall back to the user's active endpoint (OPENAI_BASE_URL) instead of returning a useless static summary string. This handles the common scenario where a user switches providers via 'hermes model' but the old provider's API key remains in .env. The auxiliary client picks up the stale key, fails (402/auth error), and previously compression would produce garbage. Now it gracefully retries with the working endpoint. On successful fallback, the working client is cached for future compressions in the same session so the fallback cost is paid only once. Ref: #348	2026-03-04 17:58:09 -08:00
teknium1	db58cfb13d	Merge PR #269 : Fix nous refresh token rotation failure on key mint failure Fixes a bug where the refresh token was not persisted when the API key mint failed (e.g., 402 insufficient credits, timeout). The rotated refresh token was lost, causing subsequent auth attempts to fail with a stale token. Changes: - Persist auth state immediately after each successful token refresh, before attempting the mint - Use latest in-memory refresh token on mint-retry paths (was using the stale original) - Atomic durable writes for auth.json (temp file + fsync + replace) - Opt-in OAuth trace logging (HERMES_OAUTH_TRACE=1, fingerprint-only) - 3 regression tests covering refresh+402, refresh+timeout, and invalid-token retry behavior Author: Robin Fernandes <rewbs>	2026-03-04 17:52:10 -08:00
teknium1	3220bb8aaa	Merge PR #403 : Fix context overrun crash with local LLM backends Authored by ch3ronsa. Fixes #348. Adds 'context size' (LM Studio) and 'context window' (Ollama) to context-length error detection phrases so local backend 400 errors trigger compression instead of aborting. Also removes 'error code: 400' from the non-retryable error list as defense in depth.	2026-03-04 17:48:44 -08:00
teknium1	ff3a479156	fix: coerce session_id and data to string in process tool handler Some models send session_id as an integer instead of a string, causing type errors downstream. Defensively cast session_id and write/submit data args to str to handle non-compliant model outputs.	2026-03-04 16:37:00 -08:00
teknium1	6f4941616d	fix(gateway): include history_offset in error return path The error return (no final_response) was missing history_offset, falling back to len(history) which has the same session_meta offset bug fixed in PR #395. Now both return paths include the correct filtered history length.	2026-03-04 16:26:53 -08:00
teknium1	bd3025d669	Merge PR #395 : fix(gateway): use filtered history length for transcript message extraction Authored by PercyDikec. Fixes #394. The transcript extraction used len(history) to find new messages, but history includes session_meta entries stripped before reaching the agent. This caused 1 message lost per turn from turn 2 onwards. Fix returns history_offset (filtered length) from _run_agent and uses it for the slice.	2026-03-04 16:25:09 -08:00
teknium1	4c72329412	feat: add backend validation for required binaries in setup wizard Implemented checks to ensure that necessary binaries (Docker, Singularity, SSH) are installed for the selected backend in the setup wizard. If a required binary is missing, the user is prompted to proceed with a fallback to the local backend. This enhances user experience by preventing potential runtime errors due to missing dependencies.	2026-03-04 14:49:23 -08:00
teknium1	8311e8984b	fix: preflight context compression + error handler ordering for model switches Two fixes for the case where a user switches to a model with a smaller context window while having a large existing session: 1. Preflight compression in run_conversation(): Before the main loop, estimate tokens of loaded history + system prompt. If it exceeds the model's compression threshold (85% of context), compress proactively with up to 3 passes. This naturally handles model switches because the gateway creates a fresh AIAgent per message with the current model's context length. 2. Error handler reordering: Context-length errors (400 with 'maximum context length' etc.) are now checked BEFORE the generic 4xx handler. Previously, OpenRouter's 400-status context-length errors were caught as non-retryable client errors and aborted immediately, never reaching the compression+retry logic. Reported by Sonicrida on Discord: 840-message session (2MB+) crashed after switching from a large-context model to minimax via OpenRouter.	2026-03-04 14:42:41 -08:00
teknium1	093acd72dd	fix: catch exceptions from check_fn in is_toolset_available() get_definitions() already wrapped check_fn() calls in try/except, but is_toolset_available() did not. A failing check (network error, missing import, bad config) would propagate uncaught and crash the CLI banner, agent startup, and tools-info display. Now is_toolset_available() catches all exceptions and returns False, matching the existing pattern in get_definitions(). Added 4 tests covering exception handling in is_toolset_available(), check_toolset_requirements(), get_definitions(), and check_tool_availability(). Closes #402	2026-03-04 14:22:30 -08:00
Vicaversa	e9ab711b66	Fix context overrun crash with local LLM backends (fixes #348 ) Local backends (LM Studio, Ollama, llama.cpp) return HTTP 400 with messages like "Context size has been exceeded" when the context window is full. The error phrase list did not include "context size" or "context window", so these errors fell through to the generic 4xx abort handler instead of triggering compression. Changes: - Move context-length check above generic 4xx handler so it runs first (same pattern as the existing 413 check) - Add "context size" and "context window" to the phrase list - Guard 4xx handler with `not is_context_length_error` to prevent context-related 400s from being treated as non-retryable	2026-03-05 01:12:34 +03:00
teknium1	b2a9f6beaa	feat: enable up/down arrow history navigation in CLI The TextArea uses multiline=True, so up/down arrows only moved the cursor within text — history browsing via FileHistory was attached but inaccessible. Two fixes: 1. Add up/down key bindings in normal input mode that call Buffer.auto_up()/auto_down(). These intelligently handle both: cursor movement when editing multi-line text, and history browsing when on the first/last line. 2. Pass append_to_history=True to buffer.reset() in the Enter handler so messages actually get saved to ~/.hermes_history. History persists across sessions via FileHistory. The bindings are filtered out during clarify, approval, and sudo prompts (which have their own up/down handlers).	2026-03-04 13:39:48 -08:00
PercyDikec	d3504f84af	fix(gateway): use filtered history length for transcript message extraction The transcript extraction used len(history) to find new messages, but history includes session_meta entries that are stripped before passing to the agent. This mismatch caused 1 message to be lost from the transcript on every turn after the first, because the slice offset was too high. Use the filtered history length (history_offset) returned by _run_agent instead. Also changed the else branch from returning all agent_messages to returning an empty list, so compressed/shorter agent output does not duplicate the entire history into the transcript.	2026-03-04 21:34:40 +03:00
Farukest	34badeb19c	fix(whatsapp): initialize data variable and close log handle on error paths	2026-03-04 19:11:48 +03:00
Farukest	f93b48226c	fix: use Path.parts for hidden directory filter in skill listing The hidden directory filter used hardcoded forward-slash strings like '/.git/' and '/.hub/' to exclude internal directories. On Windows, Path returns backslash-separated strings, so the filter never matched. This caused quarantined skills in .hub/quarantine/ to appear as installed skills and available slash commands on Windows. Replaced string-based checks with Path.parts membership test which works on both Windows and Unix.	2026-03-04 18:34:16 +03:00

1 2 3 4 5 ...

767 Commits