hermes-agent

Author	SHA1	Message	Date
teknium1	8f3d7dfcc0	fix: defer discord adapter annotations Prevent gateway.platforms.discord from crashing at import time when discord.py is unavailable. Python 3.11 eagerly evaluates annotations, so using discord.Interaction and similar annotations caused an AttributeError after the optional import fallback set discord=None. Add postponed annotation evaluation and a regression test covering import without discord installed.	2026-03-14 09:32:05 -07:00
Teknium	95c0bee7f8	Merge pull request #1299 from NousResearch/hermes/hermes-f5fb1d3b fix: salvage PR #327 voice mode onto current main	2026-03-14 06:45:20 -07:00
Himess	fa72f4ff55	fix: email adapter IMAP UID tracking and SMTP TLS verification - Use imap.uid() for search and fetch instead of imap.search/fetch. Sequence numbers shift when messages are deleted, causing the adapter to skip new messages or reprocess old ones. UIDs are stable. - Pass ssl.create_default_context() to starttls() so the server certificate is actually verified. Without it smtplib uses ssl._create_stdlib_context() which skips verification.	2026-03-14 06:29:00 -07:00
teknium1	523a1b6faf	merge: salvage PR #327 voice mode branch Merge contributor branch feature/voice-mode onto current main for follow-up fixes.	2026-03-14 06:03:07 -07:00
0xbyt4	eb34c0b09a	fix: voice pipeline hardening — 7 bug fixes with tests 1. Anthropic + ElevenLabs TTS silence: forward full response to TTS callback for non-streaming providers (choices first, then native content blocks fallback). 2. Subprocess timeout kill: play_audio_file now kills the process on TimeoutExpired instead of leaving zombie processes. 3. Discord disconnect cleanup: leave all voice channels before closing the client to prevent leaked state. 4. Audio stream leak: close InputStream if stream.start() fails. 5. Race condition: read/write _on_silence_stop under lock in audio callback thread. 6. _vprint force=True: show API error, retry, and truncation messages even during streaming TTS. 7. _refresh_level lock: read _voice_recording under _voice_lock.	2026-03-14 14:27:21 +03:00
0xbyt4	cc0a453476	fix: address PR review round 5 — streaming guard, VC auth, history prefix, auto-TTS control 1. Gate _streaming_api_call to chat_completions mode only — Anthropic and Codex fall back to _interruptible_api_call. Preserve Anthropic base_url across all client rebuild paths (interrupt, fallback, 401 refresh). 2. Discord VC synthetic events now use chat_type="channel" instead of defaulting to "dm" — prevents session bleed into DM context. Authorization runs before echoing transcript. Sanitize @everyone/@here in voice transcripts. 3. CLI voice prefix ("[Voice input...]") is now API-call-local only — stripped from returned history so it never persists to session DB or resumed sessions. 4. /voice off now disables base adapter auto-TTS via _auto_tts_disabled_chats set — voice input no longer triggers TTS when voice mode is off.	2026-03-14 14:27:21 +03:00
0xbyt4	35748a2fb0	fix: address PR review round 4 — remove web UI, fix audio/import/interface issues Remove web UI gateway (web.py, tests, docs, toolset, env vars, Platform.WEB enum) per maintainer request — Nous is building their own official chat UI. Fix 1: Replace sd.wait() with polling pattern in play_audio_file() to prevent indefinite hang when audio device stalls (consistent with play_beep()). Fix 2: Use importlib.util.find_spec() for faster_whisper/openai availability checks instead of module-level imports that trigger heavy native library loading (CUDA/cuDNN) at import time. Fix 3: Remove inspect.signature() hack in _send_voice_reply() — add **kwargs to Telegram send_voice() so all adapters accept metadata uniformly. Fix 4: Make session loading resilient to removed platform enum values — skip entries with unknown platforms instead of crashing the entire gateway.	2026-03-14 14:27:21 +03:00
0xbyt4	1ad5e0ed15	feat: add voice channel awareness — inject participant and speaking state into agent context	2026-03-14 14:27:21 +03:00
0xbyt4	e3126aeb40	fix: STT consistency — web.py model param, error matching, local provider key - web.py: pass stt_model from config like discord.py and run.py do - run.py: match new error messages (No STT provider / not set) - _transcribe_local: add missing "provider": "local" to return dict	2026-03-14 14:27:21 +03:00
0xbyt4	eb052b1b42	fix: add explicit metadata param to Discord send_voice signature	2026-03-14 14:27:21 +03:00
0xbyt4	c433c89d7d	fix: demote RTP debug logs to DEBUG and isolate web sessions - Change RTP packet logging from INFO to DEBUG level to reduce noise (SPEAKING events remain at INFO as they are important lifecycle events) - Use per-session chat_id (web_{session_id}) instead of shared "web" to isolate conversation context between simultaneous web users	2026-03-14 14:27:21 +03:00
0xbyt4	5b47b87c42	fix: show only reachable URLs in Web UI startup message When bound to 127.0.0.1, only show localhost URL instead of listing unreachable network interfaces. Add hint about WEB_UI_HOST=0.0.0.0 for phone/tablet access. Add VPN/multi-interface and token exposure tests (11 new tests).	2026-03-14 14:27:21 +03:00
0xbyt4	a21f518c0b	fix: hide configured token value in Web UI startup log Only print the access token when auto-generated (user needs it to log in). When set via WEB_UI_TOKEN env var, just confirm it is set without exposing the value in console output.	2026-03-14 14:27:21 +03:00
0xbyt4	44abe852fb	fix: add macOS Homebrew Opus fallback and fix shutdown dict iteration - Add Homebrew library path fallback when ctypes.util.find_library fails on macOS (Apple Silicon + Intel paths, guarded by platform check) - Fix RuntimeError in gateway stop() by iterating over dict copy - Update Opus tests to verify find_library-first + conditional fallback	2026-03-14 14:27:21 +03:00
0xbyt4	0ff1b4ade2	fix: harden web gateway security and fix error swallowing - Use hmac.compare_digest for timing-safe token comparison (3 endpoints) - Default bind to 127.0.0.1 instead of 0.0.0.0 - Sanitize upload filenames with Path.name to prevent path traversal - Add DOMPurify to sanitize marked.parse() output against XSS - Replace add_static with authenticated media handler - Hide token in group chats for /remote-control command - Use ctypes.util.find_library for Opus instead of hardcoded paths - Add force=True to 5 interrupt _vprint calls for visibility - Log Opus decode errors and voice restart failures instead of swallowing	2026-03-14 14:27:21 +03:00
0xbyt4	2c84979d77	refactor: extract get_stt_model_from_config helper to eliminate DRY violation Duplicated YAML config parsing for stt.model existed in gateway/run.py and gateway/platforms/discord.py. Moved to a single helper in transcription_tools.py and added 5 tests covering all edge cases.	2026-03-14 14:27:21 +03:00
0xbyt4	238a431545	fix: make STT config env-overridable and fix doc issues Code fixes: - STT model, Groq base URL, and OpenAI STT base URL are now configurable via env vars (STT_GROQ_MODEL, STT_OPENAI_MODEL, GROQ_BASE_URL, STT_OPENAI_BASE_URL) instead of hardcoded - Gateway and Discord VC now read stt.model from config.yaml (previously only CLI did this — gateway always used defaults) Doc fixes: - voice-mode.md: move Web UI troubleshooting to web.md (was duplicated) - voice-mode.md: simplify "How It Works" for end users (remove NaCl, DAVE, RTP internals) - voice-mode.md: clarify STT priority (OpenAI used first if both keys set, Groq recommended for free tier) - voice-mode.md: document new STT env overrides in config reference - web.md: remove duplicate Quick Start / Step 1-3 sections - web.md: add mobile HTTPS mic workarounds (moved from voice-mode.md) - web.md: clarify STT fallback order	2026-03-14 14:27:20 +03:00
0xbyt4	9722bd8be0	fix: 8 voice pipeline bugs with tests proving each fix 1. VoiceReceiver.stop() now acquires _lock before clearing shared state to prevent race with _on_packet on the socket reader thread 2. _packet_debug_count moved from class-level to instance-level to avoid cross-instance race condition in multi-guild setups 3. play_in_voice_channel uses asyncio.get_running_loop() instead of deprecated asyncio.get_event_loop() 4. _send_voice_reply uses uuid for filenames instead of time-based names that can collide when two replies happen in the same second 5. Voice timeout now notifies runner via _on_voice_disconnect callback so runner cleans up _voice_mode state (prevents orphaned TTS replies) 6. play_in_voice_channel adds PLAYBACK_TIMEOUT (120s) to prevent infinite blocking when FFmpeg callback is never called 7. _send_voice_reply moves temp file cleanup to finally block so files are always cleaned up even when send_voice/play raises 8. Base adapter auto-TTS wraps play_tts in try/finally with os.remove to clean up generated audio files after playback 18 new tests (120 total voice tests)	2026-03-14 14:27:20 +03:00
0xbyt4	c925d2ee76	fix: voice pipeline thread safety and error handling bugs - Add lock protection around VoiceReceiver buffer writes in _on_packet to prevent race condition with check_silence on different threads - Wire _voice_input_callback BEFORE join_voice_channel to avoid losing voice input during the join window - Add try/except around leave_voice_channel to ensure state cleanup (voice_mode, callback) even if leave raises an exception - Guard against empty text after markdown stripping in base.py auto-TTS - Add 11 tests proving each bug and verifying the fix	2026-03-14 14:27:20 +03:00
0xbyt4	86ddaaee9c	fix: extract voice reply logic and add comprehensive tests - Fix tempfile.mktemp() TOCTOU race in Discord voice input (use NamedTemporaryFile) - Extract voice reply decision from _handle_message into _should_send_voice_reply() - Rewrite TestAutoVoiceReply to call real method instead of testing a copy - Add 59 new tests: VoiceReceiver, VC commands, adapter methods, streaming TTS	2026-03-14 14:27:20 +03:00
0xbyt4	3431f73c96	fix: show mic button on mobile Web UI with HTTPS warning Mobile browsers require HTTPS for navigator.mediaDevices API. Instead of hiding the mic button (confusing UX), show it as dimmed and display an informative message when tapped explaining the HTTPS requirement.	2026-03-14 14:27:20 +03:00
0xbyt4	62e75cd158	fix: skip duplicate TTS file attachment when bot is in Discord voice channel Override play_tts in DiscordAdapter to no-op when connected to a voice channel for the same guild. The gateway runner already plays TTS audio in the VC via play_in_voice_channel, so the base adapter's fallback to send_voice (file attachment) was causing double audio output.	2026-03-14 14:27:20 +03:00
0xbyt4	1b10c3711d	fix: accept kwargs in send_voice for Discord and Slack adapters play_tts base class forwards metadata via kwargs to send_voice, but Discord and Slack adapters did not accept extra keyword arguments, causing TypeError and silent message handling failure. Also fix test_web_defaults to patch correct env var (WEB_UI_TOKEN).	2026-03-14 14:27:20 +03:00
0xbyt4	c477f660da	feat: add continuous voice mode with VAD silence detection - Voice mode: press mic once to enter, press again to exit - VAD (Voice Activity Detection) auto-stops recording after 1.5s silence - Continuous loop: speak → transcribe → agent responds → TTS plays → auto-listen - Voice mode UI: input bar hides, large mic button centered - Auto-restart listening when TTS playback finishes - Fallback: restart listening on text response if no TTS arrives	2026-03-14 14:27:20 +03:00
0xbyt4	d3e09df01a	feat: add voice conversation support and futuristic UI redesign - Auto-TTS: voice messages get spoken response (audio first, then text) - STT: Groq Whisper fallback when VOICE_TOOLS_OPENAI_KEY not set - Futuristic UI: glassmorphism, centered container, purple theme, glow effects - Voice bubble: custom waveform player with seek and progress - Invisible TTS playback via play_tts() method (no audio file in chat) - Add hermes-web toolset with full tool access - Register Platform.WEB in toolset/config maps - Update docs for voice conversation feature	2026-03-14 14:27:20 +03:00
0xbyt4	536be3e0f6	fix: show correct LAN IP when VPN is active Detect all network interfaces instead of relying on UDP trick which returns VPN IP. Prefers 192.168.x.x/10.x.x.x over VPN ranges. Shows all available IPs in console output.	2026-03-14 14:27:20 +03:00
0xbyt4	a3905ef289	feat: add web gateway — browser-based chat UI over WebSocket New platform adapter that serves a full-featured chat interface via HTTP. Enables access from any device on the network (phone, tablet, desktop). Features: - aiohttp server with WebSocket real-time messaging - Token-based authentication - Markdown rendering (marked.js) + code highlighting (highlight.js) - Voice recording via MediaRecorder API + STT transcription - Image, voice, and document display - Typing indicator + message editing (streaming support) - Mobile responsive dark theme - Auto-reconnect on disconnect - Media file cleanup (24h TTL) Config: WEB_UI_ENABLED=true, WEB_UI_PORT=8765, WEB_UI_TOKEN=<token> No new dependencies — uses aiohttp already in [messaging] extra.	2026-03-14 14:27:20 +03:00
0xbyt4	c0c358d051	feat: add Discord voice channel listening — STT transcription and agent response pipeline Phase 2 of voice channel support: bot listens to users speaking in VC, transcribes speech via Groq Whisper, and processes through the agent pipeline. - Add VoiceReceiver class for RTP packet capture, NaCl/DAVE decryption, Opus decode - Add silence detection and per-user PCM buffering - Wire voice input callback from adapter to GatewayRunner - Fix adapter dict key: use Platform.DISCORD enum instead of string - Fix guild_id extraction for synthetic voice events via SimpleNamespace raw_message - Pause/resume receiver during TTS playback to prevent echo	2026-03-14 14:27:20 +03:00
0xbyt4	cc974904f8	feat: Discord voice channel support — bot joins VC and speaks replies - /voice channel: bot joins user's voice channel, speaks TTS replies - /voice leave: disconnect from voice channel - Auto-disconnect after 5 min inactivity - _get_guild_id() helper extracts guild from raw_message - Load opus codec for voice playback - discord.py[voice] in pyproject.toml (pulls PyNaCl + davey)	2026-03-14 14:27:20 +03:00
0xbyt4	cbe4c23efa	fix: Discord voice bubble + edge-tts mp3/ogg format mismatch - Send Discord voice messages with flags=8192 and waveform metadata so they render as native voice bubbles instead of file attachments - Use .mp3 output path for TTS so edge-tts opus conversion works correctly (edge always outputs mp3, convert was skipped for .ogg) - Use actual file_path from TTS result after potential opus conversion	2026-03-14 14:27:20 +03:00
0xbyt4	f6cf4ca826	feat: add /voice slash command to Discord + fix cross-platform send_voice - Register /voice as Discord slash command with mode choices - Fix _send_voice_reply to handle adapters that don't accept metadata parameter (Discord) by inspecting the method signature at runtime	2026-03-14 14:27:20 +03:00
0xbyt4	d80da5ddd8	feat: add /voice command for auto voice reply in Telegram gateway - /voice on: reply with voice when user sends voice messages - /voice tts: reply with voice to all messages - /voice off: disable, text-only replies - /voice status: show current mode - Per-chat state persisted to gateway_voice_mode.json - Dedup: skips auto-reply if agent already called text_to_speech tool - drop_pending_updates=True to ignore stale Telegram messages on restart - 25 tests covering command handler, reply logic, and edge cases	2026-03-14 14:27:20 +03:00
teknium1	5c9a84219d	fix: complete send_message MEDIA delivery salvage - prevent raw MEDIA tag leakage outside the gateway pipeline - make extract_media handle quoted/backticked paths and optional whitespace - send Telegram media natively with explicit error/warning handling - add regression tests for Telegram media dispatch and MEDIA parsing	2026-03-14 04:02:03 -07:00
quabug	50d6659392	fix: handle MEDIA tags in send_message tool for native file delivery The send_message tool's _send_telegram() sent MEDIA:<path> tags as literal text instead of delivering actual files. This fixes it by extracting MEDIA tags via BasePlatformAdapter.extract_media() and routing files to the appropriate Telegram Bot API method by extension. Changes: - send_message_tool: extract MEDIA tags and send files natively as photo/video/voice/audio/document based on file extension - send_message_tool: add per-file error handling and missing-file logging - send_message_tool: use cleaned text in fallback to avoid leaking tags - base.py extract_media: handle optional space after MEDIA: colon - base.py extract_media: strip surrounding backticks/quotes from paths Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-14 04:02:03 -07:00
kshitij	0bb7ed1d95	refactor: salvage adapter and CLI cleanup from PR #939 Salvaged from PR #939 by kshitij. - deduplicate Discord slash command dispatch and local file send helpers - deduplicate Slack file uploads while preserving thread metadata - extract shared CLI session relative-time formatting - hoist browser PATH cleanup constants and throttle screenshot pruning - tidy small type and import cleanups	2026-03-14 03:07:11 -07:00
Verne	52ba940c9b	feat(gateway): add reasoning hot reload Add a /reasoning command across gateway adapters so users can inspect or change reasoning effort without editing config by hand. Reload reasoning settings from config.yaml before each agent run, including background tasks, so the next message picks up the new value consistently.	2026-03-14 02:42:47 -07:00
0xIbra	437ec17125	fix(cli): respect HERMES_HOME in all remaining hardcoded ~/.hermes paths Several files resolved paths via Path.home() / ".hermes" or os.path.expanduser("~/.hermes/..."), bypassing the HERMES_HOME environment variable. This broke isolation when running multiple Hermes instances with distinct HERMES_HOME directories. Replace all hardcoded paths with calls to get_hermes_home() from hermes_cli.config, consistent with the rest of the codebase. Files fixed: - tools/process_registry.py (processes.json) - gateway/pairing.py (pairing/) - gateway/sticker_cache.py (sticker_cache.json) - gateway/channel_directory.py (channel_directory.json, sessions.json) - gateway/config.py (gateway.json, config.yaml, sessions_dir) - gateway/mirror.py (sessions/) - gateway/hooks.py (hooks/) - gateway/platforms/base.py (image_cache/, audio_cache/, document_cache/) - gateway/platforms/whatsapp.py (whatsapp/session) - gateway/delivery.py (cron/output) - agent/auxiliary_client.py (auth.json) - agent/prompt_builder.py (SOUL.md) - cli.py (config.yaml, images/, pastes/, history) - run_agent.py (logs/) - tools/environments/base.py (sandboxes/) - tools/environments/modal.py (modal_snapshots.json) - tools/environments/singularity.py (singularity_snapshots.json) - tools/tts_tool.py (audio_cache) - hermes_cli/status.py (cron/jobs.json, sessions.json) - hermes_cli/gateway.py (logs/, whatsapp session) - hermes_cli/main.py (whatsapp/session) Tests updated to use HERMES_HOME env var instead of patching Path.home(). Closes #892 (cherry picked from commit 78ac1bba43b8b74a934c6172f2c29bb4d03164b9)	2026-03-13 21:32:53 -07:00
teknium1	153ccbfd61	fix: strip user: prefix from Discord allowed user IDs in onboarding Users sometimes paste Discord IDs with prefixes like 'user:123456', '<@123456>', or '<@!123456>' from Discord's UI or third-party tools. This caused auth failures since the allowlist contained 'user:123' but the actual user_id from messages was just '123'. Fixes: - Added _clean_discord_id() helper in discord.py to strip common prefixes - Applied sanitization at runtime when parsing DISCORD_ALLOWED_USERS env var - Applied sanitization in hermes setup and hermes gateway setup input flows - Handles user:, <@>, and <@!> prefix formats	2026-03-13 09:35:46 -07:00
Teknium	b8b45bfb77	feat(discord): add /thread command, auto_thread config, and media metadata fix (#1178 ) - Add /thread slash command that creates a Discord thread and starts a new Hermes session in it. The starter message (if provided) becomes the first user input in the new session. - Add discord.auto_thread config option (DISCORD_AUTO_THREAD env var): when enabled, every message in a text channel automatically creates a thread, allowing parallel isolated sessions. - Fix Discord media method signatures to accept metadata kwarg (send_voice, send_image_file, send_image) — prevents TypeError when the base adapter passes platform metadata. - Fix test mock isolation: add app_commands and ForumChannel to discord mocks so tests pass in full-suite runs. Based on PRs #866 and #1109 by insecurejezza, modified per review: removed /channel command (unsafe), added auto_thread feature, made /thread dispatch new sessions. Co-authored-by: insecurejezza <insecurejezza@users.noreply.github.com>	2026-03-13 08:52:54 -07:00
Teknium	61531396a0	fix: Home Assistant event filtering now closed by default (#1169 ) Previously, when no watch_domains or watch_entities were configured, ALL state_changed events passed through to the agent, causing users to be flooded with notifications for every HA entity change. Now events are dropped by default unless the user explicitly configures: - watch_domains: list of domains to monitor (e.g. climate, light) - watch_entities: list of specific entity IDs to monitor - watch_all: true (new option — opt-in to receive all events) A warning is logged at connect time if no filters are configured, guiding users to set up their HA platform config. All 49 gateway HA tests + 52 HA tool tests pass.	2026-03-13 07:40:38 -07:00
teknium1	06a5cc484c	fix: improve gateway secret capture guidance message The old message referenced 'hermes setup' which doesn't handle skill-specific env vars. Updated to direct users to load the skill in the local CLI (which triggers the secure prompt) or add the key to ~/.hermes/.env manually.	2026-03-13 04:10:22 -07:00
kshitijk4poor	ccfbf42844	feat: secure skill env setup on load (core #688 ) When a skill declares required_environment_variables in its YAML frontmatter, missing env vars trigger a secure TUI prompt (identical to the sudo password widget) when the skill is loaded. Secrets flow directly to ~/.hermes/.env, never entering LLM context. Key changes: - New required_environment_variables frontmatter field for skills - Secure TUI widget (masked input, 120s timeout) - Gateway safety: messaging platforms show local setup guidance - Legacy prerequisites.env_vars normalized into new format - Remote backend handling: conservative setup_needed=True - Env var name validation, file permissions hardened to 0o600 - Redact patterns extended for secret-related JSON fields - 12 existing skills updated with prerequisites declarations - ~48 new tests covering skip, timeout, gateway, remote backends - Dynamic panel widget sizing (fixes hardcoded width from original PR) Cherry-picked from PR #723 by kshitijk4poor, rebased onto current main with conflict resolution. Fixes #688 Co-authored-by: kshitijk4poor <kshitijk4poor@users.noreply.github.com>	2026-03-13 03:14:04 -07:00
0xbyt4	064c66df8c	fix: slack file upload fallback loses thread context Fallback paths in send_image_file, send_video, and send_document called super() without metadata, causing replies to appear outside the thread when file upload fails. Use self.send() with metadata instead to preserve thread_ts context.	2026-03-13 04:26:27 +03:00
teknium1	319e6615c3	fix: Slack MAX_MESSAGE_LENGTH + typing indicator via assistant.threads.setStatus - Increase MAX_MESSAGE_LENGTH from 3,900 to 39,000 (Slack API allows 40k) - Implement real typing indicator using assistant.threads.setStatus API - Shows 'BotName is thinking...' next to the bot name in threads - Auto-clears when the bot sends a reply - Requires assistant:write or chat:write scope - Falls back silently if scope unavailable (reactions still work) - 4 new tests for typing indicator	2026-03-12 17:46:53 -07:00
teknium1	978e1356c0	feat: Slack adapter improvements — formatting, reactions, user resolution, commands 1. Markdown → mrkdwn conversion (format_message override): - bold → bold, italic → _italic_ - ## Headers → Headers (bold) - [link](url) → <url\|link> - ~~strike~~ → ~strike~ - Code blocks and inline code preserved unchanged - Placeholder-based approach (same pattern as Telegram) 2. Message length splitting: - send() now calls format_message() + truncate_message() - Long responses split at natural boundaries (newlines, spaces) - Code blocks properly closed/reopened across chunks - Chunk indicators (1/N) appended for multi-part messages 3. Reaction-based acknowledgment: - 👀 (eyes) reaction added on message receipt - Replaced with ✅ (white_check_mark) when response is complete - Graceful error handling (missing scopes, already-reacted) - Serves as visual feedback since Slack has no bot typing API 4. User identity resolution: - Resolves Slack user IDs to display names via users.info API - LRU-style in-memory cache (one API call per user) - Fallback chain: display_name → real_name → user_id - user_name now included in MessageEvent source 5. Expanded slash commands (/hermes <subcommand>): - Added: compact, compress, resume, background, usage, insights, title, reasoning, provider, rollback - Arguments preserved (e.g. /hermes resume my session) 6. reply_broadcast config option: - When gateway.slack.reply_broadcast is true, first response in a thread also appears in the main channel - Disabled by default — thread = session stays clean 30 new tests covering all features.	2026-03-12 16:22:39 -07:00
teknium1	987410fff3	fix: Slack thread handling — progress messages, responses, and session isolation Three bugs fixed in the Slack adapter: 1. Tool progress messages leaked to main channel instead of thread. Root cause: metadata key mismatch — gateway uses 'thread_id' but Slack adapter checked for 'thread_ts'. Added _resolve_thread_ts() helper that checks both keys with correct precedence. 2. Bot responses could escape threads for replies. Root cause: reply_to was set to the child message's ts, but Slack API needs the parent message's ts for thread_ts. Now metadata thread_id (always the parent ts) takes priority over reply_to. 3. All Slack DMs shared one session key ('agent:main:slack:dm'), so a long-running task blocked all other DM conversations. Fix: DMs with thread_id now get per-thread session keys. Top-level DMs still share one session for conversation continuity. Additional fix: All Slack media methods (send_image, send_voice, send_video, send_document, send_image_file) now accept metadata parameter for thread routing. Previously they only accepted reply_to, which caused media to silently fail to post in threads. Session key behavior after this change: - Slack channel @mention: creates thread, thread = session - Slack thread reply: stays in thread, same session - Slack DM (top-level): one continuous session - Slack DM (threaded): per-thread session - Other platforms: unchanged	2026-03-12 16:05:45 -07:00
insecurejezza	11825ccefa	feat(gateway): thread-aware free-response routing for Discord - Forum parent channel IDs now match free-response list (add a forum channel ID and all its threads respond without mention) - Better thread chat names: 'Guild / forum / thread' for forum threads - Add discord.require_mention and discord.free_response_channels to config.yaml (bridged to env vars, env vars still override) - Keep require_mention defaulting to true (safe for shared servers) Cherry-picked from PR #867 by insecurejezza with default fix and config.yaml integration. Co-authored-by: insecurejezza <insecurejezza@users.noreply.github.com>	2026-03-11 09:15:31 -07:00
0xbyt4	bdcf247efe	feat: add email gateway platform (IMAP/SMTP) Allow users to interact with Hermes by sending and receiving emails. Uses IMAP polling for incoming messages and SMTP for replies with proper threading (In-Reply-To, References headers). Integrates with all 14 gateway extension points: config, adapter factory, authorization, send_message tool, cron delivery, toolsets, prompt hints, channel directory, setup wizard, status display, and env example. 65 tests covering config, parsing, dispatch, threading, IMAP fetch, SMTP send, attachments, and all integration points.	2026-03-11 06:32:01 -07:00
aydnOktay	9149c34a26	refactor(slack): replace print statements with structured logging Replaces all ad-hoc print() calls in the Slack gateway adapter with proper logging.getLogger(__name__) calls, matching the pattern already used by every other platform adapter (telegram, discord, whatsapp, signal, homeassistant). Changes: - Add import logging + module-level logger - Use logger.error for failures, logger.warning for non-critical fallbacks, logger.info for status, logger.debug for routine ops - Add exc_info=True for full stack traces on all error/warning paths - Use %s format strings (lazy evaluation) instead of f-strings - Wrap disconnect() in try/except for safety - Add structured context (file paths, channel IDs, URLs) to log messages - Convert document handling prints added after the original PR Cherry-picked from PR #778 by aydnOktay, rebased onto current main with conflict resolution and extended to cover document/video methods added since the PR was created. Co-authored-by: aydnOktay <xaydinoktay@gmail.com>	2026-03-11 05:34:43 -07:00
teknium1	69090d6da1	fix: add kwargs to base/telegram media send methods for metadata routing The MEDIA routing in _process_message_background passes metadata=_thread_metadata to send_video, send_document, and send_image_file — but none accepted it, causing TypeError silently caught by the except handler. Files just failed to send. Fix: add kwargs to all four base class media methods and their Telegram overrides.	2026-03-11 03:24:39 -07:00

1 2 3

128 Commits