hermes-agent

Author	SHA1	Message	Date
teknium1	69090d6da1	fix: add kwargs to base/telegram media send methods for metadata routing The MEDIA routing in _process_message_background passes metadata=_thread_metadata to send_video, send_document, and send_image_file — but none accepted it, causing TypeError silently caught by the except handler. Files just failed to send. Fix: add kwargs to all four base class media methods and their Telegram overrides.	2026-03-11 03:24:39 -07:00
teknium1	322ffbed61	Merge PR #779 : feat: Telegram native file attachment support (send_document + send_video) Adds send_document() and send_video() overrides to TelegramAdapter. Requested by TigerHix.	2026-03-11 03:23:11 -07:00
teknium1	aead9c8ead	chore: remove unnecessary pragma comments from Telegram adapter Strip 18 '# pragma: no cover - defensive logging' annotations — these are real code paths, not worth excluding from coverage.	2026-03-11 00:37:45 -07:00
teknium1	93230af7bd	Merge PR #763 : improve Telegram gateway error handling and logging Authored by aydnOktay. Replaces print() statements with structured logging calls (error/warning/info/debug) throughout the Telegram adapter. Adds exc_info=True for stack traces on failures.	2026-03-11 00:37:28 -07:00
teknium1	928bb16da1	fix: forward thread_id to Telegram adapter + update send_typing signatures Part 2 of thread_id forum topic fix: add metadata param to send_voice, send_image, send_animation, send_typing in Telegram adapter and pass message_thread_id to all Bot API calls. Update send_typing signature in Discord, Slack, WhatsApp, HomeAssistant for compatibility. Based on the fix proposed by @Bitstreamono in PR #656.	2026-03-10 06:26:32 -07:00
teknium1	cbca0225f6	Merge PR #599 : fix: strip MarkdownV2 italic markers in Telegram plaintext fallback Authored by 0xbyt4.	2026-03-10 04:09:33 -07:00
teknium1	5eaf4a3f32	feat: Telegram send_document and send_video for native file attachments Implement send_document() and send_video() overrides in TelegramAdapter so the agent can deliver files (PDFs, CSVs, docs, etc.) and videos as native Telegram attachments instead of just printing the file path as text. The base adapter already routes MEDIA:<path> tags by extension — audio goes to send_voice(), images to send_image_file(), and everything else falls through to send_document(). But TelegramAdapter didn't override send_document() or send_video(), so those fell back to plain text. Now when the agent includes MEDIA:/path/to/report.pdf in its response, users get a proper downloadable file attachment in Telegram. Features: - send_document: sends files via bot.send_document with display name, caption (truncated to 1024), and reply_to support - send_video: sends videos via bot.send_video with inline playback - Both fall back to base class text if the Telegram API call fails - 10 new tests covering success, custom filename, file-not-found, not-connected, caption truncation, API error fallback, and reply_to Requested by @TigerHixTang on Twitter.	2026-03-09 13:07:10 -07:00
aydnOktay	46a7d6aeb2	Improve Telegram gateway error handling and logging	2026-03-09 15:58:01 +03:00
teknium1	c6b75baad0	feat: find-nearby skill and Telegram location support Adds a 'find-nearby' skill for discovering nearby places using OpenStreetMap (Overpass + Nominatim). No API keys needed. Works with: - Coordinates (from Telegram location pins) - Addresses, cities, zip codes, landmarks (auto-geocoded) - Multiple place types (restaurant, cafe, bar, pharmacy, etc.) Returns names, distances, cuisine, hours, addresses, and Google Maps links (pin + directions). 184-line stdlib-only script. Also adds Telegram location message handling: - New MessageType.LOCATION in gateway base - Telegram adapter handles LOCATION and VENUE messages - Injects lat/lon coordinates into conversation context - Prompts agent to ask what the user wants nearby Inspired by PR #422 (reimplemented with simpler script and broader skill scope — addresses/cities/zips, not just Telegram coordinates).	2026-03-09 05:31:10 -07:00
teknium1	a7f9721785	feat: register remaining commands with platform menus Telegram: add /insights, /update, /reload_mcp (underscore variant since Telegram BotCommand names don't allow hyphens). Discord: add /insights (with days parameter), /reload-mcp. Also add reload_mcp as an alias for reload-mcp in the gateway command dispatcher so Telegram's underscore form works, and add resume/provider to the _known_commands set for hook emission.	2026-03-08 17:13:45 -07:00
teknium1	a5461e07bf	feat: register title, resume, and other missing commands with platform menus Add /title, /resume, /compress, /provider, /usage to Telegram's set_my_commands so they appear in the / autocomplete menu. Add /title, /resume, /compress, /provider, /usage, /help as Discord slash commands so they appear in Discord's native command picker. These commands were functional via text but not registered with the platform-native command menus, so users couldn't discover them.	2026-03-08 17:11:49 -07:00
teknium1	b8c3bc7841	feat: browser screenshot sharing via MEDIA: on all messaging platforms browser_vision now saves screenshots persistently to ~/.hermes/browser_screenshots/ and returns the screenshot_path in its JSON response. The model can include MEDIA:<path> in its response to share screenshots as native photos. Changes: - browser_tool.py: Save screenshots persistently, return screenshot_path, auto-cleanup files older than 24 hours, mkdir moved inside try/except - telegram.py: Add send_image_file() — sends local images via bot.send_photo() - discord.py: Add send_image_file() — sends local images via discord.File - slack.py: Add send_image_file() — sends local images via files_upload_v2() (WhatsApp already had send_image_file — no changes needed) - prompt_builder.py: Updated Telegram hint to list image extensions, added Discord and Slack MEDIA: platform hints - browser.md: Document screenshot sharing and 24h cleanup - send_file_integration_map.md: Updated to reflect send_image_file is now implemented on Telegram/Discord/Slack - test_send_image_file.py: 19 tests covering MEDIA: .png extraction, send_image_file on all platforms, and screenshot cleanup Partially addresses #466 (Phase 0: platform adapter gaps for send_image_file).	2026-03-07 22:57:05 -08:00
teknium1	542faf225f	Fix Telegram image delivery for large (>5MB) images Telegram's send_photo via URL has a ~5MB limit. Upscaled images from fal.ai's Clarity Upscaler often exceed this, causing 'Wrong type of web page content' or 'Failed to get http url content' errors. Fix: Add download-and-upload fallback in Telegram's send_image(). When URL-based send_photo fails, download the image via httpx and re-upload as bytes (supports up to 10MB file uploads). Also: convert print() to logger.warning/error in image sending path for proper log visibility (print goes to socket, invisible in logs).	2026-03-07 21:29:45 -08:00
0xbyt4	5cdcb9e26f	fix: strip MarkdownV2 italic markers in Telegram plaintext fallback When MarkdownV2 parsing fails, _strip_mdv2() removes escape backslashes and bold markers (text) but missed italic markers (_text_). Users saw raw underscores around italic text in the plaintext fallback. - Add regex to strip _text_ italic markers in _strip_mdv2() - Use word boundary lookaround to preserve snake_case identifiers - Add tests for _strip_mdv2 covering italic, bold, snake_case, and edge cases	2026-03-07 18:55:25 +03:00
teknium1	55c70f3508	fix: strip MarkdownV2 escapes from Telegram plaintext fallback When Telegram's MarkdownV2 parser rejects a message, the send() fallback was sending the already-escaped text as plain text. This caused users to see raw backslashes before every special character (periods, dashes, parentheses, etc.) — e.g. 'sentence\.' or '\-\-auto\-approve'. Changes: - Add _strip_mdv2() to reverse MarkdownV2 escaping for clean plaintext - Use stripped text in the send() fallback path instead of raw escaped chunk - Add logging when the MDV2 fallback is triggered for diagnostics - Add logger to telegram.py (was missing) The edit_message() fallback already correctly used the original content; this brings send() in line with that behavior.	2026-03-07 01:23:18 -08:00
teknium1	1708dcd2b2	feat: implement edit_message() for Telegram/Discord/Slack and fix fallback regression Building on PR #288's edit_message() abstraction: - Telegram: edit_message_text() with MarkdownV2 + plain text fallback - Discord: channel.fetch_message() + msg.edit() with length capping - Slack: chat_update() via slack_bolt client Also fixes the fallback regression in send_progress_messages() where platforms that don't support editing would receive duplicated accumulated tool lines. Now uses a can_edit flag — after the first failed edit, falls back to sending individual lines (matching pre-PR behavior).	2026-03-05 03:47:51 -08:00
teknium1	90e6fa2612	Merge PR #204 : fix Telegram italic regex newline bug Authored by 0xbyt4. The italic regex [^]+ matched across newlines, corrupting bullet lists using markers (e.g. '* Item one\n* Item two' became italic garbage). Fixed by adding \n to the negated character class: [^*\n]+.	2026-03-04 19:52:03 -08:00
teknium1	daedec6957	fix: Telegram adapter crash on Windows when library not installed (#304 ) The ImportError fallback set ContextTypes = Any, but then ContextTypes.DEFAULT_TYPE was used as a type annotation at class definition time — Any doesn't have .DEFAULT_TYPE, causing AttributeError. Fix: create a _MockContextTypes class with DEFAULT_TYPE = Any. Also stub CommandHandler, TelegramMessageHandler, filters, ParseMode, and ChatType to prevent potential NameErrors. Fixes #304.	2026-03-02 22:03:36 -08:00
teknium1	7b23dbfe68	feat(animation): add support for sending animated GIFs in BasePlatformAdapter and TelegramAdapter	2026-02-28 11:25:44 -08:00
0xbyt4	b759602483	fix: prevent italic regex from spanning newlines in Telegram formatter The italic regex \([^]+)\* used [^] which matches newlines, causing bullet lists with markers to be incorrectly converted to italic text. Changed to [^*\n]+ to prevent cross-line matching. Adds 43 tests for _escape_mdv2 and format_message covering code blocks, bold/italic, headers, links, mixed formatting, and the regression case.	2026-02-28 22:01:48 +03:00
teknium1	19f28a633a	fix(agent): enhance 413 error handling and improve conversation history management in tests	2026-02-27 23:04:32 -08:00
tekelala	fbb1923fad	fix(security): patch path traversal, size bypass, and prompt injection in document processing - Sanitize filenames in cache_document_from_bytes to prevent path traversal (strip directory components, null bytes, resolve check) - Reject documents with None file_size instead of silently allowing download - Cap text file injection at 100 KB to prevent oversized prompt payloads - Sanitize display_name in run.py context notes to block prompt injection via filenames - Add 35 unit tests covering document cache utilities and Telegram document handling Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 11:53:46 -05:00
tekelala	b2172c4b2e	feat(telegram): add document file processing for PDF, text, and Office files Download, cache, and enrich document files sent via Telegram. Supports .pdf, .md, .txt, .docx, .xlsx, .pptx with size validation, unsupported type rejection, text content injection for .md/.txt, and hourly cache cleanup. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 11:44:57 -05:00
teknium1	674a6f96d3	feat: unify set-home command naming across platforms - Updated the command name from `/set-home` to `/sethome` in the GatewayRunner class for consistency. - Added a new slash command `/sethome` in the Discord adapter to set the home channel. - Registered the `/sethome` command in the Telegram adapter to align with the updated naming convention.	2026-02-23 15:01:22 -08:00
teknium1	ededaaa874	Hermes Agent UX Improvements	2026-02-22 02:16:11 -08:00
teknium1	ecb430effe	refactor: enhance API interaction and message handling in AIAgent - Introduced new methods in run_agent.py for building API keyword arguments and normalizing assistant messages from API responses. - Added functionality for compressing conversation context and managing session state in SQLite. - Improved tool call execution handling, including enhanced logging and error management. - Updated path handling in multiple platform files to utilize pathlib for better compatibility and readability.	2026-02-21 04:17:27 -08:00
teknium1	3191a9ba11	feat: add new conversation command and enhance command handling - Introduced the `/new` command to start a new conversation, resetting the history. - Updated command handling in the CLI and various platform adapters (Discord, Slack, Telegram) to support the new command. - Added help command functionality to list available commands, improving user guidance. - Enhanced command mapping for better integration across platforms, ensuring consistent command behavior.	2026-02-19 14:31:53 -08:00
teknium1	69aa35a51c	Add messaging platform enhancements: STT, stickers, Discord UX, Slack, pairing, hooks Major feature additions inspired by OpenClaw/ClawdBot integration analysis: Voice Message Transcription (STT): - Auto-transcribe voice/audio messages via OpenAI Whisper API - Download voice to ~/.hermes/audio_cache/ on Telegram/Discord/WhatsApp - Inject transcript as text so all models can understand voice input - Configurable model (whisper-1, gpt-4o-mini-transcribe, gpt-4o-transcribe) Telegram Sticker Understanding: - Describe static stickers via vision tool with JSON-backed cache - Cache keyed by file_unique_id avoids redundant API calls - Animated/video stickers get emoji-based fallback description Discord Rich UX: - Native slash commands (/ask, /reset, /status, /stop) via app_commands - Button-based exec approvals (Allow Once / Always Allow / Deny) - ExecApprovalView with user authorization and timeout handling Slack Integration: - Full SlackAdapter using slack-bolt with Socket Mode - DMs, channel messages (mention-gated), /hermes slash command - File attachment handling with bot-token-authenticated downloads DM Pairing System: - Code-based user authorization as alternative to static allowlists - 8-char codes from unambiguous alphabet, 1-hour expiry - Rate limiting, lockout after failed attempts, chmod 0600 on data - CLI: hermes pairing list/approve/revoke/clear-pending Event Hook System: - File-based hook discovery from ~/.hermes/hooks/ - HOOK.yaml + handler.py per hook, sync/async handler support - Events: gateway:startup, session:start/reset, agent:start/step/end - Wildcard matching (command:* catches all command events) Cross-Channel Messaging: - send_message agent tool for delivering to any connected platform - Enables cron job delivery and cross-platform notifications Human-Like Response Pacing: - Configurable delays between message chunks (off/natural/custom) - HERMES_HUMAN_DELAY_MODE env var with min/max ms settings Warm Injection Message Style: - Retrofitted image vision messages with friendly kawaii-consistent tone - All new injection messages (STT, stickers, errors) use warm style Also: updated config migration to prompt for optional keys interactively, bumped config version, updated README, AGENTS.md, .env.example, cli-config.yaml.example, install scripts, pyproject.toml, and toolsets.	2026-02-15 21:38:59 -08:00
teknium1	5404a8fcd8	Enhance image handling and analysis capabilities across platforms - Updated the vision tool to accept both HTTP/HTTPS URLs and local file paths for image analysis. - Implemented caching of user-uploaded images in local directories to ensure reliable access for the vision tool, addressing issues with ephemeral URLs. - Enhanced platform adapters (Discord, Telegram, WhatsApp) to download and cache images, allowing for immediate analysis and enriched message context. - Added a new method to auto-analyze images attached by users, enriching the conversation with detailed descriptions. - Improved documentation for image handling processes and updated related functions for clarity and efficiency.	2026-02-15 16:10:50 -08:00
teknium1	f5be6177b2	Add Text-to-Speech (TTS) functionality with multiple providers Add tool previews Add AGENTS and SOUL.md support Add Exec Approval	2026-02-12 10:05:08 -08:00
teknium1	ada0b4f131	Enhance image handling in platform adapters - Updated the image generation function description to clarify usage with markdown. - Added `send_image` method to `BasePlatformAdapter` for native image sending across platforms. - Implemented `send_image` in `DiscordAdapter` and `TelegramAdapter` to handle image attachments directly. - Introduced `extract_images` method to extract image URLs from markdown and HTML, improving content processing. - Enhanced message handling to support sending images as attachments while maintaining text content.	2026-02-10 21:02:40 -08:00
teknium1	beeb7896e0	Refactor message handling and error logging in agent and gateway - Updated the AIAgent class to extract the first user message for trajectory formatting, improving the accuracy of user queries in the trajectory format. - Enhanced the GatewayRunner to convert transcript history into the agent format, ensuring proper handling of message roles and content. - Adjusted the typing indicator refresh rate to every 2 seconds for better responsiveness. - Improved error handling in the message sending process for the Telegram adapter, implementing a fallback mechanism for Markdown parsing failures, and logging send failures for better debugging.	2026-02-03 15:42:54 -08:00
teknium1	619c72e566	Enhance CLI with multi-platform messaging integration and configuration management - Updated CLI to load configuration from user-specific and project-specific YAML files, prioritizing user settings. - Introduced a new command `/platforms` to display the status of connected messaging platforms (Telegram, Discord, WhatsApp). - Implemented a gateway system for handling messaging interactions, including session management and delivery routing for cron job outputs. - Added support for environment variable configuration and a dedicated gateway configuration file for advanced settings. - Enhanced documentation in README.md and added a new messaging.md file to guide users on platform integrations and setup. - Updated toolsets to include platform-specific capabilities for Telegram, Discord, and WhatsApp, ensuring secure and tailored interactions.	2026-02-02 19:01:51 -08:00

33 Commits