hermes-agent

Author	SHA1	Message	Date
teknium1	aaf8f2d2d2	feat: expand secret redaction patterns Added 14 new redaction patterns, all with distinctive prefixes that have near-zero false positive risk: Prefix patterns: - AWS Access Key ID (AKIA...) - Stripe keys (sk_live_, sk_test_, rk_live_) - SendGrid (SG....) - HuggingFace (hf_...) - Replicate (r8_...) - npm tokens (npm_...) - PyPI tokens (pypi-...) - DigitalOcean PATs (dop_v1_, doo_v1_) - AgentMail (am_...) Structural patterns: - Private key blocks (-----BEGIN...PRIVATE KEY-----) - Database connection string passwords (postgres://user:PASS@host)	2026-03-09 01:28:27 -07:00
teknium1	12f4800631	docs: add security.redact_secrets as commented config section Moved redact_secrets out of DEFAULT_CONFIG (it's on by default when unset) and into the commented sections at the bottom of config.yaml, alongside fallback_model. Users can see the option and uncomment to disable.	2026-03-09 01:12:49 -07:00
teknium1	57b48a81ca	feat: add config toggle to disable secret redaction New config option: security: redact_secrets: false # default: true When set to false, API keys, tokens, and passwords are shown in full in read_file, search_files, and terminal output. Useful for debugging auth issues where you need to verify the actual key value. Bridged to both CLI and gateway via HERMES_REDACT_SECRETS env var. The check is in redact_sensitive_text() itself, so all call sites (terminal, file tools, log formatter) respect it.	2026-03-09 01:04:33 -07:00
teknium1	7af33accf1	fix: apply secret redaction to file tool outputs Terminal output was already redacted via redact_sensitive_text() but read_file and search_files returned raw content. Now both tools redact secrets before returning results to the LLM. Based on PR #372 by @teyrebaz33 (closes #363) — applied manually due to branch conflicts with the current codebase.	2026-03-09 00:49:46 -07:00
teknium1	3214c05e82	Merge PR #369 : fix(gateway): add missing UTF-8 encoding to file I/O Authored by @ch3ronsa. Fixes UnicodeEncodeError/UnicodeDecodeError on Windows with non-UTF-8 system locales (e.g. Turkish cp1254). Adds encoding='utf-8' to 10 open() calls across gateway/session.py, gateway/channel_directory.py, and gateway/mirror.py.	2026-03-09 00:36:38 -07:00
teknium1	4608a7fe4e	fix: make skills manifest writes atomic Uses temp file + fsync + os.replace() to avoid corruption if the process crashes mid-write. Cleans up temp file on failure, logs errors at debug level. Based on PR #335 by @aydnOktay — adapted for the current v2 manifest format (name:hash).	2026-03-08 23:53:57 -07:00
teknium1	af67ea8800	fix: setup wizard overwrites platform_toolsets saved by tools_command	2026-03-08 23:39:04 -07:00
teknium1	37c3dcf551	fix: setup wizard overwrites platform_toolsets saved by tools_command The wizard and tools_command each loaded their own config dict. When tools_command saved platform_toolsets (with MoA/HA disabled), the wizard's final save_config() overwrote it with its own dict that lacked platform_toolsets entirely — resetting everything to defaults. Fix: pass the wizard's config dict into tools_command so they share the same object. Now platform_toolsets survives the wizard's final save.	2026-03-08 23:39:00 -07:00
teknium1	6a49fbb7da	fix: correct agentmail skill — API key goes in config.yaml env block MCP server subprocess env is filtered through _build_safe_env() which only passes safe baseline vars (PATH, HOME, XDG_*) plus whatever is explicitly in the config's env: block. Env vars from ~/.hermes/.env are NOT inherited by MCP subprocesses. The key must go directly in the config.yaml mcp_servers.agentmail.env section.	2026-03-08 23:34:50 -07:00
teknium1	eb0b01de7b	chore: move agentmail skill to optional-skills, add API key docs AgentMail requires a third-party API key (free tier available, paid plans from $20/mo) — not appropriate for bundled skills that show up in every user's system prompt. Added a Requirements section at the top with clear instructions to add AGENTMAIL_API_KEY to ~/.hermes/.env. Streamlined setup steps to avoid duplicating the key in both .env and config.yaml.	2026-03-08 23:33:05 -07:00
teknium1	5b1528519c	Merge PR #330 : feat: add AgentMail skill for agent-owned email inboxes Authored by teyrebaz33. Closes #329.	2026-03-08 23:32:26 -07:00
teknium1	52f92eb689	fix: first-install tool setup shows all providers + skip options	2026-03-08 23:15:20 -07:00
teknium1	7f9dd60c15	fix: first-install tool setup shows all providers + skip options Three fixes: 1. Web search provider menu now says 'Select Search Provider' and notes that a free DuckDuckGo search skill is included if Firecrawl isn't desired. Supports custom setup_title/setup_note per TOOL_CATEGORIES. 2. All multi-provider menus (web, browser, TTS) now include a 'Skip — keep defaults / configure later' option so users can move on. 3. First-install flow now walks through ALL tools with provider options (browser, TTS, web, image_gen, etc.), not just ones missing API keys. Previously, tools with a free provider (browser/Local, TTS/Edge) were silently skipped — users never got to choose between Local vs Browserbase or Edge vs ElevenLabs.	2026-03-08 23:15:14 -07:00
teknium1	77da3bbc95	fix: use correct role for summary message in context compressor The summary message was always injected as 'user' role, which causes consecutive user messages when the last preserved head message is also 'user'. Some APIs reject this (400 error), and it produces malformed training data. Fix: check the role of the last head message and pick the opposite role for the summary — 'user' after assistant/tool, 'assistant' after user. Based on PR #328 by johnh4098. Closes #328.	2026-03-08 23:09:04 -07:00
teknium1	bb489a3903	fix: add first_install flag to tools setup for reliable API key prompting	2026-03-08 23:06:35 -07:00
teknium1	167eb824cb	fix: add first_install flag to tools setup for reliable API key prompting On fresh installs, the multi-level curses menu flow (platform menu → checklist → loop back → Done) was unreliable — users could end up skipping API key configuration entirely. Now the setup wizard passes first_install=True to tools_command(), which: - Skips the platform selection menu entirely - Goes straight to the tool checklist - Prompts for API keys on ALL selected tools that need them - Linear flow, no loop — impossible to accidentally skip Returning users (hermes tools / hermes setup tools) get the existing platform menu loop as before.	2026-03-08 23:06:31 -07:00
teknium1	efb64aee5a	fix: default MoA, Home Assistant, RL Training to off for new installs	2026-03-08 22:54:15 -07:00
teknium1	3045e29232	fix: default MoA, Home Assistant, and RL Training to off for new installs New users shouldn't have these pre-checked in the tool configurator: - MoA requires OpenRouter API key and is a niche feature - Home Assistant requires HASS_TOKEN and most users don't have one - RL Training requires Tinker + WandB keys They're still available in the checklist to enable, just not pre-selected. Existing users with saved platform_toolsets are unaffected.	2026-03-08 22:54:11 -07:00
teknium1	5d7d76025a	fix: setup wizard default max iterations 60 → 90	2026-03-08 22:51:02 -07:00
teknium1	e6c829384e	fix: setup wizard shows 60 as default max iterations, should be 90 AIAgent.__init__ defaults to max_iterations=90 but setup_agent_settings() fell back to '60' when HERMES_MAX_ITERATIONS wasn't set.	2026-03-08 22:50:58 -07:00
teknium1	5c658a416c	Merge PR #748 : fix: first-time setup skips API key prompts + install.sh echo Link2them00n. \| sudo -S -p '' on WSL	2026-03-08 22:03:12 -07:00
teknium1	a130aa8165	fix: first-time setup skips API key prompts + install.sh sudo on WSL Two issues fixed: 1. (Critical) hermes setup tools / hermes tools: On first-time setup, the tool checklist showed all tools as pre-selected (from the default hermes-cli toolset), but after confirming the selection, NO API key prompts appeared. This is because the code only prompted for 'newly added' tools (added = new_enabled - current_enabled), but since all tools were already in the default set, 'added' was always empty. Fix: Detect first-time configuration (no platform_toolsets entry in config) and check ALL enabled tools for missing API keys, not just newly added ones. Returning users still only get prompted for newly added tools (preserving skip behavior). 2. install.sh: When run via curl\|bash on WSL2/Ubuntu, ripgrep and ffmpeg install was silently skipped with a confusing 'Non-interactive mode' message. The script already uses /dev/tty for the setup wizard, but the system package section didn't. Fix: Try reading from /dev/tty when available (same pattern as the build-tools section and setup wizard). Only truly skip when no terminal is available at all (Docker build, CI).	2026-03-08 21:59:39 -07:00
teknium1	35d57ed752	refactor: unified OAuth/API-key credential resolution for fallback Split fallback provider handling into two clean registries: _FALLBACK_API_KEY_PROVIDERS — env-var-based (openrouter, zai, kimi, minimax) _FALLBACK_OAUTH_PROVIDERS — OAuth-based (openai-codex, nous) New _resolve_fallback_credentials() method handles all three cases (OAuth, API key, custom endpoint) and returns a uniform (key, url, mode) tuple. _try_activate_fallback() is now just validation + client build. Adds Nous Portal as a fallback provider — uses the same OAuth flow as the primary provider (hermes login), returns chat_completions mode. OAuth providers get credential refresh for free: the existing 401 retry handlers (_try_refresh_codex/nous_client_credentials) check self.provider, which is set correctly after fallback activation. 4 new tests (nous activation, nous no-login, codex retained). 27 total fallback tests passing, 2548 full suite.	2026-03-08 21:44:48 -07:00
teknium1	5785bd3272	feat: add openai-codex as fallback provider Codex OAuth uses a different auth flow (OAuth tokens, not env vars) and a different API mode (codex_responses, not chat_completions). The fallback now handles this specially: - Resolves credentials via resolve_codex_runtime_credentials() - Sets api_mode to codex_responses - Fails gracefully if no Codex OAuth session exists Also added to the commented-out config.yaml example. 2 new tests (codex activation + graceful failure).	2026-03-08 21:34:15 -07:00
teknium1	cf9482984e	docs: condense AGENTS.md from 927 to 242 lines AGENTS.md is read by AI agents in their context window. Every line costs tokens. The previous version had grown to 927 lines with user-facing documentation that duplicates website/docs/: Removed (belongs in website/docs/, not agent context): - Full CLI commands table (50 lines) - Full gateway slash commands list (20 lines) - Messaging gateway setup, config examples, security details - DM pairing system details - Event hooks format and examples - Tool progress notification details - Full environment variables reference - Auxiliary model configuration section (60 lines) - Background process management details - Trajectory format details - Batch processing CLI usage - Skills system directory tree and hub details - Dangerous command approval flow details - Platform toolsets listing Kept (essential for agents modifying code): - Project structure (condensed to key files only) - File dependency chain - AIAgent class signature and loop mechanics - How to add tools (3 files, full pattern) - How to add config (config.yaml + .env patterns) - How to add CLI commands - Config loader table (two separate systems) - Prompt caching policy (critical constraint) - All known pitfalls - Test commands	2026-03-08 21:33:10 -07:00
teknium1	67275641f8	fix: unify gateway session hygiene with agent compression config The gateway had a SEPARATE compression system ('session hygiene') with hardcoded thresholds (100k tokens / 200 messages) that were completely disconnected from the model's context length and the user's compression config in config.yaml. This caused premature auto-compression on Telegram/Discord — triggering at ~60k tokens (from the 200-message threshold) or inconsistent token counts. Changes: - Gateway hygiene now reads model name from config.yaml and uses get_model_context_length() to derive the actual context limit - Compression threshold comes from compression.threshold in config.yaml (default 0.85), same as the agent's ContextCompressor - Removed the message-count-based trigger (was redundant and caused false positives in tool-heavy sessions) - Removed the undocumented session_hygiene config section — the standard compression.* config now controls everything - Env var overrides (CONTEXT_COMPRESSION_THRESHOLD, CONTEXT_COMPRESSION_ENABLED) are respected - Warn threshold is now 95% of model context (was hardcoded 200k) - Updated tests to verify model-aware thresholds, scaling across models, and that message count alone no longer triggers compression For claude-opus-4.6 (200k context) at 85% threshold: gateway hygiene now triggers at 170k tokens instead of the old 100k.	2026-03-08 21:30:48 -07:00
teknium1	3ffaac00dd	feat: bell_on_complete — terminal bell when agent finishes Adds a simple config option to play the terminal bell (\a) when the agent finishes a response. Useful for long-running tasks — switch to another window and your terminal will ding when done. Works over SSH since the bell character propagates through the connection. Most terminal emulators can be configured to flash the taskbar, play a sound, or show a visual indicator on bell. Config (default: off): display: bell_on_complete: true Closes #318	2026-03-08 21:30:48 -07:00
Teknium	816a3ef6f1	Merge pull request #745 from NousResearch/hermes/hermes-f8d56335 feat: browser console tool, annotated screenshots, auto-recording, and dogfood QA skill	2026-03-08 21:29:52 -07:00
teknium1	a8bf414f4a	feat: browser console/errors tool, annotated screenshots, auto-recording, and dogfood QA skill New browser capabilities and a built-in skill for agent-driven web QA. ## New tool: browser_console Returns console messages (log/warn/error/info) AND uncaught JavaScript exceptions in a single call. Uses agent-browser's 'console' and 'errors' commands through the existing session plumbing. Supports --clear to reset buffers. Verified working in both local and Browserbase cloud modes. ## Enhanced tool: browser_vision(annotate=True) New boolean parameter on browser_vision. When true, agent-browser overlays numbered [N] labels on interactive elements — each [N] maps to ref @eN. Annotation data (element name, role, bounding box) returned alongside the vision analysis. Useful for QA reports and spatial reasoning. ## Config: browser.record_sessions Auto-record browser sessions as WebM video files when enabled: - Starts recording on first browser_navigate - Stops and saves on browser_close - Saves to ~/.hermes/browser_recordings/ - Works in both local and cloud modes (verified) - Disabled by default ## Built-in skill: dogfood Systematic exploratory QA testing for web applications. Teaches the agent a 5-phase workflow: 1. Plan — accept URL, create output dirs, set scope 2. Explore — systematic crawl with annotated screenshots 3. Collect Evidence — screenshots, console errors, JS exceptions 4. Categorize — severity (Critical/High/Medium/Low) and category (Functional/Visual/Accessibility/Console/UX/Content) 5. Report — structured markdown with per-issue evidence Includes: - skills/dogfood/SKILL.md — full workflow instructions - skills/dogfood/references/issue-taxonomy.md — severity/category defs - skills/dogfood/templates/dogfood-report-template.md — report template ## Tests 21 new tests covering: - browser_console message/error parsing, clear flag, empty/failed states - browser_console schema registration - browser_vision annotate schema and flag passing - record_sessions config defaults and recording lifecycle - Dogfood skill file existence and content validation Addresses #315.	2026-03-08 21:28:12 -07:00
teknium1	3b312d45c5	fix: show fallback_model as commented-out YAML example in config Remove fallback_model from DEFAULT_CONFIG (empty strings were useless noise). Instead, save_config() appends a commented-out section at the bottom of config.yaml showing the available providers and example usage. When the user actually configures fallback_model, it appears as normal YAML and the comment block is omitted.	2026-03-08 21:25:58 -07:00
teknium1	fcd899f888	docs: add platform integration checklist for new gateway adapters Comprehensive 16-point checklist covering every integration point needed when adding a new messaging platform to the gateway. Built from the Signal integration experience where 7 integration points were initially missed. Covers: adapter, config enum, factory, auth maps, session source, prompt hints, toolsets, cron delivery, send_message tool, cronjob tool schema, channel directory, status display, setup wizard, redaction, documentation, and tests.	2026-03-08 21:20:06 -07:00
Teknium	315f3ea429	Merge pull request #740 from NousResearch/hermes/hermes-3cd7c62d feat: simple fallback model for provider resilience (#737)	2026-03-08 21:16:58 -07:00
teknium1	b7d6eae64c	fix: Signal adapter parity pass — integration gaps, clawdbot features, env var simplification Integration gaps fixed (7 files missing Signal): - cron/scheduler.py: Signal in platform_map (cron delivery was broken) - agent/prompt_builder.py: PLATFORM_HINTS for Signal (agent knows it's on Signal) - toolsets.py: hermes-signal toolset + added to hermes-gateway composite - hermes_cli/status.py: Signal + Slack in platform status display - tools/send_message_tool.py: Signal example in target description - tools/cronjob_tools.py: Signal in delivery option docs + schema - gateway/channel_directory.py: Signal in session-based channel discovery Clawdbot parity features added to signal.py: - Self-message filtering: prevents reply loops by checking sender != account - SyncMessage filtering: ignores sync envelopes (sent transcripts, read receipts) - Edit message support: reads dataMessage from editMessage envelope - Mention rendering: replaces \uFFFC placeholders with @identifier text - Jitter in SSE reconnection backoff (20% randomization, prevents thundering herd) Env var simplification (7 → 4): - Removed SIGNAL_DM_POLICY (DM auth follows standard platform pattern via SIGNAL_ALLOWED_USERS + DM pairing, same as Telegram/Discord) - Removed SIGNAL_GROUP_POLICY (derived from SIGNAL_GROUP_ALLOWED_USERS: not set = disabled, set with IDs = allowlist, set with * = open) - Removed SIGNAL_DEBUG (was setting root logger, removed entirely) - Remaining: SIGNAL_HTTP_URL, SIGNAL_ACCOUNT (required), SIGNAL_ALLOWED_USERS, SIGNAL_GROUP_ALLOWED_USERS (optional) Updated all docs (website, AGENTS.md, signal.md) to match.	2026-03-08 21:00:21 -07:00
teknium1	b3765c28d0	fix: restrict fallback providers to actual hermes providers Remove hallucinated providers (openai, deepseek, together, groq, fireworks, mistral, gemini, nous) from the fallback provider map. These don't exist in hermes-agent's provider system. The real supported providers for fallback are: openrouter (OPENROUTER_API_KEY) zai (ZAI_API_KEY) kimi-coding (KIMI_API_KEY) minimax (MINIMAX_API_KEY) minimax-cn (MINIMAX_CN_API_KEY) For any other OpenAI-compatible endpoint, users can use the base_url + api_key_env overrides in the config. Also adds Kimi User-Agent header for kimi fallback (matching the main provider system).	2026-03-08 20:49:55 -07:00
teknium1	4cfb66bac2	docs: list all supported fallback providers with env var names The config comment now shows the complete list of built-in providers that the fallback system supports, each with the env var it reads for the API key. Also clarifies that custom OpenAI-compatible endpoints work via base_url + api_key_env.	2026-03-08 20:42:54 -07:00
teknium1	0c4cff352a	docs: add Signal messenger documentation across all doc surfaces - website/docs/user-guide/messaging/signal.md: Full setup guide with prerequisites, step-by-step instructions, access policies, features, troubleshooting, security notes, and env var reference - website/docs/user-guide/messaging/index.md: Added Signal to architecture diagram, platform toolset table, security examples, and Next Steps links - website/docs/reference/environment-variables.md: All 7 SIGNAL_* env vars - README.md: Signal in feature table and documentation table - AGENTS.md: Signal in gateway description and env var config section	2026-03-08 20:42:04 -07:00
teknium1	503269b85a	chore: remove stale docs/ directory All documentation migrated to website/docs/ (Docusaurus). The docs/ directory only contained: - README.md: redirect saying 'docs moved to website' (redundant) - send_file_integration_map.md: internal engineering notes, unreferenced by any file in the codebase The landing page at landingpage/ is still actively used by the deploy-site.yml GitHub Actions workflow.	2026-03-08 20:41:47 -07:00
teknium1	161436cfdd	feat: simple fallback model for provider resilience When the primary model/provider fails after retries (rate limit, overload, auth errors, connection failures), Hermes automatically switches to a configured fallback model for the remainder of the session. Config (in ~/.hermes/config.yaml): fallback_model: provider: openrouter model: anthropic/claude-sonnet-4 Supports all major providers: OpenRouter, OpenAI, Nous, DeepSeek, Together, Groq, Fireworks, Mistral, Gemini — plus custom endpoints via base_url and api_key_env overrides. Design principles: - Dead simple: one fallback model, not a chain - One-shot: switches once, doesn't ping-pong back - Zero new dependencies: uses existing OpenAI client - Minimal code: ~100 lines in run_agent.py, ~5 lines in cli.py/gateway - Three trigger points: max retries exhausted, non-retryable client errors, and invalid response exhaustion Does NOT trigger on context overflow or payload-too-large errors (those are handled by the existing compression system). Addresses #737. 25 new tests, 2492 total passing.	2026-03-08 20:22:33 -07:00
teknium1	24f549a692	feat: add Signal messenger gateway platform (#405 ) Complete Signal adapter using signal-cli daemon HTTP API. Based on PR #268 by ibhagwan, rebuilt on current main with bug fixes. Architecture: - SSE streaming for inbound messages with exponential backoff (2s→60s) - JSON-RPC 2.0 for outbound (send, typing, attachments, contacts) - Health monitor detects stale SSE connections (120s threshold) - Phone number redaction in all logs and global redact.py Features: - DM and group message support with separate access policies - DM policies: pairing (default), allowlist, open - Group policies: disabled (default), allowlist, open - Attachment download with magic-byte type detection - Typing indicators (8s refresh interval) - 100MB attachment size limit, 8000 char message limit - E.164 phone + UUID allowlist support Integration: - Platform.SIGNAL enum in gateway/config.py - Signal in _is_user_authorized() allowlist maps (gateway/run.py) - Adapter factory in _create_adapter() (gateway/run.py) - user_id_alt/chat_id_alt fields in SessionSource for UUIDs - send_message tool support via httpx JSON-RPC (not aiohttp) - Interactive setup wizard in 'hermes gateway setup' - Connectivity testing during setup (pings /api/v1/check) - signal-cli detection and install guidance Bug fixes from PR #268: - Timestamp reads from envelope_data (not outer wrapper) - Uses httpx consistently (not aiohttp in send_message tool) - SIGNAL_DEBUG scoped to signal logger (not root) - extract_images regex NOT modified (preserves group numbering) - pairing.py NOT modified (no cross-platform side effects) - No dual authorization (adapter defers to run.py for user auth) - Wildcard uses set membership ('*' in set, not list equality) - .zip default for PK magic bytes (not .docx) No new Python dependencies — uses httpx (already core). External requirement: signal-cli daemon (user-installed). Tests: 30 new tests covering config, init, helpers, session source, phone redaction, authorization, and send_message integration. Co-authored-by: ibhagwan <ibhagwan@users.noreply.github.com>	2026-03-08 20:20:35 -07:00
Teknium	7a8778ac73	Merge pull request #732 from NousResearch/hermes/hermes-2cb83eed docs: comprehensive AGENTS.md audit and corrections	2026-03-08 20:10:32 -07:00
teknium1	4d7d9d9715	fix: add diagnostic logging to browser tool for errors.log All failure paths in _run_browser_command now log at WARNING level, which means they automatically land in ~/.hermes/logs/errors.log (the persistent error log captures WARNING+). What's now logged: - agent-browser CLI not found (warning) - Session creation failure with task ID (warning) - Command entry with socket_dir path and length (debug) - Non-zero return code with stderr (warning) - Non-JSON output from agent-browser (warning — version mismatch/crash) - Command timeout with task ID and socket path (warning) - Unexpected exceptions with full traceback (warning + exc_info) - browser_vision: which model is used and screenshot size (debug) - browser_vision: LLM analysis failure with full traceback (warning) Also fixed: _get_vision_model() was called twice in browser_vision — now called once and reused.	2026-03-08 19:54:41 -07:00
teknium1	a9c35f9175	docs: comprehensive rewrite of all messaging platform setup guides All four platform guides rewritten from thin ~60-line summaries to comprehensive step-by-step setup guides with current (2025-2026) info: telegram.md (74 → 196 lines): - Full BotFather walkthrough with customization commands - Privacy mode section with critical group chat gotcha - Multiple user ID discovery methods - Voice message setup (Whisper STT + TTS bubbles + ffmpeg) - Group chat usage patterns and admin mode - Recent Bot API features (privacy policy requirement, streaming) - Troubleshooting table (6 issues) discord.md (57 → 260 lines): - Complete Developer Portal walkthrough (application, bot, intents) - Detailed Privileged Gateway Intents section with warning about Message Content Intent being #1 failure cause - Invite URL generation via Installation tab (new 2024) and manual - Permission integer calculation (274878286912 recommended) - Developer Mode user ID discovery - Bot behavior documentation (DMs, channels, no-prefix) - Troubleshooting table (6 issues) slack.md (57 → 214 lines): - Warning about classic Slack apps deprecated since March 2025 - Full scope tables (required + optional) with purposes - Socket Mode setup with App-Level Token (xapp-) - Event Subscriptions configuration - User ID discovery via profile - Two-token architecture explained (xoxb- + xapp-) - Troubleshooting table whatsapp.md (77 → 193 lines): - Clarified whatsapp-web.js (not Business API) with ban risk warnings - Linux Chromium dependencies (Debian + Fedora) - Setup wizard QR code scanning workflow - Session persistence with LocalAuth - Second phone number options with cost table - WhatsApp Web protocol update warnings - Troubleshooting table (7 issues) Docusaurus build verified clean.	2026-03-08 19:51:42 -07:00
teknium1	31b84213e4	docs: add Guides & Tutorials section, restructure sidebar New documentation pages (1,823 lines): - getting-started/learning-path.md: 3-tier learning path table (beginner/intermediate/advanced) + use-case-based navigation - guides/tips.md: Tips & Best Practices quick-wins collection covering prompting, CLI power user tips, context files, memory, performance/cost, messaging, and security - guides/daily-briefing-bot.md: End-to-end tutorial building an automated daily news briefing with cron + web search + messaging - guides/team-telegram-assistant.md: Full walkthrough setting up a team Telegram bot with BotFather, gateway, DM pairing, and production deployment - guides/python-library.md: Guide to using AIAgent as a Python library — basic usage, multi-turn conversations, toolset config, trajectories, custom prompts, and integration examples (FastAPI, Discord bot, CI/CD) - reference/faq.md: Centralized FAQ (8 questions) + troubleshooting guide (6 categories, 18 specific issues) with problem/cause/solution format Sidebar restructure: - Added 'Guides & Tutorials' as new top-level section - Reorganized flat Features list (17 items) into 5 subcategories: Core Features, Automation, Web & Media, Integrations, Advanced - Added FAQ to Reference section - Updated index.md quick links table Docusaurus build verified clean.	2026-03-08 19:37:34 -07:00
teknium1	2036c22f88	fix: macOS browser/code-exec socket path exceeds Unix limit (#374 ) macOS sets TMPDIR to /var/folders/xx/.../T/ (~51 chars). Combined with agent-browser session names, socket paths reach 121 chars — exceeding the 104-byte macOS AF_UNIX limit. This causes 'Screenshot file was not created' errors and silent browser_vision failures on macOS. Fix: use /tmp/ on macOS (symlink to /private/tmp, sticky-bit protected). On Linux, tempfile.gettempdir() already returns /tmp — no behavior change. Changes in browser_tool.py: - Add _socket_safe_tmpdir() helper — returns /tmp on macOS, gettempdir() elsewhere - Replace all 3 tempfile.gettempdir() calls for socket dirs - Set mode=0o700 on socket dirs for privacy (was using default umask) - Guard vision/text client init with try/except — a broken auxiliary config no longer prevents the entire browser_tool module from importing (which would disable all 10 browser tools, not just vision) - Improve screenshot error messages with mode info and diagnostic hints - Don't delete screenshots when LLM analysis fails — the capture was valid, only the vision API call failed. Screenshots are still cleaned up by the existing 24-hour _cleanup_old_screenshots mechanism. Changes in code_execution_tool.py: - Same /tmp fix for RPC socket path (was 103 chars on macOS — one char from the 104-byte limit)	2026-03-08 19:31:23 -07:00
teknium1	7185a66b96	feat: enhance Solana skill with USD pricing, token names, smart wallet output Enhancements to the Solana blockchain skill (PR #212 by gizdusum): - CoinGecko price integration (free, no API key) - Wallet shows tokens with USD values, sorted by value - Token info includes price and market cap - Transaction details show USD amounts for balance changes - Whale detector shows USD alongside SOL amounts - Stats includes SOL price and market cap - New `price` command for quick lookups by symbol or mint - Smart wallet output - Tokens sorted by USD value (highest first) - Default limit of 20 tokens (--limit N to adjust) - Dust filtering (< $0.01 tokens hidden, count shown) - --all flag to see everything - --no-prices flag for fast RPC-only mode - NFT summary (count + first 10) - Portfolio total in USD - Token name resolution - 25+ well-known tokens mapped (SOL, USDC, BONK, JUP, etc.) - CoinGecko fallback for unknown tokens - Abbreviated mint addresses for unlabeled tokens - Reliability - Retry with exponential backoff on 429 rate-limit (RPC + CoinGecko) - Graceful degradation when price data unavailable - Capped API calls to respect CoinGecko free-tier limits - Updated SKILL.md with all new capabilities and flags	2026-03-08 19:15:11 -07:00
teknium1	2394e18729	fix: add context to interruption messages for model awareness When the agent is interrupted, the model now receives descriptive context instead of a generic 'Operation interrupted.' string: - Tool skip messages include the tool name: '[Tool execution cancelled — terminal was skipped due to user interrupt]' '[Tool execution skipped — web_search was not started. User sent a new message]' - API call interrupts include timing: 'Operation interrupted: waiting for model response (4.2s elapsed).' - Retry/error interrupts include retry context: 'Operation interrupted: retrying API call after rate limit (retry 2/5).' 'Operation interrupted: handling API error (Timeout: connection timed out).' This helps the model understand what was happening when it was interrupted, reducing wasted iterations spent re-discovering state.	2026-03-08 18:58:23 -07:00
teknium1	99f7582175	chore: move Solana skill to optional-skills/ Solana blockchain queries are a niche use case — not needed by every user. Moved from skills/ (bundled) to optional-skills/ (installable via Skills Hub).	2026-03-08 18:52:02 -07:00
teknium1	93c5997290	Merge PR #212 : feat(skills): add Solana blockchain skill Authored by Deniz Alagoz (gizdusum). Closes #164. Will be moved to optional-skills/ and enhanced post-merge.	2026-03-08 18:51:33 -07:00
teknium1	2d1a1c1c47	refactor: remove redundant 'openai' auxiliary provider, clean up docs The 'openai' provider was redundant — using OPENAI_BASE_URL + OPENAI_API_KEY with provider: 'main' already covers direct OpenAI API. Provider options are now: auto, openrouter, nous, codex, main. - Removed _try_openai(), _OPENAI_AUX_MODEL, _OPENAI_BASE_URL - Replaced openai tests with codex provider tests - Updated all docs to remove 'openai' option and clarify 'main' - 'main' description now explicitly mentions it works with OpenAI API, local models, and any OpenAI-compatible endpoint Tests: 2467 passed.	2026-03-08 18:50:26 -07:00
teknium1	71e81728ac	feat: Codex OAuth vision support + multimodal content adapter The Codex Responses API (chatgpt.com/backend-api/codex) supports vision via gpt-5.3-codex. This was verified with real API calls using image analysis. Changes to _CodexCompletionsAdapter: - Added _convert_content_for_responses() to translate chat.completions multimodal format to Responses API format: - {type: 'text'} → {type: 'input_text'} - {type: 'image_url', image_url: {url: '...'}} → {type: 'input_image', image_url: '...'} - Fixed: removed 'stream' from resp_kwargs (responses.stream() handles it) - Fixed: removed max_output_tokens and temperature (Codex endpoint rejects them) Provider changes: - Added 'codex' as explicit auxiliary provider option - Vision auto-fallback now includes Codex (OpenRouter → Nous → Codex) since gpt-5.3-codex supports multimodal input - Updated docs with Codex OAuth examples Tested with real Codex OAuth token + ~/.hermes/image2.png — confirmed working end-to-end through the full adapter pipeline. Tests: 2459 passed.	2026-03-08 18:44:33 -07:00

1 2 3 4 5 ...

1069 Commits