hermes-agent

Author	SHA1	Message	Date
Zainan Victor Zhou	190471fdc0	docs: use HERMES_HOME in google-workspace skill examples - avoid hard-coded ~/.hermes paths in the setup and API shorthands - prefer HERMES_HOME with a sane default to /Users/peteradams/.hermes - keep the examples aligned with profile-aware Hermes installs	2026-04-06 16:50:07 -07:00
Zainan Victor Zhou	83df001d01	fix: allow google-workspace skill scripts to run directly - fall back to adding the repo root to sys.path when hermes_constants is not importable - fixes direct execution of setup.py and google_api.py from the repo checkout - keeps the upstream PR scoped to the google-workspace compatibility fix	2026-04-06 16:50:07 -07:00
Teknium	539629923c	docs(llm-wiki): add Obsidian Headless setup for servers (#5660 ) Adds obsidian-headless (npm) setup guide to the Obsidian Integration section — Node 22+, ob login, sync-create-remote, sync-setup, systemd service for continuous background sync. Covers the full headless workflow for agents running on servers syncing to Obsidian desktop on other devices.	2026-04-06 16:37:14 -07:00
Teknium	150f70f821	feat(skills): add skill config interface + llm-wiki skill (#5635 ) Skills can now declare config.yaml settings via metadata.hermes.config in their SKILL.md frontmatter. Values are stored under skills.config.* namespace, prompted during hermes config migrate, shown in hermes config show, and injected into the skill context at load time. Also adds the llm-wiki skill (Karpathy's LLM Wiki pattern) as the first skill to use the new config interface, declaring wiki.path. Skill config interface (new): - agent/skill_utils.py: extract_skill_config_vars(), discover_all_skill_config_vars(), resolve_skill_config_values(), SKILL_CONFIG_PREFIX - agent/skill_commands.py: _inject_skill_config() injects resolved values into skill messages as [Skill config: ...] block - hermes_cli/config.py: get_missing_skill_config_vars(), skill config prompting in migrate_config(), Skill Settings in show_config() LLM Wiki skill (skills/research/llm-wiki/SKILL.md): - Three-layer architecture (raw sources, wiki pages, schema) - Three operations (ingest, query, lint) - Session orientation, page thresholds, tag taxonomy, update policy, scaling guidance, log rotation, archiving workflow Docs: creating-skills.md, configuration.md, skills.md, skills-catalog.md Closes #5100	2026-04-06 13:49:13 -07:00
SHL0MS	17e2a27c51	feat(skills): add p5js creative coding skill Production pipeline for interactive and generative visual art using p5.js. Covers 7 modes: generative art, data visualization, interactive experiences, animation/motion graphics, 3D scenes, image processing, and audio-reactive. Includes: - SKILL.md with creative standard, pipeline, and critical implementation notes - 10 reference files covering core API, shapes, visual effects (noise, flow fields, particles, domain warp, attractors, L-systems, circle packing, bloom, reaction-diffusion), animation (easing, springs, state machines, scene transitions), typography, color systems, WebGL/3D/shaders, interaction, and comprehensive export pipeline - Deterministic headless frame capture via Puppeteer (noLoop + redraw) - ffmpeg render pipeline for MP4 video export - Per-clip architecture for multi-scene video production - Interactive viewer template with seed navigation and parameter controls - Performance guidance: FES disable, Math.* hot loops, per-pixel budgets - Addon library coverage: p5.brush, p5.grain, CCapture.js, p5.js-svg - fxhash/Art Blocks generative platform conventions - p5.js 2.0 migration guide (async setup, OKLCH, splineVertex, shader.modify) - 13 documented common mistakes and troubleshooting patterns 17 files, ~5,900 lines.	2026-04-06 14:39:00 -04:00
SHL0MS	a912cd4568	docs(manim-video): add 5 new reference files — design thinking, updaters, paper explainer, decorations, production quality Five new reference files expanding the skill from rendering knowledge into production methodology: animation-design-thinking.md (161 lines): When to animate vs show static, concept decomposition into visual beats, pacing rules, narration sync, equation reveal strategies, architecture diagram patterns, common design mistakes. updaters-and-trackers.md (260 lines): Deep ValueTracker mental model, lambda/time-based/always_redraw updaters, DecimalNumber and Variable live displays, animation-based updaters, 4 complete practical patterns (dot tracing, live area, connected diagram, parameter exploration). paper-explainer.md (255 lines): Full workflow for turning research papers into animations. Audience selection, 5-minute template, pre-code gates (narration, scene list, style contract), equation reveal strategies, architecture diagram building, results animation, domain-specific patterns for ML/physics/ biomedical papers. decorations.md (202 lines): SurroundingRectangle, BackgroundRectangle, Brace, arrows (straight, curved, labeled), DashedLine, Angle/RightAngle, Cross, Underline, color highlighting workflows, annotation lifecycle pattern. production-quality.md (190 lines): Pre-code, pre-render, post-render checklists. Text overlap prevention, spatial layout coordinate budget, max simultaneous elements, animation variety audit, tempo curve, color consistency, data viz minimums. Total skill now: 14 reference files, 2614 lines.	2026-04-06 13:51:36 -04:00
SHL0MS	95a044a2e0	feat(research-paper-writing): fill coverage gaps and integrate patterns from AI-Scientist, GPT-Researcher Fix duplicate step numbers (5.3, 7.3) and missing 7.5. Add coverage for human evaluation, theory/survey/benchmark/position papers, ethics/broader impact, arXiv strategy, code packaging, negative results, workshop papers, multi-author coordination, compute budgeting, and post-acceptance deliverables. Integrate ensemble reviewing with meta-reviewer and negative bias, pre-compilation validation pipeline, experiment journal with tree structure, breadth/depth literature search, context management for large projects, two-pass refinement, VLM visual review, and claim verification. New references: human-evaluation.md, paper-types.md.	2026-04-06 01:12:32 -04:00
SHL0MS	b26e7fd43a	fix(manim-video): recommend monospace fonts — proportional fonts have broken kerning in Pango Manim's Pango text renderer produces broken kerning with proportional fonts (Helvetica, Inter, SF Pro, Arial) at all sizes and resolutions. Characters overlap and spacing is inconsistent. This is a fundamental Pango limitation. Changes: - Recommend Menlo (monospace) as the default font for ALL text - Proportional fonts only acceptable for large titles (>=48, short strings) - Set minimum font_size=18 for readability - Update all code examples to use MONO='Menlo' pattern - Remove Inter/Helvetica/SF Pro from recommendations	2026-04-06 00:35:43 -04:00
SHL0MS	447ec076a4	docs(manim-video): expand references with comprehensive Manim CE and 3b1b patterns Adds 601 lines across 6 reference files, sourced from deep review of: - Manim CE v0.20.1 full reference manual - 3b1b/manim example_scenes.py and source modules - 3b1b/videos production CLAUDE.md and workflow patterns - Manim CE thematic guides (voiceover, text, configuration) animations.md: always_redraw, TracedPath, FadeTransform, TransformFromCopy, ApplyMatrix, squish_rate_func, ShowIncreasingSubsets, ShowPassingFlash, expanded rate functions mobjects.md: SVGMobject, ImageMobject, Variable, BulletedList, DashedLine, Angle/RightAngle, boolean ops, LabeledArrow, t2c/t2f/t2s/t2w per-substring styling, backstroke for readability, apply_complex_function with prepare_for_nonlinear_transform equations.md: substrings_to_isolate, multi-line equations, TransformMatchingTex with matched_keys and key_map, set_color_by_tex graphs-and-data.md: Graph/DiGraph with layout algorithms, ArrowVectorField/StreamLines, ComplexPlane/PolarPlane camera-and-3d.md: ZoomedScene with inset zoom, LinearTransformationScene for 3b1b-style linear algebra rendering.md: manim.cfg project config, self.next_section() chapter markers, manim-voiceover plugin with ElevenLabs/GTTS integration and bookmark-based audio sync	2026-04-06 00:08:17 -04:00
SHL0MS	1572956fdc	Merge pull request #4930 from SHL0MS/feat/manim-video-skill-v2 feat(skills): add manim-video skill for mathematical and technical animations	2026-04-05 16:10:30 -07:00
SHL0MS	9d885b266c	feat(skills): add manim-video skill for mathematical and technical animations Production pipeline for creating 3Blue1Brown-style animated videos using Manim Community Edition. The agent handles the full workflow: creative planning, Python code generation, rendering, scene stitching, audio muxing, and iterative refinement. Modes: concept explainers, equation derivations, algorithm visualizations, data stories, architecture diagrams, paper explainers, 3D visualizations. 9 reference files, setup verification script, README. All API references verified against ManimCommunity/manim source.	2026-04-05 19:09:37 -04:00
Teknium	cb63b5f381	feat(skills): add popular-web-designs skill with 54 website design systems (#5194 ) Curated collection of production-quality design system specifications extracted from real websites (sourced from VoltAgent/awesome-design-md). Each template captures a site's complete visual language: colors, typography, components, layout, shadows, responsive behavior, and agent-ready CSS values. Hermes-specific adaptations in every template: - Google Fonts CDN link tags for proprietary font substitutes - CSS font-family stacks with proper fallbacks - Integration notes for write_file + generative-widgets workflow - browser_vision verification reminders SKILL.md includes categorized catalog, font substitution reference table, HTML generation pattern, and design-to-use-case matching guide. Sites: Airbnb, Airtable, Apple, BMW, Cal.com, Claude, Clay, ClickHouse, Cohere, Coinbase, Composio, Cursor, ElevenLabs, Expo, Figma, Framer, HashiCorp, IBM, Intercom, Kraken, Linear, Lovable, Minimax, Mintlify, Miro, Mistral AI, MongoDB, Notion, NVIDIA, Ollama, OpenCode, Pinterest, PostHog, Raycast, Replicate, Resend, Revolut, RunwayML, Sanity, Sentry, SpaceX, Spotify, Stripe, Supabase, Superhuman, Together AI, Uber, Vercel, VoltAgent, Warp, Webflow, Wise, xAI, Zapier	2026-04-05 00:42:55 -07:00
Teknium	0fd3de2674	docs(skill): claude-code v2.2 — add cheat sheet commands, env vars, rules, advanced features (#5158 ) Expands the claude-code skill with content from official docs and community cheat sheets that was missing from v2.0: Slash commands: /cost, /btw, /plan, /loop, /batch, /security-review, /resume, /effort (with auto level), /mcp, /release-notes, /voice details Keyboard shortcuts: Alt+P (model), Alt+T (thinking), Alt+O (fast mode), Ctrl+V (paste image), Ctrl+O (transcript), Ctrl+G (external editor) Ultrathink keyword for max reasoning on a specific turn Rules directory: .claude/rules/.md and ~/.claude/rules/.md Auto-memory: ~/.claude/projects/<proj>/memory/ (25KB/200 lines limit) Environment variables: CLAUDE_CODE_EFFORT_LEVEL, MAX_THINKING_TOKENS, CLAUDE_CODE_NO_FLICKER, CLAUDE_CODE_SUBPROCESS_ENV_SCRUB MCP limits: 2KB tool desc cap, maxResultSizeChars 500K, transport types Reorganized slash commands into Session/Development/Configuration groups Reorganized keyboard shortcuts into Controls/Toggles/Multiline groups	2026-04-04 19:15:57 -07:00
Teknium	ff544526cd	docs(skill): comprehensive claude-code skill rewrite v2.0 (#5155 ) Major rewrite of the claude-code orchestration skill from 94 to 460 lines. Based on official docs research, community guides, and live experimentation. Key additions: - Two orchestration modes: Print mode (-p) vs Interactive PTY via tmux - Detailed PTY dialog handling (trust + permissions bypass patterns) - Print mode deep dive: JSON output, piped input, session resumption, --json-schema, --bare mode for CI - Complete flag reference (20+ flags organized by category) - Interactive session patterns with tmux send-keys/capture-pane - Claude's slash commands and keyboard shortcuts reference - CLAUDE.md, hooks, custom subagents, MCP, custom commands docs - Cost/performance tips (effort levels, budget caps, context mgmt) - 10 specific pitfalls discovered through live testing - 10 rules for Hermes agents orchestrating Claude Code	2026-04-04 19:00:50 -07:00
Teknium	34308e4de9	docs: improve youtube-content skill structure and workflow Clearer workflow with validation/chunking steps, expanded description with trigger terms for better agent matching, tightened error handling. Fixed stray pipe character in original PR diff. Based on PR #4778 by fernandezbaptiste. Co-authored-by: fernandezbaptiste <fernandezbaptiste@users.noreply.github.com>	2026-04-03 22:18:00 -07:00
kshitijk4poor	5359921199	refactor: simplify scope validation helpers in google workspace scripts Fix double file read bug in google_api.py _missing_scopes(), consolidate redundant _normalize_scope_values into callers, merge duplicate except blocks.	2026-04-03 17:49:18 -07:00
kshitijk4poor	37e2ef6c3f	fix: protect profile-scoped google workspace oauth tokens	2026-04-03 17:49:18 -07:00
Teknium	52ddd6bc64	refactor(skills): consolidate code verification skills into one (#4854 ) * chore: release v0.7.0 (2026.4.3) 168 merged PRs, 223 commits, 46 resolved issues, 40+ contributors. Highlights: pluggable memory providers, credential pools, Camofox browser, inline diff previews, API server session continuity, ACP MCP registration, gateway hardening, secret exfiltration blocking. * refactor(skills): consolidate code-review + verify-code-changes into requesting-code-review Merge the passive code-review checklist and the automated verification pipeline (from PR #4459 by @MorAlekss) into a single requesting-code-review skill. This eliminates model confusion between three overlapping skills. Now includes: - Static security scan (grep on diff lines) - Baseline-aware quality gates (only flag NEW failures) - Multi-language tool detection (Python, Node, Rust, Go) - Independent reviewer subagent with fail-closed JSON verdict - Auto-fix loop with separate fixer agent (max 2 attempts) - Git checkpoint and [verified] commit convention Deletes: skills/software-development/code-review/ (absorbed) Closes: #406 (independent code verification)	2026-04-03 14:13:27 -07:00
SHL0MS	b86647c295	Replace ml-paper-writing with research-paper-writing: full research pipeline skill Replaces the writing-focused ml-paper-writing skill (940 lines) with a complete end-to-end research paper pipeline (1,599 lines SKILL.md + 3,184 lines across 7 reference files). New content: - Full 8-phase pipeline: project setup, literature review, experiment design, execution/monitoring, analysis, paper drafting, review/revision, submission preparation - Iterative refinement strategy guide from autoreason research (when to use autoreason vs critique-and-revise vs single-pass, model selection) - Hermes agent integration: delegate_task parallel drafting, cronjob monitoring, memory/todo state management, skill composition - Professional LaTeX tooling: microtype, siunitx, TikZ diagram patterns, algorithm2e, subcaption, latexdiff, SciencePlots - Human evaluation design: annotation protocols, inter-annotator agreement, crowdsourcing platforms - Title, Figure 1, conclusion, appendix strategy, page budget management - Anonymization checklist, rebuttal writing, camera-ready preparation - AAAI and COLM venue coverage (checklists, reviewer guidelines) Preserved from ml-paper-writing: - All writing philosophy (Nanda, Farquhar, Gopen & Swan, Lipton, Perez) - Citation verification workflow (5-step mandatory process) - All 6 conference templates (NeurIPS, ICML, ICLR, ACL, AAAI, COLM) - Conference requirements, format conversion workflow - Proactivity/collaboration guidance Bug fixes in inherited reference files: - BibLaTeX recommendation now correctly says natbib for conferences - Bare except clauses fixed to except Exception - Jinja2 template tags removed from citation-workflow.md - Stale date caveats added to reviewer-guidelines.md	2026-04-02 16:13:26 -04:00
analista	3400098481	fix: update fetch_transcript.py for youtube-transcript-api v1.x The library removed the static get_transcript() method in v1.0. Migrate to the new instance-based fetch() API and normalize FetchedTranscriptSnippet objects back to dicts for compatibility with the rest of the script.	2026-04-01 10:49:24 -07:00
Teknium	85e96a4638	fix(skills): move unified hermes-agent skill into autonomous-ai-agents category (#4435 ) The unified skill from PR #4332 was placed at a top-level skills/hermes-agent/ directory, creating a redundant standalone category. Move it to skills/autonomous-ai-agents/hermes-agent/ alongside claude-code, codex, and opencode where it belongs.	2026-04-01 03:39:25 -07:00
Teknium	b118f607b2	feat(skills): unify hermes-agent and hermes-agent-setup into single skill (#4332 ) Merges the hermes-agent-spawning skill (autonomous-ai-agents/) and hermes-agent-setup skill (dogfood/) into a single comprehensive skills/hermes-agent/ skill. The unified skill covers: - What Hermes Agent is and how it compares to Claude Code/Codex/OpenClaw - Complete CLI reference (all subcommands and flags) - Slash command reference - Configuration guide (providers, toolsets, config sections) - Voice/STT/TTS setup - Spawning additional agent instances (one-shot and interactive PTY) - Multi-agent coordination patterns - Troubleshooting guide - Where-to-find-things lookup table with docs links - Concise contributor quick reference Removes: - skills/autonomous-ai-agents/hermes-agent/ (hermes-agent-spawning) - skills/dogfood/hermes-agent-setup/	2026-03-31 14:49:20 -07:00
SHL0MS	ab62614a89	ascii-video: add text readability techniques and external layout oracle pattern - composition.md: add text backdrop (gaussian dark mask behind glyphs) and external layout oracle pattern (browser-based text layout → JSON → Python renderer pipeline for obstacle-aware text reflow) - shaders.md: add reverse vignette shader (center-darkening for text readability) - troubleshooting.md: add diagnostic entries for text-over-busy-background readability and kaleidoscope-destroys-text pitfall	2026-03-30 18:48:22 -04:00
Teknium	5ceed021dc	feat(gateway): skill-aware slash commands, paginated /commands, Telegram 100-cap (#3934 ) * feat(gateway): skill-aware slash commands, paginated /commands, Telegram 100-cap Map active skills to Telegram's slash command menu so users can discover and invoke skills directly. Three changes: 1. Telegram menu now includes active skill commands alongside built-in commands, capped at 100 entries (Telegram Bot API limit). Overflow commands remain callable but hidden from the picker. Logged at startup when cap is hit. 2. New /commands [page] gateway command for paginated browsing of all commands + skills. /help now shows first 10 skill commands and points to /commands for the full list. 3. When a user types a slash command that matches a disabled or uninstalled skill, they get actionable guidance: - Disabled: 'Enable it with: hermes skills config' - Optional (not installed): 'Install with: hermes skills install official/<path>' Built on ideas from PR #3921 by @kshitijk4poor. * chore: move 21 niche skills to optional-skills Move specialized/niche skills from built-in (skills/) to optional (optional-skills/) to reduce the default skill count. Users can install them with: hermes skills install official/<category>/<name> Moved skills (21): - mlops: accelerate, chroma, faiss, flash-attention, hermes-atropos-environments, huggingface-tokenizers, instructor, lambda-labs, llava, nemo-curator, pinecone, pytorch-lightning, qdrant, saelens, simpo, slime, tensorrt-llm, torchtitan - research: domain-intel, duckduckgo-search - devops: inference-sh cli Built-in skills: 96 → 75 Optional skills: 22 → 43 * fix: only include repo built-in skills in Telegram menu, not user-installed User-installed skills (from hub or manually added) stay accessible via /skills and by typing the command directly, but don't get registered in the Telegram slash command picker. Only skills whose SKILL.md is under the repo's skills/ directory are included in the menu. This keeps the Telegram menu focused on the curated built-in set while user-installed skills remain discoverable through /skills and /commands.	2026-03-30 10:57:30 -07:00
Teknium	17b6000e90	feat(skills): add songwriting-and-ai-music creative skill (salvage #1901 ) (#3834 ) Adds a songwriting craft and AI music prompt engineering skill covering song structure, rhyme/meter, emotional arcs, Suno metatag reference, phonetic tricks for AI singers, parody adaptation, and production workflow. Complements existing music skills (heartmula, audiocraft, songsee) which cover model setup/usage — this one covers the creative process itself. Also removes the empty skills/music-creation/ category (only had a DESCRIPTION.md, no actual skills). Co-authored-by: 123mikeyd <123mikeyd@users.noreply.github.com>	2026-03-29 17:49:19 -07:00
Teknium	253a9adc72	docs(skills): clarify DuckDuckGo runtime requirements (#3680 ) Co-authored-by: kshitij <82637225+kshitijk4poor@users.noreply.github.com>	2026-03-29 00:17:57 -07:00
Teknium	7a3682ac3f	feat: mount skill credential files + fix env passthrough for remote backends (#3671 ) Two related fixes for remote terminal backends (Modal/Docker): 1. NEW: Credential file mounting system Skills declare required_credential_files in frontmatter. Files are mounted into Docker (read-only bind mounts) and Modal (mounts at creation + sync via exec on each command for mid-session changes). Google Workspace skill updated with the new field. 2. FIX: Docker backend now includes env_passthrough vars Skills that declare required_environment_variables (e.g. Notion with NOTION_API_KEY) register vars in the env_passthrough system. The local backend checked this, but Docker's forward_env was a separate disconnected list. Now Docker exec merges both sources, so skill-declared env vars are forwarded into containers automatically. This fixes the reported issue where NOTION_API_KEY in ~/.hermes/.env wasn't reaching the Docker container despite being registered via the Notion skill's prerequisites. Closes #3665	2026-03-28 23:53:40 -07:00
kshitij	a99c0478d0	fix(skills): move parallel-cli to optional-skills (#3673 ) parallel-cli is a paid third-party vendor skill that requires PARALLEL_API_KEY, but it was shipped in the default skills/ directory with no env-var gate. This caused it to appear in every user's system prompt even when they have no Parallel account or API key. Move it to optional-skills/ so it is only visible through the Skills Hub and must be explicitly installed. Also remove it from the default skills catalog docs.	2026-03-28 23:45:05 -07:00
Mibay	a6bc13ce13	fix(github-auth): check ~/.hermes/.env before ~/.git-credentials for token extraction (#3466 ) * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials for token extraction Users who configured their token via `hermes setup` have it stored in ~/.hermes/.env (GITHUB_TOKEN=...), not in ~/.git-credentials. On macOS with osxkeychain as the default git credential helper, ~/.git-credentials may not exist at all, causing silent 401 failures in all GitHub skills. Add ~/.hermes/.env as the first fallback in the auth detection block and the inline "Extracting the Token from Git Credentials" example. Priority order: env var → ~/.hermes/.env → ~/.git-credentials → none Part of fix for NousResearch/hermes-agent#3464 * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials Fixes #3464 * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials Fixes #3464 * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials Fixes #3464 * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials Fixes #3464 * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials Fixes #3464 * fix(github-auth): check ~/.hermes/.env before ~/.git-credentials Fixes #3464	2026-03-28 14:46:49 -07:00
Teknium	dabe3c34cc	feat(webhook): hermes webhook CLI + skill for event-driven subscriptions (#3578 ) Adds 'hermes webhook' CLI subcommand and a skill — zero new model tools. CLI commands (require webhook platform to be enabled): hermes webhook subscribe <name> [--events, --prompt, --deliver, ...] hermes webhook list hermes webhook remove <name> hermes webhook test <name> All commands gate on webhook platform being enabled in config. If not configured, prints setup instructions (gateway setup wizard, manual config.yaml, or env vars). The agent uses these via terminal tool, guided by the webhook-subscriptions skill which documents setup, common patterns (GitHub, Stripe, CI/CD, monitoring), prompt template syntax, security, and troubleshooting. Adapter enhancement: webhook.py hot-reloads dynamic subscriptions from ~/.hermes/webhook_subscriptions.json on each incoming request (mtime-gated). Static config.yaml routes always take precedence. Docs: updated webhooks.md with Dynamic Subscriptions section, added hermes webhook to cli-commands.md reference. No new model tools. No toolset changes. 24 new tests for CLI CRUD, persistence, enabled-gate, and adapter dynamic route loading.	2026-03-28 14:33:35 -07:00
Teknium	26bfdc22b4	feat: add godmode jailbreaking skill + docs (#3157 )	2026-03-26 01:37:18 -07:00
Teknium	f84230527c	docs(skill): add split, merge, search examples to ocr-and-documents skill (#2461 ) * fix: respect DashScope v1 runtime mode for alibaba Remove the hardcoded Alibaba branch from resolve_runtime_provider() that forced api_mode='anthropic_messages' regardless of the base URL. Alibaba now goes through the generic API-key provider path, which auto-detects the protocol from the URL: - /apps/anthropic → anthropic_messages (via endswith check) - /v1 → chat_completions (default) This fixes Alibaba setup with OpenAI-compatible DashScope endpoints (e.g. coding-intl.dashscope.aliyuncs.com/v1) that were broken because runtime always forced Anthropic mode even when setup saved a /v1 URL. Based on PR #2024 by @kshitijk4poor. * docs(skill): add split, merge, search examples to ocr-and-documents skill Adds pymupdf examples for PDF splitting, merging, and text search to the existing ocr-and-documents skill. No new dependencies — pymupdf already covers all three operations natively. --------- Co-authored-by: kshitijk4poor <kshitijk4poor@users.noreply.github.com>	2026-03-22 04:31:22 -07:00
Test	672e9752a0	docs: align venv path to match installer (venv/ not .venv/) The install script creates venv/ but several docs referenced .venv/, causing agents to fail with 'No such file or directory' when following AGENTS.md instructions. Fixes #2066	2026-03-19 18:16:26 -07:00
Test	7e30e97a59	chore: trim redundant trigger sentence from huggingface-hub description	2026-03-18 04:18:13 -07:00
Test	adf188c439	chore: add search to huggingface-hub skill description	2026-03-18 04:15:03 -07:00
Test	947827bba0	chore: tighten huggingface-hub skill description	2026-03-18 04:11:33 -07:00
Test	56ca84f243	feat: add huggingface-hub bundled skill Adds the Hugging Face CLI (hf) reference as a built-in skill under mlops/. Covers downloading/uploading models and datasets, repo management, SQL queries on datasets, inference endpoints, Spaces, buckets, and more. Based on the official HF skill from huggingface/skills.	2026-03-18 04:07:41 -07:00
Test	764825bbff	feat: expand hermes-agent-setup skill + tell agent about it in STT notes Skill now covers full CLI usage (hermes setup, hermes skills, hermes tools, hermes config, session management, etc.), config file reference, and expanded gateway commands. Agent context notes for STT failure now mention the hermes-agent-setup skill is available to help users configure Hermes features.	2026-03-18 03:05:17 -07:00
Test	9c0f346258	fix: direct user message on STT failure + hermes-agent-setup skill When a user sends a voice message and STT isn't configured, the gateway now sends a clear message directly to the user explaining how to set up voice transcription, rather than relying on the agent to relay an injected context note (which often gets misinterpreted). Also adds a hermes-agent-setup bundled skill covering STT/TTS setup, tool configuration, dependency installation, and troubleshooting.	2026-03-18 03:01:41 -07:00
Teknium	d132a3dfbb	feat(skills): add inference.sh skill (terminal-based, no custom tools) (#1686 ) Add inference.sh as a built-in skill that uses the terminal tool to run infsh CLI commands. No custom tools or tool registration — the skill teaches the agent how to use the infsh binary via terminal. Covers 150+ AI apps: image gen (FLUX, Reve, Seedream), video (Veo, Wan, Seedance), LLMs, search (Tavily, Exa), 3D, avatars, and more. Includes reference docs for authentication, app discovery, running apps, and CLI command reference. Based on PR #1021 by @okaris, reworked as a skill-only integration. Co-authored-by: okaris <okaris@users.noreply.github.com>	2026-03-17 03:06:53 -07:00
Teknium	c3d626eb07	Revert "feat: add inference.sh integration (infsh tool + skill) (#1682 )" (#1684 ) This reverts commit `6020db0243`.	2026-03-17 03:01:30 -07:00
Teknium	6020db0243	feat: add inference.sh integration (infsh tool + skill) (#1682 ) Add inference.sh CLI (infsh) as a tool integration, giving agents access to 150+ AI apps through a single CLI — image gen (FLUX, Reve, Seedream), video (Veo, Wan, Seedance), LLMs, search (Tavily, Exa), 3D, avatar/lipsync, and more. One API key manages all services. Tools: - infsh: run any infsh CLI command (app list, app run, etc.) - infsh_install: install the CLI if not present Registered as an 'inference' toolset (opt-in, not in core tools). Includes comprehensive skill docs with examples for all app categories. Changes from original PR: - NOT added to _HERMES_CORE_TOOLS (available via --toolsets inference) - Added 12 tests covering tool registration, command execution, error handling, timeout, JSON parsing, and install flow Inspired by PR #1021 by @okaris. Co-authored-by: okaris <okaris@users.noreply.github.com>	2026-03-17 02:59:21 -07:00
SHL0MS	63635744bf	Refactor ascii-video skill: creative-first SKILL.md, consolidate reference files	2026-03-16 20:11:12 -04:00
Teknium	dd7921d514	fix(honcho): isolate session routing for multi-user gateway (#1500 ) Salvaged from PR #1470 by adavyas. Core fix: Honcho tool calls in a multi-session gateway could route to the wrong session because honcho_tools.py relied on process-global state. Now threads session context through the call chain: AIAgent._invoke_tool() → handle_function_call() → registry.dispatch() → handler **kw → _resolve_session_context() Changes: - Add _resolve_session_context() to prefer per-call context over globals - Plumb honcho_manager + honcho_session_key through handle_function_call - Add sync_honcho=False to run_conversation() for synthetic flush turns - Pass honcho_session_key through gateway memory flush lifecycle - Harden gateway PID detection when /proc cmdline is unreadable - Make interrupt test scripts import-safe for pytest-xdist - Wrap BibTeX examples in Jekyll raw blocks for docs build - Fix thread-order-dependent assertion in client lifecycle test - Expand Honcho docs: session isolation, lifecycle, routing internals Dropped from original PR: - Indentation change in _create_request_openai_client that would move client creation inside the lock (causes unnecessary contention) Co-authored-by: adavyas <adavyas@users.noreply.github.com>	2026-03-16 00:23:47 -07:00
teknium1	4524cddc72	fix: persist google oauth pkce for headless auth Store the pending OAuth state and code verifier between --auth-url and --auth-code so the manual headless flow can reuse Flow.fetch_token() without disabling PKCE.	2026-03-14 22:11:34 -07:00
Teknium	b14a07315b	fix: save /plan output in workspace (#1381 )	2026-03-14 21:28:51 -07:00
Teknium	ff3473a37c	feat: add /plan command (#1372 ) * feat: add /plan command * refactor: back /plan with bundled skill * docs: document /plan skill	2026-03-14 21:18:17 -07:00
teknium1	3229e434b8	Merge origin/main into hermes/hermes-5d160594	2026-03-14 19:34:05 -07:00
teknium1	a6dc73fa07	docs: finish cron terminology cleanup	2026-03-14 19:20:58 -07:00
Teknium	6d2cfc24e9	Merge pull request #953 from JackTheGit/fix/docs-typos-batch4 Fix several documentation typos across training references	2026-03-14 10:26:15 -07:00

1 2 3

131 Commits