When a skill declares required_environment_variables in its YAML frontmatter, missing env vars trigger a secure TUI prompt (identical to the sudo password widget) when the skill is loaded. Secrets flow directly to ~/.hermes/.env, never entering LLM context. Key changes: - New required_environment_variables frontmatter field for skills - Secure TUI widget (masked input, 120s timeout) - Gateway safety: messaging platforms show local setup guidance - Legacy prerequisites.env_vars normalized into new format - Remote backend handling: conservative setup_needed=True - Env var name validation, file permissions hardened to 0o600 - Redact patterns extended for secret-related JSON fields - 12 existing skills updated with prerequisites declarations - ~48 new tests covering skip, timeout, gateway, remote backends - Dynamic panel widget sizing (fixes hardcoded width from original PR) Cherry-picked from PR #723 by kshitijk4poor, rebased onto current main with conflict resolution. Fixes #688 Co-authored-by: kshitijk4poor <kshitijk4poor@users.noreply.github.com>
177 lines
5.8 KiB
Markdown
177 lines
5.8 KiB
Markdown
---
|
|
sidebar_position: 3
|
|
title: "Creating Skills"
|
|
description: "How to create skills for Hermes Agent — SKILL.md format, guidelines, and publishing"
|
|
---
|
|
|
|
# Creating Skills
|
|
|
|
Skills are the preferred way to add new capabilities to Hermes Agent. They're easier to create than tools, require no code changes to the agent, and can be shared with the community.
|
|
|
|
## Should it be a Skill or a Tool?
|
|
|
|
Make it a **Skill** when:
|
|
- The capability can be expressed as instructions + shell commands + existing tools
|
|
- It wraps an external CLI or API that the agent can call via `terminal` or `web_extract`
|
|
- It doesn't need custom Python integration or API key management baked into the agent
|
|
- Examples: arXiv search, git workflows, Docker management, PDF processing, email via CLI tools
|
|
|
|
Make it a **Tool** when:
|
|
- It requires end-to-end integration with API keys, auth flows, or multi-component configuration
|
|
- It needs custom processing logic that must execute precisely every time
|
|
- It handles binary data, streaming, or real-time events
|
|
- Examples: browser automation, TTS, vision analysis
|
|
|
|
## Skill Directory Structure
|
|
|
|
Bundled skills live in `skills/` organized by category. Official optional skills use the same structure in `optional-skills/`:
|
|
|
|
```
|
|
skills/
|
|
├── research/
|
|
│ └── arxiv/
|
|
│ ├── SKILL.md # Required: main instructions
|
|
│ └── scripts/ # Optional: helper scripts
|
|
│ └── search_arxiv.py
|
|
├── productivity/
|
|
│ └── ocr-and-documents/
|
|
│ ├── SKILL.md
|
|
│ ├── scripts/
|
|
│ └── references/
|
|
└── ...
|
|
```
|
|
|
|
## SKILL.md Format
|
|
|
|
```markdown
|
|
---
|
|
name: my-skill
|
|
description: Brief description (shown in skill search results)
|
|
version: 1.0.0
|
|
author: Your Name
|
|
license: MIT
|
|
platforms: [macos, linux] # Optional — restrict to specific OS platforms
|
|
# Valid: macos, linux, windows
|
|
# Omit to load on all platforms (default)
|
|
metadata:
|
|
hermes:
|
|
tags: [Category, Subcategory, Keywords]
|
|
related_skills: [other-skill-name]
|
|
---
|
|
|
|
# Skill Title
|
|
|
|
Brief intro.
|
|
|
|
## When to Use
|
|
Trigger conditions — when should the agent load this skill?
|
|
|
|
## Quick Reference
|
|
Table of common commands or API calls.
|
|
|
|
## Procedure
|
|
Step-by-step instructions the agent follows.
|
|
|
|
## Pitfalls
|
|
Known failure modes and how to handle them.
|
|
|
|
## Verification
|
|
How the agent confirms it worked.
|
|
```
|
|
|
|
### Platform-Specific Skills
|
|
|
|
Skills can restrict themselves to specific operating systems using the `platforms` field:
|
|
|
|
```yaml
|
|
platforms: [macos] # macOS only (e.g., iMessage, Apple Reminders)
|
|
platforms: [macos, linux] # macOS and Linux
|
|
platforms: [windows] # Windows only
|
|
```
|
|
|
|
When set, the skill is automatically hidden from the system prompt, `skills_list()`, and slash commands on incompatible platforms. If omitted or empty, the skill loads on all platforms (backward compatible).
|
|
|
|
See `skills/apple/` for examples of macOS-only skills.
|
|
|
|
## Secure Setup on Load
|
|
|
|
Use `required_environment_variables` when a skill needs an API key or token. Missing values do **not** hide the skill from discovery. Instead, Hermes prompts for them securely when the skill is loaded in the local CLI.
|
|
|
|
```yaml
|
|
required_environment_variables:
|
|
- name: TENOR_API_KEY
|
|
prompt: Tenor API key
|
|
help: Get a key from https://developers.google.com/tenor
|
|
required_for: full functionality
|
|
```
|
|
|
|
The user can skip setup and keep loading the skill. Hermes never exposes the raw secret value to the model. Gateway and messaging sessions show local setup guidance instead of collecting secrets in-band.
|
|
|
|
Legacy `prerequisites.env_vars` remains supported as a backward-compatible alias.
|
|
|
|
## Skill Guidelines
|
|
|
|
### No External Dependencies
|
|
|
|
Prefer stdlib Python, curl, and existing Hermes tools (`web_extract`, `terminal`, `read_file`). If a dependency is needed, document installation steps in the skill.
|
|
|
|
### Progressive Disclosure
|
|
|
|
Put the most common workflow first. Edge cases and advanced usage go at the bottom. This keeps token usage low for common tasks.
|
|
|
|
### Include Helper Scripts
|
|
|
|
For XML/JSON parsing or complex logic, include helper scripts in `scripts/` — don't expect the LLM to write parsers inline every time.
|
|
|
|
### Test It
|
|
|
|
Run the skill and verify the agent follows the instructions correctly:
|
|
|
|
```bash
|
|
hermes chat --toolsets skills -q "Use the X skill to do Y"
|
|
```
|
|
|
|
## Where Should the Skill Live?
|
|
|
|
Bundled skills (in `skills/`) ship with every Hermes install. They should be **broadly useful to most users**:
|
|
|
|
- Document handling, web research, common dev workflows, system administration
|
|
- Used regularly by a wide range of people
|
|
|
|
If your skill is official and useful but not universally needed (e.g., a paid service integration, a heavyweight dependency), put it in **`optional-skills/`** — it ships with the repo, is discoverable via `hermes skills browse` (labeled "official"), and installs with builtin trust.
|
|
|
|
If your skill is specialized, community-contributed, or niche, it's better suited for a **Skills Hub** — upload it to a registry and share it via `hermes skills install`.
|
|
|
|
## Publishing Skills
|
|
|
|
### To the Skills Hub
|
|
|
|
```bash
|
|
hermes skills publish skills/my-skill --to github --repo owner/repo
|
|
```
|
|
|
|
### To a Custom Repository
|
|
|
|
Add your repo as a tap:
|
|
|
|
```bash
|
|
hermes skills tap add owner/repo
|
|
```
|
|
|
|
Users can then search and install from your repository.
|
|
|
|
## Security Scanning
|
|
|
|
All hub-installed skills go through a security scanner that checks for:
|
|
|
|
- Data exfiltration patterns
|
|
- Prompt injection attempts
|
|
- Destructive commands
|
|
- Shell injection
|
|
|
|
Trust levels:
|
|
- `builtin` — ships with Hermes (always trusted)
|
|
- `official` — from `optional-skills/` in the repo (builtin trust, no third-party warning)
|
|
- `trusted` — from openai/skills, anthropics/skills
|
|
- `community` — any findings = blocked unless `--force`
|