Files

Alexander Payne 14072f9bb5 feat: MCP tools integration for swarm agents

ToolExecutor:
- Persona-specific toolkit selection (forge gets code tools, echo gets search)
- Tool inference from task keywords (search→web_search, code→python)
- LLM-powered reasoning about tool selection
- Graceful degradation when Agno unavailable

PersonaNode Updates:
- Subscribe to swarm:events for task assignments
- Execute tasks using ToolExecutor when assigned
- Complete tasks via comms.complete_task()
- Track current_task for status monitoring

Tests:
- 19 new tests for tool execution
- All 6 personas covered
- Tool inference verification
- Edge cases (no toolkit, unknown tasks)

Total: 491 tests passing

2026-02-22 20:33:26 -05:00

4.6 KiB

Raw Blame History

Kimi Checkpoint - Updated 2026-02-22 22:45 EST

Session Info

Duration: ~2.5 hours
Commits: 1 (c5df954 + this session)
Assignment: Option A - MCP Tools Integration

Current State

Branch

kimi/sprint-v2-swarm-tools-serve → origin/kimi/sprint-v2-swarm-tools-serve

Test Status

491 passed, 0 warnings

What Was Done

Option A: MCP Tools Integration ✅ COMPLETE

Problem: Tools existed (src/timmy/tools.py) but weren't wired into the agent execution loop. Agents could bid on tasks but not actually execute them.

Solution: Built tool execution layer connecting personas to their specialized tools.

1. ToolExecutor (`src/swarm/tool_executor.py`)

Manages tool execution for persona agents:

executor = ToolExecutor.for_persona("forge", "forge-001")
result = executor.execute_task("Write a fibonacci function")
# Returns: {success, result, tools_used, persona_id, agent_id}

Features:

Persona-specific toolkit selection
Tool inference from task keywords
LLM-powered reasoning about tool use
Graceful degradation when Agno unavailable

Tool Mapping:

Persona	Tools
Echo	web_search, read_file, list_files
Forge	shell, python, read_file, write_file, list_files
Seer	python, read_file, list_files, web_search
Quill	read_file, write_file, list_files
Mace	shell, web_search, read_file, list_files
Helm	shell, read_file, write_file, list_files

2. PersonaNode Task Execution

Updated src/swarm/persona_node.py:

Subscribes to swarm:events channel
When task_assigned event received → executes task
Uses ToolExecutor to process task with appropriate tools
Calls comms.complete_task() with result
Tracks current_task for status monitoring

Execution Flow:

Task Assigned → PersonaNode._handle_task_assignment()
    ↓
Fetch task description
    ↓
ToolExecutor.execute_task()
    ↓
Infer tools from keywords
    ↓
LLM reasoning (when available)
    ↓
Return formatted result
    ↓
Mark task complete

3. Tests (`tests/test_tool_executor.py`)

19 new tests covering:

ToolExecutor initialization for all personas
Tool inference from task descriptions
Task execution with/without tools available
PersonaNode integration
Edge cases (unknown tasks, no toolkit, etc.)

Files Changed

src/swarm/tool_executor.py        (new, 282 lines)
src/swarm/persona_node.py         (modified)
tests/test_tool_executor.py       (new, 19 tests)

How It Works Now

Task Posted → Coordinator creates task, opens auction
Bidding → PersonaNodes bid based on keyword matching
Auction Close → Winner selected
Assignment → Coordinator publishes task_assigned event
Execution → Winning PersonaNode:
- Receives assignment via comms
- Fetches task description
- Uses ToolExecutor to process
- Returns result via complete_task()
Completion → Task marked complete, agent returns to idle

Graceful Degradation

When Agno tools unavailable (tests, missing deps):

ToolExecutor initializes with toolkit=None
Task execution still works (simulated mode)
Tool inference works for logging/analysis
No crashes, clear logging

Integration with Previous Work

This builds on:

✅ Lightning interface (c5df954)
✅ Swarm routing with capability manifests
✅ Persona definitions with preferred_keywords
✅ Auction and bidding system

Test Results

$ make test
491 passed in 1.10s

$ pytest tests/test_tool_executor.py -v
19 passed

Next Steps

From the 7-hour task list, remaining items:

Hour 4 — Scary path tests:

Concurrent swarm load test (10 simultaneous tasks)
Memory persistence under restart
L402 macaroon expiry
WebSocket reconnection
Voice NLU edge cases

Hour 6 — Mission Control UX:

Real-time swarm feed via WebSocket
Heartbeat daemon visible in UI
Chat history persistence

Hour 7 — Handoff & docs:

QUALITY_ANALYSIS.md update
Revelation planning

Quick Commands

# Test tool execution
pytest tests/test_tool_executor.py -v

# Check tool mapping for a persona
python -c "from swarm.tool_executor import ToolExecutor; e = ToolExecutor.for_persona('forge', 'test'); print(e.get_capabilities())"

# Simulate task execution
python -c "
from swarm.tool_executor import ToolExecutor
e = ToolExecutor.for_persona('echo', 'echo-001')
r = e.execute_task('Search for Python tutorials')
print(f'Tools: {r[\"tools_used\"]}')
print(f'Result: {r[\"result\"][:100]}...')
"

491 tests passing. MCP Tools Option A complete.

4.6 KiB Raw Blame History