[COMPUTER_USE] Add Desktop Automation Primitives to Hermes #1125
Open
opened 2026-04-07 21:17:09 +00:00 by Timmy
·
1 comment
No Branch/Tag Specified
main
bezalel/rca-deadman-fratricide
perplexity/soul-md-pointer
claude/issue-1125
claude/issue-1126
feat/mempalace-api-add-1775582323040
claude/issue-1112
groq/issue-1126
groq/issue-1118
groq/issue-1119
groq/issue-1047
groq/issue-915
claude/issue-1075
groq/issue-917
groq/issue-918
groq/issue-1103
groq/issue-1105
groq/issue-1106
groq/issue-1108
groq/issue-1092
groq/issue-1095
groq/issue-1098
groq/issue-913
timmy/issue-fix-896-897-898-910
claude/issue-823
claude/issue-879
claude/issue-880
claude/issue-827
claude/issue-882
claude/issue-826
claude/issue-836
claude/issue-832
claude/issue-833
timmy/issue-855
allegro/self-improvement-infra
ezra/deep-dive-architecture-20260405
claude/modularization-phase-1
gemini/issue-431
v7.0.0
GoldenRockachopa
pre-agent-workers-v1
v0-golden
Labels
Clear labels
222-epic
3d-world
CI
QA
actionable
agent-presence
aistudio-ready
assigned-aistudio
assigned-claude
assigned-claw-code
assigned-gemini
assigned-groq
assigned-kimi
assigned-kimi
assigned-perplexity
assigned-sonnet
blocked
claude-ready
claw-code-done
claw-code-in-progress
deprioritized
duplicate
epic
gemini-api
gemini-review
google-ai-ultra
groq-ready
harness
identity
infrastructure
kimi-done
kimi-in-progress
kimi-ready
lazzyPit
media-gen
modularization
needs-design
nostr
p0-critical
p1-important
p2-backlog
performance
perplexity-ready
portal
research
security
sonnet-ready
sovereignty
velocity-engine
Continuous integration, runners, workflow issues
Quality assurance, testing, and production audit
Queued for Code Claw (qwen/openrouter)
Dispatched to Kimi via OpenClaw
Blocked by external dependency or merge conflict
Code Claw completed this task
Code Claw is actively working
Epic / umbrella issue
Gemini API integration
Google AI Ultra integration work
Timmy identity and branding
Kimi completed this task
Kimi is actively working on this
Lazarus Pit — automated agent resurrection and health recovery
AI media generation (image/video/audio)
Deep research and planning tasks
Security hardening, vulnerability fixes, access control
Auto-generated by velocity engine
No Label
Milestone
No items
No Milestone
Projects
Clear projects
No project
Assignees
KimiClaw
Rockachopa
Timmy
allegro
antigravity
bezalel
claude
claw-code
codex-agent
ezra
gemini
google
grok
hermes
kimi
manus
perplexity
sonnet
Clear assignees
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: Timmy_Foundation/the-nexus#1125
Reference in New Issue
Block a user
Blocking a user prevents them from interacting with repositories, such as opening or commenting on pull requests or issues. Learn more about blocking a user.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Objective
Add computer-use primitives to Hermes so that agents can control a desktop environment (screenshot, click, type, read screen content) for automation and testing.
Background
Anthropic’s "computer use" capability allows Claude to control a desktop via screenshots and mouse/keyboard actions. For our fleet, this unlocks: visual testing of web UIs, automated Gitea workflow verification, screenshot-based incident diagnosis, and operating GUI-only tools.
Acceptance Criteria
Phase 1 — Environment & Primitives (1 week)
Phase 2 — Tool Integration (1 week)
computer_screenshot,computer_click,computer_type,computer_scrollrm -rf, password entry) require explicit confirmationPhase 3 — Use-Case Pilots (1 week)
the-nexus/docs/computer-use.mdSuggested Implementation Path
playwright,pyautogui, oranthropic-computer-use-demoreference codeOwner
Bezalel
Linked Epic
#1120
PR created: #1131
Summary of changes
nexus/computer_use.py— four Hermes tools with poka-yoke safety and JSONL action logging:computer_screenshot()— capture & log desktop snapshotcomputer_click(x, y, button, confirm)— right/middle clicks requireconfirm=Truecomputer_type(text, confirm)— sensitive keywords (password/token/key) refused withoutconfirm=Truecomputer_scroll(x, y, amount)— scroll wheelread_action_log()— read recent log entriespyautogui.FAILSAFE=Trueglobally; all tools degrade gracefully when headlessnexus/computer_use_demo.py— Phase 1 demo: baseline screenshot → open browser → navigate to Gitea → evidence screenshottests/test_computer_use.py— 29 unit tests, fully headless (pyautogui mocked), all passingdocs/computer-use.md— full Phase 1–3 documentation with API reference, safety table, pilot recipesdocker-compose.desktop.yml— sandboxed Xvfb + noVNC container