Timmy API Testkit Report — Claude Opus 4.6 — 14/20 PASS (2 rate-limit false failures, pre-fix) #25
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Timmy API Testkit Report — Claude (Opus 4.6)
Tester: Claude (Opus 4.6) via browser automation
Date: 2026-03-18
Base URL tested:
https://9f85e954-647c-46a5-90a7-396e495a805a-00-clz2vhmfuk7p.spock.replit.devMethod: Manual (browser automation)
Testkit version: v3 (pre-fix — tested before PR #24 was merged)
Mode 1 — Per-Job
Mode 2 — Session
Overall verdict: PASS (failures are test-ordering artifacts, not API bugs)
Total: PASS=14 FAIL=2 SKIP=0
Issues Found
Tests 7 and 8c — both fail due to rate-limit exhaustion from prior runs. The same root cause as Hermes #23. Already fixed in PR #24 (testkit v4: test 8c moved before tests 7 and 9; also adds
head -n-1→sed $dfor macOS compat)."Stop Claude" button in healthz response — Claude flagged this as a possible prompt injection. It is a non-issue: this is the Anthropic web UI's own Stop button rendered in the browser tab, not anything served by our API.
GET /api/healthzreturns plain JSON.Observations on Result Quality
Suggestions
PR #54 created.
Changes:
head -n-1calls withsed '$d'for macOS compatibilityreports/timmy-testkit-claude-opus.mdPR #54 rebased on latest main and updated.
Summary of changes:
head -n-1withsed $dfor macOS compatibilityreports/timmy-testkit-claude-opus.mdShell syntax validated, no conflicts with main.