This website requires JavaScript.
Explore
Help
Sign In
Rockachopa
/
Timmy-time-dashboard
Watch
1
Star
0
Fork
2
You've already forked Timmy-time-dashboard
Code
Issues
7
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
c58093dcccd837e0b899a11785e90c03881eea37
Timmy-time-dashboard
/
scripts
/
benchmarks
History
Claude (Opus 4.6)
7dfbf05867
Some checks failed
Tests / lint (push)
Has been cancelled
Details
Tests / test (push)
Has been cancelled
Details
[claude] Run 5-test benchmark suite against local model candidates (
#1066
) (
#1271
)
2026-03-24 01:38:59 +00:00
..
01_tool_calling.py
[claude] Run 5-test benchmark suite against local model candidates (
#1066
) (
#1271
)
2026-03-24 01:38:59 +00:00
02_code_generation.py
[claude] Run 5-test benchmark suite against local model candidates (
#1066
) (
#1271
)
2026-03-24 01:38:59 +00:00
03_shell_commands.py
[claude] Run 5-test benchmark suite against local model candidates (
#1066
) (
#1271
)
2026-03-24 01:38:59 +00:00
04_multi_turn_coherence.py
[claude] Run 5-test benchmark suite against local model candidates (
#1066
) (
#1271
)
2026-03-24 01:38:59 +00:00
05_issue_triage.py
[claude] Run 5-test benchmark suite against local model candidates (
#1066
) (
#1271
)
2026-03-24 01:38:59 +00:00
run_suite.py
[claude] Run 5-test benchmark suite against local model candidates (
#1066
) (
#1271
)
2026-03-24 01:38:59 +00:00