teknium1
5c867fd79f
test: strengthen assertions across 3 more test files (batch 2)
...
test_run_agent.py (2 weak → 0, +13 assertions):
- Session ID validated against actual YYYYMMDD_HHMMSS_hex format
- API failure verifies error message propagation
- Invalid JSON args verifies empty dict fallback + message structure
- Context compression verifies final_response + completed flag
- Invalid tool name retry verifies api_calls count
- Invalid response verifies completed/failed/error structure
test_model_tools.py (3 weak → 0):
- Unknown tool error includes tool name in message
- Exception returns dict with 'error' key + non-empty message
- get_all_tool_names verifies both web_search AND terminal present
test_approval.py (1 weak → 0, assert ratio 1.1 → 2.2):
- Dangerous commands verify description content (delete, shell, drop, etc.)
- Safe commands explicitly assert key AND desc are None
- Pre/post condition checks for state management
2026-03-05 18:46:30 -08:00
teknium1
b4b426c69d
test: add coverage for tee, process substitution, and full-path rm patterns
...
Tests for the three new dangerous command patterns added in PR #280 :
- TestProcessSubstitutionPattern: 7 tests (bash/sh/zsh/ksh + safe commands)
- TestTeePattern: 7 tests (sensitive paths + safe destinations)
- TestFindExecFullPathRm: 4 tests (/bin/rm, /usr/bin/rm, bare rm, safe find)
2026-03-05 01:58:33 -08:00
teknium1
7862e7010c
test: add additional multiline bypass tests for find patterns
...
Extra test coverage for newline bypass detection (DOTALL fix).
Inspired by Bartok9's PR #245 .
2026-03-02 04:46:27 -08:00
Farukest
7166647ca1
fix(security): add re.DOTALL to prevent multiline bypass of dangerous command detection
2026-03-01 03:23:29 +03:00
darya
f5c09a3aba
test: add regression tests for recursive delete false positive fix
...
Add 15 new tests in two classes:
- TestRmFalsePositiveFix (8 tests): verify filenames starting with 'r'
(readme.txt, requirements.txt, report.csv, etc.) are NOT falsely
flagged as 'recursive delete'
- TestRmRecursiveFlagVariants (7 tests): verify all recursive delete
flag styles (-r, -rf, -rfv, -fr, -irf, --recursive, sudo rm -rf)
are still correctly caught
All 29 tests pass (14 existing + 15 new).
2026-02-26 16:40:44 +03:00
0xbyt4
8fc28c34ce
test: reorganize test structure and add missing unit tests
...
Reorganize flat tests/ directory to mirror source code structure
(tools/, gateway/, hermes_cli/, integration/). Add 11 new test files
covering previously untested modules: registry, patch_parser,
fuzzy_match, todo_tool, approval, file_tools, gateway session/config/
delivery, and hermes_cli config/models. Total: 147 unit tests passing,
9 integration tests gated behind pytest marker.
2026-02-26 03:20:08 +03:00