hermes-agent

Author	SHA1	Message	Date
teknium1	5c867fd79f	test: strengthen assertions across 3 more test files (batch 2) test_run_agent.py (2 weak → 0, +13 assertions): - Session ID validated against actual YYYYMMDD_HHMMSS_hex format - API failure verifies error message propagation - Invalid JSON args verifies empty dict fallback + message structure - Context compression verifies final_response + completed flag - Invalid tool name retry verifies api_calls count - Invalid response verifies completed/failed/error structure test_model_tools.py (3 weak → 0): - Unknown tool error includes tool name in message - Exception returns dict with 'error' key + non-empty message - get_all_tool_names verifies both web_search AND terminal present test_approval.py (1 weak → 0, assert ratio 1.1 → 2.2): - Dangerous commands verify description content (delete, shell, drop, etc.) - Safe commands explicitly assert key AND desc are None - Pre/post condition checks for state management	2026-03-05 18:46:30 -08:00
Teknium	21d61bdd71	Merge pull request #307 from batuhankocyigit/patch-1 fix: correct typo 'Grup' -> 'Group' in test section headers	2026-03-05 08:54:05 -08:00
0xbyt4	aefc330b8f	merge: resolve conflict with main (add mcp + homeassistant extras)	2026-03-03 14:52:22 +03:00
BathreeNode	f08ad94d4d	fix: correct typo 'Grup' -> 'Group' in test section headers Three section header comments in tests/test_run_agent.py used 'Grup' instead of 'Group': - Line 124: # Grup 1: Pure Functions - Line 276: # Grup 2: State / Structure Methods - Line 572: # Grup 3: Conversation Loop Pieces (OpenAI mock)	2026-03-03 09:10:35 +03:00
teknium1	56b53bff6e	Merge PR #229 : fix(agent): copy conversation_history to avoid mutating caller's list Authored by Farukest. Fixes #228. # Conflicts: # tests/test_run_agent.py	2026-03-02 04:21:39 -08:00
teknium1	c4ea996612	fix: repair flush sentinel test — mock auxiliary client and add guard The TestFlushSentinelNotLeaked test from PR #227 had two issues: 1. flush_memories() uses get_text_auxiliary_client() which could bypass agent.client entirely — mock it to return (None, None) 2. No assertion that the API was actually called — added guard assert Without these fixes the test passed vacuously (API never called).	2026-03-02 03:21:08 -08:00
teknium1	234b67f5fd	fix: mock time in retry exhaustion tests to prevent backoff sleep The TestRetryExhaustion tests from PR #223 didn't mock time.sleep/time.time, causing the retry backoff loops (275s+ total) to run in real time. Tests would time out instead of running quickly. Added _make_fast_time_mock() helper that creates a mock time module where time.time() advances 500s per call (so sleep_end is always in the past) and time.sleep() is a no-op. Both tests now complete in <1s.	2026-03-02 02:59:41 -08:00
Farukest	e87859e82c	fix(agent): copy conversation_history to avoid mutating caller's list	2026-03-01 03:06:13 +03:00
Farukest	de101a8202	fix(agent): strip _flush_sentinel from API messages	2026-03-01 02:51:31 +03:00
Farukest	c33f8d381b	fix: correct off-by-one in retry exhaustion checks The retry exhaustion checks used > instead of >= to compare retry_count against max_retries. Since the while loop condition is retry_count < max_retries, the check retry_count > max_retries can never be true inside the loop. When retries are exhausted, the loop exits and falls through to response.choices[0] on an invalid response, crashing with IndexError instead of returning a proper error.	2026-03-01 02:27:26 +03:00
0xbyt4	dfd50ceccd	fix: preserve Gemini thought_signature in tool call messages Gemini 3 thinking models attach extra_content with thought_signature to function call responses. This must be echoed back on subsequent API calls or the server rejects with a 400 error. The assistant message builder was dropping this field, causing all Gemini 3 Flash/Pro tool-calling flows to fail after the first function call.	2026-02-28 18:10:05 +03:00
teknium1	50cb4d5fc7	fix(agent): update error message for unsupported Anthropic API endpoints to clarify usage of OpenRouter	2026-02-27 23:23:31 -08:00
Teknium	2bc9508b7c	Merge pull request #173 from adavyas/fix/anthropic-base-url-guard fix(agent): fail fast on Anthropic native base URLs	2026-02-27 23:22:01 -08:00
teknium1	19f28a633a	fix(agent): enhance 413 error handling and improve conversation history management in tests	2026-02-27 23:04:32 -08:00
adavyas	0c0a2eb0a2	fix(agent): fail fast on Anthropic native base URLs	2026-02-27 21:19:29 -08:00
0xbyt4	90ca2ae16b	test: add unit tests for run_agent.py (AIAgent) 71 tests covering pure functions, state/structure methods, and conversation loop pieces. OpenAI client and tool loading are mocked.	2026-02-26 16:15:04 +03:00

16 Commits