[HARNESS] Z3 Crucible as a timmy-config sidecar (no Hermes fork) #86

New Issue

Timmy · 2026-03-29T00:03:08Z

Timmy commented

2026-03-29 00:03:08 +00:00

Context
The raw proposal is strong in spirit: add a formal verifier so Timmy can prove constraint logic instead of bluffing through scheduling/resource-allocation problems.

But the implementation must fit current architecture:

timmy-config is now the source-controlled home for Timmy's scripts and toolset
hermes-agent stays upstream; do not fork or host a custom Hermes codebase
raw exec() of model-written Python inside the main agent process is not acceptable as the permanent design

Triage decision
ACCEPT THE DIRECTION. RESCOPE THE IMPLEMENTATION.

We want a Z3-backed Crucible, but as a sidecar capability deployed from timmy-config, not as a direct long-lived patch to hermes-agent.

V0 architecture

Install z3-solver locally on the Mac.
Add a Timmy-owned verifier service/script inside timmy-config (prefer MCP server or other sidecar entrypoint already supported by Hermes config).
Expose a verify_logic / crucible-style tool to the agent through sidecar config, not a fork.
Keep inputs constrained:
- preferred: structured JSON or SMT-LIB fragments
- acceptable prototype: z3py in a jailed subprocess with timeout and strict allowlist
- reject: unrestricted exec() in the main agent runtime
Add a Timmy playbook/skill for when Crucible is mandatory:
- scheduling
- dependency constraints
- resource allocation
- consistency checking
Log SAT / UNSAT / model output as proof trail.

Non-goals

Do NOT replace all normal conversation with formal proof.
Do NOT force every answer through Z3.
Do NOT create a permanent hermes-agent fork just to add one tool.

Acceptance criteria

All implementation lives in Timmy_Foundation/timmy-config and deploys through deploy.sh.
Hermes can call the verifier as a sidecar tool/service after deploy.
Example prompt works end-to-end: “Can tasks of duration 2, 3, and 4 fit in an 8-hour window if B depends on A?”
The system returns SAT/UNSAT plus witness model (or contradiction), not hand-wavy prose.
At least 3 reusable verified templates exist:
- task scheduling
- dependency ordering
- capacity/resource constraints
Failure mode is honest: if the verifier cannot prove it, Timmy says so.

Related prior architecture issues

#36 / #37 Reasoning-DPO with Z3 truth oracle
#38 / #39 Lean 4 integration
#40 / #41 SymPy tool
#42 / #43 Adaptive logic router

Suggested first cut
Implement the narrowest useful slice first:

one sidecar verifier
one scheduling template
one proof-producing demo path
Then decide whether to expand into router / training / broader neuro-symbolic work.

Context The raw proposal is strong in spirit: add a formal verifier so Timmy can prove constraint logic instead of bluffing through scheduling/resource-allocation problems. But the implementation must fit current architecture: - timmy-config is now the source-controlled home for Timmy's scripts and toolset - hermes-agent stays upstream; do not fork or host a custom Hermes codebase - raw exec() of model-written Python inside the main agent process is not acceptable as the permanent design Triage decision ACCEPT THE DIRECTION. RESCOPE THE IMPLEMENTATION. We want a Z3-backed Crucible, but as a sidecar capability deployed from timmy-config, not as a direct long-lived patch to hermes-agent. V0 architecture 1. Install z3-solver locally on the Mac. 2. Add a Timmy-owned verifier service/script inside timmy-config (prefer MCP server or other sidecar entrypoint already supported by Hermes config). 3. Expose a verify_logic / crucible-style tool to the agent through sidecar config, not a fork. 4. Keep inputs constrained: - preferred: structured JSON or SMT-LIB fragments - acceptable prototype: z3py in a jailed subprocess with timeout and strict allowlist - reject: unrestricted exec() in the main agent runtime 5. Add a Timmy playbook/skill for when Crucible is mandatory: - scheduling - dependency constraints - resource allocation - consistency checking 6. Log SAT / UNSAT / model output as proof trail. Non-goals - Do NOT replace all normal conversation with formal proof. - Do NOT force every answer through Z3. - Do NOT create a permanent hermes-agent fork just to add one tool. Acceptance criteria - All implementation lives in Timmy_Foundation/timmy-config and deploys through deploy.sh. - Hermes can call the verifier as a sidecar tool/service after deploy. - Example prompt works end-to-end: “Can tasks of duration 2, 3, and 4 fit in an 8-hour window if B depends on A?” - The system returns SAT/UNSAT plus witness model (or contradiction), not hand-wavy prose. - At least 3 reusable verified templates exist: - task scheduling - dependency ordering - capacity/resource constraints - Failure mode is honest: if the verifier cannot prove it, Timmy says so. Related prior architecture issues - #36 / #37 Reasoning-DPO with Z3 truth oracle - #38 / #39 Lean 4 integration - #40 / #41 SymPy tool - #42 / #43 Adaptive logic router Suggested first cut Implement the narrowest useful slice first: - one sidecar verifier - one scheduling template - one proof-producing demo path Then decide whether to expand into router / training / broader neuro-symbolic work.

Timmy self-assigned this 2026-03-29 00:03:08 +00:00

Timmy referenced a pull request that will close this issue

2026-03-29 00:53:46 +00:00

[crucible] Z3 sidecar MCP verifier first cut (#86) #88

Timmy commented

2026-03-29 00:53:47 +00:00

First cut is up in PR #88.

Tangible results already verified locally:

~/.hermes/bin/crucible_mcp_server.py selftest returns SAT/UNSAT witness results
proof logs are being written under ~/.hermes/logs/crucible/
fresh MCP discovery sees:
- mcp_crucible_schedule_tasks
- mcp_crucible_order_dependencies
- mcp_crucible_capacity_fit

PR: http://143.198.27.163:3000/Timmy_Foundation/timmy-config/pulls/88

First cut is up in PR #88. Tangible results already verified locally: - `~/.hermes/bin/crucible_mcp_server.py selftest` returns SAT/UNSAT witness results - proof logs are being written under `~/.hermes/logs/crucible/` - fresh MCP discovery sees: - mcp_crucible_schedule_tasks - mcp_crucible_order_dependencies - mcp_crucible_capacity_fit PR: http://143.198.27.163:3000/Timmy_Foundation/timmy-config/pulls/88

Timmy referenced this issue

2026-03-29 06:00:56 +00:00

☀️ Good Morning Report — 2026-03-29 (Sunday) #89

Timmy commented

2026-03-29 23:59:00 +00:00

A GOFAI-oriented follow-on workload is now staged in timmy-config#98 — policy cards and constraint sidecar for local Timmy decisions.

Intent: build on the Crucible/sidecar philosophy without forking Hermes, and use explicit policy/constraint scaffolding to improve local decision reliability.

A GOFAI-oriented follow-on workload is now staged in `timmy-config#98` — policy cards and constraint sidecar for local Timmy decisions. Intent: build on the Crucible/sidecar philosophy without forking Hermes, and use explicit policy/constraint scaffolding to improve local decision reliability.

Timmy referenced this issue

2026-03-30 06:01:00 +00:00

☀️ Good Morning Report — 2026-03-30 (Monday) #99

Timmy commented

2026-03-30 16:49:50 +00:00

PR #88 is open for this. Timmy: review and merge or close.

Rockachopa referenced this issue

2026-03-30 17:06:35 +00:00

[crucible] Z3 sidecar MCP verifier first cut (#86) #88

gemini referenced a pull request that will close this issue

2026-03-30 22:20:27 +00:00

[crucible] Z3 sidecar MCP verifier first cut (#86) #88

Timmy referenced this issue from a commit

2026-04-03 22:58:45 +00:00

feat(crucible): Z3 sidecar MCP verifier -- rebased onto current main

Timmy closed this issue

2026-04-03 22:58:45 +00:00

Timmy commented

2026-04-03 22:59:37 +00:00

Crucible merged to main in commit 8ec4bff. Three templates shipping: schedule_tasks, order_dependencies, capacity_fit. Proof trail logging active. Closing.

Crucible merged to main in commit 8ec4bff. Three templates shipping: schedule_tasks, order_dependencies, capacity_fit. Proof trail logging active. Closing.

Timmy referenced this issue

2026-04-03 22:59:40 +00:00

[GOFAI] Policy cards and constraint sidecar for local Timmy decisions #98

Timmy referenced this issue

2026-04-04 11:55:13 +00:00

[RETRO] Burn Down Night Retrospective -- 2026-04-04 #114

Timmy referenced this issue

2026-04-04 11:56:25 +00:00

[RETRO] Burn Down Night Retrospective -- 2026-04-04 #114

Timmy referenced this issue

2026-04-04 16:43:46 +00:00

[GOFAI] Policy cards and constraint sidecar for local Timmy decisions #98

Sign in to join this conversation.

Branches Tags

main

ezra/lazarus-cell-spec-268

allegro/m2-commit-or-abort-845

gemini/issue-246

allegro/m1-stop-protocol-842

gemini/issue-182

master

feat/architecture-linter-provenance

feat/adr-system-provenance

sonnet/smoke-test-sonnet

sonnet/issue-260

docs/automation-audit-20260404

docs/architecture-kt-unified-schema

feat/frontier-local-layer-4-mesh

timmy/code-claw-docs

claw-code/issue-232

feat/frontier-local-layer-5-immortality

feat/frontier-local-layer-3

feature/workforce-manager

feat/frontier-local-agenda-v2

feat/cost-saving-guide

timmy/gemini-loop-hardening

timmy/orchestrator-kimi-heartbeat-status

timmy/orchestrator-kimi-visibility

timmy/issue-186-import-bridge

codex/workflow-pr-review

feat/sovereign-identity-phase-23

feat/sovereign-evolution-redistribution

gemini/orchestration-hardening

gemini/audit-bugfixes

timmy/issue-86-z3-crucible

feat/allegro-identity-fix

gemini/issue-75

gemini/issue-76

gemini/issue-78

review/move-last-two-main-commits-20260328-000322

gemini/issue-50

backup/main-before-reset-20260328-000322

gemini/issue-52

gemini/issue-54

fix/mcp-morrowind-tool-naming

gemini/issue-59

gemini/issue-60

gemini/issue-61

gemini/issue-62

gemini/issue-63

gemini/issue-41

gemini/issue-42

gemini/issue-43

codex/hermes-venv-runner

codex/twitter-archive-orchestration

codex/cleanup-pass-2

codex/cleanup-boundaries

gemini/issue-8

gemini/issue-20

gemini/issue-21

gemini/issue-22

gemini/issue-9

gemini/issue-10

gemini/issue-11

gemini/issue-12

gemini/issue-13

manus/dpo-data-pipeline

feature/dpo-training-pipeline

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: Timmy_Foundation/timmy-config#86