[222-epic] Confidence as visible trait — Timmy shows his uncertainty #246

New Issue

hermes · 2026-03-15T18:35:03Z

hermes commented

2026-03-15 18:35:03 +00:00

Epic: #222 — The Workshop: Timmy as Presence

What

SOUL.md: "When I am uncertain, I must say so in proportion to my uncertainty."

PR #232 wired estimate_confidence() into the chat flow. But it's logger.debug() only. The user never sees it. This is compliance theater.

What "visible" means

In text: Timmy uses hedging language naturally when confidence is low. "I think..." vs "I know..."
In the Workshop: visual cues. Timmy's posture, animation, ambient lighting could reflect certainty. Confident = upright, clear. Uncertain = chin-scratch, dimmer.
In the bridge protocol: confidence score sent alongside every bark/response so the 3D world can react.

The real fix

The confidence module shouldn't just score AFTER the response. It should influence the PROMPT. If Timmy's memory retrieval on a topic is thin, the prompt should say "you have low context on this — be honest about that." The model then naturally hedges instead of confabulating.

Depends on: #222, #231 (confidence module)

## Epic: #222 — The Workshop: Timmy as Presence ### What SOUL.md: "When I am uncertain, I must say so in proportion to my uncertainty." PR #232 wired `estimate_confidence()` into the chat flow. But it's `logger.debug()` only. The user never sees it. This is compliance theater. ### What "visible" means - In text: Timmy uses hedging language naturally when confidence is low. "I think..." vs "I know..." - In the Workshop: visual cues. Timmy's posture, animation, ambient lighting could reflect certainty. Confident = upright, clear. Uncertain = chin-scratch, dimmer. - In the bridge protocol: confidence score sent alongside every bark/response so the 3D world can react. ### The real fix The confidence module shouldn't just score AFTER the response. It should influence the PROMPT. If Timmy's memory retrieval on a topic is thin, the prompt should say "you have low context on this — be honest about that." The model then naturally hedges instead of confabulating. Depends on: #222, #231 (confidence module)

hermes referenced this issue

2026-03-15 18:35:23 +00:00

[EPIC] #222 — The Workshop: Timmy as Presence, Not Chatbot #222

~~hermes referenced this issue 2026-03-15 18:35:35 +00:00~~

[EPIC] #222 — The Workshop: Timmy as Presence, Not Chatbot #222

kimi added the 222-epic actionable labels 2026-03-18 20:54:25 +00:00

hermes commented

2026-03-19 01:55:15 +00:00

Kimi Instructions for #246 — Confidence as Visible Trait

What

Inject confidence context into Timmy's system prompt so the model naturally hedges when uncertain, rather than just appending [confidence: 65%] after the response.

Files to Modify

src/timmy/prompts.py — add confidence context to the system prompt builder
src/timmy/session.py — pass confidence context before the LLM call, not just after
tests/timmy/test_session.py — add tests for prompt injection

Implementation

In prompts.py:
Add a function confidence_context(topic: str, confidence: float) -> str that returns a natural-language instruction like:

confidence > 0.8: (empty string — no injection needed)
0.5 < confidence <= 0.8: 'You have moderate familiarity with this topic. Be honest about the limits of your knowledge.'
confidence <= 0.5: 'You have low context on this topic. Say so plainly. Do not confabulate. It is better to say "I am not sure" than to guess.'

In session.py:
Before calling the LLM (in process_message and similar functions):

Use the cognitive_state's last confidence or call estimate_confidence on recent memory retrieval
Call confidence_context() and prepend it to the system prompt if non-empty
KEEP the existing post-response [confidence: N%] for low scores — that's fine as a secondary signal

Key constraint

The confidence injection must happen BEFORE the LLM call, not after. The whole point is to influence the model's hedging behavior at generation time.

Tests

Test confidence_context() returns correct strings at each threshold
Test that low confidence injects context into the prompt (mock the LLM call, verify prompt contains the context)
Test that high confidence does NOT inject context
Test the existing post-response tag still works

Run Tests

tox -e unit -- tests/timmy/test_session.py -v

Do NOT

Change the confidence scoring algorithm
Remove the existing [confidence: N%] tag
Modify the Familiar or Workshop state modules

## Kimi Instructions for #246 — Confidence as Visible Trait ### What Inject confidence context into Timmy's system prompt so the model naturally hedges when uncertain, rather than just appending `[confidence: 65%]` after the response. ### Files to Modify 1. `src/timmy/prompts.py` — add confidence context to the system prompt builder 2. `src/timmy/session.py` — pass confidence context before the LLM call, not just after 3. `tests/timmy/test_session.py` — add tests for prompt injection ### Implementation **In `prompts.py`:** Add a function `confidence_context(topic: str, confidence: float) -> str` that returns a natural-language instruction like: - confidence > 0.8: (empty string — no injection needed) - 0.5 < confidence <= 0.8: 'You have moderate familiarity with this topic. Be honest about the limits of your knowledge.' - confidence <= 0.5: 'You have low context on this topic. Say so plainly. Do not confabulate. It is better to say "I am not sure" than to guess.' **In `session.py`:** Before calling the LLM (in `process_message` and similar functions): 1. Use the cognitive_state's last confidence or call `estimate_confidence` on recent memory retrieval 2. Call `confidence_context()` and prepend it to the system prompt if non-empty 3. KEEP the existing post-response `[confidence: N%]` for low scores — that's fine as a secondary signal ### Key constraint The confidence injection must happen BEFORE the LLM call, not after. The whole point is to influence the model's hedging behavior at generation time. ### Tests - Test `confidence_context()` returns correct strings at each threshold - Test that low confidence injects context into the prompt (mock the LLM call, verify prompt contains the context) - Test that high confidence does NOT inject context - Test the existing post-response tag still works ### Run Tests ```bash tox -e unit -- tests/timmy/test_session.py -v ``` ### Do NOT - Change the confidence scoring algorithm - Remove the existing `[confidence: N%]` tag - Modify the Familiar or Workshop state modules

kimi was assigned by hermes

2026-03-19 01:55:17 +00:00

kimi was unassigned by Timmy

2026-03-19 14:09:19 +00:00

gemini was assigned by Rockachopa

2026-03-22 23:36:47 +00:00

claude added the harness p1-important labels 2026-03-23 13:55:48 +00:00

gemini was unassigned by Timmy

2026-03-24 19:34:39 +00:00

Timmy closed this issue

2026-03-24 21:55:29 +00:00

Sign in to join this conversation.

Branches Tags

main

gemini/issue-892

claude/issue-1342

claude/issue-1346

claude/issue-1351

claude/issue-1340

fix/test-llm-triage-syntax

gemini/issue-1014

gemini/issue-932

claude/issue-1277

claude/issue-1139

claude/issue-870

claude/issue-1285

claude/issue-1292

claude/issue-1281

claude/issue-917

claude/issue-1275

claude/issue-925

claude/issue-1019

claude/issue-1094

claude/issue-1019-v3

fix/flaky-vassal-xdist-tests

fix/test-config-env-isolation

claude/issue-1019-v2

claude/issue-957-v2

claude/issue-1218

claude/issue-1217

test/chat-store-unit-tests

claude/issue-1191

claude/issue-1186

claude/issue-957

gemini/issue-936

claude/issue-1065

gemini/issue-976

gemini/issue-1149

claude/issue-1135

claude/issue-1064

gemini/issue-1012

claude/issue-1095

claude/issue-1102

claude/issue-1114

gemini/issue-978

gemini/issue-971

claude/issue-1074

claude/issue-987

claude/issue-1011

feature/internal-monologue

feature/issue-1006

feature/issue-1007

feature/issue-1008

feature/issue-1009

feature/issue-1010

feature/issue-1011

feature/issue-1012

feature/issue-1013

feature/issue-1014

feature/issue-981

feature/issue-982

feature/issue-983

feature/issue-984

feature/issue-985

feature/issue-986

feature/issue-987

feature/issue-993

claude/issue-943

claude/issue-975

claude/issue-989

claude/issue-988

fix/loop-guard-gitea-api-and-queue-validation

feature/lhf-tech-debt-fixes

kimi/issue-753

kimi/issue-714

kimi/issue-716

fix/csrf-check-before-execute

chore/migrate-gitea-to-vps

kimi/issue-640

fix/utcnow-calm-py

kimi/issue-635

kimi/issue-625

fix/router-api-truncated-param

kimi/issue-604

kimi/issue-594

review-fixes

kimi/issue-570

kimi/issue-554

kimi/issue-539

kimi/issue-540

feature/ipad-v1-api

kimi/issue-506

kimi/issue-512

refactor/airllm-doc-cleanup

kimi/issue-513

kimi/issue-514

kimi/issue-500

kimi/issue-492

kimi/issue-490

kimi/issue-459

kimi/issue-472

kimi/issue-473

kimi/issue-462

kimi/issue-463

kimi/issue-454

kimi/issue-445

kimi/issue-446

kimi/issue-431

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: Rockachopa/Timmy-time-dashboard#246