docs: document planned live reasoning token display as future enhancement

The streaming infrastructure already fires reasoning deltas via
_fire_reasoning_delta() during streaming. The remaining work is the
CLI display layer: a dim reasoning box that opens on first reasoning
token, streams live, then transitions to the response box.

Reference: PR #1214 (raulvidis) for gateway reasoning visibility.
This commit is contained in:
teknium1
2026-03-16 10:22:44 -07:00
parent 23b9d88a76
commit d3687d3e81

9
cli.py
View File

@@ -1413,6 +1413,15 @@ class HermesCLI:
self._invalidate()
# ── Streaming display ────────────────────────────────────────────────
#
# Future: When display.show_reasoning is also enabled, stream reasoning
# tokens into a dim box above the response (like the existing static
# reasoning display, but live). The infrastructure exists — reasoning
# deltas fire via _fire_reasoning_delta() during streaming. The display
# layer needs: a dim reasoning box that opens on first reasoning token,
# accumulates live, then transitions to the response box when content
# tokens start arriving. See PR #1214 (raulvidis) for gateway-side
# reasoning visibility modes as a reference implementation.
def _stream_delta(self, text: str) -> None:
"""Line-buffered streaming callback for real-time token rendering.