docs: document planned live reasoning token display as future enhancement

The streaming infrastructure already fires reasoning deltas via _fire_reasoning_delta() during streaming. The remaining work is the CLI display layer: a dim reasoning box that opens on first reasoning token, streams live, then transitions to the response box. Reference: PR #1214 (raulvidis) for gateway reasoning visibility.
2026-03-16 10:22:44 -07:00
parent 23b9d88a76
commit d3687d3e81
1 changed files with 9 additions and 0 deletions
--- a/cli.py
+++ b/cli.py
@@ -1413,6 +1413,15 @@ class HermesCLI:
        self._invalidate()

    # ── Streaming display ────────────────────────────────────────────────
+    #
+    # Future: When display.show_reasoning is also enabled, stream reasoning
+    # tokens into a dim box above the response (like the existing static
+    # reasoning display, but live). The infrastructure exists — reasoning
+    # deltas fire via _fire_reasoning_delta() during streaming. The display
+    # layer needs: a dim reasoning box that opens on first reasoning token,
+    # accumulates live, then transitions to the response box when content
+    # tokens start arriving. See PR #1214 (raulvidis) for gateway-side
+    # reasoning visibility modes as a reference implementation.

    def _stream_delta(self, text: str) -> None:
        """Line-buffered streaming callback for real-time token rendering.