Token-by-token streaming makes tool-call detection fragile in the client
When streaming, the client tries to detect whether the model is producing a tool call vs. a regular text response by watching for the tool-call marker. Sometimes the marker arrives split across two tokens and the client's regex misses it, rendering a broken UI state.