Send text over WebSocket; server streams voice via WebRTC chunk-by-chunk.
Server calls the RAG RPC, streams the answer through TTS + LipSync over WebRTC.
Server calls OpenAI directly, streams the answer through TTS + LipSync over WebRTC.