Open
Description
With the implementation of multimodal capabilities (#71), it might also be worthwhile to consider integrating multimodal real-time capabilities, such as those offered by existing APIs like Gemini 2.0's Multimodal Live API or OpenAI's Realtime API.