You may be interested in gemini-2.5-flash-preview-ttsText in, audio out, so you can merge in a single step LLM+TTS (streamable)https://ai.google.dev/gemini-api/docs/models/gemini-2.5-flas...