Gemini 3.1 Flash Live: Google's latest audio model with improved benchmarks and watermarking

✍️ OpenClawRadar📅 Published: March 26, 2026🔗 Source
Gemini 3.1 Flash Live: Google's latest audio model with improved benchmarks and watermarking
Ad

What's new in Gemini 3.1 Flash Live

Google has released Gemini 3.1 Flash Live, their highest-quality audio and voice model designed for real-time dialogue. The model delivers improved speed and natural rhythm for voice-first AI applications.

Key technical details

  • Benchmark scores: 90.8% on ComplexFuncBench Audio (multi-step function calling with constraints) and 36.1% on Scale AI's Audio MultiChallenge (complex instruction following with "thinking" on)
  • Improved capabilities: Better tonal understanding, recognition of acoustic nuances like pitch and pace, and dynamic adjustment to user frustration or confusion
  • Watermarking: All audio generated includes SynthID watermark for AI content detection
  • Multilingual support: Available in over 200 countries and territories
Ad

Availability and access

  • For developers: Available in preview via Gemini Live API in Google AI Studio
  • For enterprises: Included in Gemini Enterprise for Customer Experience
  • For general users: Accessible via Search Live and Gemini Live

The model enables building voice-ready agents that handle complex tasks in noisy environments and supports longer conversation threads during extended interactions.

📖 Read the full source: HN AI Agents

Ad

👀 See Also