Compare gemini-3.1-flash-live-preview with other models

Select another model to compare pricing, limits, and capabilities with gemini-3.1-flash-live-preview.

Models

gemini-3.1-flash-live-preview

gemini

Context Length

131K

Max Output

66K

Input Cost

$0.75/M

Output Cost

$4.50/M

Mode

Chat

Max Input Tokens

131K

Max Tokens

66K

Supported Endpoints

/v1/realtime

Provider

Google Gemini

[ WE'RE OPEN SOURCE ]

Scale with the Fastest LLM Gateway

Built for enterprise-grade reliability, governance, and scale. Deploy in seconds.

or, Get started here

Comparison Insights

Comprehensive analysis based on the latest model metadata from the comparison table above.

What should I know about gemini-3.1-flash-live-preview?

Overview

gemini-3.1-flash-live-preview is a chat model provided by Google Gemini.
With a context window of 131K tokens, this model can handle substantial inputs such as detailed documents or extended conversation histories.

Pricing

Input processing costs $0.75 per million tokens.
Output generation costs $4.50 per million tokens.

Output Capabilities

The model can generate up to 66K tokens in a single response.

Availability

Available through the following endpoints: /v1/realtime.

What capabilities does gemini-3.1-flash-live-preview support?

Supports function calling, enabling integration with external tools and APIs for extended functionality.
Includes vision capabilities to process and analyze images alongside text inputs.
Provides web search integration for accessing real-time information and current data.
Accepts audio input, allowing for voice-based interactions and audio processing.
Generates audio output for text-to-speech and voice response applications.

gemini-3.1-flash-live-preview Pricing Overview

At $0.75 per 1M input tokens and $4.50 per 1M output tokens, gemini-3.1-flash-live-preview ranks 1561 out of 2637 chat models by input cost. It is more expensive compared to the median of $0.50 for chat models, and is cheaper than 41% of models in this category.