Gemini 3.1 Flash-Lite
Google's most cost-efficient Gemini 3 model with 1M context, multimodal input, and 2.5x faster time-to-first-token than Gemini 2.5 Flash
Gemini 3.1 Flash-Lite
Google • March 2026
Training Data
Up to early 2025
Gemini 3.1 Flash-Lite
March 2026
Parameters
Not disclosed
Training Method
Multimodal pre-training with RLHF
Context Window
1,000,000 tokens
Knowledge Cutoff
January 2025
Key Features
1M Context Window • Multimodal Input • Extended Thinking
Capabilities
Speed: Outstanding
Cost Efficiency: Outstanding
Multimodal: Good
What's New in This Version
2.5x faster time-to-first-token and 45% faster output generation than Gemini 2.5 Flash at the lowest cost in the Gemini 3 family
Google's most cost-efficient Gemini 3 model with 1M context, multimodal input, and 2.5x faster time-to-first-token than Gemini 2.5 Flash
What's New in This Version
2.5x faster time-to-first-token and 45% faster output generation than Gemini 2.5 Flash at the lowest cost in the Gemini 3 family
Technical Specifications
Key Features
Capabilities
Other Google Models
Explore more models from Google
Gemini 3.5 Flash
Google's frontier Flash model built for agentic workflows, coding, and long-horizon tasks at high speed
Gemini 3.1 Pro
Google's newest flagship with doubled reasoning performance and 1M token context for agentic workflows
Gemini 3 Pro
Google's latest flagship model with advanced multimodal capabilities and PhD-level reasoning