LATEST MODEL

Gemini 3.1 Flash-Lite

Google ⚡ The Runner Released March 2026

Google's most cost-efficient Gemini 3 model with 1M context, multimodal input, and 2.5x faster time-to-first-token than Gemini 2.5 Flash

Gemini 3.1 Flash-Lite

Google • March 2026

Latest

Training Data

Up to early 2025

Gemini 3.1 Flash-Lite

March 2026

Parameters

Not disclosed

Training Method

Multimodal pre-training with RLHF

Context Window

1,000,000 tokens

Knowledge Cutoff

January 2025

Key Features

1M Context Window • Multimodal Input • Extended Thinking

Capabilities

Speed: Outstanding

Cost Efficiency: Outstanding

Multimodal: Good

What's New in This Version

2.5x faster time-to-first-token and 45% faster output generation than Gemini 2.5 Flash at the lowest cost in the Gemini 3 family

Google's most cost-efficient Gemini 3 model with 1M context, multimodal input, and 2.5x faster time-to-first-token than Gemini 2.5 Flash

What's New in This Version

2.5x faster time-to-first-token and 45% faster output generation than Gemini 2.5 Flash at the lowest cost in the Gemini 3 family

Technical Specifications

Parameters Not disclosed

Context Window 1,000,000 tokens

Training Method Multimodal pre-training with RLHF

Knowledge Cutoff January 2025

Training Data Up to early 2025

Key Features

1M Context Window Multimodal Input Extended Thinking

Capabilities

Speed: Outstanding

Cost Efficiency: Outstanding

Multimodal: Good

Other Google Models

Explore more models from Google

Official Documentation Compare with Other Models View Timeline All Google Models

Gemini 3.1 Flash-Lite

Gemini 3.1 Flash-Lite

Training Data

Parameters

Training Method

Context Window

Knowledge Cutoff

Key Features

Capabilities

What's New in This Version

What's New in This Version

Technical Specifications

Key Features

Capabilities

Other Google Models

Gemini 3.5 Flash

Gemini 3.1 Pro

Gemini 3 Pro