What are the key points?

Google introduces Gemini 3.1 Flash-Lite, an ultra-affordable addition to its efficient model family. New pricing set at $0.25 per million input tokens, significantly lower than Pro version costs. Model features four distinct "thinking levels" to balance computational depth with response speed.

Google Launches Ultra-Low-Cost Gemini 3.1 Flash-Lite

•Google introduces Gemini 3.1 Flash-Lite, an ultra-affordable addition to its efficient model family.
•New pricing set at $0.25 per million input tokens, significantly lower than Pro version costs.
•Model features four distinct "thinking levels" to balance computational depth with response speed.

In a detailed overview provided by technologist Simon Willison, Google has expanded its ecosystem with the release of Gemini 3.1 Flash-Lite, a model designed to prioritize cost-efficiency. This latest iteration is the most economical entry in the Gemini lineup, boasting a price point of just $0.25 per million input tokens and $1.50 per million output tokens.

The Flash-Lite model is approximately one-eighth the cost of the Gemini 3.1 Pro. This dramatic reduction in overhead makes it an ideal candidate for high-volume tasks where budget constraints previously limited AI integration. One of the standout features is the introduction of selectable "thinking levels" that range from minimal to high.

Users can now choose how much computational effort the model exerts on a specific prompt. For simple queries, a "minimal" setting preserves resources, while complex reasoning tasks can be escalated to the "high" level to ensure more accurate outputs.

This control represents a shift toward customizable AI interactions, where the balance between speed, cost, and intelligence is determined by user needs. By lowering the barrier to entry, Google is making advanced language processing more accessible for diverse applications.

In a detailed overview provided by technologist Simon Willison, Google has expanded its ecosystem with the release of Gemini 3.1 Flash-Lite, a model designed to prioritize cost-efficiency. This latest iteration is the most economical entry in the Gemini lineup, boasting a price point of just $0.25 per million input tokens and $1.50 per million output tokens.

The Flash-Lite model is approximately one-eighth the cost of the Gemini 3.1 Pro. This dramatic reduction in overhead makes it an ideal candidate for high-volume tasks where budget constraints previously limited AI integration. One of the standout features is the introduction of selectable "thinking levels" that range from minimal to high.

Users can now choose how much computational effort the model exerts on a specific prompt. For simple queries, a "minimal" setting preserves resources, while complex reasoning tasks can be escalated to the "high" level to ensure more accurate outputs.

This control represents a shift toward customizable AI interactions, where the balance between speed, cost, and intelligence is determined by user needs. By lowering the barrier to entry, Google is making advanced language processing more accessible for diverse applications.

Google Launches Ultra-Low-Cost Gemini 3.1 Flash-Lite

Tags