Introducing Gemini 3.1 Flash-Lite for Scalable Intelligence
Google has unveiled the new Gemini 3.1 Flash-Lite model, which is the fastest and most cost-effective in the Gemini 3 series. It is aimed at developers working with high volumes of data.
Starting today, Gemini 3.1 Flash-Lite is available for developers in preview mode through the Gemini API in Google AI Studio, as well as for enterprises via Vertex AI.
The model offers an excellent price-to-performance ratio, costing just $0.25 per 1 million input tokens and $1.50 per 1 million output tokens. Gemini 3.1 Flash-Lite demonstrates enhanced performance at significantly lower costs compared to larger models.
According to Artificial Analysis, the new model is 2.5 times faster in providing the first response and increases output speed by 45% compared to the 2.5 Flash model, while maintaining comparable or even better quality.
The low latency makes Gemini 3.1 Flash-Lite ideal for high-frequency workflows, enabling developers to create responsive and realistic user interfaces.
Celebrating 10 Years of AlphaGo's Impact on Artificial Intelligence
Accelerating Discoveries in India with AI in Science and Education
Похожие статьи
Compare Image Editing Models for Optimal Choice
Compare various image editing models and choose the best one for your needs.
Create Music with Lyria 3, Our Newest Generation Model
Discover the new music generation model Lyria 3 from Google, available for developers.
LL COOL J and James Manyika Discuss AI and Music
LL COOL J and James Manyika discuss how AI impacts music and creativity.