Google AI Launches Veo 3.1 Lite for Low-Cost High-Speed Video

31 просмотров Источник
Google AI Launches Veo 3.1 Lite for Low-Cost High-Speed Video

Google has announced the release of Veo 3.1 Lite, a new model tier within its generative video portfolio aimed at addressing the primary bottleneck for production-scale deployments: pricing. While the generative video space has seen rapid advancements in visual fidelity, the cost per second of generated content has remained high, often prohibitive for developers building high-volume applications. Veo 3.1 Lite is now available via the Gemini API and Google AI Studio for users in the paid tier. By offering the same generation speed as the existing Veo 3.1 Fast model at approximately half the cost, Google positions this model as the standard for developers focused on programmatic video generation and iterative prototyping.

The most significant aspect of the Veo 3.1 family is its underlying Diffusion Transformer (DiT) architecture. Traditional generative video models often relied on U-Net-based diffusion, which can struggle with high-dimensional data and long-range temporal dependencies. Veo 3.1 Lite utilizes a transformer-based backbone that operates on spatio-temporal patches. In this architecture, video frames are processed not as static 2D images but as a continuous sequence of tokens in a latent space. By applying self-attention across these patches, the model maintains better temporal consistency, ensuring that objects, lighting, and textures remain coherent throughout the duration of the clip, reducing artifacts commonly seen in earlier models.

Veo 3.1 Lite provides specific parameters for resolution and duration, allowing AI developers to integrate it into structured workflows. Unlike the flagship Veo 3.1 model, which supports 4K resolution, the Lite version is optimized for high-definition (HD) outputs. Supported resolutions include 720p and 1080p, with native support for both landscape (16:9) and portrait (9:16) orientations. Developers can specify generation lengths of 4, 6, or 8 seconds. The model is optimized for 'Cinematic Control,' recognizing technical directives such as 'pan,' 'tilt,' and specific lighting instructions.

The core value proposition of Veo 3.1 Lite is its cost structure. In the current market, high-quality video inference often costs several dollars per minute of footage, making it difficult to justify for applications like dynamic ad generation or social media automation. Veo 3.1 Lite pricing is structured as follows: 720p at $0.05 per second and 1080p at $0.08 per second. Deployment is facilitated via the Gemini API, allowing for the integration of video generation into existing Python or Node.js applications using standard REST or gRPC calls.

A critical technical feature for enterprise developers is the inclusion of SynthID. Developed by Google DeepMind, SynthID is a tool for watermarking and identifying AI-generated content. It embeds a digital watermark directly into the pixels of the video that is imperceptible to the human eye but detectable by specialized software. This is a mandatory component for developers concerned with safety, compliance, and distinguishing synthetic media from captured footage.

Похожие статьи