Gemini Robotics-ER 1.6 enhances robotic perception capabilities
For robots to be truly helpful in our daily lives and industries, they must do more than follow instructions; they must reason about the physical world. A robot's ability for 'embodied reasoning' allows it to bridge the gap between digital intelligence and physical action. Today, we are introducing Gemini Robotics-ER 1.6, a significant upgrade to our reasoning-first model that enables robots to understand their environments with unprecedented precision.
By enhancing spatial reasoning and multi-view understanding, we are bringing a new level of autonomy to the next generation of physical agents. This model specializes in reasoning capabilities critical for robotics, including visual and spatial understanding, task planning, and success detection. It acts as the high-level reasoning model for a robot, capable of executing tasks by natively calling tools like Google Search to find information, vision-language-action models (VLAs), or any other third-party user-defined functions.
Gemini Robotics-ER 1.6 shows significant improvement over both Gemini Robotics-ER 1.5 and Gemini 3.0 Flash, specifically enhancing spatial and physical reasoning capabilities such as pointing, counting, and success detection. We are also unlocking a new capability: instrument reading, enabling robots to read complex gauges and sight glasses — a use case we discovered through close collaboration with our partner, Boston Dynamics.
Starting today, Gemini Robotics-ER 1.6 is available to developers via the Gemini API and Google AI Studio. To help you get started, we are sharing a developer Colab containing examples of how to configure the model and prompt it for embodied reasoning tasks.
NVIDIA Ising Launches AI Models for Low-Error Quantum Systems
Microsoft launches MAI-Image-2-Efficient, a new AI image model
Related articles
Drones become smarter for large agricultural holdings
GEODASH Aerosystems develops smart drones for agriculture.
Google DeepMind Introduces Gemini Robotics-ER 1.6 with Enhanced Reasoning
Google DeepMind announces the update of Gemini Robotics-ER 1.6 with new features.
Max Hodak prepares for first human trials of brain-computer interface
Science Corporation is gearing up for the first human trials of its brain-computer interface.