Google DeepMind is now available Gemini Robotics On-Device, a compact, local version of its powerful vision-language-action (VLA) model, bringing advanced robotic intelligence directly onto devices. The Gemini family of models represents a significant step forward for embodied AI. This is because it does not require continuous cloud connectivity and maintains flexibility, generality and precision.
Local AI to Realize Robotic Dexterity
Due to memory and computational constraints, VLAs with high capacity have traditionally relied on the cloud for processing. With Gemini Robotics On-Device, DeepMind introduces an architecture that operates entirely on local GPUs embedded within robots, supporting latency-sensitive and bandwidth-constrained scenarios like homes, hospitals, and manufacturing floors.
On-device models retain the strengths of Gemini Robotics, including the ability to perceive and understand multimodal (visual-textual) input and perform real-time actions. It also has a high sample efficiency, only requiring 50 to 100 demos in order to generalize new abilities.
Gemini Robotics On the Device Features
- Fully local execution: Model runs on robot GPU onboard, providing closed loop control without internet dependence.
- Two-Handed Dexterity: Thanks to the ALOHA dataset, it can perform complex bimanual tasks.
- Multi-Embodiment Compatibility: Although the model was trained using specific robots it can be used on other platforms, including industrial dual arm manipulators or humanoids.
- Few-Shot Adaptation: Model supports learning novel tasks quickly from just a few demonstrations. This dramatically reduces development time.

Capabilities of Real World and Application
For dexterous tasks like folding clothing, opening jars, and assembling parts, fine motor control is required. Real-time feedback must also be integrated. Gemini Robotics On Device provides these capabilities, while also reducing communication delays and increasing responsiveness. This is critical, especially for edge deployments when connectivity can be unpredictable or where data privacy may be a problem.
Possible applications include
- Automated home assistance capable of doing daily chores.
- Health robots to assist in eldercare or rehabilitation.
- Automation of industrial processes requires adaptable assembly-line workers.
SDK Integration and MuJoCo for Developers
DeepMind also released an a Gemini Robotics SDK The SDK includes tools to integrate, test and fine-tune the model on device into custom workflows. SDK includes:
- Pipelines for training specific to tasks.
- Compatible with most robots and cameras
- Assessment within the MuJoCo The physics simulation has new benchmarks that are specifically for bimanual tasks.
Gemini Robotics On Device offers a flexible, modular solution to robotics developers and researchers.
Gemini Robotics, the Future of On-Device Embodied AI
Gemini Robotics has focused its broader initiative on unifying perception and reasoning in physical environments. The release of this on-device software bridges the divide between AI foundational research and autonomous systems in the real world.
Although large VLA models such as Gemini 1.5 are capable of generalizing across multiple modalities with impressive accuracy, the inference latencies and dependency on cloud computing have restricted their application in robotics. This on-device release addresses these issues with optimized computation graphs, model compressing, and task specific architectures that are tailored to embedded GPUs.
Wider Implications of Robotics and Artificial Intelligence Deployment
Gemini Robotics On Device paves way for robotics that is scalable and privacy-preserving by decoupling AI-powered models from cloud. This is in line with the growing trend of edge AI where computing workloads are moved closer to data sources. The robot agents are able to work in environments where latency and privacy is a concern. This improves their safety, responsiveness and overall performance.
As DeepMind continues to broaden access to its robotics stack—including opening up its simulation platform and releasing benchmarks—researchers worldwide are now better equipped to experiment, iterate, and build reliable, real-time robotic systems.
Click here to find out more Paper The following are some examples of how to get started: Technical details. This research is the work of researchers. Also, feel free to follow us on Twitter Don’t forget about our 100k+ ML SubReddit Subscribe Now our Newsletter.
Asif Razzaq serves as the CEO at Marktechpost Media Inc. As an entrepreneur, Asif has a passion for harnessing Artificial Intelligence’s potential to benefit society. Marktechpost was his most recent venture. This platform, which specializes in covering machine learning and deep-learning news, is both technically solid and understandable to a broad audience. Over 2 million views per month are a testament to the platform’s popularity.


