News
Introducing computer use in Gemini 3. 5 Flash
1+ hour, 53+ min ago (172+ words) Computer use is now a built-in tool in Gemini 3. 5 Flash to build agents that can interact across platforms. Developers and enterprises can start using computer use in 3. 5 Flash via the Gemini API and Gemini Enterprise Agent Platform. 3. 5 Flash uses computer…...
Interactions API: our primary interface for Gemini models and agents
2+ day, 38+ min ago (364+ words) A single unified endpoint for Gemini models and agents with server-side state, background execution, tool combination and multimodal generation. Today we're announcing that the Interactions API has reached general availability and is now our primary API for interacting with Gemini…...
Diffusion Gemma: 4x faster text generation
1+ week, 6+ day ago (324+ words) Our newest open experimental model delivers up to 4x faster inference on dedicated GPUs and opens the door to exploring speed-critical, interactive local workflows. You can improve Diffusion Gemma's performance on specific tasks through fine-tuning. In the example below, Unsloth fine-tuned…...
See what 3 builders are making with Gemma 4
2+ week, 21+ hour ago (137+ words) Search freely using keywords, or by asking a question After 150 million downloads of Gemma 4, a few creations caught our eye. Here's how three builders are using Gemma 4 to push creative boundaries and build new apps. The team at the app…...
Fluid, natural voice translation with Gemini 3. 5 Live Translate
2+ week, 1+ day ago (393+ words) Gemini 3. 5 Live Translate is our latest audio model, delivering near real-time speech-to-speech translation in over 70 languages. Twenty years ago, translation at Google began as one of our pioneering machine learning experiments to turn the science of language into the magic…...
Bringing the latest Gemini models to Apple developers
2+ week, 1+ day ago (209+ words) Search freely using keywords, or by asking a question Apple developers are getting seamless access to Gemini models to build dynamic experiences and smarter apps, faster. Additionally, Gemini in Xcode helps you accelerate multi-step coding tasks without switching windows. If…...
Kaggle is making AI benchmark creation effortless
2+ week, 6+ day ago (412+ words) Now you can build Kaggle Benchmarks in your local development environment, with your coding agents. Developers can write, push, run and download tasks directly from their local environment using the Kaggle CLI and AI coding agents to measure model capabilities…...
Introducing Gemma 4 12 B: a unified, encoder-free multimodal model
3+ week, 1+ hour ago (234+ words) Gemma 4 12 B is designed to bring high-performance multimodal intelligence directly to your laptop, combining mobile-first efficiency with advanced reasoning. Today, we are introducing Gemma 4 12 B, our latest model designed to bring agentic multimodal intelligence directly to laptops. Bridging the gap…...
Take our I/O 2026 quiz, vibe coded in Google AI Studio.
3+ week, 4+ day ago (17+ words) We used Google AI Studio to vibe code a quiz about our top I/O 2026 announcements....
11 demos of Gemini Omni and Gemini 3. 5 in action
3+ week, 4+ day ago (642+ words) With Gemini Omni, Gemini's ability to reason meets the ability to create, while Gemini 3. 5 is built to help you execute complex, agentic workflows. At Google I/O 2026, we announced our latest models: Gemini Omni and the Gemini 3. 5 family of models....