News
How the community trained Gemma to "Think" with Tunix and TPUs
1+ hour, 3+ min ago (208+ words) G-Ra R trains Gemma models to produce structured reasoning by combining Supervised Fine-Tuning (SFT) with GRPO, driven by a novel rubric-based LLM-as-judge reward system. Evolving a 1 B parameter model into a structured reasoning engine ("Pinocchio") via a highly efficient, 9-hour…...
Enhancing Android Checkout with Dynamic Callbacks in Google Pay
2+ day, 18+ min ago (184+ words) Why use Dynamic Callbacks? Dynamic callbacks enable a true "Express Checkout" experience. By moving the Google Pay button upstream to your Product Detail or Cart pages, you can provide the user's shipping address, payment credentials, and contact details all within…...
Empowering Service Providers and Hardware Partners with Gemini for Home - Google Developers Blog
6+ day, 23+ hour ago (373+ words) At last year's Google I/O, we introduced the Gemini era for Google Home, moving beyond basic device control toward a home that acts proactively on your behalf. Today, we're excited to take the next step: making Gemini for Home…...
Announcing ADK for Kotlin and ADK for Android 0. 1. 0: Building AI Agents on Android and Beyond- Google Developers Blog
1+ week, 12+ min ago (342+ words) ADK for Kotlin brings agentic workflows to your backend projects, while ADK for Android provides specialized on-device optimizations With just a few lines of Kotlin, you can: During our I/O session, we showcased how ADK for Kotlin powers an…...
A Smarter Google AI Edge Gallery: MCP integration, notifications, and session continuity
1+ week, 1+ day ago (497+ words) Google AI Edge Gallery now supports the Model Context Protocol (MCP), notifications reminders, and persistent chat history'providing developers with a showcase to build connected, automated, on-device agentic experiences. This architecture allows the mobile device to coordinate complex tasks across various…...
Blazing fast on-device Gen AI with Lite RT-LM
1+ week, 1+ day ago (403+ words) One of the most significant performance milestones in the Lite RT-LM pipeline is our native support for the Multi-Token Prediction (MTP) drafters recently launched with the Gemma 4 model family. By integrating this specialized speculative decoding architecture, Lite RT-LM bypasses traditional…...
One Year of Innovation: Celebrating 100k Members in the Google Cloud x NVIDIA Developer Community
1+ week, 1+ day ago (239+ words) Last year at Google I/O "25, we launched the Google Cloud and NVIDIA developer community to bridge the gap between cutting-edge AI infrastructure and the builders shaping the future. Today, at Google I/O "26, we're celebrating 100, 000 members and we want…...
Announcing Genkit Middleware: Intercept, extend, and harden your agentic apps
1+ week, 6+ day ago (480+ words) Genkit solves this with middleware: composable hooks that intercept generation calls, including the tool execution loop, and inject custom behaviors. The middleware system is available today in Type Script, Go, and Dart, with Python support coming soon. Automatically retries failed…...
Accelerating on-device AI: A look at Arm and Google AI Edge optimization
1+ week, 6+ day ago (764+ words) Arm Scalable Matrix Extension 2 (SME2) eliminates this tradeoff by integrating a dedicated matrix-compute unit directly into the CPU cluster. This architecture enables the CPU to function as a high-performance AI accelerator, delivering up to 5x faster inference for the matrix-heavy workloads at…...
Build Long-running AI agents that pause, resume, and never lose context with ADK - Google Developers Blog
2+ week, 1+ day ago (1285+ words) Most agent tutorials end at a stateless chatbot " a conversational loop that forgets everything the moment the container restarts. Real enterprise workflows don't wrap up in a single API call. HR onboarding spans two weeks. Invoice disputes stall for days…...