News
NVIDIA HORIZON: A Hands-Free Agent that Evolves Git Worktrees and Hits 100% RTL Benchmark Completion
2+ hour, 7+ min ago (715+ words) The research team reports 100% completion across every evaluated RTL benchmark suite. It also states plainly that agentic hardware design is not solved. Single-turn code generation has a clear limit on executable design tasks. Plausible Verilog is not enough for real…...
NVIDIA AI Introduces ASPIRE: A Self-Improving Robotics Framework Reaching 31% Zero-Shot on LIBERO-Pro Long Tasks
11+ hour, 31+ min ago (429+ words) A team of researchers from NVIDIA, University of Michigan, UIUC, UC Berkeley, and CMU introduces ASPIRE (Agentic Skill Programming through Iterative Robot Exploration). It is a continual learning system that writes and refines robot control programs. It also distills validated…...
Mistral AI Releases Leanstral 1. 5: An Apache-2. 0 Lean 4 Code Agent Model Solving 587 of 672 Putnam Bench Problems
19+ hour, 51+ min ago (697+ words) Today, Mistral AI released Leanstral 1. 5. It is a code agent model built for Lean 4. The release targets automated theorem proving and proof engineering. Weights are open under Apache 2. 0. A free API endpoint, leanstral-1-5, is now live. Leanstral 1. 5 updates the earlier…...
Meet Web Brain: An Open-Source, Local-First AI Browser Agent That Reads Pages and Automates Tasks in Chrome and Firefox
1+ day, 12+ hour ago (637+ words) Web Brain is a free, open-source browser agent for Chrome and Firefox. It reads pages, extracts data, and automates multi-step tasks. Unlike most browser AI plugins, it can also run entirely on a local model. It is built by Emre…...
Meet Alibaba's Page Agent: A Java Script In-Page GUI Agent That Controls Web Interfaces With Natural Language Through the DOM
1+ day, 21+ hour ago (525+ words) Most browser automation runs from the outside. Playwright, Puppeteer, Selenium, and browser-use all drive a browser from an external process. They read the page through screenshots or the Chrome Dev Tools Protocol. Alibaba's Page Agent takes the opposite path. The…...
Google AI Introduces Tab FM: A Hybrid-Attention Tabular Foundation Model for Zero-Shot Classification and Regression
3+ day, 10+ hour ago (511+ words) Google Research introduced Tab FM, a foundation model built for tabular data. Tab FM performs classification and regression without dataset-specific training. Every prediction comes from a single forward pass. The model reframes tabular prediction as an in-context learning problem. It…...
Linq's i Message Apps Bring Payments, Tickets, Flights, and Games Into the i Message Bubble Through the imessage_app Part
3+ day, 19+ hour ago (509+ words) The new imessage_app part renders tappable, updatable cards in the bubble, removing the link-and-handoff step. Linq developers can now build i Message Apps. These are interactive mini-apps that run inside a i Messages conversation. A user can shop, play a game,…...
Anthropic Claude Sonnet 5 vs Sonnet 4. 6 vs Opus 4. 8: Agentic Coding Benchmarks, API Pricing, and Cost-Performance Tradeoffs Compared
3+ day, 20+ hour ago (503+ words) Anthropic just shipped Claude Sonnet 5. They call it its most agentic Sonnet model yet. It plans, drives browsers and terminals, and runs autonomously across long tasks. Sonnet 5 is the default model for Free and Pro plans today. Max, Team, and…...
Open Claw Releases i OS and Android Companion Node Apps That Connect a Phone to a Self-Hosted AI Agent Gateway
4+ day, 18+ hour ago (215+ words) The open-source assistant pairs phones to a self-hosted Gateway, adding camera, location, voice, and Canvas to its agent. Open Claw just released native companion apps for i OS and Android. The i OS app is listed as "Open Claw " AI…...
NVIDIA Bio Ne Mo Agent Toolkit Turns Biomolecular Models Into Callable Skills for AI Agents in Drug Discovery
4+ day, 23+ hour ago (463+ words) AI scientists are becoming a new interface for scientific computing. These agents read papers, write code, generate hypotheses, call APIs, and inspect files. But science is not software engineering. No test suite turns green when a hypothesis is correct. Discovery…...