News

Mark Tech Post
marktechpost. com > 07/04/2026 > nvidia-horizon-a-hands-free-agent-that-evolves-git-worktrees-and-hits-100-rtl-benchmark-completion

NVIDIA HORIZON: A Hands-Free Agent that Evolves Git Worktrees and Hits 100% RTL Benchmark Completion

2+ hour, 7+ min ago  (715+ words) The research team reports 100% completion across every evaluated RTL benchmark suite. It also states plainly that agentic hardware design is not solved. Single-turn code generation has a clear limit on executable design tasks. Plausible Verilog is not enough for real…...

Symbols: sse:each,skill.md,msfton
Mark Tech Post
marktechpost. com > 07/03/2026 > nvidia-ai-introduces-aspire-a-self-improving-robotics-framework-reaching-31-zero-shot-on-libero-pro-long-tasks

NVIDIA AI Introduces ASPIRE: A Self-Improving Robotics Framework Reaching 31% Zero-Shot on LIBERO-Pro Long Tasks

11+ hour, 31+ min ago  (429+ words) A team of researchers from NVIDIA, University of Michigan, UIUC, UC Berkeley, and CMU introduces ASPIRE (Agentic Skill Programming through Iterative Robot Exploration). It is a continual learning system that writes and refines robot control programs. It also distills validated…...

Symbols: nasdaq:nvda,btc-usd
Mark Tech Post
marktechpost. com > 07/03/2026 > mistral-ai-releases-leanstral-1-5-an-apache-2-0-lean-4-code-agent-model-solving-587-of-672-putnambench-problems

Mistral AI Releases Leanstral 1. 5: An Apache-2. 0 Lean 4 Code Agent Model Solving 587 of 672 Putnam Bench Problems

19+ hour, 51+ min ago  (697+ words) Today, Mistral AI released Leanstral 1. 5. It is a code agent model built for Lean 4. The release targets automated theorem proving and proof engineering. Weights are open under Apache 2. 0. A free API endpoint, leanstral-1-5, is now live. Leanstral 1. 5 updates the earlier…...

Symbols: skill.md,gpt-4o
Mark Tech Post
marktechpost. com > 07/02/2026 > meet-webbrain-an-open-source-local-first-ai-browser-agent-that-reads-pages-and-automates-tasks-in-chrome-and-firefox

Meet Web Brain: An Open-Source, Local-First AI Browser Agent That Reads Pages and Automates Tasks in Chrome and Firefox

1+ day, 12+ hour ago  (637+ words) Web Brain is a free, open-source browser agent for Chrome and Firefox. It reads pages, extracts data, and automates multi-step tasks. Unlike most browser AI plugins, it can also run entirely on a local model. It is built by Emre…...

Symbols: btc-usd
Mark Tech Post
marktechpost. com > 07/02/2026 > meet-alibabas-page-agent-a-javascript-in-page-gui-agent-that-controls-web-interfaces-with-natural-language-through-the-dom

Meet Alibaba's Page Agent: A Java Script In-Page GUI Agent That Controls Web Interfaces With Natural Language Through the DOM

1+ day, 21+ hour ago  (525+ words) Most browser automation runs from the outside. Playwright, Puppeteer, Selenium, and browser-use all drive a browser from an external process. They read the page through screenshots or the Chrome Dev Tools Protocol. Alibaba's Page Agent takes the opposite path. The…...

Symbols: btc-usd
Mark Tech Post
marktechpost. com > 07/01/2026 > google-ai-introduces-tabfm-a-hybrid-attention-tabular-foundation-model-for-zero-shot-classification-and-regression

Google AI Introduces Tab FM: A Hybrid-Attention Tabular Foundation Model for Zero-Shot Classification and Regression

3+ day, 10+ hour ago  (511+ words) Google Research introduced Tab FM, a foundation model built for tabular data. Tab FM performs classification and regression without dataset-specific training. Every prediction comes from a single forward pass. The model reframes tabular prediction as an in-context learning problem. It…...

Symbols: htwo.pvt,btc-usd,ufrn.br
Mark Tech Post
marktechpost. com > 06/30/2026 > linqs-imessage-apps

Linq's i Message Apps Bring Payments, Tickets, Flights, and Games Into the i Message Bubble Through the imessage_app Part

3+ day, 19+ hour ago  (509+ words) The new imessage_app part renders tappable, updatable cards in the bubble, removing the link-and-handoff step. Linq developers can now build i Message Apps. These are interactive mini-apps that run inside a i Messages conversation. A user can shop, play a game,…...

Symbols: nyse:twlo
Mark Tech Post
marktechpost. com > 06/30/2026 > anthropic-claude-sonnet-5-vs-sonnet-4-6-vs-opus-4-8-agentic-coding-benchmarks-api-pricing-and-cost-performance-tradeoffs-compared

Anthropic Claude Sonnet 5 vs Sonnet 4. 6 vs Opus 4. 8: Agentic Coding Benchmarks, API Pricing, and Cost-Performance Tradeoffs Compared

3+ day, 20+ hour ago  (503+ words) Anthropic just shipped Claude Sonnet 5. They call it its most agentic Sonnet model yet. It plans, drives browsers and terminals, and runs autonomously across long tasks. Sonnet 5 is the default model for Free and Pro plans today. Max, Team, and…...

Symbols: anth.pvt,btc-usd,nasdaq:soun
Mark Tech Post
marktechpost. com > 06/29/2026 > openclaw-releases-ios-and-android-companion-node-apps-that-connect-a-phone-to-a-self-hosted-ai-agent-gateway

Open Claw Releases i OS and Android Companion Node Apps That Connect a Phone to a Self-Hosted AI Agent Gateway

4+ day, 18+ hour ago  (215+ words) The open-source assistant pairs phones to a self-hosted Gateway, adding camera, location, voice, and Canvas to its agent. Open Claw just released native companion apps for i OS and Android. The i OS app is listed as "Open Claw " AI…...

Symbols: gpt-4o
Mark Tech Post
marktechpost. com > 06/29/2026 > nvidia-bionemo-agent-toolkit-turns-biomolecular-models-into-callable-skills-for-ai-agents-in-drug-discovery

NVIDIA Bio Ne Mo Agent Toolkit Turns Biomolecular Models Into Callable Skills for AI Agents in Drug Discovery

4+ day, 23+ hour ago  (463+ words) AI scientists are becoming a new interface for scientific computing. These agents read papers, write code, generate hypotheses, call APIs, and inspect files. But science is not software engineering. No test suite turns green when a hypothesis is correct. Discovery…...

Symbols: btc-usd