agent
-
AI2’s MolmoWeb Outscores GPT-4o on Web Tasks — With Just 8 Billion Parameters
The web agent race has a new open-source contender, and the benchmarks are hard to ignore. On March 24, the Allen Institute for AI (AI2) released MolmoWeb, a fully open-source visual web agent that navigates browsers by looking at screenshots — the same way a human would. The kicker: its 8B-parameter model outperforms agents built… Continue reading
-
A 20-Year-Old Dropout Built Supermemory — Now It Has 18K GitHub Stars and Google’s Jeff Dean as an Investor
Every AI agent today has the same problem: amnesia. End the conversation, and the context vanishes. Start a new session, and you’re re-explaining everything from scratch. Supermemory is a bet that persistent, time-aware memory will become as essential to AI infrastructure as databases are to web apps — and the bet is attracting serious attention.… Continue reading
-
Hypura Runs a 31GB Model on a 32GB Mac at 2.2 tok/s — llama.cpp Just OOMs
There’s a frustrating ceiling that every Apple Silicon user running local LLMs hits eventually: your model is slightly too big for your RAM, and everything falls apart. llama.cpp crashes. MLX refuses to load it. The OS starts swapping so aggressively that your entire machine grinds to a halt. You either buy a more expensive Mac… Continue reading
-
Google TurboQuant Squeezes LLM Cache to 3 Bits — 6x Less Memory, 8x Faster, Zero Accuracy Loss
Every large language model running today has the same dirty secret: the longer the conversation goes, the more memory the Key-Value cache eats. For models like Gemini handling 100k+ token contexts, the KV cache can balloon to consume more memory than the model weights themselves. Google Research just published a direct answer to this problem.… Continue reading
-
Databricks’ $134B Data Empire Now Wants to Own Your Security Stack — Lakewatch by Databricks Takes on Splunk and Microsoft
The SIEM market has operated under the same basic economics for two decades: the more data you store, the more you pay. Databricks thinks that model is broken, and it’s betting two acquisitions and a new product called Lakewatch on proving it. On March 24, 2026, Databricks officially entered the cybersecurity market with Lakewatch —… Continue reading
-
83K GitHub Stars and $17M in Funding: How browser-use Became the Default Framework for AI Browser Agents
Two ETH Zurich grad students built a demo in five weeks. Twelve months later, browser-use sits at 83,500 GitHub stars, has taken $17 million in seed funding led by Felicis, and counts over 20 Y Combinator W25 startups as users. In a space crowded with browser automation tools, this open-source Python framework has pulled away… Continue reading
-
SentrySearch Turns Hours of Video Into a Searchable Index for $2.50 — Using Google’s New Multimodal Embeddings
Searching through video footage has always been painful. Whether it’s dashcam recordings, security cameras, or raw production clips, the standard approach involves either scrubbing through hours of footage manually or relying on transcription-based pipelines that miss everything visual. SentrySearch, an open-source CLI tool that appeared on Hacker News this week, takes a fundamentally different approach:… Continue reading
-
3 Months, $1 Billion, 200 Characters: The OpenAI Sora Shutdown & Disney Deal Collapse That Reshapes AI Video
On December 11, 2025, Disney and OpenAI stood on stage together to announce what looked like the future of entertainment: a $1 billion investment, a three-year licensing deal covering 200+ iconic characters, and a vision where fans could generate short videos starring Darth Vader, Iron Man, and Mickey Mouse using Sora. It was the single… Continue reading
-
Arm AGI CPU: 136 Cores, 3nm, 2x Performance Per Rack — and 9 Companies Already Signed Up
For 35 years, Arm has been the company that designs chips but never builds them. That changed on March 24, 2026. Arm Holdings unveiled the Arm AGI CPU — its first in-house data center processor — a 136-core beast fabricated on TSMC’s 3nm process, built from the ground up for agentic AI workloads. Meta is… Continue reading
-
95 Million Monthly Downloads Compromised: Inside the LiteLLM Supply Chain Attack
On March 24, 2026, two poisoned versions of LiteLLM — the Python library that routes LLM calls for nearly every major AI agent framework — landed on PyPI. Versions 1.82.7 and 1.82.8 carried a three-stage credential stealer that harvests SSH keys, cloud tokens, Kubernetes secrets, crypto wallets, and more, then encrypts everything with RSA-4096 and… Continue reading
