AI Hardware & Infrastructure
-
Generalist GEN-1 Scores 99% Success Rate on Robot Tasks — With Just 1 Hour of Robot Data
Half a million hours of humans grabbing, folding, and stacking things. That’s what Generalist AI fed into its new foundation model before it ever touched a robot. The result is GEN-1, and the numbers are hard to argue with: 99% success rate on tasks where previous models managed 64%, roughly 3x faster execution than the… Continue reading
-
Lightning V3 (Smallest.ai) Scores 3.89 MOS and Beats OpenAI, ElevenLabs — From a 16-Person Team in Pune
The voice AI race right now looks like a bar fight in a crowded room. ElevenLabs has the brand. OpenAI has the distribution. Cartesia has the latency story. Microsoft just shipped MAI-Voice-1. Mistral open-sourced Voxtral TTS. And somehow, a 16-person startup from Pune, India, with $8 million in total funding, just posted the highest conversational… Continue reading
-
700 GitHub Stars in a Week: Apfel Exposes the Free LLM Apple Locked Behind Siri
Every Mac with Apple Silicon has a large language model built into the operating system. Not downloaded. Not sideloaded. Baked in. Apple ships it as part of Apple Intelligence — a 3-billion-parameter model that runs entirely on your Neural Engine and GPU. Zero cloud. Zero cost. Zero API keys. You just can’t use it. Not… Continue reading
-
Gemma 4 Scores 89% on AIME With Just 4B Active Parameters — Google’s Open Model Bet Gets Real
Google has been playing defense in the open model race for months. Llama 4 grabbed headlines. Qwen 3.5 dominated coding benchmarks. Gemma 3, despite solid performance, kept losing enterprise deals over one thing that had nothing to do with intelligence: its license. That changed on April 2. Gemma 4 dropped with four model sizes, vision… Continue reading
-
OpenRouter Is Raising $120M at a $1.3B Valuation — and It’s Processing More Tokens Than Most AI Companies Make
Alex Atallah has impeccable timing. In January 2022, Forbes pegged his stake in OpenSea at $2.2 billion. Six months later, he walked away from the NFT marketplace he co-founded. A few months after that, OpenSea’s daily trading volume collapsed 99% — from $2.7 billion to $9 million. The crypto crowd called him crazy for leaving.… Continue reading
-
Liquid AI LFM2.5-350M: How 350 Million Parameters Trained on 28 Trillion Tokens Outrun Models Twice Its Size
There’s a number that should make every AI engineer stop and think: 80,000 to 1. That’s the token-to-parameter ratio of Liquid AI’s new LFM2.5-350M — a model with just 350 million parameters that was trained on 28 trillion tokens. For context, most models see maybe 20 to 100 tokens per parameter during training. Liquid AI… Continue reading
-
PrismML Exits Stealth With $16M and a 1-Bit Model That Rivals Llama 3 at 1/16th the Memory
An 8-billion-parameter model that fits in 1 GB of memory. Not a quantized approximation of a bigger model. Not a research paper that’ll never ship. A production-ready LLM, trained from scratch with 1-bit weights, running at 368 tokens per second on an RTX 4090 and 44 tokens per second on an iPhone. PrismML came out… Continue reading
-
Ollama MLX on Apple Silicon: 1,810 Tokens/Sec Prefill and the End of llama.cpp on Mac
Ollama just ripped out its entire inference backend on Mac and replaced it with Apple’s MLX framework. That sentence alone would have been unthinkable a year ago, when Ollama was synonymous with llama.cpp and GGUF quantization. But the numbers from their March 30 blog post make the decision look obvious in hindsight: prefill jumped from… Continue reading
-
Rebellions Raises $400M at $2.34B Valuation — Korea’s Answer to NVIDIA in AI Inference
Every major AI company on the planet is spending billions on NVIDIA GPUs. Training clusters, inference farms, the whole stack runs on Jensen Huang’s hardware. NVIDIA owns north of 80% of the AI accelerator market. And yet, a five-year-old Korean startup just raised $400 million to bet that inference — the part of AI that… Continue reading
-
Starcloud Put an NVIDIA H100 in Orbit — 17 Months Later, It’s Worth $1.1B
A startup that launched its first satellite five months ago just raised $170 million at a $1.1 billion valuation. Starcloud, a Y Combinator company building data centers in low Earth orbit, closed its Series A today led by Benchmark and EQT Ventures. That makes it the fastest company in YC history to hit unicorn status… Continue reading
