Top AI Product

Every day, hundreds of new AI tools launch across Product Hunt, Hacker News, and GitHub. We dig through the noise so you don't have to — surfacing only the ones worth your attention with honest, no-fluff reviews. Explore our latest picks, deep dives, and curated collections to find your next favorite AI tool.


Apple AFM 3 runs a 20B model on your iPhone — and hands it to developers

At WWDC 2026 Apple shipped AFM 3, its third-generation Foundation Models: five models split across on-device and cloud. The headline is AFM 3 Core Advanced, a 20-billion-parameter sparse model that actually runs on an iPhone.

The flash-memory trick

A 20B model won’t fit in phone RAM, so Apple doesn’t try. The full model lives in flash (NAND), and a lightweight router picks a fixed set of experts per prompt — activating just 1–4B parameters at a time. They call it Instruction-Following Pruning. Result: 20B-class quality on hardware that can only hold a few billion active weights in memory. Alongside it sit the 3B AFM 3 Core, server-side AFM 3 Cloud, an image model, and AFM 3 Cloud Pro, Apple’s most capable model — extended to NVIDIA GPUs in Google Cloud while keeping Private Cloud Compute’s privacy guarantees.

What developers actually get

Through the Foundation Models framework, you call AFM 3 directly inside your app — on-device or cloud, your choice. Good for agentic tool use, on-device dictation, summarization, and structured generation with no API key and no per-token bill. That last part is the real shift.


You Might Also Like


Discover more from Top AI Product

Subscribe to get the latest posts sent to your email.



Leave a comment