Top AI Product

We track trending AI tools across Product Hunt, Hacker News, GitHub, and more — then write honest, opinionated takes on the ones that actually matter. No press releases, no sponsored content. Just real picks, published daily. Subscribe to stay ahead without drowning in hype.

February 25, 2026

Moonshine Open-Weights STT: The Tiny Speech Model That Punches Way Above Its Weight

There’s a certain thrill when you find an open-source project that makes you rethink what’s possible on cheap hardware. [Moonshine Open-Weights STT](https://github.com/moonshine-ai/moonshine) is exactly that kind of project. Built by [Useful Sensors](https://usefulsensors.com/) — the company led by Pete Warden, former TensorFlow lead at Google — Moonshine is a family of speech-to-text models designed to run locally on just about anything: phones, Raspberry Pis, IoT gadgets, even wearables.

The numbers are hard to ignore. At the top end, Moonshine claims higher accuracy than Whisper Large V3, and at the bottom end you’re looking at a 26 MB model that still holds its own. For context, Whisper Large V3 is around 1.5 GB. That’s not a typo. The trick is a variable-length encoder that scales computation to the actual length of your audio input instead of padding everything out to 30-second chunks like Whisper does. The result is roughly a 5x reduction in compute compared to Whisper Tiny with no increase in word error rate, and a 1.7x overall speed boost across the board.

What caught my attention is that Moonshine just [showed up on Hacker News](https://news.ycombinator.com/item?id=47143755) as a Show HN post and racked up 269 points with 59 comments in a single day — the top-scoring AI Show HN that day. The discussion was genuinely useful, with people sharing benchmarks and comparing it against other local STT options.

The latest version, [Moonshine v2](https://arxiv.org/abs/2602.12241), introduces a streaming encoder with sliding-window attention, which means you get bounded latency regardless of how long the audio clip runs. That’s a big deal for live transcription and voice command use cases where waiting for the full utterance to finish isn’t acceptable.

Models are available on [Hugging Face](https://huggingface.co/UsefulSensors/moonshine) and run via Keras with Torch, TensorFlow, or JAX backends, plus there’s ONNX runtime support for edge devices. Platform coverage is broad — Python, iOS, Android, macOS, Linux, Windows, all supported. If you’ve been looking for a private, offline-capable speech recognition solution that doesn’t require a beefy GPU, Moonshine is worth a serious look.

Discover more from Top AI Product

Subscribe to get the latest posts sent to your email.

Uncategorized

Posted by:

agent

About Me

Hi. I’m a builder who’s obsessed with what AI can actually do — not the hype, but the real tools people ship every day. I use AI to help me find, research, and write about the most interesting AI products launching across Product Hunt, Hacker News, GitHub, and everywhere else. The articles are AI-assisted. The curiosity is mine. I started this site because I was already spending hours every day digging through launches and repos. Figured I might as well share what I find. If something shows up here, it’s because I thought it was genuinely worth your time.

Moonshine Open-Weights STT: The Tiny Speech Model That Punches Way Above Its Weight

Share this:

Discover more from Top AI Product

Leave a comment Cancel reply