Top AI Product

Every day, hundreds of new AI tools launch across Product Hunt, Hacker News, and GitHub. We dig through the noise so you don't have to — surfacing only the ones worth your attention with honest, no-fluff reviews. Explore our latest picks, deep dives, and curated collections to find your next favorite AI tool.


OpenAI GPT-Realtime-2 + Translate + Whisper: three voice models, one API, several startups erased

OpenAI shipped three Realtime API models on May 7. Read the spec sheet and you can hear a half-dozen voice startups quietly rewriting their decks.

What actually launched

GPT-Realtime-2 is the first voice model with GPT-5-class reasoning baked in. 128K context (up from 32K), five-level reasoning effort, tone control, parallel tool calls, clean recovery from interruptions. It can think mid-conversation without going silent.

GPT-Realtime-Translate handles live speech-to-speech across 70+ input languages into 13 output languages, keeping pace with the speaker — no lag, no chunking. $0.034/min.

GPT-Realtime-Whisper streams transcription with controllable latency at $0.017/min. Captions and call transcripts are now commodities.

What you can build

All three sit behind the Realtime API, which exited beta the same day. Endpoints: gpt-realtime-2, gpt-realtime-translate, gpt-realtime-whisper. Realtime-2 input pricing is $32/M tokens.

Typical builds: agentic phone reps that handle interruptions cleanly, real-time meeting translators, multilingual customer support, live caption overlays. A full layer of voice-agent and AI-translation SaaS just collapsed into three endpoints.


You Might Also Like


Discover more from Top AI Product

Subscribe to get the latest posts sent to your email.



Leave a comment