Google shipped Google Gemini Omni Flash on June 30 — the cheap, fast tier of its Omni family, built for one thing: making video by talking to it. Describe a scene in plain English, get a clip. Then keep talking to fix it: “darken the sky,” “add a dog,” “make it slower.” No timeline, no keyframes, no editing app. The model treats a conversation as the edit history.
What it actually is
A production-grade text/image/video-to-video model that does conversational editing. It generates clips up to 10 seconds at 720p, and every output carries an invisible SynthID watermark. It’s live now in the Gemini app, Google Flow, and free on YouTube Shorts and Create.
The API angle
At $0.10 per second, a ten-second clip runs about a dollar — an order of magnitude under the flagship Omni, and matching Veo 3.1 Fast. The developer and enterprise API opens in the coming weeks. The obvious use case: ad variants, product demos, and short-form social at volume, where you iterate by prompt instead of re-rendering a timeline.
Why it matters: Google is betting that “chat is the editor” scales, and it just made that bet cheap enough to test.
You Might Also Like
- Google Health Swaps the Fitbit app for a 9 99 Gemini Coach
- Google Ships Gemini api File Search Multimodal rag Page Level Citations and Embedding 2 Baked in
- Google Nano Banana 2 Lite Gemini 3 1 Flash Lite Image a Picture in 4 Seconds for 0 034 per 1000
- Google Lyria 3 Just Turned Gemini Into a Music Studio and im Weirdly Into it
- Gemini 3 1 Flash Lite Googles Cheapest Model Just got Surprisingly Good

Leave a comment