Google Gemini 3.5 Pro (GA) is Google’s new frontier LLM, reachable through the Gemini API and Vertex AI. It solves one problem better than anything shipping today: fitting an absurd amount of context into a single call.
The 2M number is the whole story
Two million input tokens. Double what Gemini 3.5 Flash handles, and the largest window of any production frontier model right now. That’s a full codebase, a year of support tickets, or a whole contract library dropped into one prompt — no chunking, no retrieval hacks. The other headline is Deep Think, a multi-step reasoning mode for hard math, science, and coding. But it’s locked behind Google’s Ultra subscription, which starts around $100/month, so most people won’t touch it.
What you can build with it
It’s API-interactive: hit it via the Gemini API or Vertex AI, feed it giant contexts, and let it reason across the whole thing. Expected pricing lands near $15 in / $60 out per million tokens — roughly 10x Flash, so this is for jobs where context beats cost.
Why the noise? It slipped from June to July right as DeepMind bleeds talent. Google needed a win. The window delivers; the price won’t.
You Might Also Like
- Gemini 3 Deep Think Scores 84 6 on arc agi 2 Googles Reasoning bet is Paying off
- Google Nano Banana 2 Lite Gemini 3 1 Flash Lite Image a Picture in 4 Seconds for 0 034 per 1000
- Google Gemini Omni Flash Drops Video Generation to 0 10 a Second
- Google Deepmind Aletheia Just Solved Math Problems Nobody Could Heres why That Matters
- Google Lyria 3 Just Turned Gemini Into a Music Studio and im Weirdly Into it

Leave a comment