AI Models & APIs
-
Mistral Medium 3.5 scores 77.6% on SWE-Bench — one point shy of Gemini 3.1 Pro
Mistral just shipped Medium 3.5, a 128B dense model that hits 77.6% on SWE-Bench Verified. For context: Gemini 3.1 Pro Preview leads the board at 78.8%. An open-weight model from a European lab is now within rounding distance of Google’s flagship on real coding work. It’s a single set of weights doing instruction-following, reasoning, and… Continue reading
