AI Models & APIs
-
Claude Opus 4.8 Runs Up to 1,000 Subagents in a Single Session
Anthropic released Claude Opus 4.8 on May 28, just 41 days after 4.7, and the headline isn’t a benchmark bump — it’s a new way of working called dynamic workflows. In Claude Code, the model can now plan a large task, write a JavaScript orchestration script, and spin up as many as 1,000 subagents running… Continue reading
-
dots.tts Is a New Open-Source TTS Model With No Discrete Tokens
Most text-to-speech systems convert audio into discrete tokens at some stage of the pipeline. dots.tts, a new open-source TTS model from RedNote’s Hilab team, throws that step out entirely. It’s a 2B-parameter, fully continuous, end-to-end autoregressive system — a semantic encoder, an LLM, and a flow-matching acoustic head running over a 48kHz AudioVAE, with no… Continue reading
-
turbovec is a Rust vector index that fits 10M docs in 4GB and beats FAISS on speed
turbovec is an open-source vector index — the part a RAG system uses to find relevant chunks — written in Rust with Python bindings, and it’s trending hard this week. The pitch is brutal efficiency: a 10-million-document corpus that eats 31GB of RAM in a typical setup fits in about 4GB here, roughly 8x compression,… Continue reading
-
MiniMax M3 is an open-weight model that beats GPT-5.5 on SWE-Bench Pro with a 1M-token context
MiniMax M3, out June 1 from the Shanghai lab, makes a loud claim: the first open-weight model to put frontier coding, a 1-million-token context window, and native multimodal understanding in one architecture. On SWE-Bench Pro it scores 59.0%, edging out OpenAI’s GPT-5.5 and Google’s Gemini 3.1 Pro. ## The benchmark sweep It’s not just SWE-Bench.… Continue reading
-
ArcANE tests whether role-playing AI characters evolve with the story, not just stay in character
Role-playing AI is usually judged on consistency: does it remember the character’s facts and stay on persona? ArcANE, a new arXiv benchmark, argues that’s the wrong bar. A good character isn’t fixed — its values and behavior should evolve as the story moves, and the real question is whether the AI tracks that arc at… Continue reading
-
Brandfetch’s Brand Context API gives AI agents structured data on 50M brands in one call
LLMs are confidently wrong about brands they barely know — the logo, the positioning, the competitors. Brandfetch’s Brand Context API is built to ground that: one API call returns structured data on 50M+ brands, including voice, mission, positioning, audience, and competitors. ## What it’s for The pitch is grounding AI features so they stop hallucinating… Continue reading
-
Grok Imagine 1.5 turns a still image into 720p video with synced audio and tops the image-to-video arena
xAI’s Grok Imagine 1.5 takes a still image (or a text prompt) and animates it into a clip — with native, synchronized audio baked in: music, sound effects, even lip-synced dialogue. It shipped as an API preview on June 3. ## What’s new in 1.5 The headline is audio in every generation — no separate… Continue reading
-
NVIDIA Cosmos Reason 2 is an open vision-language model built to reason about the physical world
NVIDIA Cosmos Reason 2 is an open reasoning vision-language model with a narrow but hard job: let machines see a physical scene, understand what’s happening, and decide how to act. It’s the reasoning brain in NVIDIA’s broader Cosmos physical-AI stack. ## Reasoning, not just captioning Most vision-language models describe an image. Cosmos Reason 2 is… Continue reading
-
NVIDIA’s GR00T N1.6 gives humanoid robots full-body control from one vision-language-action model
NVIDIA’s Isaac GR00T N1.6 is the latest open foundation model for humanoid robots — a vision-language-action (VLA) model that turns camera streams, robot state, and plain-language instructions into one unified control policy. ## What changed from N1.5 N1.6 isn’t a rewrite; it’s a sharpening. Architecture, data, and modeling improvements let it beat the previous N1.5… Continue reading
-
Ideogram 4.0 is the first open-weight text-to-image model to top the DesignArena leaderboard
Ideogram 4.0, released June 3, is Ideogram’s first open-weight text-to-image model — and it landed at #1 among all open-weight models on the DesignArena leaderboard the day it shipped. It’s a 9.3-billion-parameter Diffusion Transformer trained from scratch, not a fine-tune of someone else’s base. ## Built for design, not just pretty pictures The headline feature… Continue reading
