ByteDance’s UI-TARS-desktop pulled 656 stars yesterday — sitting at 31.9k on GitHub, and everyone’s calling it the open-source answer to OpenAI’s Operator.
What it actually is: two pieces. Agent TARS, a multimodal agent stack you run in a terminal, browser, or embed in your product. And UI-TARS Desktop, a native app that hands your computer to the agent — looks at the screen, moves the mouse, types on the keyboard. No accessibility APIs. Pure vision plus action.
The numbers it beats
The underlying model UI-TARS-1.5-7B hits 61.6% on ScreenSpot Pro. Claude 3 gets 27.7%. GPT-4o gets 41.2%. On OSWorld: 24.6 vs Claude’s 22.0. AndroidWorld: 46.6 vs GPT-4o’s 34.5. A 7B model running locally is outperforming frontier closed models on the GUI task they were supposed to own.
Apache 2.0 and an SDK
This is the part that matters. Self-host, swap models, build the agent loop into your product via the SDK. No rate limit, no ToS, no per-action billing. ByteDance also ships a free Remote Computer Operator — click and the model drives a sandboxed machine.
A frontier GUI agent, open weights, from China. The “open-source Operator” is no longer just a meme.
You Might Also Like
- Deerflow 2 0 Hits 65k Stars Bytedance Open Sources its Long Horizon Agent Stack
- Insforge Hits 1 on Product Hunt and 3600 Github Stars is This What Agent Native Backends Look Like
- Openviking Treats ai Agent Memory Like a File System and 9k Github Stars say its Working
- 27k Github Stars in Weeks Learn Claude Code by Shareai lab Breaks Down ai Coding Agents Into 12 Lessons
- Claude hud hit 5 3k Github Stars Because Developers Were Flying Blind With Claude Code

Leave a comment