UI-TARS Desktop
Control your computer using natural language
2025-01-23

A GUI Agent application based on https://github.com/bytedance/UI-TARS that allows you to control your computer using natural language. From Bytedance.
UI-TARS Desktop is a cutting-edge application that enables users to control their computer using natural language, powered by a Vision-Language Model. Developed by Bytedance, this GUI Agent integrates seamlessly with web browsers, command lines, and file systems, offering precise mouse and keyboard control, real-time feedback, and cross-platform support for Windows and MacOS. It ensures privacy and security with fully local processing. The app also features screenshot and visual recognition capabilities, making it a versatile tool for automating tasks. With the recent release of the UI TARS SDK, developers can now build GUI automation agents more efficiently. UI-TARS Desktop is ideal for those seeking a natural, intuitive way to interact with their computer, backed by advanced AI technology.
Open Source
Artificial Intelligence
GitHub