Post
852
The era of local Computer Use AI Agents is here.
Meet UI-TARS-1.5-7B-6bit, now running natively on Apple Silicon via MLX.
The video is of UI-TARS-1.5-7B-6bit completing the prompt "draw a line from the red circle to the green circle, then open reddit in a new tab" running entirely on MacBook. The video is just a replay, during actual usage it took between 15s to 50s per turn with 720p screenshots (on avg its ~30s per turn), this was also with many apps open so it had to fight for memory at times.
Built using c/ua : https://github.com/trycua/cua
Join us making them here: https://discord.gg/4fuebBsAUj
Kudos to the MLX community here on huggingface :
mlx-community
Meet UI-TARS-1.5-7B-6bit, now running natively on Apple Silicon via MLX.
The video is of UI-TARS-1.5-7B-6bit completing the prompt "draw a line from the red circle to the green circle, then open reddit in a new tab" running entirely on MacBook. The video is just a replay, during actual usage it took between 15s to 50s per turn with 720p screenshots (on avg its ~30s per turn), this was also with many apps open so it had to fight for memory at times.
Built using c/ua : https://github.com/trycua/cua
Join us making them here: https://discord.gg/4fuebBsAUj
Kudos to the MLX community here on huggingface :
