Gemini 2.5 Computer Use: AI That Can Operate Browsers
Google's Gemini 2.5 Computer Use AI agent is here. It can directly control your browser to click, type, and automate tasks
“AI Disruption” Publication 7900 Subscriptions 20% Discount Offer Link.
Google’s Computer Use Model is Here!
Today, Google DeepMind made a major announcement with the release of Gemini 2.5 Computer Use, a computer-using model based on Gemini 2.5.
Considering that Google just released Chrome DevTools (MCP) a few days ago, the arrival of Gemini 2.5 Computer Use isn’t particularly surprising.
Simply put, similar to OpenAI’s Computer-Using Agent (CUA), DeepMind’s model allows AI to directly control users’ browsers — building on visual understanding and reasoning capabilities, the model can help users perform actions like clicking, scrolling, and typing within the browser.
Let’s first look at two official demonstrations.