AI Disruption

AI Disruption

Gemini 2.5 Computer Use: AI That Can Operate Browsers

Google's Gemini 2.5 Computer Use AI agent is here. It can directly control your browser to click, type, and automate tasks

Meng Li's avatar
Meng Li
Oct 08, 2025
∙ Paid
2
2
Share

“AI Disruption” Publication 7900 Subscriptions 20% Discount Offer Link.


Image

Google’s Computer Use Model is Here!

Today, Google DeepMind made a major announcement with the release of Gemini 2.5 Computer Use, a computer-using model based on Gemini 2.5.

Considering that Google just released Chrome DevTools (MCP) a few days ago, the arrival of Gemini 2.5 Computer Use isn’t particularly surprising.

Simply put, similar to OpenAI’s Computer-Using Agent (CUA), DeepMind’s model allows AI to directly control users’ browsers — building on visual understanding and reasoning capabilities, the model can help users perform actions like clicking, scrolling, and typing within the browser.

Let’s first look at two official demonstrations.

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Meng Li
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture