Google now allows developers to preview the model for using the Gemini 2.5 computer behind Project Mariner and agent features in AI mode.
This “specialized model” can interact with graphic user interfaces, in particular browsers and websites. There are several steps that occur in a loop “until the task is over”.
Other user interface actions supported by the model include backward / search, web search, navigation to a specific URL, overview of the cursor, keyboard combinations, scrolling and draining and drop.
Google shared two examples (at a 3X speed) with the following prompts:
“From https://tinyurl.com/pet-care-signup, get all the details for any pet with a residence in California and add them as a guest in my CRM SPA at https://pet-luxe-spa.web.app/. Then organizes the reason for the visit of the animal specialist.
“”My art club has thought about the tasks before our fair. The board is chaotic and I need your help to organize the tasks in certain categories I have created. Go to Sticky-note-jam.web.app and make sure that the notes are clearly in the right sections. Slide them there if not.
The use of the Gemini 2.5 computer is “mainly optimized for web browsers”. However, Google has a “Androidworld” reference which “demonstrates a strong promise for control tasks of the mobile user interface”, while it is not yet optimized for office OS level control “.
Google has demonstrated solid performance on web and mobile control references compared to the offer from Claude and Openai, as well as “the top quality for browser control to the lowest latency”.
This model is built on the capacities of visual understanding and reasoning of Gemini 2.5 Pro. Google indicates that “the versions of this model” Power Project Mariner and the agent capabilities of the AI mode. It was used internally for user interface tests to accelerate software development, while Google has an early access program for third -party developer developers and workflow automation tools.
Gemini 2.5 The use of the computer is available in the public preview today via Gemini API in Google AI Studio and Vertex AI.
Try it now: In an environment of demonstration hosted by Browserbase.
FTC: We use automatic income affiliation links. More.
Apple has finally decided to pull the plug, removing Clips from the App Store. The company also updated its support…
A new trailer for Paramount+'s next Star Trek adventure, Starfleet Academywas released on Saturday at New York Comic Con. The…
IN A WORD 🌌 Physicists propose the universe could possibly collapse under its own gravity due to evolution of dark…
President Donald Trump delivers remarks during a news conference at the White House with Secretary of Defense Pete Hegseth. Andrew…
Thinking Machines Lab, the AI startup led by former OpenAI CTO Mira Murati, has lost one of its co-founders to…
Penn State quarterback Drew Allar, a potential first-round pick in the 2026 NFL Draft, suffered a season-ending injury late in…