Ai2 releases open-source visual AI agent that can take control of web browsers

Allen Institute for AI, a prominent Seattle-based nonprofit research organization working on advancing artificial intelligence models and systems, launched a new open-source AI agent that can take control of web browsers on a user’s behalf and automate tasks. Web agents represent the next step of what is called vision-language models, which move large language models … Read more

Google upgrades its Stitch AI interface development tool

Google LLC today released a new version of Stitch, an artificial intelligence tool that can generate user interfaces for websites and mobile apps. Shares of graphic design software maker Figma Inc. declined more than 4% on the news. The company’s namesake platform is the go-to choice for UI development projects. Building an interface involves more … Read more

Google expands the availability of its Personal Intelligence tool

Google LLC today significantly expanded the availability of the Personal Intelligence tool in its Gemini assistant and search engine. The technology customizes artificial intelligence responses based on information in the user’s Google account. Personal Intelligence made its original debut in mid-January. Initially, the service was only included in Google’s AI Pro and Ultra plans, which … Read more

Anthropic’s Claude gets interactive visuals to enhance learning

Last year, Anthropic PBC released a temporary experience called “Imagine with Claude” that enabled its chatbot to create interactive visuals in real-time without any code, and now the same capability is coming to Claude’s chat conversations. In a blog post today, the company said Claude’s new visualization capabilities are an “expansion” of Imagine with Claude that … Read more

Google enhances Docs, Sheets, Slides and Drive with deeper Gemini integration

Google LLC announced today that it’s making “prep work” for collaboration and creation easier for users of its cloud productivity tools in Workspace across Docs, Sheets, Slides and Drive through a deeper integration with Gemini as an artificial intelligence assistant. Gemini, Google’s flagship large language model, received its most recent update major update in February … Read more

Sauce Labs launches ‘programmable infrastructure’ for mobile testing with Real Device Access API

Sauce Labs Inc., a continuous testing solutions provider, today announced the general availability of its Real Device Access API, changing how the company delivers mobile testing infrastructure and making it easier for developers to test software directly on real devices. The new Real Device Access application programming interface removes the need to work through traditional … Read more

Google launches Lyria 3 music generation model

Google LLC today introduced an artificial intelligence model called Lyria 3 that consumers can use to generate short tracks. The algorithm is rolling out to the company’s Gemini app and Dream Track, a music generation feature in YouTube’s creator toolkit. Files generated with Lyria 3 will contain an imperceptible watermark generated by a Google technology … Read more

OpenAI quietly launches ChatGPT Translate with support for 25 languages

OpenAI Group PBC today launched ChatGPT Translate, a free translation service hosted on a standalone web page. The rollout wasn’t accompanied by an announcement, which hints that the service may be a prototype. In July 2024, OpenAI launched a search engine called SearchGPT with the goal of collecting user feedback about its information retrieval features. … Read more

Google revamps Gemini 2.5 Pro again, claiming superiority in coding and math

Google LLC has launched another, even more capable preview of its powerful Gemini 2.5 Pro model, proclaiming it to be the “most intelligent” large language model it has released so far. It’s the latest update to Gemini 2.5 Pro, which first debuted in March and was then upgraded as a preview in May, and it … Read more

Cloud-empowered AI becomes reality in Google’s ecosystem

The cloud isn’t just changing — it’s accelerating. Artificial intelligence sits at the heart of this shift, sparking a transformation spanning entire ecosystems. Cloud-empowered AI is the new normal. Google Cloud’s Jim Anderson discusses shifting AI dynamics. At Google Cloud, this new wave is creating complexity, opportunity and fierce momentum for partners and customers alike, … Read more