ChatGPT becomes the central knowledge portal for companies. OpenAI launches ChatGPT Connectors, an important extension that connects the AI assistant directly with enterprise applications such as Google Drive and Slack.
News
LaVague: Open source framework for automated web agents
The open source landscape for AI web agents has been enriched by the introduction of a new platform: LaVague – a framework that redefines the future of automated web interaction. With a focus on flexibility, ease of use and high-level automation capabilities, LaVague offers exciting prospects for developers and businesses alike.
Baidu ERNIE 4.5 & X1: Multimodal AI meets logical thinking
The release of Baidu’s ERNIE 4.5, a multimodal AI model, and ERNIE X1, which specializes in deep reasoning, marks an extraordinary advance in the global AI competition. Both models combine advanced technology with impressive cost-efficiency and have become more accessible to both individuals and organizations.
OpenAI Agents Python SDK: Develop multi-agent systems easily
The launch of the OpenAI Agents Python SDK provides developers with a unique platform to seamlessly create and manage complex multi-agent systems. As the AI sector continues to innovate, OpenAI aims to lower the barriers to developing modern AI workflows with this release. With a focus on flexibility, security and interoperability, the SDK stands out as a comprehensive tool in the AI landscape.
Google AI update: Gemini 2.0 Flash expands possibilities for image generation
The availability of Gemini 2.0 Flash for image generation marks a new step in Google’s ambitious AI strategy. Previously only available to limited testers, the feature has now been rolled out globally via an experimental version in Google AI Studio and the Gemini API. With unique features such as storytelling capabilities, the option for conversational image editing and improved text rendering quality, Google is setting new standards in interactive image generation.
Google Gemma 3: Multimodal language models with extended context
Google has announced the release of Gemma 3, the latest version of its Open Model family – introducing far-reaching innovations for the AI industry. With an impressive combination of multimodality, a huge context window and enhanced language support, this development marks a significant step in the evolution of Large Language Models (LLMs).
Cartesia Sonic: Fast, realistic and flexible text-to-speech technology
Cartesia brings a new generation in text-to-speech (TTS) technology with Sonic – with amazing speed, outstanding realism and ultimate adaptability. This innovation sets new standards in AI speech synthesis.
Browserbase: Automated web interactions with AI
In the rapid development of artificial intelligence, Browserbase is emerging as a key player bridging the gap between AI and browser-based automation. With a highly specialized infrastructure for Computer Use Agents (CUAs), the company enables AI systems to seamlessly interact with web browsers and perform complex tasks – from data extraction to web searches to controlling entire workflows – fully automatically. Now also with OpenAI’s new Computer Use Model.
OpenAI releases new tools for agent-based applications
The release of OpenAI’s latest tools marks a significant step forward in the development of agent-based artificial intelligence. With new APIs, integrated functions and an open developer kit, the company aims to make it much easier to create powerful, autonomously acting systems. This step comes amid growing competition from rivals such as Google and Anthropic and underpins the trend towards agent-based AI platforms.