AI research is continuously driving the possibilities in visual content creation. With the launch of Wan AI, a powerful visual generation model from Alibaba Group’s Tongyi Lab, the industry is entering a new field of innovation. This versatile tool combines advanced functions such as text-to-video generation and image editing, while opening up opportunities and challenges that extend far beyond the creative sector.
Comet: A new browser by Perplexity AI
Perplexity AI, an emerging artificial intelligence giant with a $9 billion valuation and over 100 million weekly searches, has announced it is entering the browser market with its latest innovation, the AI-powered Comet browser. This innovation could not only revolutionize the internet experience, but also bring about profound changes in the way people interact with the digital space.
Claude 3.7 Sonnet & Claude Code: A new chapter in AI development
With Claude 3.7 Sonnet, Anthropic is setting a new benchmark in the AI industry. Improved thinking capabilities and outstanding coding functions open up exciting opportunities for companies and developers alike.
FlashMLA: New standards for AI decoding on NVIDIA Hopper GPUs
DeepSeek AI’s introduction of FlashMLA, an innovative decoding kernel technology for Multi-head Latent Attention (MLA), is a significant step in the continuous optimization of AI models. This open technology was developed specifically for the NVIDIA Hopper architecture and aims to dramatically improve the processing of variable sequence lengths in AI models.
Google Vertex AI now offers flexible pricing with low entry costs
Pricing has long been seen as key to advancing AI adoption – Google is now taking a bold step forward with Vertex AI. The new pricing structure enables companies of all sizes to utilize generative AI models without a large initial investment.
Compact models, big impact: SmolVLM2 sets new standards for video AI
The introduction of the compact SmolVLM2 video language models from Hugging Face marks a significant step towards more efficient and accessible AI technology. By focusing on smaller models without sacrificing performance, this development underlines the growing importance of AI for a wide range of applications.
Microsoft & OpenAI: Shaping the future of AI with new language models!
Microsoft is strengthening its collaboration with OpenAI and bringing the new GPT-4.5 and GPT-5 language models to the Azure platform. This development could have a lasting impact on the AI landscape and pave the way for greater integration of generative AI.
Deepseek with Ollama: How to use Deepseek securely on your own computer
Deepseek is a powerful open-source language model that is suitable for research, code generation and many other AI applications. With Ollama, you can easily run Deepseek locally on your computer – without having to rely on a cloud connection. In this article, you will learn how to install, configure and securely use Deepseek with Ollama.
AutoAgent Framework: How zero-code technologies accelerate the development of autonomous AI agents
The development of AI-powered autonomous systems is reaching a new level: with AutoAgent, a GitHub-based platform, customizable AI assistants are now made accessible without any programming knowledge. The combination of a fully automated process structure and an extremely resource-efficient design opens up new ways not only for companies but also for individuals to use personalized and powerful tools for their work.
Helix: The new vision-language-action technology for controlling humanoid robots
The technological advancement of humanoid robots has reached a significant milestone. The company Figure AI presented Helix, a new vision-language-action (VLA) model platform for real-time control of humanoid robots via voice input. This innovation is characterized by the combination of visual and voice-based data processing as well as motion control and promises to fundamentally redefine the interaction between humans and machines.