QwQ-Max preview: Alibaba’s AI revolution overtakes Claude 3.5 and DeepSeek V3

Alibaba is setting new standards in the AI space with QwQ-Max-Preview, challenging established models such as Claude 3.5 and GPT-4o. The new model achieves an impressive 60% success rate on the first attempt for challenging AIME 2025 math problems and outperforms both DeepSeek V3 (85.5) and Claude 3.5 Sonnet (85.2) with 89.4 points in the Arena Hard benchmark.

Read more