DeepSeek R1 Distill Qwen 32B distils Qwen 2.5 and DeepSeek R1 expertise into a powerful 32B model. Outperforms OpenAI's o1-mini with impressive math capabilities (AIME: 72.6%, MATH-500: 94.3%) and coding skills (CodeForces: 1691).
anthropic