Continue • ppio/deepseek-r1-distill

public

Published on 4/21/2025

deepseek-r1-distill-qwen-32b

基于 Qwen 2.5 32B 的蒸馏大语言模型，通过使用 DeepSeek R1 的输出进行训练而得。该模型在多个基准测试中超越了 OpenAI 的 o1-mini，取得了密集模型（dense models）的最新技术领先成果（state-of-the-art）。

Models

PPIO