Continue • ppio/deepseek-r1-distill-qwen-14b

public

Published on 4/21/2025

deepseek-r1-distill-qwen-14b

基于 Qwen 2.5 14B 的蒸馏大语言模型，通过使用 DeepSeek R1 的输出进行训练而得。该模型在多个基准测试中超越了 OpenAI 的 o1-mini，取得了密集模型（dense models）的最新技术领先成果（state-of-the-art）。该模型通过从 DeepSeek R1 的输出中进行微调，展现了与更大规模的前沿模型相当的竞争性能。

Models

deepseek-r1-distill-qwen-14b

PPIO

chat

edit

apply