student - 搜索 News

1 天

2026开年新风向：上下文即Teacher，三文详解Self-Distillation新范式

这三项工作打破了传统知识蒸馏必须依赖更强外部 Teacher（如 GPT-4）的定式，共同指向了一种 On-Policy Self-Distillation的新范式：在数学推理任务中，SFT 存在训练与推理分布偏移的问题。OPSD (On-Policy Self-Distillation) 关注如何利用训练数据中隐含的特权信息——即 Ground Truth 答案。

China.org.cn

Roundup: Approaching Chinese Spring Festival brings festive joy to Sudanese students

The event aims to introduce the local community to the traditions of the Spring Festival while encouraging students to deepen their interest in learning Chinese, Babikir said. He also noted that the ...

当前正在显示可能无法访问的结果。

隐藏无法访问的结果

2026开年新风向：上下文即Teacher，三文详解Self-Distillation新范式

Roundup: Approaching Chinese Spring Festival brings festive joy to Sudanese students

今日热点