Skip to primary navigation
Skip to content
Skip to footer

首页
必读
快捷键
分类
标签
关于

推理 LLM 技术内幕 - DeepSeek-R1/o1

2025-03-08 less than 1 minute read

本文目录

Understanding Reasoning LLMs
Sebastian Raschka：关于DeepSeek R1和推理模型，我有几点看法
Large Language Models are Zero-Shot Reasoners
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
04 论文 Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
论文笔记：Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Updated: 2025-03-08

Previous Next

Enter your search term...

GitHub

© 2026 军舰日志. Powered by Jekyll & Minimal Mistakes.