推理 LLM 技术内幕 - DeepSeek-R1/o1
类别: DeepSeek-R1 推理模型 标签: DeepSeek-R1 OpenAI-o1 推理模型 LLM目录
- Understanding Reasoning LLMs
- Sebastian Raschka:关于DeepSeek R1和推理模型,我有几点看法
- Large Language Models are Zero-Shot Reasoners
- Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
- 04 论文 Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
- 论文笔记:Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling
- DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning