• Skip to primary navigation
  • Skip to content
  • Skip to footer
军舰日志 军舰日志
  • 首页
  • 快捷键
  • 分类
  • 标签
  • 关于

    推理 LLM 技术内幕 - DeepSeek-R1/o1

    2025-03-08 less than 1 minute read

    本文目录

    • Understanding Reasoning LLMs
    • Sebastian Raschka:关于DeepSeek R1和推理模型,我有几点看法
    • Large Language Models are Zero-Shot Reasoners
    • Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
    • 04 论文 Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
    • 论文笔记:Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling
    • DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

    Updated: 2025-03-08

    Previous Next
    • GitHub
    © 2025 军舰日志. Powered by Jekyll & Minimal Mistakes.