AI Disruption

AI Disruption

Share this post

AI Disruption
AI Disruption
DeepSeek-R1: Pure RL Cuts Costs by 90%, Milestone in AI Learning

DeepSeek-R1: Pure RL Cuts Costs by 90%, Milestone in AI Learning

Discover DeepSeek-R1: A game-changing AI model with OpenAI-level capabilities, open-sourced for free, sharing training secrets to pave the way for AGI.

Meng Li's avatar
Meng Li
Jan 21, 2025
∙ Paid
6

Share this post

AI Disruption
AI Disruption
DeepSeek-R1: Pure RL Cuts Costs by 90%, Milestone in AI Learning
5
Share

"AI Disruption" publication New Year 30% discount link.


OpenAI's Original Vision, Ultimately Realized by a Startup?

On January 19, DeepSeek began releasing the R1 preview version.

DeepSeek-R1-Preview Tops Rankings, Comparable to OpenAI o1, Confirmed Open-Source

DeepSeek-R1-Preview Tops Rankings, Comparable to OpenAI o1, Confirmed Open-Source

Meng Li
·
Jan 19
Read full story

Last night, DeepSeek officially launched DeepSeek-R1, which, like OpenAI's O1, excels in tasks like mathematics, code, and natural language reasoning.

Image

The open-source large model DeepSeek-V3, released in December last year, had already stirred up a buzz by achieving many "impossibilities."

This time, the open-source R1 large model has stunned many AI researchers from the very beginning, with people speculating how it was accomplished.

Casper Hansen, author of AutoAWQ, stated that DeepSeek-R1 uses a multi-stage cyclic training method: Foundation → RL → Fine-tuning → RL → Fine-tuning → RL.

Professor Alex Dimakis from UC Berkeley believes that DeepSeek is now in the lead, and U.S. companies may need to catch up.

Currently, DeepSeek has fully launched R1 on the web, app, and API platforms. The following image shows the web interface, where selecting DeepSeek-R1 provides immediate access.

Experience it here: https://www.deepseek.com/

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Meng Li
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share