AI Disruption
Subscribe
Sign in
Home
Podcast
Chat
Chip
Meta
Paper
Qwen
Agent
Robot
Google
OpenAI
AI Code
AI Video
AI Weekly
Elon Musk
AI Writing
AI Painting
AI Business
🎈 Guest Posts
AI Open Source
Machine Learning
Chinese Outbound
Foundation Model
Archive
About
Chinese Outbound
Latest
Top
Discussions
New SOTA 32B Model: Open, Free, 1/20 Size of DeepSeek-R1
Skywork-OR1: Powerful 32B open model, free for commercial use. 1/20th size of DeepSeek-R1, beats Qwen-32B. Weights, code & data fully open!
Apr 14
Â
•
Â
Meng Li
2
Share this post
AI Disruption
New SOTA 32B Model: Open, Free, 1/20 Size of DeepSeek-R1
Copy link
Facebook
Email
Notes
More
200B Model Beats DeepSeek-R1 – ByteDance Launches Seed-Thinking-v1.5
ByteDance's Seed-Thinking-v1.5: 200B MoE model beats DeepSeek-R1 in reasoning & STEM tasks. Outperforms in benchmarks like AIME & Codeforces.
Apr 11
Â
•
Â
Meng Li
2
Share this post
AI Disruption
200B Model Beats DeepSeek-R1 – ByteDance Launches Seed-Thinking-v1.5
Copy link
Facebook
Email
Notes
More
DeepSeek Unveils New Paper on Inference-Time Scaling, Is R2 Coming?
DeepSeek's new Self-Principled Critique Tuning (SPCT) boosts AI reward models. Is R2 coming? Read the arXiv paper now!
Apr 4
Â
•
Â
Meng Li
4
Share this post
AI Disruption
DeepSeek Unveils New Paper on Inference-Time Scaling, Is R2 Coming?
Copy link
Facebook
Email
Notes
More
DeepSeek V3 Upgrade: Code Breakthrough Rivals Claude 3.5/3.7
DeepSeek-V3-0324: MIT-licensed, MoE-powered AI with 685B params, 20+ tokens/sec speed. Outperforms Claude 3.5—free on Hugging Face!
Mar 25
Â
•
Â
Meng Li
9
Share this post
AI Disruption
DeepSeek V3 Upgrade: Code Breakthrough Rivals Claude 3.5/3.7
Copy link
Facebook
Email
Notes
More
Tencent Launches Hunyuan-T1: First Transformer-Mamba AI Model
Tencent Releases Hunyuan-T1: A New AI Model That Outperforms GPT-4.5 and DeepSeek R1
Mar 22
Â
•
Â
Meng Li
7
Share this post
AI Disruption
Tencent Launches Hunyuan-T1: First Transformer-Mamba AI Model
Copy link
Facebook
Email
Notes
More
OpenAI Report: DeepSeek Closes U.S.-China AI Competition Gap
OpenAI urges U.S. government to boost AI adoption, streamline regulations, and counter competition like DeepSeek. Explore the ambitious strategy and its…
Mar 15
Â
•
Â
Meng Li
2
Share this post
AI Disruption
OpenAI Report: DeepSeek Closes U.S.-China AI Competition Gap
Copy link
Facebook
Email
Notes
More
Manus Ignites the Intelligent Agent Reproduction Craze! DeepSeek Has Been Integrated
Manus ignites an intelligent agent revolution—sparking rapid open-source reproductions, DeepSeek integration, and fresh GAIA benchmark insights.
Mar 9
Â
•
Â
Meng Li
3
Share this post
AI Disruption
Manus Ignites the Intelligent Agent Reproduction Craze! DeepSeek Has Been Integrated
Copy link
Facebook
Email
Notes
More
DeepSeek R1 Technology Successfully Migrates to the Multimodal Domain, Fully Open Sourced
Discover Visual-RFT—an open-source breakthrough that extends DeepSeek-R1’s rule-based reinforcement learning to vision-language models for efficient…
Mar 4
Â
•
Â
Meng Li
4
Share this post
AI Disruption
DeepSeek R1 Technology Successfully Migrates to the Multimodal Domain, Fully Open Sourced
Copy link
Facebook
Email
Notes
More
DeepSeek's GRPO: Complete From-Scratch Implementation
Discover how to implement GRPO from scratch using Qwen2.5-1.5B-Instruct in this comprehensive distributed RL tutorial to boost model performance and…
Mar 2
Â
•
Â
Meng Li
6
Share this post
AI Disruption
DeepSeek's GRPO: Complete From-Scratch Implementation
Copy link
Facebook
Email
Notes
More
DeepSeek Unveils V3/R1 Inference System with 545% Cost-Profit Ratio and Full Transparency!
Discover DeepSeek's V3/R1 Inference System, with optimized batch scaling, load balancing, and full cost transparency for maximum performance and…
Mar 1
Â
•
Â
Meng Li
2
Share this post
AI Disruption
DeepSeek Unveils V3/R1 Inference System with 545% Cost-Profit Ratio and Full Transparency!
Copy link
Facebook
Email
Notes
More
DeepSeek Open Source: 3FS & Smallpond for Effortless PB-Level Data Processing
DeepSeek’s 3FS & Smallpond are revolutionizing AI data processing. With impressive speed, scalability, and performance, these open-source tools handle…
Feb 28
Â
•
Â
Meng Li
2
Share this post
AI Disruption
DeepSeek Open Source: 3FS & Smallpond for Effortless PB-Level Data Processing
Copy link
Facebook
Email
Notes
More
DeepSeek Release: Boost AI Performance with DualPipe, EPLB, and Profile-Data
Discover DeepSeek's latest releases, including DualPipe, EPLB, and profile-data, designed to enhance AI performance and optimize…
Feb 27
Â
•
Â
Meng Li
1
Share this post
AI Disruption
DeepSeek Release: Boost AI Performance with DualPipe, EPLB, and Profile-Data
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts