thread/1883686162709295541 HTML

DeepSeek has revolutionized AI model training by drastically reducing costs from $100M to $5M and hardware requirements from 100,000 GPUs to just 2,000, while maintaining competitive performance. Their innovative approach includes using less memory and specialized expert systems that activate only when needed, making AI development more accessible. This disruption poses a significant threat to Nvidia's business model, as it allows smaller players to compete without the need for expensive data centers.

Uploaded by

andrewhedgehogging

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views2 pages

thread/1883686162709295541 HTML

Uploaded by

andrewhedgehogging

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

threadreaderapp.com /thread/1883686162709295541.

html

🧵 Finally had a chance to dig into DeepSeek’s r1…

Let me break down why DeepSeek's AI innovations are blowing people's minds (and possibly threatening
Nvidia's $2T market cap) in simple terms...
0/ first off, shout out to @doodlestein who wrote the must-read on this here:
youtubetranscriptoptimizer.com/blog/05_the_sh…

1/ First, some context: Right now, training top AI models is INSANELY expensive. OpenAI, Anthropic, etc.
spend $100M+ just on compute. They need massive data centers with thousands of $40K GPUs. It's like
needing a whole power plant to run a factory.

2/ DeepSeek just showed up and said "LOL what if we did this for $5M instead?" And they didn't just talk
- they actually DID it. Their models match or beat GPT-4 and Claude on many tasks. The AI world is (as
my teenagers say) shook.

3/ How? They rethought everything from the ground up. Traditional AI is like writing every number with 32
decimal places. DeepSeek was like "what if we just used 8? It's still accurate enough!" Boom - 75% less
memory needed.

4/ Then there's their "multi-token" system. Normal AI reads like a first-grader: "The... cat... sat..."
DeepSeek reads in whole phrases at once. 2x faster, 90% as accurate. When you're processing billions
of words, this MATTERS.

5/ But here's the really clever bit: They built an "expert system." Instead of one massive AI trying to know
everything (like having one person be a doctor, lawyer, AND engineer), they have specialized experts that
only wake up when needed.

6/ Traditional models? All 1.8 trillion parameters active ALL THE TIME. DeepSeek? 671B total but only
37B active at once. It's like having a huge team but only calling in the experts you actually need for each
task.

7/ The results are mind-blowing:

- Training cost: $100M → $5M
- GPUs needed: 100,000 → 2,000
- API costs: 95% cheaper
- Can run on gaming GPUs instead of data center hardware

8/ "But wait," you might say, "there must be a catch!" That's the wild part - it's all open source. Anyone
can check their work. The code is public. The technical papers explain everything. It's not magic, just
incredibly clever engineering.

9/ Why does this matter? Because it breaks the model of "only huge tech companies can play in AI." You
don't need a billion-dollar data center anymore. A few good GPUs might do it.

1/2
10/ For Nvidia, this is scary. Their entire business model is built on selling super expensive GPUs with
90% margins. If everyone can suddenly do AI with regular gaming GPUs... well, you see the problem.

11/ And here's the kicker: DeepSeek did this with a team of <200 people. Meanwhile, Meta has teams
where the compensation alone exceeds DeepSeek's entire training budget... and their models aren't as
good.

12/ This is a classic disruption story: Incumbents optimize existing processes, while disruptors rethink the
fundamental approach. DeepSeek asked "what if we just did this smarter instead of throwing more
hardware at it?"

13/ The implications are huge:

- AI development becomes more accessible
- Competition increases dramatically
- The "moats" of big tech companies look more like puddles
- Hardware requirements (and costs) plummet

14/ Of course, giants like OpenAI and Anthropic won't stand still. They're probably already implementing
these innovations. But the efficiency genie is out of the bottle - there's no going back to the "just throw
more GPUs at it" approach.

15/ Final thought: This feels like one of those moments we'll look back on as an inflection point. Like
when PCs made mainframes less relevant, or when cloud computing changed everything.

AI is about to become a lot more accessible, and a lot less expensive. The question isn't if this will disrupt
the current players, but how fast.

/end

P.S. And yes, all this is available open source. You can literally try their models right now. We're living in
wild times!🚀
•••

Missing some Tweet in this thread? You can try to force a refresh

2/2

The Effortless Money Formula ChatGPT DeepSeek Edition (Esmam Khan Babu) (Z-Library)
No ratings yet
The Effortless Money Formula ChatGPT DeepSeek Edition (Esmam Khan Babu) (Z-Library)
83 pages
Chinas DeepSeek and The Criminal World of American AI
No ratings yet
Chinas DeepSeek and The Criminal World of American AI
53 pages
DeepSeek Artificial Intelligence
No ratings yet
DeepSeek Artificial Intelligence
6 pages
DeepSeek Master AI in 2025 - The Ultimate Guide To Outperform ChatGPT, Boost Productivity Future-Proof Your Skills Automate... (Sanchez, Cesar)
No ratings yet
DeepSeek Master AI in 2025 - The Ultimate Guide To Outperform ChatGPT, Boost Productivity Future-Proof Your Skills Automate... (Sanchez, Cesar)
50 pages
重庆市第一中学校2024 2025学年高三下学期3月月考英语试题
No ratings yet
重庆市第一中学校2024 2025学年高三下学期3月月考英语试题
13 pages
Run DeepSeek Models Locally in 5 Minutes
No ratings yet
Run DeepSeek Models Locally in 5 Minutes
10 pages
Deepseek Fin
No ratings yet
Deepseek Fin
13 pages
Cyberhaven Labs - 2025 AI Adoption & Risk Report
No ratings yet
Cyberhaven Labs - 2025 AI Adoption & Risk Report
17 pages
Deep Learning Cookbook
No ratings yet
Deep Learning Cookbook
24 pages
Why DeepSeek Is Great For AI and HPC and Maybe No Big Deal For Data Centers
No ratings yet
Why DeepSeek Is Great For AI and HPC and Maybe No Big Deal For Data Centers
7 pages
DeepSeek Ai Research
No ratings yet
DeepSeek Ai Research
3 pages
DeepSeek Unlocked - Tavian F Draven
No ratings yet
DeepSeek Unlocked - Tavian F Draven
131 pages
Introduction To AI
No ratings yet
Introduction To AI
27 pages
Anthropic Response To BIS-2025-0001 (Apr. 29 2025)
No ratings yet
Anthropic Response To BIS-2025-0001 (Apr. 29 2025)
13 pages
Artificial Intelligence FINAL LINKED
No ratings yet
Artificial Intelligence FINAL LINKED
112 pages
Guardian Weekly - 7 February 2025
No ratings yet
Guardian Weekly - 7 February 2025
64 pages
DeepSeek pzx2nv
No ratings yet
DeepSeek pzx2nv
56 pages
Sentence Rearrangement - Parajumbles No Annotation 17th Feb
No ratings yet
Sentence Rearrangement - Parajumbles No Annotation 17th Feb
71 pages
Nvidia Unfolds GPU, Interconnect Roadmaps Out To 2027
No ratings yet
Nvidia Unfolds GPU, Interconnect Roadmaps Out To 2027
9 pages
Global Data Center Market Trends in 2025 and The Impact of DeepSeek On Data Centers
No ratings yet
Global Data Center Market Trends in 2025 and The Impact of DeepSeek On Data Centers
4 pages
Full Stack - China's Evolving Industrial Policy For AI - RAND
No ratings yet
Full Stack - China's Evolving Industrial Policy For AI - RAND
28 pages
Manthan Volume 3
No ratings yet
Manthan Volume 3
49 pages
DeepSeek, TikTok, Temu - How China Is Taking The Lead in Tech - BBC World Service
No ratings yet
DeepSeek, TikTok, Temu - How China Is Taking The Lead in Tech - BBC World Service
4 pages
Monthly Magazine, Institute of Competitive Studies
No ratings yet
Monthly Magazine, Institute of Competitive Studies
38 pages
Top 20 Must Use AI Tools For Digital Marketing in 2025
No ratings yet
Top 20 Must Use AI Tools For Digital Marketing in 2025
29 pages
Deep Seek
No ratings yet
Deep Seek
2 pages
Affan 1
No ratings yet
Affan 1
24 pages
Preprints202503 1887 v1
No ratings yet
Preprints202503 1887 v1
30 pages
Deepseek-Ai - awesome-Deepseek-Integration - Integrate The DeepSeek API Into Popular Softwares
No ratings yet
Deepseek-Ai - awesome-Deepseek-Integration - Integrate The DeepSeek API Into Popular Softwares
10 pages
DeepSeek - Wikipedia
No ratings yet
DeepSeek - Wikipedia
23 pages
(English) Grok AI Is Exposing Everyone! - Is It Biased - Elon Musk - Dhruv Rathee (DownSub - Com)
No ratings yet
(English) Grok AI Is Exposing Everyone! - Is It Biased - Elon Musk - Dhruv Rathee (DownSub - Com)
19 pages
How China's New AI Model DeepSeek Is Threatening U S Dominance
No ratings yet
How China's New AI Model DeepSeek Is Threatening U S Dominance
26 pages
Mint Delhi 14-03
No ratings yet
Mint Delhi 14-03
17 pages
Build Robust RAG Systems Using DeepSeek-R1 & LangChain! - by Pavan Belagatti - Feb, 2025 - Level Up Coding
No ratings yet
Build Robust RAG Systems Using DeepSeek-R1 & LangChain! - by Pavan Belagatti - Feb, 2025 - Level Up Coding
20 pages
How To Prompt DeepSeek The ChatGPT Killer 1738331990
No ratings yet
How To Prompt DeepSeek The ChatGPT Killer 1738331990
16 pages
A Technical Primer On Deepseek
No ratings yet
A Technical Primer On Deepseek
18 pages
Deepseek AI How This Remarkable Technology Changed The World
No ratings yet
Deepseek AI How This Remarkable Technology Changed The World
14 pages
How I Built My Own AI Web Agent (And Saved Hundreds A Month!) - by Algo Insights - Coding Nexus - Apr, 2025 - Medium
No ratings yet
How I Built My Own AI Web Agent (And Saved Hundreds A Month!) - by Algo Insights - Coding Nexus - Apr, 2025 - Medium
15 pages
The DeepSeek Series A Technical Overview
No ratings yet
The DeepSeek Series A Technical Overview
11 pages
How DeepSeek-R1 Was Built - Architecture and Training Explained
No ratings yet
How DeepSeek-R1 Was Built - Architecture and Training Explained
12 pages
DeepSeek-Coder-v2 - The BEST Opensource Coding LLM! (Beats GPT-4o and Claude 3.5 Sonnet) (DownSub - Com)
No ratings yet
DeepSeek-Coder-v2 - The BEST Opensource Coding LLM! (Beats GPT-4o and Claude 3.5 Sonnet) (DownSub - Com)
14 pages
AIfB 2503 Web
No ratings yet
AIfB 2503 Web
9 pages
A Comparative Study of DeepSeek and Other Ai Tools
No ratings yet
A Comparative Study of DeepSeek and Other Ai Tools
8 pages
DeepSeek Models Explained
No ratings yet
DeepSeek Models Explained
11 pages
Big Tech in Panic Mode, Because of Deepseek
No ratings yet
Big Tech in Panic Mode, Because of Deepseek
11 pages
Deepseek Meeting Points
No ratings yet
Deepseek Meeting Points
12 pages
The New King of AI Coding
No ratings yet
The New King of AI Coding
8 pages
Meta Vs DeepSeek
No ratings yet
Meta Vs DeepSeek
10 pages
Đề Thực Chiến Dự Đoán Số 4 - Khóa Cấp Tốc
No ratings yet
Đề Thực Chiến Dự Đoán Số 4 - Khóa Cấp Tốc
6 pages
DeepSeek pdf-1
No ratings yet
DeepSeek pdf-1
7 pages
DEEPSEEK
No ratings yet
DEEPSEEK
10 pages
02.25.25 Adv Egan Deepseek Ai Blog Post
No ratings yet
02.25.25 Adv Egan Deepseek Ai Blog Post
6 pages
DeepSeek AI: The Emerging Force in The AI Revolution
No ratings yet
DeepSeek AI: The Emerging Force in The AI Revolution
3 pages
GPT4架构揭秘
No ratings yet
GPT4架构揭秘
12 pages
Chart of The Week - EuroBull
No ratings yet
Chart of The Week - EuroBull
4 pages
DeepSeek Pioneers A New Way For AI To Reason'
No ratings yet
DeepSeek Pioneers A New Way For AI To Reason'
5 pages
OpenAI O3 Tries To Curb Stomp DeepSeek... (English (Auto-Generated) ) (DownloadYoutubeSubtitles - Com)
No ratings yet
OpenAI O3 Tries To Curb Stomp DeepSeek... (English (Auto-Generated) ) (DownloadYoutubeSubtitles - Com)
5 pages
Obsolescence in AI
No ratings yet
Obsolescence in AI
5 pages
Deep Seek
No ratings yet
Deep Seek
6 pages
NITI Aayog Discussion Paper - Latest 6 Feb 25
No ratings yet
NITI Aayog Discussion Paper - Latest 6 Feb 25
5 pages
Deep Learning Research Paper
No ratings yet
Deep Learning Research Paper
4 pages
31.DL - Post-Proofreading 1-5
No ratings yet
31.DL - Post-Proofreading 1-5
5 pages
A Survey of DeepSeek Models
No ratings yet
A Survey of DeepSeek Models
6 pages
Deepseek Amv
No ratings yet
Deepseek Amv
6 pages
How She Turned 5000 Into 22 Million and How You Might Too
No ratings yet
How She Turned 5000 Into 22 Million and How You Might Too
4 pages
JPM Tech HW Semis Thi 2025-01-25 4895303
No ratings yet
JPM Tech HW Semis Thi 2025-01-25 4895303
4 pages
What Is DeepSeek - and How Is It Upending A.I. - The New York Times
No ratings yet
What Is DeepSeek - and How Is It Upending A.I. - The New York Times
6 pages
GPT-4 Architecture, Infrastructure, Training Dataset, Costs, Vision, MoE
No ratings yet
GPT-4 Architecture, Infrastructure, Training Dataset, Costs, Vision, MoE
4 pages
Origin of The Surname He
No ratings yet
Origin of The Surname He
3 pages
Research China - 280125
No ratings yet
Research China - 280125
6 pages
AI Report
No ratings yet
AI Report
3 pages
AshniSingh DeepSeek-R1+FP8Training 02-16-25
No ratings yet
AshniSingh DeepSeek-R1+FP8Training 02-16-25
3 pages
The Real Meaning of The DeepSeek Drama - The Economist
No ratings yet
The Real Meaning of The DeepSeek Drama - The Economist
4 pages
What DeepSeek Means For Chinas AI
No ratings yet
What DeepSeek Means For Chinas AI
4 pages
Deepsek 1
No ratings yet
Deepsek 1
2 pages
The AI Revolution Triggered by DeepSeek AI
No ratings yet
The AI Revolution Triggered by DeepSeek AI
2 pages
Deep Seek
No ratings yet
Deep Seek
2 pages
Report Deepseek r1
No ratings yet
Report Deepseek r1
2 pages
First Newsletter Cglug
No ratings yet
First Newsletter Cglug
3 pages
DeepSeek AI
No ratings yet
DeepSeek AI
2 pages
DeepSeeks Success Will Undermine The US-China Tech War
No ratings yet
DeepSeeks Success Will Undermine The US-China Tech War
2 pages
DeepSeek R1
No ratings yet
DeepSeek R1
2 pages
DeepSeek's Impact Is Huge - But It's Not Game Over For US Rivals
No ratings yet
DeepSeek's Impact Is Huge - But It's Not Game Over For US Rivals
2 pages
Investing Lessons by Chuck Akre
No ratings yet
Investing Lessons by Chuck Akre
1 page
Investing Lessons by John Bogle
No ratings yet
Investing Lessons by John Bogle
1 page
Investing Lessons by Benjamin Graham
No ratings yet
Investing Lessons by Benjamin Graham
1 page
Investing Lessons by Ray Dalio
No ratings yet
Investing Lessons by Ray Dalio
1 page
DeepSeek's Revolutionary AI Architecture Overview
No ratings yet
DeepSeek's Revolutionary AI Architecture Overview
1 page
What Is DeepSeek China's AI Breakthrough, and Why It's Hammering Tech Stocks Explained - WSJ
No ratings yet
What Is DeepSeek China's AI Breakthrough, and Why It's Hammering Tech Stocks Explained - WSJ
1 page

thread/1883686162709295541 HTML

Uploaded by

thread/1883686162709295541 HTML

Uploaded by

threadreaderapp.com /thread/1883686162709295541.

🧵 Finally had a chance to dig into DeepSeek’s r1…

7/ The results are mind-blowing:

13/ The implications are huge:

You might also like