How DeepSeek Changes the LLM Story

  Рет қаралды 11,210

Sasha Rush

Sasha Rush

Күн бұрын

Quick turnaround survey of DeepSeek v3 and DeepSeek R1 the two technical papers behind the recent open-source LLM news. Presented at Simons Institute Feb 3, 2024.
Slides: docs.google.co...

Пікірлер: 19
DeepSeek facts vs hype, model distillation, and open source competition
39:17
Speculations on Test-Time Scaling (o1)
47:56
Sasha Rush 🤗
Рет қаралды 25 М.
IL'HAN - Qalqam | Official Music Video
03:17
Ilhan Ihsanov
Рет қаралды 700 М.
Леон киллер и Оля Полякова 😹
00:42
Канал Смеха
Рет қаралды 4,7 МЛН
Visualizing transformers and attention | Talk for TNG Big Tech Day '24
57:45
What if all the world's biggest problems have the same solution?
24:52
NVIDIA CEO Jensen Huang's Vision for the Future
1:03:03
Cleo Abram
Рет қаралды 1,3 МЛН
[1hr Talk] Intro to Large Language Models
59:48
Andrej Karpathy
Рет қаралды 2,5 МЛН
Will Merrill: The Illusion of State in State-Space Models
45:43
Formal Languages and Neural Networks Seminar
Рет қаралды 1,8 М.
Terence Tao on how we measure the cosmos | Part 1
28:33
3Blue1Brown
Рет қаралды 880 М.
Global Capitalism: What Trump 2.0 Means
1:02:56
Democracy At Work
Рет қаралды 2,5 МЛН
Attention in transformers, step-by-step | DL6
26:10
3Blue1Brown
Рет қаралды 2,2 МЛН