GPT-1 Paper Explained

  Рет қаралды 9,996

Aladdin Persson

Aladdin Persson

Күн бұрын

Пікірлер: 13
@AladdinPersson
@AladdinPersson 3 жыл бұрын
I saw some people wanted from scratch implementations & particularly on previous BERT video. Implementing these (correctly) takes a lot of time so it's not always worth the time investment so we'll see if I'll do it.. Right now I'm focusing more on Kaggle and will have more videos on that soon :)
@jeffw991
@jeffw991 3 жыл бұрын
Great video! If you're able to do the GPT-2 and GPT-3 papers as well that would be great, but this is a solid start to understanding how GPT differs from transformers. I tend to bounce off of the dry academic papers; sometimes it feels like some of them don't *want* to be understood. But your explainer videos have really helped me get a grip on the concepts so I can go back to the paper and get more out of the details.
@tejasvix
@tejasvix 3 жыл бұрын
I like your paper explanation videos very much, they helped me a lot in reading paper as i am a newbie , Would love to see more videos , Thanx a lot
@Fr4nk4000
@Fr4nk4000 Жыл бұрын
I remember watching this video around when it came out. It's insane to see how far we've come in language models
@halilhelvaci
@halilhelvaci 2 жыл бұрын
Excellent video! Hope to see it for the other language models as well
@stephennfernandes
@stephennfernandes 3 жыл бұрын
Hey BERT and GPT1 use sentencepice representation of sentences while training , but how do they fine tune and work with POS tagging task as they require to classify a word based on the enter word per POS token ?? any idea how its done ???
@user-sg4uq4nj1n
@user-sg4uq4nj1n 2 жыл бұрын
Great explanation! I got a dumb question regarding section 3.1 equation 2 in their paper. It says "h_l = transformer_block(h_{l-1} \forall i \in [1,n])". Is it supposed to be "l" instead of "i"?
@garikhakobyan3013
@garikhakobyan3013 3 жыл бұрын
Good work. Waiting for new videos
@rog0079
@rog0079 3 жыл бұрын
So are you planning to code GPT from scratch ?
@shambhaviaggarwal9977
@shambhaviaggarwal9977 3 жыл бұрын
Hey! Can you make a video on pytorch-lightning? I think it is super useful.
@abhishek_maity
@abhishek_maity 3 жыл бұрын
So are you going to put more video on solving the Kaggle problem Use cases??(ML or NLP using DL) ?
@AladdinPersson
@AladdinPersson 3 жыл бұрын
Yes, my plan is to do solutions to previous Kaggle competitions and my goal is to show how to get Top 1% solution on the leaderboard and do it intuitively, step by step of how I approached the problem. Right now I want to focus on computer vision tasks on Kaggle but I think I'm going to do NLP & Tabular data competitions etc
@abhishek_maity
@abhishek_maity 3 жыл бұрын
@@AladdinPersson Great!! will be eagerly waiting for your amazing content on these !! ❤️😃😃
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 4,2 МЛН
GPT-1 | Paper Explained & PyTorch Implementation
18:46
Maciej Balawejder
Рет қаралды 6 М.
VIP ACCESS
00:47
Natan por Aí
Рет қаралды 30 МЛН
Леон киллер и Оля Полякова 😹
00:42
Канал Смеха
Рет қаралды 4,7 МЛН
СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️
01:01
DO$HIK
Рет қаралды 3,3 МЛН
“Don’t stop the chances.”
00:44
ISSEI / いっせい
Рет қаралды 62 МЛН
BERT Paper Explained
14:39
Aladdin Persson
Рет қаралды 16 М.
Chat GPT Rewards Model Explained!
17:56
CodeEmporium
Рет қаралды 18 М.
NeRFs: Neural Radiance Fields - Paper Explained
20:14
Aladdin Persson
Рет қаралды 38 М.
ChatGPT: от новичка до PRO за полчаса
38:21
DiazBarnz
Рет қаралды 538 М.
Attention in transformers, visually explained | DL6
26:10
3Blue1Brown
Рет қаралды 2 МЛН
What It's Like To be a Computer: An Interview with GPT-3
16:17
Eric Elliott
Рет қаралды 4,3 МЛН
Two AIs Discuss Their Loneliness and Immortality. (GPT-3)
7:07
GPT3: An Even Bigger Language Model - Computerphile
25:57
Computerphile
Рет қаралды 435 М.
30 Year History of ChatGPT
26:55
Art of the Problem
Рет қаралды 1,1 МЛН
VIP ACCESS
00:47
Natan por Aí
Рет қаралды 30 МЛН