BERT: transfer learning for NLP

Transformers (how LLMs work) explained visually | DL5

BERT for pretraining Transformers

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

IL'HAN - Qalqam | Official Music Video

😯 Подарила сыну БМВ, но не ожидала такой реакции на машину! | Новостничок

How to treat Acne💉

BERT: transfer learning for NLP

Рет қаралды 10,081

Lennart Svensson

Lennart Svensson

Күн бұрын

Пікірлер

@g111an Жыл бұрын

Just binged the entire playlist, helped me understand the intuitions behind the math. I hope you make more videos :)

@goelnikhils 2 жыл бұрын

Thanks a lot Lennart. What a crisp and clear explanation of BERT.

@rickyebay 2 жыл бұрын

This is the best explanation on Transformer I have found in the web. Can you doing another set of video for T5 ?

@jeremyyd1258 Жыл бұрын

Excellent video! Thank you!

@JsaintUK 2 жыл бұрын

Great video. Are the original word embeddings simple static embeddings? Where do they come from?

@lennartsvensson7636

@lennartsvensson7636 2 жыл бұрын

They are "simple static embeddings". It is common to train them along with the other parameters.

@JsaintUK 2 жыл бұрын

@@lennartsvensson7636 Okay so they could be something such as Word2vec embeddings? These are then passed into the encoder where they are contextualised?

@lennartsvensson7636

@lennartsvensson7636 2 жыл бұрын

@@JsaintUK They could be but more commonly they are trained along all the other network parameters.

Transformers (how LLMs work) explained visually | DL5

27:14

Transformers (how LLMs work) explained visually | DL5

3Blue1Brown

Рет қаралды 4,2 МЛН

BERT for pretraining Transformers

15:53

BERT for pretraining Transformers

Shusen Wang

Рет қаралды 12 М.

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

00:41

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

King jr

Рет қаралды 7 МЛН

IL'HAN - Qalqam | Official Music Video

03:17

IL'HAN - Qalqam | Official Music Video

Ilhan Ihsanov

Рет қаралды 700 М.

😯 Подарила сыну БМВ, но не ожидала такой реакции на машину! | Новостничок

00:20

😯 Подарила сыну БМВ, но не ожидала такой реакции на машину! | Новостничок

НОВОСТНИЧОК

Рет қаралды 6 МЛН

How to treat Acne💉

00:31

How to treat Acne💉

ISSEI / いっせい

Рет қаралды 108 МЛН

Applying BERT to Question Answering (SQuAD v1.1)

21:13

Applying BERT to Question Answering (SQuAD v1.1)

ChrisMcCormickAI

Рет қаралды 59 М.

What are Word Embeddings?

8:38

What are Word Embeddings?

IBM Technology

Рет қаралды 26 М.

Transfer learning and Transformer models (ML Tech Talks)

44:59

Transfer learning and Transformer models (ML Tech Talks)

TensorFlow

Рет қаралды 116 М.

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

57:45

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Grant Sanderson

Рет қаралды 297 М.

Transfer Learning with Tensorflow in Python

31:30

Transfer Learning with Tensorflow in Python

NeuralNine

Рет қаралды 6 М.

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

54:52

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

Umar Jamil

Рет қаралды 49 М.

Why Does Diffusion Work Better than Auto-Regression?

20:18

Why Does Diffusion Work Better than Auto-Regression?

Algorithmic Simplicity

Рет қаралды 413 М.

What are Transformer Models and how do they work?

44:26

What are Transformer Models and how do they work?

Serrano.Academy

Рет қаралды 133 М.

Transformers - Part 1 - Self-attention: an introduction

15:56

Transformers - Part 1 - Self-attention: an introduction

Lennart Svensson

Рет қаралды 18 М.

What is BERT? | Deep Learning Tutorial 46 (Tensorflow, Keras & Python)

23:03

What is BERT? | Deep Learning Tutorial 46 (Tensorflow, Keras & Python)

codebasics

Рет қаралды 287 М.

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

00:41

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

King jr

Рет қаралды 7 МЛН