Guide to TRANSFORMERS ENCODER-DECODER Neural Network : A Step by Step Intuitive Explanation

6 NLP Applications using Hugging Face Pipeline | Tour of all Hugging Face Transformers | Tutorial 4

Decoder-Only Transformers, ChatGPTs specific Transformer, Clearly Explained!!!

Каха инструкция по шашлыку

МНЕ НУЖЕН ЕЩЕ 1 ПОДПИСЧИК! #cat #funny #pets #funnycats #animals #memes #cute

WHY IS A CAR MORE EXPENSIVE THAN A GIRL?

Sprinting with More and More Money

Guide to TRANSFORMERS ENCODER-DECODER Neural Network : A Step by Step Intuitive Explanation

Рет қаралды 9,235

Datafuse Analytics

Datafuse Analytics

Күн бұрын

Transformers are the State of the Art Models nowadays, but how do they work?
This video explains and demystifies the novel neural network architecture in an intuitive manner with step by step explanation and illustrations on how transformers work.
This video also explains what is the intuition behind using QUERY, KEY and VALUE terminology in Attention Mechanism.
Chapters
0:00 Introduction
0:28 High Level Working Overview of Encoder & Decoder
1:34 Encoder - Decoder Flow
3:03 Can we have ONLY encoder or ONLY decoder based architectures?
5:22 The ENCODER Components
6:06 Why Self-Attention?
6:48 How to compute a Self-Attention Mechanism?
9:31 Intuition behind Query, Key and Value Terminology
11:04 Feed-Forward Layer
11:53 Layer Normalization
13:13 Positional Embeddings
14:48 Classification Head
15:41 The Decoder
#transformers #datascience #machinelearning #encoder #decoder #neuralnetwork

Пікірлер: 10

@datafuseanalytics

@datafuseanalytics Жыл бұрын

In this video I tried to explain all the building blocks of Transformers Encoder - Decoder in a simplified and an intuitive manner. Chapters 0:00 Introduction 0:28 High Level Working Overview of Encoder & Decoder 1:34 Encoder - Decoder Flow 3:03 Can we have ONLY encoder or ONLY decoder based architectures? 5:22 The ENCODER Components 6:06 Why Self-Attention? 6:48 How to compute a Self-Attention Mechanism? 9:31 Intuition behind Query, Key and Value Terminology 11:04 Feed-Forward Layer 11:53 Layer Normalization 13:13 Positional Embeddings 14:48 Classification Head 15:41 The Decoder

@sergibar Жыл бұрын

Thanks for sharing!

@datafuseanalytics

@datafuseanalytics Жыл бұрын

Thanks a lot Sergio. 😃

@sagar3482 Жыл бұрын

Thanks for the explanation.

@datafuseanalytics

@datafuseanalytics Жыл бұрын

Thank you Sagar. Do share it with your Data enthusiasts friends.

@sagar3482 Жыл бұрын

@@datafuseanalytics yes yes sir , surely

@dudeabideth4428

@dudeabideth4428 4 ай бұрын

Yeah so many words like embeddings are used with no context

@datafuseanalytics

@datafuseanalytics 2 ай бұрын

Hello. I apologize for the confusion. Please let me know the timestamps. I will try my best to explain the context

@piyushpr Жыл бұрын

Not intuitive at all. So much jargon without explanation

@datafuseanalytics

@datafuseanalytics Жыл бұрын

Thanks for your valuable feedback, Piyush. I will try to explain it in a simpler fashion next time.

6 NLP Applications using Hugging Face Pipeline | Tour of all Hugging Face Transformers | Tutorial 4

26:02

6 NLP Applications using Hugging Face Pipeline | Tour of all Hugging Face Transformers | Tutorial 4

Datafuse Analytics

Рет қаралды 985

Decoder-Only Transformers, ChatGPTs specific Transformer, Clearly Explained!!!

36:45

Decoder-Only Transformers, ChatGPTs specific Transformer, Clearly Explained!!!

StatQuest with Josh Starmer

Рет қаралды 100 М.

Каха инструкция по шашлыку

01:00

Каха инструкция по шашлыку

К-Media

Рет қаралды 8 МЛН

МНЕ НУЖЕН ЕЩЕ 1 ПОДПИСЧИК! #cat #funny #pets #funnycats #animals #memes #cute

00:44

МНЕ НУЖЕН ЕЩЕ 1 ПОДПИСЧИК! #cat #funny #pets #funnycats #animals #memes #cute

SOFIADELMONSTRO

Рет қаралды 23 МЛН

WHY IS A CAR MORE EXPENSIVE THAN A GIRL?

00:37

WHY IS A CAR MORE EXPENSIVE THAN A GIRL?

Levsob

Рет қаралды 21 МЛН

Sprinting with More and More Money

00:29

Sprinting with More and More Money

MrBeast

Рет қаралды 175 МЛН

Confused which Transformer Architecture to use? BERT, GPT-3, T5, Chat GPT? Encoder Decoder Explained

15:30

Confused which Transformer Architecture to use? BERT, GPT-3, T5, Chat GPT? Encoder Decoder Explained

Datafuse Analytics

Рет қаралды 28 М.

What are Transformer Models and how do they work?

44:26

What are Transformer Models and how do they work?

Serrano.Academy

Рет қаралды 99 М.

Orignal transformer paper "Attention is all you need" introduced by a layman | Shawn's ML Notes

37:56

Orignal transformer paper "Attention is all you need" introduced by a layman | Shawn's ML Notes

Yuxiang "Shawn" Wang

Рет қаралды 9 М.

The Attention Mechanism in Large Language Models

21:02

The Attention Mechanism in Large Language Models

Serrano.Academy

Рет қаралды 80 М.

Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!

36:15

Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!

StatQuest with Josh Starmer

Рет қаралды 590 М.

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Paper Explained)

29:56

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Paper Explained)

Yannic Kilcher

Рет қаралды 328 М.

Segment Anything - Model explanation with code

42:53

Segment Anything - Model explanation with code

Umar Jamil

Рет қаралды 14 М.

Encoder Decoder | Sequence-to-Sequence Architecture | Deep Learning | CampusX

1:13:42

Encoder Decoder | Sequence-to-Sequence Architecture | Deep Learning | CampusX

CampusX

Рет қаралды 30 М.

CS480/680 Lecture 19: Attention and Transformer Networks

1:22:38

CS480/680 Lecture 19: Attention and Transformer Networks

Pascal Poupart

Рет қаралды 338 М.

Which transformer architecture is best? Encoder-only vs Encoder-decoder vs Decoder-only models

7:38

Which transformer architecture is best? Encoder-only vs Encoder-decoder vs Decoder-only models

Efficient NLP

Рет қаралды 18 М.

Каха инструкция по шашлыку

01:00

Каха инструкция по шашлыку

К-Media

Рет қаралды 8 МЛН