Introduction to Mixture-of-Experts (MoE)

The moment we stopped understanding AI [AlexNet]

Large Language Models (LLMs) - Everything You NEED To Know

Каха заблудился в горах

Slow motion boy #shorts by Tsuriki Show

World’s Largest Jello Pool

Disparos en la colectora de la General Paz: ladrón atropelló a los policías que lo quisieron detener

Introduction to Mixture-of-Experts (MoE)

Рет қаралды 1,199

AI Papers Academy

AI Papers Academy

Күн бұрын

In this video we go back to the extremely important Google paper which introduced the Mixture-of-Experts (MoE) layer with authors including Geoffrey Hinton.
The paper is titled Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer. MoE today is widely used in various top Large Language Models and interestingly, it was published at the beginning of 2017, while the Attention All You Need paper which introduced Transformers was published later that year, also by Google. It this video the purpose is to understand why the Mixture-of-Experts method is important and how it works.
Paper page - arxiv.org/abs/1701.06538
-----------------------------------------------------------------------------------------------
✉️ Join the newsletter - aipapersacademy.com/newsletter/
👍 Please like & subscribe if you enjoy this content
-----------------------------------------------------------------------------------------------
Chapters:
0:00 Why MoE is needed?
1:33 Sparse MoE Layer
3:41 MoE Paper's Figure

Пікірлер: 1

The moment we stopped understanding AI [AlexNet]

17:38

The moment we stopped understanding AI [AlexNet]

Welch Labs

Рет қаралды 858 М.

Large Language Models (LLMs) - Everything You NEED To Know

25:20

Large Language Models (LLMs) - Everything You NEED To Know

Matthew Berman

Рет қаралды 73 М.

Каха заблудился в горах

00:57

Каха заблудился в горах

К-Media

Рет қаралды 10 МЛН

Slow motion boy #shorts by Tsuriki Show

00:14

Slow motion boy #shorts by Tsuriki Show

Tsuriki Show

Рет қаралды 10 МЛН

World’s Largest Jello Pool

01:00

World’s Largest Jello Pool

Mark Rober

Рет қаралды 110 МЛН

Disparos en la colectora de la General Paz: ladrón atropelló a los policías que lo quisieron detener

00:14

Disparos en la colectora de la General Paz: ladrón atropelló a los policías que lo quisieron detener

CLARÍN

Рет қаралды 63 МЛН

"Don't Learn to Code, But Study This Instead..." says NVIDIA CEO Jensen Huang

11:35

"Don't Learn to Code, But Study This Instead..." says NVIDIA CEO Jensen Huang

Goda Go

Рет қаралды 908 М.

Soft Mixture of Experts - An Efficient Sparse Transformer

7:31

Soft Mixture of Experts - An Efficient Sparse Transformer

AI Papers Academy

Рет қаралды 4,6 М.

Coding a ChatGPT Like Transformer From Scratch in PyTorch

31:11

Coding a ChatGPT Like Transformer From Scratch in PyTorch

StatQuest with Josh Starmer

Рет қаралды 34 М.

AI in banking: TOP use cases and examples

4:53

AI in banking: TOP use cases and examples

Jelvix | TECH IN 5 MINUTES

Рет қаралды 3,5 М.

Mistral 8x7B Part 1- So What is a Mixture of Experts Model?

12:33

Mistral 8x7B Part 1- So What is a Mixture of Experts Model?

Sam Witteveen

Рет қаралды 40 М.

How to set up RAG - Retrieval Augmented Generation (demo)

19:52

How to set up RAG - Retrieval Augmented Generation (demo)

Don Woodlock

Рет қаралды 22 М.

Intro to RAG for AI (Retrieval Augmented Generation)

14:31

Intro to RAG for AI (Retrieval Augmented Generation)

Matthew Berman

Рет қаралды 48 М.

How I'd Learn AI (If I Had to Start Over)

15:04

How I'd Learn AI (If I Had to Start Over)

Thu Vu data analytics

Рет қаралды 766 М.

Fast Inference of Mixture-of-Experts Language Models with Offloading

11:58

Fast Inference of Mixture-of-Experts Language Models with Offloading

AI Papers Academy

Рет қаралды 1,2 М.

MIT Introduction to Deep Learning | 6.S191

1:09:58

MIT Introduction to Deep Learning | 6.S191

Alexander Amini

Рет қаралды 429 М.

Мой новый мега монитор!🤯

1:00

Мой новый мега монитор!🤯

Корнеич

Рет қаралды 907 М.

WATER TEST😱Galaxy Z Fold 6😈Vs iPhone 15 pro max💀- PUBG TEST #iphone #pubgmobile #shorts #bgmi

0:18

WATER TEST😱Galaxy Z Fold 6😈Vs iPhone 15 pro max💀- PUBG TEST #iphone #pubgmobile #shorts #bgmi

Nir Gaming

Рет қаралды 707 М.

Homemade mobile phone cooling device, the phone does not get hot and can be used as a stand. I n

0:11

Homemade mobile phone cooling device, the phone does not get hot and can be used as a stand. I n

Toy Kid

Рет қаралды 34 МЛН

low battery 🪫

0:10

low battery 🪫

dednahype

Рет қаралды 1,8 МЛН

Запись звонков на iPhone, Apple Intelligence и новая Siri! Обзор iOS 18.1 (beta 1)

12:20

Запись звонков на iPhone, Apple Intelligence и новая Siri! Обзор iOS 18.1 (beta 1)

ProTech

Рет қаралды 86 М.

Как бесплатно замутить iphone 15 pro max

0:59

Как бесплатно замутить iphone 15 pro max

ЖЕЛЕЗНЫЙ КОРОЛЬ

Рет қаралды 8 МЛН

Как на Любом смартфоне восстановить Старую Фотопленку?📸 #Shorts

0:37

Как на Любом смартфоне восстановить Старую Фотопленку?📸 #Shorts

lifegoodd

Рет қаралды 1,3 МЛН