Training Large Language Models: Practices and Research Questions

On large language models and transformers: perspectives from physics, neuroscience, and theory

RLHF: How to Learn from Human Feedback with Reinforcement Learning

Seja Gentil com os Pequenos Animais 😿

Who’s the Real Dad Doll Squid? Can You Guess in 60 Seconds? | Roblox 3D

didn't manage to catch the ball #tiktok

How I Turned a Lolipop Into A New One 🤯🍭

Training Large Language Models: Practices and Research Questions

Рет қаралды 376

Simons Institute

Simons Institute

Күн бұрын

Danqi Chen (Princeton University)
simons.berkele...
Special Year on Large Language Models and Transformers: Part 1 Boot Camp
In this tutorial, I will provide a comprehensive walk-through of the pipeline for training large language models, covering both pre-training and post-training phases. My goal is to discuss the best practices at each stage of training as known today-drawing from open models and public research papers-including data curation, training algorithms, and safety mitigations. The tutorial aims to serve as a starting point to facilitate discussions on the open research questions in training the next generation of large language models.

Пікірлер

On large language models and transformers: perspectives from physics, neuroscience, and theory

1:32:41

On large language models and transformers: perspectives from physics, neuroscience, and theory

Simons Institute

Рет қаралды 1,2 М.

RLHF: How to Learn from Human Feedback with Reinforcement Learning

59:17

RLHF: How to Learn from Human Feedback with Reinforcement Learning

Cooperative AI Foundation

Рет қаралды 6 М.

Seja Gentil com os Pequenos Animais 😿

00:20

Seja Gentil com os Pequenos Animais 😿

Los Wagners

Рет қаралды 31 МЛН

Who’s the Real Dad Doll Squid? Can You Guess in 60 Seconds? | Roblox 3D

00:34

Who’s the Real Dad Doll Squid? Can You Guess in 60 Seconds? | Roblox 3D

Minec Music Short

Рет қаралды 27 МЛН

didn't manage to catch the ball #tiktok

00:19

didn't manage to catch the ball #tiktok

Анастасия Тарасова

Рет қаралды 33 МЛН

How I Turned a Lolipop Into A New One 🤯🍭

00:19

How I Turned a Lolipop Into A New One 🤯🍭

Wian

Рет қаралды 11 МЛН

Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained

8:55

Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained

AI Coffee Break with Letitia

Рет қаралды 23 М.

Harvard Professor Explains Algorithms in 5 Levels of Difficulty | WIRED

25:47

Harvard Professor Explains Algorithms in 5 Levels of Difficulty | WIRED

WIRED

Рет қаралды 3 МЛН

CS224V Fall 2024 Lecture 1: Introduction on 9 23 2024 Mon

1:29:00

CS224V Fall 2024 Lecture 1: Introduction on 9 23 2024 Mon

StanfordCSVideos

Рет қаралды 893

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

46:02

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

The Royal Institution

Рет қаралды 1 МЛН

AI, Machine Learning, Deep Learning and Generative AI Explained

10:01

AI, Machine Learning, Deep Learning and Generative AI Explained

IBM Technology

Рет қаралды 370 М.

Toward Understanding In-context Learning

1:29:35

Toward Understanding In-context Learning

Simons Institute

Рет қаралды 586

What are AI Agents?

12:29

What are AI Agents?

IBM Technology

Рет қаралды 542 М.

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

15:21

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Entry Point AI

Рет қаралды 92 М.

ICML 2024 Tutorial: Physics of Language Models

1:53:43

ICML 2024 Tutorial: Physics of Language Models

Zeyuan Allen-Zhu

Рет қаралды 28 М.

LLM vs NLP | Kevin Johnson

10:36

LLM vs NLP | Kevin Johnson

dscout

Рет қаралды 17 М.

Seja Gentil com os Pequenos Animais 😿

00:20

Seja Gentil com os Pequenos Animais 😿

Los Wagners

Рет қаралды 31 МЛН