No video

How Gen AI Models Trained

  Рет қаралды 31

The AI Engineering

The AI Engineering

Күн бұрын

In this video, we'll explore the fascinating process of training generative AI models like GPT. There are three main stages:
1. Generative Pre-Training:
Goal: Knowledge Creation
Process: Training on vast amounts of internet text to learn language patterns.
Resources: 12 days, ~10 TB data, ~6000 GPUs.
Outcome: Generates text, summarizes, and performs sentiment analysis.
2. Supervised Fine-Tuning (SFT):
Goal: Becoming a Helpful Assistant
Process: Fine-tuning with curated human conversation data.
Resources: 1 day, ~100K request-response pairs.
Outcome: Generates appropriate and socially acceptable responses.
3. Reinforcement Learning through Human Feedback (RLHF):
Goal: Aligning with Human Preferences
Process: Learning from real user interactions and feedback.
Resources: Weeks to months, ~100K to 1M comparison labels, 10K to 100K prompts.
Outcome: Aligns responses closely with human preferences.
Join us as we dive deep into how these stages transform a basic language model into a sophisticated conversational agent!
Newsletter:
Subscribe now to my newsletter "𝗧𝗵𝗲 𝗔𝗶 𝗗𝗶𝘀𝗰𝗼𝘃𝗲𝗿𝘆" (www.newsletter...) and get 𝗙𝗥𝗘𝗘 eBook "Ultimate Resources for Generative AI".

Пікірлер
Introduction to Generative AI
22:02
Google Cloud
Рет қаралды 242 М.
[1hr Talk] Intro to Large Language Models
59:48
Andrej Karpathy
Рет қаралды 2,1 МЛН
when you have plan B 😂
00:11
Andrey Grechka
Рет қаралды 8 МЛН
Glow Stick Secret Pt.4 😱 #shorts
00:35
Mr DegrEE
Рет қаралды 18 МЛН
Unveiling my winning secret to defeating Maxim!😎| Free Fire Official
00:14
Garena Free Fire Global
Рет қаралды 16 МЛН
Using AI (ChatGPT) in QA Manual Work (rus)
36:15
Andersen People Live
Рет қаралды 1,6 М.
API hooking simplified
7:09
Malware-Reverse-Engineering-Made-Easy
Рет қаралды 101
dSPACE Develops Simulation Solutions With CIM Database Cloud.
3:22
Electronics Technology
Рет қаралды 11
Ad Click Aggregator System Design | Step-by-Step Guide
20:38
Fine Tuning LLM Models - Generative AI Course
2:37:05
freeCodeCamp.org
Рет қаралды 94 М.
Let's build GPT: from scratch, in code, spelled out.
1:56:20
Andrej Karpathy
Рет қаралды 4,6 МЛН
James Webb Telescope Just Captured a TERRIFYING Object In Space!
10:38
Future of Platformization
18:13
Círculo de CISO
Рет қаралды 93
What are Azure AD payment models
6:43
Future Skills
Рет қаралды 14
when you have plan B 😂
00:11
Andrey Grechka
Рет қаралды 8 МЛН