Writer CEO May Habib talks utilizing synthetic data to train AI models

  Рет қаралды 8,852

CNBC Television

CNBC Television

Күн бұрын

May Habib, Writer CEO, joins 'Closing Bell Overtime' to talk the companies new AI model and how they are innovating AI training.

Пікірлер: 15
@TaskSwitcherify
@TaskSwitcherify 2 күн бұрын
How do you avoid Garbage In - Garbage Out? When training on synthetic data, some of which is already hallucinated, flawed, and misinformed, don't you get even more hallucinated and "machined" outputs and a form of data poisoning?
@Maioubi
@Maioubi Күн бұрын
Like humans, it's easier for AI to know something is good than produce the good thing. AI is very good at "labeling" stuff (classification) and the tech for that is much more mature from the early days of deep learning. Synthetic data allows the AI to convert its classifying ability into greater intelligence by generating tons of examples and discarding the bad output. The next model will then be slightly smarter and better at generating and labeling. It's not perfect but it can go very far if you have vast computing power, and we're not close to the ceiling.
@KK-pm7ud
@KK-pm7ud 4 күн бұрын
Sounds too good to be true
@joe_hoeller_chicago
@joe_hoeller_chicago 3 күн бұрын
Synthetic data doesn’t work as good as you think for real world tasks, esp within domains that require you understand a context within a context.
@alshiferaw925
@alshiferaw925 4 күн бұрын
The entire talk the lady said was a bunch of air.
@sim-racer
@sim-racer 2 күн бұрын
Not really. She is right, smaller models perform much better when trained with high quality synthetic data generated from LLMs.
@Cellardoor187
@Cellardoor187 Күн бұрын
No she did not, she is on point and this is a very smart venture. that "bunch of air" she produced got her a 2B dollar valuation. So perhaps get off your high horse.
@mymusicpublisher
@mymusicpublisher Күн бұрын
Not really. Her voice throws me off though.
@bluesque9687
@bluesque9687 Күн бұрын
I like your blonde hairstyle and the big ring earrings! A blast from the past!
@MrDonald911
@MrDonald911 Күн бұрын
Research already showed it doesnt work unfortunately.
@DanielKwan-b7g
@DanielKwan-b7g 2 күн бұрын
Training on synthetic data gets the illusion that a model works but bc it’s trained on fake data it’s less accurate lol. Lady, there is a reason why ppl dont want to go this route 😅
@Maioubi
@Maioubi Күн бұрын
Any synthetic data is reviewed by AI as well, and AI is better at knowing good from bad than making good, kinda like us humans. Bad output is discarded. This cycle isn't perfect but it definitely grows more accurate over time, not less so. Look at o1 by OpenAI, mostly trained by synthetic data.
@briandouglas7375
@briandouglas7375 Күн бұрын
Synthetic data will be flawed.
@Maioubi
@Maioubi Күн бұрын
It's easier for AI to know something is good than produce the good thing, just like humans. AI is very good at labeling stuff (classification) and the tech for that is much more mature from before deep learning. Synthetic data allows the AI to generalize its classification skills into greater intelligence by generating tons of examples and discarding the bad output. The next model will then be slightly smarter and better at generating and labeling. If you have enough computing power, this cycle seems to have no upper bound.
Why Elon Musk’s Robotaxi Is Such a Risky Bet for Tesla
9:24
Bloomberg Originals
Рет қаралды 366 М.
Where Are Laid Off Tech Employees Going? | CNBC Marathon
41:28
Don't look down on anyone#devil  #lilith  #funny  #shorts
00:12
Devil Lilith
Рет қаралды 46 МЛН
Officer Rabbit is so bad. He made Luffy deaf. #funny #supersiblings #comedy
00:18
Funny superhero siblings
Рет қаралды 19 МЛН
إخفاء الطعام سرًا تحت الطاولة للتناول لاحقًا 😏🍽️
00:28
حرف إبداعية للمنزل في 5 دقائق
Рет қаралды 78 МЛН
Good teacher wows kids with practical examples #shorts
00:32
I migliori trucchetti di Fabiosa
Рет қаралды 12 МЛН
This $10M U.S. Army Laser Melts Drones With $3 Beams | WSJ Equipped
6:12
The Wall Street Journal
Рет қаралды 663 М.
The AI already in your phone | BBC News
7:33
BBC News
Рет қаралды 138 М.
The David Rubenstein Show: Sundar Pichai
24:07
David Rubenstein
Рет қаралды 115 М.
How T.J. Maxx Disrupted The Retail Industry
9:39
CNBC
Рет қаралды 368 М.
Don't look down on anyone#devil  #lilith  #funny  #shorts
00:12
Devil Lilith
Рет қаралды 46 МЛН