Reinforcement Fine-Tuning-12 Days of OpenAI: Day 2

  Рет қаралды 221,100

OpenAI

OpenAI

Күн бұрын

Пікірлер: 745
@lavaboots1337
@lavaboots1337 6 күн бұрын
Millions of dollars for staff. Billions for Invidia chips. Sound recorded with a potato - priceless.
@staffanlundberg
@staffanlundberg 6 күн бұрын
Sorry, I guess You´re right: I was too busy following their storyline to notice. But I WILL make an effort to check this on their next video. 😏
@fegyenc
@fegyenc 6 күн бұрын
@staffanlundberg please make it better next time the sound and lights are so amateur. It is not your level...
@jadioj
@jadioj 6 күн бұрын
Lmfao
@Cyber_Jan78
@Cyber_Jan78 6 күн бұрын
This is good, their focus is the product not bla bla marketing
@Cyber_Jan78
@Cyber_Jan78 6 күн бұрын
But i like you potato 🥔 referral, even o1 wouldn’t come on that … or would he …🧐
@HelamanGile
@HelamanGile 6 күн бұрын
Do you guys need to hire a sound guy if so I am available have lots of experience
@upgradeu2
@upgradeu2 6 күн бұрын
30 sec pitch go
@sammy45654565
@sammy45654565 6 күн бұрын
@@upgradeu2 they need a better sound guy to hear it
@Lighto222
@Lighto222 6 күн бұрын
Hello fellow sound guy
@A123-d8o
@A123-d8o 6 күн бұрын
make sure to charge them $200 a month and watch them cancel roflmao
@WayOfTheZombie
@WayOfTheZombie 6 күн бұрын
At least just boost about 2 to 4 db
@theoz1
@theoz1 6 күн бұрын
Recap → Reinforcement Fine-Tuning (RFT) preview announced, allowing fine-tuning of o1 models on custom datasets using reinforcement learning (launching publicly next year). → Applications of RFT include creating expert models for domains like law, finance, healthcare, and engineering (e.g., partnership with Thomson Reuters for a legal assistant). → o1 Mini + RFT shown to outperform the full o1 model for specific tasks with smaller, faster, and cheaper models. → Alpha program for RFT expansion now open to select organizations working on complex tasks. RFT allows models to "think and learn" in new ways with just a few examples, offering potential for domain-specific AI advancements. If you're in research, enterprise level, or need custom AI, this may be useful. However for most users this is advanced so most won't likely use RFT or need to.
@ridrugo182
@ridrugo182 6 күн бұрын
Doin' the lord's work right here.
@garette8672
@garette8672 6 күн бұрын
appreciate this
@callmebiz
@callmebiz 6 күн бұрын
🫡
@staffanlundberg
@staffanlundberg 6 күн бұрын
"However for most users this is advanced so most won't likely use RFT or need to." I believe these procedures will be simplified by time such that most people may be able to use them. It is a way to develope better knowledge in essentially any subject that You are "specialized" in. Hey, Chat GPT is trying to turn us all into researchers ! Love it !🤗
@theoz1
@theoz1 6 күн бұрын
@@staffanlundberg so true! Over time all of these features become easier and easier to use. "Turning us all into researchers" - I like that take!
@ChrizzeeB
@ChrizzeeB 6 күн бұрын
Can they fine-tune their microphones?
@TheHawkeyede
@TheHawkeyede 6 күн бұрын
that was a good one ;D
@A-Hit
@A-Hit 6 күн бұрын
Sound recorded by a newbie. They need a professional sound guy. Pick me.
@xevil21
@xevil21 6 күн бұрын
@@A-Hit Open a.i pick a.i. not You.
@jmg9509
@jmg9509 5 күн бұрын
Lmao
@qijia4769
@qijia4769 3 күн бұрын
I suppose o1 pro could improve the sound quality with one prompt.
@robertniyazoff2591
@robertniyazoff2591 6 күн бұрын
the guy on the very left is a genius. Bro has worked at jane street as a quant, Microsoft as a swe, and been at openai for 6 years and going
@o1-preview
@o1-preview 6 күн бұрын
pretty smart lad indeed
@farisitum9594
@farisitum9594 6 күн бұрын
Our left or their left
@combatninjaturtle
@combatninjaturtle 6 күн бұрын
@@farisitum9594the guy on the other end is computational biologist. So it’s obvious.
@Ragingwasabi9000
@Ragingwasabi9000 6 күн бұрын
Which end is the computational biologist at?
@combatninjaturtle
@combatninjaturtle 6 күн бұрын
He was an intern at microsoft and jane street. He is sharp nonetheless.
@Luxcium
@Luxcium 6 күн бұрын
next time they should have a better sound don’t you guys think??? 🤔
@CaioBrazVS
@CaioBrazVS 6 күн бұрын
Yeah, plsss!
@sanchitkaul5084
@sanchitkaul5084 6 күн бұрын
Turn off "stable volume " in settings
@orterves
@orterves 6 күн бұрын
Billions of dollars, dollar store microphones
@emilyd4385
@emilyd4385 6 күн бұрын
100% the static noise and muffled mics are distracting to the incredible work they're showcasing
@akhilj307
@akhilj307 6 күн бұрын
Yes - it’s muffled
@npc-drew
@npc-drew 6 күн бұрын
Day 3: Audio o1 -- Delivers movie studios audio-like quality.
@o1-preview
@o1-preview 6 күн бұрын
that would be nice
@thepresistence5935
@thepresistence5935 5 күн бұрын
brhhhhhh
@Mastertingus
@Mastertingus 3 күн бұрын
You almost guessed , except it's Video and not audio..
@JohnDoe-fh9kn
@JohnDoe-fh9kn 6 күн бұрын
so cool to watch this all unfold. the work you guys do is incredible
@sunainakhedekar6802
@sunainakhedekar6802 Күн бұрын
all these people complaining about voice quality. honestly the people who are 100% invested in the actual content don’t bother to be bugged by something like this. I honestly didn’t realise something was off until I read the comments. It’s rather a clever filtration tactic among the audience watching!
@AbhilashKorraprolu
@AbhilashKorraprolu 6 күн бұрын
Its cute and adorable how you guys have the script in front of you, but react like you are hearing things for the first time :P Cheers guys
@TheRealTommyR
@TheRealTommyR 5 күн бұрын
that is exactly what actors are paid to do.
@McDonaldsCalifornia
@McDonaldsCalifornia 6 күн бұрын
Lol i love how the expert didn't sound excited at all. "Yeah might be cool in the future. Getting better, cant really be compared to existing tools"
@kairi4640
@kairi4640 6 күн бұрын
And the woman is just like 😃 to make up for his lack of excitement. 😂
@martymarl4602
@martymarl4602 6 күн бұрын
Basically they are saying"help train our models to replace you" with a smile. Not so that you can get more done (short term), but that open AI can use your data to compete in your area of expertise (long term, 1 year from now)
@o1-preview
@o1-preview 6 күн бұрын
@@martymarl4602 nah, horse drivers were scared to lose their jobs to cars. its not about the tool, its about who is driving the tool.
@o1-preview
@o1-preview 6 күн бұрын
reminds me of gpt2 days, it will be cool in the future
@martymarl4602
@martymarl4602 6 күн бұрын
@@o1-preview If you don't realise that the "tool" will be superior to the "Drivers" shortly, you're on the wrong page mate
@Dan-e6r7s
@Dan-e6r7s 6 күн бұрын
OpenAI's $157B valuation suggests that there is no money left for good sound quality.
@SurrealDistractions
@SurrealDistractions 6 күн бұрын
They should replace thier current sound engineer with AI
@o1-preview
@o1-preview 6 күн бұрын
lol, I feel bad for who ever it is, the comment section is mostly about this, i bet it'll be his/her hero arc moment
@zurgmuckerberg
@zurgmuckerberg 6 күн бұрын
There's probably no sound engineer, just their internal IT guys.
@OrofinX
@OrofinX 6 күн бұрын
Yes, I'm in shock 😮. Just put a mic on their clothes.
@A-Hit
@A-Hit 6 күн бұрын
Pick me
@GenAIWithNandakishor
@GenAIWithNandakishor 6 күн бұрын
Fix the fkknn sound
@BrianMosleyUK
@BrianMosleyUK 5 күн бұрын
Amazing the groundbreaking functionality being presented here, and so many people are thinking about the sound quality. Tumbleweed.
@naft3R
@naft3R 6 күн бұрын
Please update so o1 has access to project files. It makes it very difficult to work on medium size projects when you cant provide enough content due to the amount of files
@BruceHartford3
@BruceHartford3 6 күн бұрын
Looks like that feature is included in the latest update. :)
@naft3R
@naft3R 6 күн бұрын
@BruceHartford3 it only supports images
@jr-hp7er
@jr-hp7er 6 күн бұрын
You can only upload the images, not files ex. Pdf can't be attached ​@@BruceHartford3
@101RealTalker
@101RealTalker 6 күн бұрын
What are your metrics for a "medium-sized" project?... I have over 3.5 million words worth of research representing a single project, what size would that qualify as in comparison?
@naft3R
@naft3R 6 күн бұрын
@@101RealTalker a model needs full context of a web app in order to make changes and not break the code. Right now all you can do is copy paste code, but other languages support project files.
@ilyasalcantara6450
@ilyasalcantara6450 6 күн бұрын
start using ai for giving u guys tips on how to have a proper audio
@Iwantalloftheinformation
@Iwantalloftheinformation 6 күн бұрын
It makes them look out of the garage like. They're definitely not anymore.
@watchingvideos9871
@watchingvideos9871 6 күн бұрын
@@Iwantalloftheinformationwhat lol
@seanmikejaffer423
@seanmikejaffer423 5 күн бұрын
Thanks a lot for the valuable feedback
@b2brish
@b2brish 6 күн бұрын
OpenAudioAI might be the next billion-dollar industry!
@cjgoeson
@cjgoeson 6 күн бұрын
Nice sip noise 0:08
@jasondisney
@jasondisney 6 күн бұрын
Followed by *gulp*
@TheCosmy2012
@TheCosmy2012 6 күн бұрын
Good soup
@alejandromedina1019
@alejandromedina1019 6 күн бұрын
@@TheCosmy2012 mmtsssaaaaahhhh
@allenlawson9872
@allenlawson9872 6 күн бұрын
Literally made me stop the video LOL
@GeoMeridium
@GeoMeridium 6 күн бұрын
We need some Vic Berger style edits of these releases
@千修-n8x
@千修-n8x 4 күн бұрын
My understanding of OpenAI’s reinforcement learning is: it establishes a foundational cognitive framework (understanding the essence of the world) through the scoring model and leverages reinforcement learning and experience accumulation to continuously optimize the AI’s reasoning and decision-making capabilities. In other words, it learns the logical thinking of experts in each field.
@KaizenKaizen-pro
@KaizenKaizen-pro 6 күн бұрын
Good morning, My name is Mohamed Abdallah, and I am a young person passionate about artificial intelligence. Since discovering OpenAI, I have been captivated by your vision and projects. You inspired me to take a deep interest in this field and develop my own AI, which I called "Kaizen". My biggest dream is to one day work at OpenAI. Your commitment to innovation and positive impact has motivated me to push myself and imagine solutions that could, in turn, transform people's lives. Thank you for showing enthusiasts like me what is possible. I hope to have the opportunity, in the future, to contribute to your incredible projects. Sincerely, Mohamed Abdallah
@KaiPlayz7689
@KaiPlayz7689 5 күн бұрын
Good morning, The comment section is not a email section 💀
@bluestar1234able
@bluestar1234able 5 күн бұрын
​@@KaiPlayz7689he doesn't know we're all being replaced soon 😔
@ozanaydn4705
@ozanaydn4705 5 күн бұрын
Blud even made ai wrote this 😅
@KaiPlayz7689
@KaiPlayz7689 5 күн бұрын
@@bluestar1234able i do and I know this is written by ChatGPT
@jaketron.seattle
@jaketron.seattle 6 күн бұрын
"our models will bring a new breakthrough in healthcare, physics and mathematics" me using o1 to write standup comedy bids about my workplace, while simultaneously teaching me spanish phrases "Mis colegas están como cabras"😂😂😂
@IceMetalPunk
@IceMetalPunk 6 күн бұрын
4o is definitely a much better model for creative tasks like that. And cheaper, too.
@PatrickBulteel
@PatrickBulteel 6 күн бұрын
Your colleagues are like goats?
@jaketron.seattle
@jaketron.seattle 6 күн бұрын
@@PatrickBulteel 🤣😂😂😂 that joke was made by the most advanced model in the world, Mr!
@R.E-O
@R.E-O 6 күн бұрын
@@PatrickBulteel It means they are crazy.
@IceMetalPunk
@IceMetalPunk 6 күн бұрын
@@R.E-O You sure it doesn't mean they eat anything they find in the trash?
@markmatzke
@markmatzke 5 күн бұрын
Impressive results from fine-tuning! The performance improvements in Top-1, Top-5, and Top-at-Max accuracy are exciting, especially in the context of such complex bioinformatics tasks. The clear visualization really helps to showcase the impact of reinforcement learning. Looking forward to seeing how this approach evolves and what new applications it could be used for in healthcare and other fields. Great work!
@madeline-onassis
@madeline-onassis 6 күн бұрын
awesome new developments big brains delivering big prizes to humanity
@halbarba
@halbarba 6 күн бұрын
At OpenAI's conferences, especially during their '12 Days of OpenAI', they seem to have developed not just advanced AI, but also mysteriously bottomless coffee cups
@Silus1008
@Silus1008 6 күн бұрын
I hadnt realized that Open AI had come so far in AI video generation! These guys struck me basically as human, all four of them!
@micbab-vg2mu
@micbab-vg2mu 6 күн бұрын
I work for a pharmaceutical company in the medical department, and I cannot wait to test the Reinforcement Fine-Tuning. Thank you!
@FrancisZhou-thecrocodilekeeper
@FrancisZhou-thecrocodilekeeper 6 күн бұрын
When will the video call and screen sharing features come out? Have been waiting for a looooong time...
@fiddlestix26
@fiddlestix26 6 күн бұрын
Literally like has that been scrapped or what?! Also I feel like I’m always the only one that is ever asking where tf that feature is.
@epicboy330
@epicboy330 6 күн бұрын
@FrancisZhou-thecrocodilekeeper you have to realize how much computing power that would take to do on a global server. It’s probably years out before it’ll release and be a reasonable price. That’s simply a hardware limitation. The biggest problem in tech rn is that our software capabilities far exceed our hardware
@fiddlestix26
@fiddlestix26 6 күн бұрын
@@epicboy330 ummm they literally demoed it in a keynote presentation like a year ago and said it would be available in the “coming weeks” and then just never mentioned it again.
@modalmixture
@modalmixture 3 күн бұрын
What's quite interesting to me is that they're doing bioinformatics ML, the kind of stuff you would do with traditional deep learning methods or maybe a reinforcement learning model like AlphaFold, but they're using a pre-trained *language* model, and not only does it do pretty well, it can explain its reasoning! Shows that these frontier LLMs really can generalize their intelligence over many domains. I find it deeply, deeply profound that language turned out to be the key to intelligence.
@pachvandio
@pachvandio Күн бұрын
Amazing how audio quality will make a massive impact on what people remember from a video…
@deadsippy
@deadsippy 6 күн бұрын
Is there a more detailed benchmark in the works to compare the performance of RFT against o1-mini? The val-accuracy improvement and the top-k tests for the specific task seem promising but I'd love to learn more specifically about what the model is learning through RFT.
@Red-rummy
@Red-rummy 3 күн бұрын
Guys. I love what yall have done. Just playing around with GPT4o I am able to use it as my paint brush in ways I never thought possible. Thank you for everything yall do. Please stay ethical. I see what we can do with this technology and it’s amazing and a little scary. But in my eyes openAI is the gold standard hands down.
@wolpumba4099
@wolpumba4099 4 күн бұрын
*OpenAI Introduces Reinforcement Fine-Tuning (RFT) for Customizing AI Models* * *0:00** Introduction:* OpenAI announces Reinforcement Fine-Tuning (RFT), a new method for customizing its large language models, specifically the O1 series. RFT uses reinforcement learning, moving beyond standard fine-tuning to allow models to reason in new ways over custom data. Public launch is planned for next year, but an alpha program is available for researchers and enterprises. * *1:02** RFT Use Cases:* RFT is suited for domains requiring deep expertise, such as law, finance, engineering, and insurance. A partnership with Thomson Reuters demonstrates RFT's use in creating a legal assistant AI. * *2:22** RFT vs. Supervised Fine-Tuning:* Unlike supervised fine-tuning, which focuses on replicating patterns in input data, RFT teaches models to reason differently. It involves grading model responses and reinforcing successful lines of thinking, requiring only a few dozen examples for effective learning. * *3:50** RFT in Scientific Research:* Justin Reese, a computational biologist, highlights the potential of RFT in rare genetic disease research. RFT can help identify causative genes based on patient symptoms, potentially shortening the diagnostic odyssey for millions. A collaborative dataset from scientific publications was used, containing patient symptoms, absent symptoms, and the causative gene. * *6:24** RFT Demo:* A demonstration shows how RFT improves O1 mini's performance to surpass even the full O1 model in identifying causative genes based on symptoms. * *7:17** Training Data and Graders:* The demonstration explains training datasets (JSONL files), validation datasets, and graders (functions scoring model outputs against correct answers). Graders provide feedback for the reinforcement learning process. * *13:29** Evaluation Results:* RFT significantly improved O1 mini's performance. Evaluations measured top-1, top-5, and top-at-max accuracy, showing substantial gains after fine-tuning. * *15:03** Model Output Analysis:* The fine-tuned model's output now includes reasoning explanations, ranking potential causative genes and significantly boosting accuracy. * *17:38** Broader Applications:* RFT has potential beyond scientific research and has shown promise in various fields, including biochem, AI safety, legal, and healthcare. * *18:58** Alpha Program Expansion:* OpenAI expands its alpha program through the "Reinforcement Fine-Tuning Research Program," offering limited spots for organizations working on complex tasks with expert teams. A link to the application is provided in the description. I used gemini-exp-1121 on rocketrecap dot com to summarize the transcript. Input tokens: 21622 Output tokens: 561
@aomameditation3497
@aomameditation3497 6 күн бұрын
Thanks for new video as an active user of ChatGP-4o. I'm looking forward the next video.
@DarkandTwisted
@DarkandTwisted 6 күн бұрын
The bad audio makes it look as if you guys used PlaySkool mics.
@o1-preview
@o1-preview 6 күн бұрын
small room
@mz8755
@mz8755 6 күн бұрын
When she smiles my heart melted
@GinaRealTalk
@GinaRealTalk 6 күн бұрын
Did I hear this right? An alpha program for creative and functional AI offerings?! OpenAI is leading the way in modeling what intergenerational collaboration can look like in the Age of AI, and I’m here for it! Could this mean I can leverage my 15 years of expertise as a Bilingual Relationship & Family Therapist to design a GPT guide that coaches wellness and resilience skills? This feels like the perfect opportunity to reimagine intentional self-care-teaching the skills to thrive before therapy becomes necessary. My inner MCU nerd is geeking out at the possibilities!
@jtfxjhggvkuhhvktgu
@jtfxjhggvkuhhvktgu 6 күн бұрын
Therapy should NOT be done by AI at this point in time. An AI can do its best to predict the perfect response but a therapist should be focused on providing actual help that will fit best for the client as im sure you're well aware, with an AI you'll at this point in time have edge cases with bad responses whch can harm the client instead...
@augustuslxiii
@augustuslxiii 6 күн бұрын
@@jtfxjhggvkuhhvktgu Right, because human therapists could never give bad responses or harm their clients. Oh wait. Some can, and do, and for hundreds - if not thousands - of dollars.
@TC-jo2vj
@TC-jo2vj 6 күн бұрын
🤮
@TC-jo2vj
@TC-jo2vj 6 күн бұрын
@@jtfxjhggvkuhhvktgucause therapy is a scam, correct.
@homeyworkey
@homeyworkey 6 күн бұрын
​@@jtfxjhggvkuhhvktgunot everyone can afford therapy. It's a good jumping off point for the many (mostly men) who are doubtful of it aswell
@gridvid
@gridvid 6 күн бұрын
That's actually a really cool and important feature 👍
@shubhamcweb
@shubhamcweb 6 күн бұрын
People talking about the sound here are probably better off not watching the video at all. I can’t think of a better way to introduce your product in a friendly and concise way than this. I heard everything in the video correctly. They showed a whole new functionality of their product and all that the Karens noticed was sound.
@AriaAlessandra
@AriaAlessandra 6 күн бұрын
I love this! I just wish they allowed the 128k context window for plus users. Even if it’s in the “4o mini” version, so it can be cheaper for open ai, but please remove the 8k cap
@Patrick-gm3fb
@Patrick-gm3fb 6 күн бұрын
They should have an option for a mechanism to offboard some of the system weight onto your own resources. It might increase latency but it would be worth it for people who have a ton of RAM and need the larger context window. And of course, it would be a win-win because it offloads some of their resource needs that could be better utilized for people who don't have thier own local resources.
@AriaAlessandra
@AriaAlessandra 6 күн бұрын
@ lol I’d be one of those cause I have 8gb of unified ram 😂
@Patrick-gm3fb
@Patrick-gm3fb 6 күн бұрын
@@AriaAlessandra I've got 32 GB of RAM and when I run a locl LLM the context window is absolutely _massive_. My prediction: RAM is going to start to be in very high demand pretty soon. Start buying now if you can afford upgrades.
@AriaAlessandra
@AriaAlessandra 5 күн бұрын
@@Patrick-gm3fbMac doesn’t have that option unless it was the highest model. 😖 I do intend soon-ish to upgrade to 32 gb of ram, but other than ai, Mac has done magic with 8, so that’s what I got this one. We are also waiting on the image generation and image editing of gpt4o In the demo it works way way better than any other image generator but they never released it, they keep using dalle
@Patrick-gm3fb
@Patrick-gm3fb 5 күн бұрын
@@AriaAlessandra Yet another reason why I'm loathe of 🍏 and their proprietary systems. There's always other lightweight options in the FOSS world. I have an entry level AMD MoBo and I can shove up to 128GB of RAM on there.
@grandgigs
@grandgigs 6 күн бұрын
All really cool but Claude 3.5 sonnet is currently still a better user experience and work flow and has separate project memory,Without the memory element it's hard to iterate on large projects,
@naft3R
@naft3R 6 күн бұрын
@@grandgigs hey i would like to know how you handle claude w big project files im kinda struggling there
@handsanitizer2457
@handsanitizer2457 6 күн бұрын
What kind of files and what kind of project ? ​@@naft3R
@John-p7y7b
@John-p7y7b 5 күн бұрын
When is the next video? Is it every day or 12 random days before christmas? Edit: I found out, its every weekday for 12 days.
@ecasado7
@ecasado7 2 күн бұрын
Love the color pallette, warm colors, looks gorgeous
@TomMooneyUE4
@TomMooneyUE4 6 күн бұрын
Turn on Ambient Mode in youtube settings
@culoacido420
@culoacido420 6 күн бұрын
why?
@TheZEN2011
@TheZEN2011 6 күн бұрын
Reasoning, I guess, tested that method several times. There are lots of methods to improve the output. They will eventually change the transformer completely to give it a boost. Reasoning will be kind of built into the neuron layers as well as other useful features. But yeah, OpenAI has made some incredible advancements. They should be proud.
@MomentsInTrading
@MomentsInTrading 6 күн бұрын
Your sound quality would vastly improve if you hang some things off camera to absorb the echo. Moving blankets work well.
@OpenAITutor
@OpenAITutor 6 күн бұрын
We have come a long way, but we still have a long journey ahead of us.
@dell_rew
@dell_rew 6 күн бұрын
Another great video!!! Thank you guys!
@j_hull
@j_hull 5 күн бұрын
So many comments about audio quality, so many who don’t get that it doesn’t matter anymore. They don’t have a thumbnail with their mouth open pointing, they don’t have jump cuts every 10 seconds, and yes-one of the mics was low-but who cares? That’s surface-level phony presentation meant to manipulate you to stay longer on valueless information. The work is all that matters now.
@DaniGieseler
@DaniGieseler 4 күн бұрын
I like the sound effect - it has a vintage/lofi quality to it
@rustomshroff417
@rustomshroff417 5 күн бұрын
There should be a test data set as well in the tool because you are indirectly training your model on the validation set as well by using graders.
@yurijmikhassiak7342
@yurijmikhassiak7342 6 күн бұрын
31% of correct answers doesn't look close to a tool that you could reliably use at work. Am I missing anything?
@ckq
@ckq 6 күн бұрын
The context is that it's incredibly difficult so 31% is pretty impressive and apparently near SoTA along with specialized models
@grmancool
@grmancool 6 күн бұрын
ok that's great for researchers but doesn't sound like a production ready tool
@yurijmikhassiak7342
@yurijmikhassiak7342 6 күн бұрын
I guess that they should state that in research of some substanes it will give you let's say a list of 100 candidates from which 31 will be correct. It's better than going yourself and checking millions of options. But in healthcare having 70% of time incorrect diagnosis..
@TheRealTommyR
@TheRealTommyR 5 күн бұрын
I like that they are being honest about it instead of cherry picking a case with better accuracy. Plus, the guy said this experiment was ran with missing information, so hopefully with that information included, the results would be better than the current results without AI, when comparing apples to apples.(apple products on the table)
@aryanluharuwala6407
@aryanluharuwala6407 5 күн бұрын
maybe more than 1100 examples would help
@hundredfireify
@hundredfireify 6 күн бұрын
Everyone is complaining about audio.. who cares? We can understand them, and even the automated captions work well on their audio. Look at their work instead of focusing on useless nitpicks about video production
@timrod94
@timrod94 5 күн бұрын
Where is Day 3? 😅
@bluestar1234able
@bluestar1234able 5 күн бұрын
Business days only bro
@elprox1290
@elprox1290 4 күн бұрын
@@bluestar1234ablewait actually?
@MrSchweppes
@MrSchweppes 2 күн бұрын
I wonder when other companies will offer reinforcement fine-tuning! Looks amazing!👍
@BlayneOliver
@BlayneOliver 6 күн бұрын
Do we need to contact OpenAI for Reinforcement Finetuning this year (if applicable) but will have access publicly next year right?
@markmatzke
@markmatzke 4 күн бұрын
Thank you for the insightful presentation! I have a question regarding GPT's learning and evaluation process. In cases where the model receives a failing grade despite being told the correct answer, is this reflective of how it processes information? Could it be similar to how humans sometimes internalize incorrect patterns or concepts, even when given the correct information? Alternatively, could it be due to how humans teach or evaluate the model-where what we believe to be correct might not always be accurate? Did GPT possibly find a better answer in these cases, or is it more a matter of the model becoming confused?
@MrSchweppes
@MrSchweppes 6 күн бұрын
The most important thing about this reinforcement fine-tuning feature is that it will be available for the next generation of o models, like o2, o3, and so on.
@SergioBarreracoding
@SergioBarreracoding 6 күн бұрын
Lots of great things, amazing job guys! I am sure Santa will RFTe's his sleds with that joke to fix his wheels:)
@georgeserban007
@georgeserban007 4 күн бұрын
Day 3 anyone???? 😢😢😢
@FifthDistrictStudio
@FifthDistrictStudio 6 күн бұрын
When will Sora be available?😢
@hasanak4896
@hasanak4896 3 күн бұрын
So let me get this straight-OpenAI’s now letting the rich kids on the block teach the AI to think better while the rest of us get to play with the toy version. ‘Reinforcement fine-tuning’ sounds cool and all, but I’m just sitting here wondering if the $200-a-month Pro-tier gets me access to these gene-guessing shenanigans or if I need a research grant for that too. Also, props for the dad jokes-nothing says cutting-edge AI like cringeworthy holiday puns.
@GalaxyHomeA9
@GalaxyHomeA9 6 күн бұрын
Santa didn't pine tune his model because he was busy watching this video to do it.🙃
@farbodeg2
@farbodeg2 6 күн бұрын
When it would be the third day? Tomorrow or on Monday?
@jiamingz
@jiamingz 6 күн бұрын
Monday
@Rohan-rb5le
@Rohan-rb5le 6 күн бұрын
Will we ever get o1 for custom GPTs?
@NishitChokhawala
@NishitChokhawala 6 күн бұрын
After reindeerforcement learning, AI did improve and its now working piner than ever.
@JesseFleming1990
@JesseFleming1990 5 күн бұрын
@6:15 "Training ollama... The uh.. oh... O1 model's to reason for effectively" Nice. Nailed it lol
@renecouture3719
@renecouture3719 4 күн бұрын
Looking forward to o1. Great work!
@billbond2682
@billbond2682 6 күн бұрын
I am all here for her smile
@greatermoose
@greatermoose 6 күн бұрын
For better audio, turn on consistent volume setting in KZbin
@philadams9254
@philadams9254 5 күн бұрын
No day 3?
@LoKSET
@LoKSET 6 күн бұрын
Should have included the results of a fine-tuned 4o.
@giovannibrunoro1055
@giovannibrunoro1055 6 күн бұрын
when it comes to creative writing, claude 3.5 sonnet still excels... I can't believe that OpenAI still hasn't closed that gap...
@shiccup
@shiccup 6 күн бұрын
it seems openai is more focused on high profit areas
@MRAMAZRBALLZZ
@MRAMAZRBALLZZ 6 күн бұрын
Have you considered making this audible?
@wholeness
@wholeness 6 күн бұрын
I guess the sound quality is a way to make it feel more natural and real. I get it 😂
@neodenjin
@neodenjin 6 күн бұрын
Please upload this into chatgpt and ask it how to fix your audio quality
@eealliance5997
@eealliance5997 6 күн бұрын
The sound is always so low.
@venkatgopidas9788
@venkatgopidas9788 5 күн бұрын
Love the format.
@dnbdsj
@dnbdsj 6 күн бұрын
I love the raw sound ❤
@MOHAMED_DEEB84
@MOHAMED_DEEB84 6 күн бұрын
Room voice Eco is trouble; I couldn't continue watching. it is very noisy
@NileshPotdar-m9m
@NileshPotdar-m9m 6 күн бұрын
Cool way to employ experts to finetune your data... :) do I get copyright on the model which I fine tuned?
@nomaditsu
@nomaditsu 6 күн бұрын
You'd think they'd know they can use AI to improve the audio lol
@BennuBirdPred
@BennuBirdPred 6 күн бұрын
Can definitely be applied to GP diagnostics too...
@NovaAmor-b1k
@NovaAmor-b1k 6 күн бұрын
Can I buy official merch eg - OpenAI t-shirt or cap? If so, where? 🙏
@SumedhKadoo
@SumedhKadoo 6 күн бұрын
Turn off Stable Volume in Settings
@o1-preview
@o1-preview 6 күн бұрын
also, needs a bigger room
@mikestaub
@mikestaub 6 күн бұрын
Very nice, keep crushing it!
@sammito_
@sammito_ 6 күн бұрын
How different would it be to approach a problem by fine-tuning using this approach instead of just having very good system prompts with few-shot examples embedded within the system prompt?
@adaparavikiran
@adaparavikiran 5 күн бұрын
If I purchase access to the o1 pro model, will my custom GPTs automatically utilize the o1 pro model, or will I need to configure them manually to switch to it?
@tuckersnow609
@tuckersnow609 6 күн бұрын
AGI but we have no sound guy
@mrepic8263
@mrepic8263 6 күн бұрын
Data set must not be limited to json... It should be flexible for other docs like pdf or power point or documents
@andersberg756
@andersberg756 6 күн бұрын
That's how you know it's not for you, but will be built into a product you use instead. Curating a dataset for the fine tuning task is at its core. You're probably thinking about RAG, where info can be supplied to the model at inference time, when you ask it stuff.
@sky-productions
@sky-productions Күн бұрын
Summary The video presents OpenAI's latest advancements in their model customization program, focusing on the introduction of reinforcement fine-tuning (RFT) for the O1 series. ​RFT allows users to train AI models on their datasets using reinforcement learning techniques, enhancing the model's ability to reason and respond effectively to specific tasks in various fields such as healthcare, finance, and legal systems.​ The video features practical applications, particularly in the context of rare genetic diseases, demonstrating how AI can assist researchers in identifying genetic causes based on symptoms. Key Points Introduction to Reinforcement Fine-Tuning 00:00 OpenAI launches O1 series in ChatGPT and discusses upcoming API integration, emphasizing its capability for improved reasoning. 01:03 Users can customize O1 with their datasets using reinforcement fine-tuning, which differs from traditional fine-tuning by employing reinforcement learning. Applications and Advantages 02:49 RFT allows developers from various domains-like legal and finance-to create expert models tailored to specific tasks, highlighting recent collaboration with Thomson Reuters. 04:11 Justin Ree discusses the potential of RFT in aiding scientific research, particularly for understanding rare genetic diseases which affect a significant population. Technical Overview 07:00 A demonstration of RFT in training a model to predict genetic diseases from symptoms, stressing that the process only requires a few dozen examples for the model to learn effectively. 12:45 The validation process is explained, with graders scoring the model's outputs based on their accuracy against known correct answers, illustrating the training's effectiveness. Results and Evaluation 15:03 The results show that the fine-tuned O1 mini model outperformed its predecessor, indicating the model's improved reasoning and generalization capabilities in identifying genetic conditions. 17:22 Evaluation metrics show how the model ranks genes based on symptoms, emphasizing the importance of both correct answers and reasoning in the outputs provided by the AI. Conclusion and Future Perspectives 19:00 OpenAI invites organizations to participate in their reinforcement fine-tuning research program, aiming to enhance AI's applicability in complex tasks across various fields. 19:53 The video closes with a light-hearted joke, reinforcing community engagement while reiterating excitement about the advancements in AI model training.
@Just2Fast24
@Just2Fast24 6 күн бұрын
are these videos made by sora?
@jesseburstrom5920
@jesseburstrom5920 5 күн бұрын
I can finally apply my masters in tech mathematical statistics in development. Hurray!!!
@odrammurks1497
@odrammurks1497 6 күн бұрын
why can´t we also use normal voice mode or at least speech to text in the PC App? 😕
@DefinitlyAPerson
@DefinitlyAPerson 6 күн бұрын
0:08 How soon is it? Is it regular soon or OpenAI soon, because if it is OpenAI soon then its the same soon they said for their real time vision.
@Alden1320
@Alden1320 6 күн бұрын
Maybe "this time next year" soon
@Alexander01998
@Alexander01998 6 күн бұрын
In typical OpenAI fashion, they haven't actually shipped anything on the second day of shipmas. An impossible to get into waitlist does not count!
@tomich20
@tomich20 6 күн бұрын
Awesome job, i rather have real engineers with crappy audio, than Media People spending money where they shouldn't. Keep it up =)
@brian-jv1nw
@brian-jv1nw 6 күн бұрын
Kudos for leaving the comments enabled
@ShahafSegev1
@ShahafSegev1 4 күн бұрын
Will Sora be allowed to the public in this 12 days thing?
@ZhensongRen
@ZhensongRen 6 күн бұрын
took us from advanced high school level to expert PhD level? You need to show benchmarks in this example. What are the medical doctor would do on these genetic disease tasks?
@IsxaaqAcademy
@IsxaaqAcademy Күн бұрын
I heard new things and understood it from this video; validation dataset & graders
Canvas-12 Days of OpenAI: Day 4
20:01
OpenAI
Рет қаралды 325 М.
Meet Willow, our state-of-the-art quantum chip
6:39
Google Quantum AI
Рет қаралды 951 М.
Players push long pins through a cardboard box attempting to pop the balloon!
00:31
To Brawl AND BEYOND!
00:51
Brawl Stars
Рет қаралды 16 МЛН
Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny
00:32
Family Games Media
Рет қаралды 54 МЛН
Мясо вегана? 🧐 @Whatthefshow
01:01
История одного вокалиста
Рет қаралды 7 МЛН
The Next Frontier: Sam Altman on the Future of A.I. and Society
36:47
New York Times Events
Рет қаралды 279 М.
Sora-12 Days of OpenAI: Day 3
20:27
OpenAI
Рет қаралды 328 М.
Large Language Models explained briefly
8:48
3Blue1Brown
Рет қаралды 826 М.
ChatGPT x Apple Intelligence-12 Days of OpenAI: Day 5
11:37
This Video is AI Generated! SORA Review
16:41
Marques Brownlee
Рет қаралды 3,1 МЛН
AMD's CEO Wants to Chip Away at Nvidia's Lead | The Circuit with Emily Chang
24:02
Players push long pins through a cardboard box attempting to pop the balloon!
00:31