Reinforcement Fine-Tuning-12 Days of OpenAI: Day 2

  Рет қаралды 254,888

OpenAI

OpenAI

Күн бұрын

Watch Justin Reese and members of the OpenAI team introduce and demo Reinforcement Fine-Tuning.
If you’re interested in the Reinforcement Fine-Tuning Research Program, visit openai.com/for...
Speakers (from left to right): Mark Chen, John Allard, Julie Wang, Justin Reese (Berkeley Lab)

Пікірлер: 744
@lavaboots1337
@lavaboots1337 Ай бұрын
Millions of dollars for staff. Billions for Nvidia chips. Sound recorded with a potato - priceless.
@staffanlundberg
@staffanlundberg Ай бұрын
Sorry, I guess You´re right: I was too busy following their storyline to notice. But I WILL make an effort to check this on their next video. 😏
@fegyenc
@fegyenc Ай бұрын
@staffanlundberg please make it better next time the sound and lights are so amateur. It is not your level...
@jadioj
@jadioj Ай бұрын
Lmfao
@RuneX_ai
@RuneX_ai Ай бұрын
This is good, their focus is the product not bla bla marketing
@RuneX_ai
@RuneX_ai Ай бұрын
But i like you potato 🥔 referral, even o1 wouldn’t come on that … or would he …🧐
@HelamanGile
@HelamanGile Ай бұрын
Do you guys need to hire a sound guy if so I am available have lots of experience
@upgradeu2
@upgradeu2 Ай бұрын
30 sec pitch go
@sammy45654565
@sammy45654565 Ай бұрын
@@upgradeu2 they need a better sound guy to hear it
@Lighto222
@Lighto222 Ай бұрын
Hello fellow sound guy
@A123-d8o
@A123-d8o Ай бұрын
make sure to charge them $200 a month and watch them cancel roflmao
@WayOfTheZombie
@WayOfTheZombie Ай бұрын
At least just boost about 2 to 4 db
@theoz1
@theoz1 Ай бұрын
Recap → Reinforcement Fine-Tuning (RFT) preview announced, allowing fine-tuning of o1 models on custom datasets using reinforcement learning (launching publicly next year). → Applications of RFT include creating expert models for domains like law, finance, healthcare, and engineering (e.g., partnership with Thomson Reuters for a legal assistant). → o1 Mini + RFT shown to outperform the full o1 model for specific tasks with smaller, faster, and cheaper models. → Alpha program for RFT expansion now open to select organizations working on complex tasks. RFT allows models to "think and learn" in new ways with just a few examples, offering potential for domain-specific AI advancements. If you're in research, enterprise level, or need custom AI, this may be useful. However for most users this is advanced so most won't likely use RFT or need to.
@ridrugo182
@ridrugo182 Ай бұрын
Doin' the lord's work right here.
@garette8672
@garette8672 Ай бұрын
appreciate this
@staffanlundberg
@staffanlundberg Ай бұрын
"However for most users this is advanced so most won't likely use RFT or need to." I believe these procedures will be simplified by time such that most people may be able to use them. It is a way to develope better knowledge in essentially any subject that You are "specialized" in. Hey, Chat GPT is trying to turn us all into researchers ! Love it !🤗
@theoz1
@theoz1 Ай бұрын
@@staffanlundberg so true! Over time all of these features become easier and easier to use. "Turning us all into researchers" - I like that take!
@godfather9443
@godfather9443 Ай бұрын
​@@ridrugo182can do this in 5 secondds using ai
@ChrizzeeB
@ChrizzeeB Ай бұрын
Can they fine-tune their microphones?
@TheHawkeyede
@TheHawkeyede Ай бұрын
that was a good one ;D
@A-Hit
@A-Hit Ай бұрын
Sound recorded by a newbie. They need a professional sound guy. Pick me.
@xevil21
@xevil21 Ай бұрын
@@A-Hit Open a.i pick a.i. not You.
@jmg9509
@jmg9509 Ай бұрын
Lmao
@qijia4769
@qijia4769 Ай бұрын
I suppose o1 pro could improve the sound quality with one prompt.
@robertniyazoff2591
@robertniyazoff2591 Ай бұрын
the guy on the very left is a genius. Bro has worked at jane street as a quant, Microsoft as a swe, and been at openai for 6 years and going
@o1-preview
@o1-preview Ай бұрын
pretty smart lad indeed
@farisitum9594
@farisitum9594 Ай бұрын
Our left or their left
@combatninjaturtle
@combatninjaturtle Ай бұрын
@@farisitum9594the guy on the other end is computational biologist. So it’s obvious.
@Ragingwasabi9000
@Ragingwasabi9000 Ай бұрын
Which end is the computational biologist at?
@combatninjaturtle
@combatninjaturtle Ай бұрын
He was an intern at microsoft and jane street. He is sharp nonetheless.
@sunainakhedekar6802
@sunainakhedekar6802 Ай бұрын
all these people complaining about voice quality. honestly the people who are 100% invested in the actual content don’t bother to be bugged by something like this. I honestly didn’t realise something was off until I read the comments. It’s rather a clever filtration tactic among the audience watching!
@npc-drew
@npc-drew Ай бұрын
Day 3: Audio o1 -- Delivers movie studios audio-like quality.
@o1-preview
@o1-preview Ай бұрын
that would be nice
@thepresistence5935
@thepresistence5935 Ай бұрын
brhhhhhh
@Mastertingus
@Mastertingus Ай бұрын
You almost guessed , except it's Video and not audio..
@Luxcium
@Luxcium Ай бұрын
next time they should have a better sound don’t you guys think??? 🤔
@CaioBrazVS
@CaioBrazVS Ай бұрын
Yeah, plsss!
@sanchitkaul5084
@sanchitkaul5084 Ай бұрын
Turn off "stable volume " in settings
@orterves
@orterves Ай бұрын
Billions of dollars, dollar store microphones
@emilyd4385
@emilyd4385 Ай бұрын
100% the static noise and muffled mics are distracting to the incredible work they're showcasing
@akhilj307
@akhilj307 Ай бұрын
Yes - it’s muffled
@AbhilashKorraprolu
@AbhilashKorraprolu Ай бұрын
Its cute and adorable how you guys have the script in front of you, but react like you are hearing things for the first time :P Cheers guys
@TheRealTommyR
@TheRealTommyR Ай бұрын
that is exactly what actors are paid to do.
@BrianMosleyUK
@BrianMosleyUK Ай бұрын
Amazing the groundbreaking functionality being presented here, and so many people are thinking about the sound quality. Tumbleweed.
@JohnDoe-fh9kn
@JohnDoe-fh9kn Ай бұрын
so cool to watch this all unfold. the work you guys do is incredible
@McDonaldsCalifornia
@McDonaldsCalifornia Ай бұрын
Lol i love how the expert didn't sound excited at all. "Yeah might be cool in the future. Getting better, cant really be compared to existing tools"
@kairi4640
@kairi4640 Ай бұрын
And the woman is just like 😃 to make up for his lack of excitement. 😂
@martymarl4602
@martymarl4602 Ай бұрын
Basically they are saying"help train our models to replace you" with a smile. Not so that you can get more done (short term), but that open AI can use your data to compete in your area of expertise (long term, 1 year from now)
@o1-preview
@o1-preview Ай бұрын
@@martymarl4602 nah, horse drivers were scared to lose their jobs to cars. its not about the tool, its about who is driving the tool.
@o1-preview
@o1-preview Ай бұрын
reminds me of gpt2 days, it will be cool in the future
@martymarl4602
@martymarl4602 Ай бұрын
@@o1-preview If you don't realise that the "tool" will be superior to the "Drivers" shortly, you're on the wrong page mate
@naft3R
@naft3R Ай бұрын
Please update so o1 has access to project files. It makes it very difficult to work on medium size projects when you cant provide enough content due to the amount of files
@BruceHartford3
@BruceHartford3 Ай бұрын
Looks like that feature is included in the latest update. :)
@naft3R
@naft3R Ай бұрын
@BruceHartford3 it only supports images
@jr-hp7er
@jr-hp7er Ай бұрын
You can only upload the images, not files ex. Pdf can't be attached ​@@BruceHartford3
@101RealTalker
@101RealTalker Ай бұрын
What are your metrics for a "medium-sized" project?... I have over 3.5 million words worth of research representing a single project, what size would that qualify as in comparison?
@naft3R
@naft3R Ай бұрын
@@101RealTalker a model needs full context of a web app in order to make changes and not break the code. Right now all you can do is copy paste code, but other languages support project files.
@KaizenKaizen-pro
@KaizenKaizen-pro Ай бұрын
Good morning, My name is Mohamed Abdallah, and I am a young person passionate about artificial intelligence. Since discovering OpenAI, I have been captivated by your vision and projects. You inspired me to take a deep interest in this field and develop my own AI, which I called "Kaizen". My biggest dream is to one day work at OpenAI. Your commitment to innovation and positive impact has motivated me to push myself and imagine solutions that could, in turn, transform people's lives. Thank you for showing enthusiasts like me what is possible. I hope to have the opportunity, in the future, to contribute to your incredible projects. Sincerely, Mohamed Abdallah
@KaiPlayz7689
@KaiPlayz7689 Ай бұрын
Good morning, The comment section is not a email section 💀
@bluestar1234able
@bluestar1234able Ай бұрын
​@@KaiPlayz7689he doesn't know we're all being replaced soon 😔
@ozanaydn4705
@ozanaydn4705 Ай бұрын
Blud even made ai wrote this 😅
@KaiPlayz7689
@KaiPlayz7689 Ай бұрын
@@bluestar1234able i do and I know this is written by ChatGPT
@SurrealDistractions
@SurrealDistractions Ай бұрын
They should replace thier current sound engineer with AI
@o1-preview
@o1-preview Ай бұрын
lol, I feel bad for who ever it is, the comment section is mostly about this, i bet it'll be his/her hero arc moment
@zurgmuckerberg
@zurgmuckerberg Ай бұрын
There's probably no sound engineer, just their internal IT guys.
@OrofinX
@OrofinX Ай бұрын
Yes, I'm in shock 😮. Just put a mic on their clothes.
@A-Hit
@A-Hit Ай бұрын
Pick me
@GenAIWithNandakishor
@GenAIWithNandakishor Ай бұрын
Fix the fkknn sound
@cjgoeson
@cjgoeson Ай бұрын
Nice sip noise 0:08
@jasondisney
@jasondisney Ай бұрын
Followed by *gulp*
@TheCosmy2012
@TheCosmy2012 Ай бұрын
Good soup
@alejandromedina1019
@alejandromedina1019 Ай бұрын
@@TheCosmy2012 mmtsssaaaaahhhh
@allenlawson9872
@allenlawson9872 Ай бұрын
Literally made me stop the video LOL
@GeoMeridium
@GeoMeridium Ай бұрын
We need some Vic Berger style edits of these releases
@b2brish
@b2brish Ай бұрын
OpenAudioAI might be the next billion-dollar industry!
@jaketron.seattle
@jaketron.seattle Ай бұрын
"our models will bring a new breakthrough in healthcare, physics and mathematics" me using o1 to write standup comedy bids about my workplace, while simultaneously teaching me spanish phrases "Mis colegas están como cabras"😂😂😂
@IceMetalPunk
@IceMetalPunk Ай бұрын
4o is definitely a much better model for creative tasks like that. And cheaper, too.
@PatrickBulteel
@PatrickBulteel Ай бұрын
Your colleagues are like goats?
@jaketron.seattle
@jaketron.seattle Ай бұрын
@@PatrickBulteel 🤣😂😂😂 that joke was made by the most advanced model in the world, Mr!
@R.E-O
@R.E-O Ай бұрын
@@PatrickBulteel It means they are crazy.
@IceMetalPunk
@IceMetalPunk Ай бұрын
@@R.E-O You sure it doesn't mean they eat anything they find in the trash?
@micbab-vg2mu
@micbab-vg2mu Ай бұрын
I work for a pharmaceutical company in the medical department, and I cannot wait to test the Reinforcement Fine-Tuning. Thank you!
@FrancisZhou-thecrocodilekeeper
@FrancisZhou-thecrocodilekeeper Ай бұрын
When will the video call and screen sharing features come out? Have been waiting for a looooong time...
@fiddlestix26
@fiddlestix26 Ай бұрын
Literally like has that been scrapped or what?! Also I feel like I’m always the only one that is ever asking where tf that feature is.
@epicboy330
@epicboy330 Ай бұрын
@FrancisZhou-thecrocodilekeeper you have to realize how much computing power that would take to do on a global server. It’s probably years out before it’ll release and be a reasonable price. That’s simply a hardware limitation. The biggest problem in tech rn is that our software capabilities far exceed our hardware
@fiddlestix26
@fiddlestix26 Ай бұрын
@@epicboy330 ummm they literally demoed it in a keynote presentation like a year ago and said it would be available in the “coming weeks” and then just never mentioned it again.
@ilyasalcantara6450
@ilyasalcantara6450 Ай бұрын
start using ai for giving u guys tips on how to have a proper audio
@Iwantalloftheinformation
@Iwantalloftheinformation Ай бұрын
It makes them look out of the garage like. They're definitely not anymore.
@watchingvideos9871
@watchingvideos9871 Ай бұрын
@@Iwantalloftheinformationwhat lol
@seanmikejaffer423
@seanmikejaffer423 Ай бұрын
Thanks a lot for the valuable feedback
@DarkandTwisted
@DarkandTwisted Ай бұрын
The bad audio makes it look as if you guys used PlaySkool mics.
@o1-preview
@o1-preview Ай бұрын
small room
@markmatzke
@markmatzke Ай бұрын
Impressive results from fine-tuning! The performance improvements in Top-1, Top-5, and Top-at-Max accuracy are exciting, especially in the context of such complex bioinformatics tasks. The clear visualization really helps to showcase the impact of reinforcement learning. Looking forward to seeing how this approach evolves and what new applications it could be used for in healthcare and other fields. Great work!
@Dan-e6r7s
@Dan-e6r7s Ай бұрын
OpenAI's $157B valuation suggests that there is no money left for good sound quality.
@TomMooneyUE4
@TomMooneyUE4 Ай бұрын
Turn on Ambient Mode in youtube settings
@culoacido420
@culoacido420 Ай бұрын
why?
@pachvandio
@pachvandio Ай бұрын
Amazing how audio quality will make a massive impact on what people remember from a video…
@deadsippy
@deadsippy Ай бұрын
Is there a more detailed benchmark in the works to compare the performance of RFT against o1-mini? The val-accuracy improvement and the top-k tests for the specific task seem promising but I'd love to learn more specifically about what the model is learning through RFT.
@madeline-onassis
@madeline-onassis Ай бұрын
awesome new developments big brains delivering big prizes to humanity
@千修-n8x
@千修-n8x Ай бұрын
My understanding of OpenAI’s reinforcement learning is: it establishes a foundational cognitive framework (understanding the essence of the world) through the scoring model and leverages reinforcement learning and experience accumulation to continuously optimize the AI’s reasoning and decision-making capabilities. In other words, it learns the logical thinking of experts in each field.
@Silus1008
@Silus1008 Ай бұрын
I hadnt realized that Open AI had come so far in AI video generation! These guys struck me basically as human, all four of them!
@aomameditation3497
@aomameditation3497 Ай бұрын
Thanks for new video as an active user of ChatGP-4o. I'm looking forward the next video.
@timrod94
@timrod94 Ай бұрын
Where is Day 3? 😅
@bluestar1234able
@bluestar1234able Ай бұрын
Business days only bro
@elprox1290
@elprox1290 Ай бұрын
@@bluestar1234ablewait actually?
@modalmixture
@modalmixture Ай бұрын
What's quite interesting to me is that they're doing bioinformatics ML, the kind of stuff you would do with traditional deep learning methods or maybe a reinforcement learning model like AlphaFold, but they're using a pre-trained *language* model, and not only does it do pretty well, it can explain its reasoning! Shows that these frontier LLMs really can generalize their intelligence over many domains. I find it deeply, deeply profound that language turned out to be the key to intelligence.
@AriaAlessandra
@AriaAlessandra Ай бұрын
I love this! I just wish they allowed the 128k context window for plus users. Even if it’s in the “4o mini” version, so it can be cheaper for open ai, but please remove the 8k cap
@Patrick-gm3fb
@Patrick-gm3fb Ай бұрын
They should have an option for a mechanism to offboard some of the system weight onto your own resources. It might increase latency but it would be worth it for people who have a ton of RAM and need the larger context window. And of course, it would be a win-win because it offloads some of their resource needs that could be better utilized for people who don't have thier own local resources.
@AriaAlessandra
@AriaAlessandra Ай бұрын
@ lol I’d be one of those cause I have 8gb of unified ram 😂
@Patrick-gm3fb
@Patrick-gm3fb Ай бұрын
@@AriaAlessandra I've got 32 GB of RAM and when I run a locl LLM the context window is absolutely _massive_. My prediction: RAM is going to start to be in very high demand pretty soon. Start buying now if you can afford upgrades.
@AriaAlessandra
@AriaAlessandra Ай бұрын
@@Patrick-gm3fbMac doesn’t have that option unless it was the highest model. 😖 I do intend soon-ish to upgrade to 32 gb of ram, but other than ai, Mac has done magic with 8, so that’s what I got this one. We are also waiting on the image generation and image editing of gpt4o In the demo it works way way better than any other image generator but they never released it, they keep using dalle
@Patrick-gm3fb
@Patrick-gm3fb Ай бұрын
@@AriaAlessandra Yet another reason why I'm loathe of 🍏 and their proprietary systems. There's always other lightweight options in the FOSS world. I have an entry level AMD MoBo and I can shove up to 128GB of RAM on there.
@John-p7y7b
@John-p7y7b Ай бұрын
When is the next video? Is it every day or 12 random days before christmas? Edit: I found out, its every weekday for 12 days.
@philadams9254
@philadams9254 Ай бұрын
No day 3?
@FifthDistrictStudio
@FifthDistrictStudio Ай бұрын
When will Sora be available?😢
@Red-rummy
@Red-rummy Ай бұрын
Guys. I love what yall have done. Just playing around with GPT4o I am able to use it as my paint brush in ways I never thought possible. Thank you for everything yall do. Please stay ethical. I see what we can do with this technology and it’s amazing and a little scary. But in my eyes openAI is the gold standard hands down.
@wolpumba4099
@wolpumba4099 Ай бұрын
*OpenAI Introduces Reinforcement Fine-Tuning (RFT) for Customizing AI Models* * *0:00** Introduction:* OpenAI announces Reinforcement Fine-Tuning (RFT), a new method for customizing its large language models, specifically the O1 series. RFT uses reinforcement learning, moving beyond standard fine-tuning to allow models to reason in new ways over custom data. Public launch is planned for next year, but an alpha program is available for researchers and enterprises. * *1:02** RFT Use Cases:* RFT is suited for domains requiring deep expertise, such as law, finance, engineering, and insurance. A partnership with Thomson Reuters demonstrates RFT's use in creating a legal assistant AI. * *2:22** RFT vs. Supervised Fine-Tuning:* Unlike supervised fine-tuning, which focuses on replicating patterns in input data, RFT teaches models to reason differently. It involves grading model responses and reinforcing successful lines of thinking, requiring only a few dozen examples for effective learning. * *3:50** RFT in Scientific Research:* Justin Reese, a computational biologist, highlights the potential of RFT in rare genetic disease research. RFT can help identify causative genes based on patient symptoms, potentially shortening the diagnostic odyssey for millions. A collaborative dataset from scientific publications was used, containing patient symptoms, absent symptoms, and the causative gene. * *6:24** RFT Demo:* A demonstration shows how RFT improves O1 mini's performance to surpass even the full O1 model in identifying causative genes based on symptoms. * *7:17** Training Data and Graders:* The demonstration explains training datasets (JSONL files), validation datasets, and graders (functions scoring model outputs against correct answers). Graders provide feedback for the reinforcement learning process. * *13:29** Evaluation Results:* RFT significantly improved O1 mini's performance. Evaluations measured top-1, top-5, and top-at-max accuracy, showing substantial gains after fine-tuning. * *15:03** Model Output Analysis:* The fine-tuned model's output now includes reasoning explanations, ranking potential causative genes and significantly boosting accuracy. * *17:38** Broader Applications:* RFT has potential beyond scientific research and has shown promise in various fields, including biochem, AI safety, legal, and healthcare. * *18:58** Alpha Program Expansion:* OpenAI expands its alpha program through the "Reinforcement Fine-Tuning Research Program," offering limited spots for organizations working on complex tasks with expert teams. A link to the application is provided in the description. I used gemini-exp-1121 on rocketrecap dot com to summarize the transcript. Input tokens: 21622 Output tokens: 561
@greatermoose
@greatermoose Ай бұрын
For better audio, turn on consistent volume setting in KZbin
@farbodeg2
@farbodeg2 Ай бұрын
When it would be the third day? Tomorrow or on Monday?
@jiamingz
@jiamingz Ай бұрын
Monday
@TheZEN2011
@TheZEN2011 Ай бұрын
Reasoning, I guess, tested that method several times. There are lots of methods to improve the output. They will eventually change the transformer completely to give it a boost. Reasoning will be kind of built into the neuron layers as well as other useful features. But yeah, OpenAI has made some incredible advancements. They should be proud.
@GinaRealTalk
@GinaRealTalk Ай бұрын
Did I hear this right? An alpha program for creative and functional AI offerings?! OpenAI is leading the way in modeling what intergenerational collaboration can look like in the Age of AI, and I’m here for it! Could this mean I can leverage my 15 years of expertise as a Bilingual Relationship & Family Therapist to design a GPT guide that coaches wellness and resilience skills? This feels like the perfect opportunity to reimagine intentional self-care-teaching the skills to thrive before therapy becomes necessary. My inner MCU nerd is geeking out at the possibilities!
@jtfxjhggvkuhhvktgu
@jtfxjhggvkuhhvktgu Ай бұрын
Therapy should NOT be done by AI at this point in time. An AI can do its best to predict the perfect response but a therapist should be focused on providing actual help that will fit best for the client as im sure you're well aware, with an AI you'll at this point in time have edge cases with bad responses whch can harm the client instead...
@augustuslxiii
@augustuslxiii Ай бұрын
@@jtfxjhggvkuhhvktgu Right, because human therapists could never give bad responses or harm their clients. Oh wait. Some can, and do, and for hundreds - if not thousands - of dollars.
@TC-jo2vj
@TC-jo2vj Ай бұрын
🤮
@TC-jo2vj
@TC-jo2vj Ай бұрын
@@jtfxjhggvkuhhvktgucause therapy is a scam, correct.
@homeyworkey
@homeyworkey Ай бұрын
​@@jtfxjhggvkuhhvktgunot everyone can afford therapy. It's a good jumping off point for the many (mostly men) who are doubtful of it aswell
@ecasado7
@ecasado7 Ай бұрын
Love the color pallette, warm colors, looks gorgeous
@rustomshroff417
@rustomshroff417 Ай бұрын
There should be a test data set as well in the tool because you are indirectly training your model on the validation set as well by using graders.
@Rohan-rb5le
@Rohan-rb5le Ай бұрын
Will we ever get o1 for custom GPTs?
@MrSchweppes
@MrSchweppes Ай бұрын
I wonder when other companies will offer reinforcement fine-tuning! Looks amazing!👍
@yurijmikhassiak7342
@yurijmikhassiak7342 Ай бұрын
31% of correct answers doesn't look close to a tool that you could reliably use at work. Am I missing anything?
@ckq
@ckq Ай бұрын
The context is that it's incredibly difficult so 31% is pretty impressive and apparently near SoTA along with specialized models
@grmancool
@grmancool Ай бұрын
ok that's great for researchers but doesn't sound like a production ready tool
@yurijmikhassiak7342
@yurijmikhassiak7342 Ай бұрын
I guess that they should state that in research of some substanes it will give you let's say a list of 100 candidates from which 31 will be correct. It's better than going yourself and checking millions of options. But in healthcare having 70% of time incorrect diagnosis..
@TheRealTommyR
@TheRealTommyR Ай бұрын
I like that they are being honest about it instead of cherry picking a case with better accuracy. Plus, the guy said this experiment was ran with missing information, so hopefully with that information included, the results would be better than the current results without AI, when comparing apples to apples.(apple products on the table)
@aryanluharuwala6407
@aryanluharuwala6407 Ай бұрын
maybe more than 1100 examples would help
@DaniGieseler
@DaniGieseler Ай бұрын
I like the sound effect - it has a vintage/lofi quality to it
@gridvid
@gridvid Ай бұрын
That's actually a really cool and important feature 👍
@eliaspereirah
@eliaspereirah Ай бұрын
The link to next KZbin live?
@halbarba
@halbarba Ай бұрын
At OpenAI's conferences, especially during their '12 Days of OpenAI', they seem to have developed not just advanced AI, but also mysteriously bottomless coffee cups
@ahmedabdulrahman8567
@ahmedabdulrahman8567 24 күн бұрын
Is there a way to define the domain(s) that I want the model to reason about?
@grandgigs
@grandgigs Ай бұрын
All really cool but Claude 3.5 sonnet is currently still a better user experience and work flow and has separate project memory,Without the memory element it's hard to iterate on large projects,
@naft3R
@naft3R Ай бұрын
@@grandgigs hey i would like to know how you handle claude w big project files im kinda struggling there
@handsanitizer2457
@handsanitizer2457 Ай бұрын
What kind of files and what kind of project ? ​@@naft3R
@OpenAITutor
@OpenAITutor Ай бұрын
We have come a long way, but we still have a long journey ahead of us.
@MrSchweppes
@MrSchweppes Ай бұрын
The most important thing about this reinforcement fine-tuning feature is that it will be available for the next generation of o models, like o2, o3, and so on.
@eealliance5997
@eealliance5997 Ай бұрын
The sound is always so low.
@MomentsInTrading
@MomentsInTrading Ай бұрын
Your sound quality would vastly improve if you hang some things off camera to absorb the echo. Moving blankets work well.
@markmatzke
@markmatzke Ай бұрын
Thank you for the insightful presentation! I have a question regarding GPT's learning and evaluation process. In cases where the model receives a failing grade despite being told the correct answer, is this reflective of how it processes information? Could it be similar to how humans sometimes internalize incorrect patterns or concepts, even when given the correct information? Alternatively, could it be due to how humans teach or evaluate the model-where what we believe to be correct might not always be accurate? Did GPT possibly find a better answer in these cases, or is it more a matter of the model becoming confused?
@TirtaMilkita
@TirtaMilkita Ай бұрын
When the O1 model will be released?
@mz8755
@mz8755 Ай бұрын
When she smiles my heart melted
@BlayneOliver
@BlayneOliver Ай бұрын
Do we need to contact OpenAI for Reinforcement Finetuning this year (if applicable) but will have access publicly next year right?
@dell_rew
@dell_rew Ай бұрын
Another great video!!! Thank you guys!
@SergioBarreracoding
@SergioBarreracoding Ай бұрын
Lots of great things, amazing job guys! I am sure Santa will RFTe's his sleds with that joke to fix his wheels:)
@odrammurks1497
@odrammurks1497 Ай бұрын
why can´t we also use normal voice mode or at least speech to text in the PC App? 😕
@NileshPotdar-m9m
@NileshPotdar-m9m Ай бұрын
Cool way to employ experts to finetune your data... :) do I get copyright on the model which I fine tuned?
@behnamkhorsandian
@behnamkhorsandian Ай бұрын
Where is day 3 Altman?
@Just2Fast24
@Just2Fast24 Ай бұрын
are these videos made by sora?
@MRAMAZRBALLZZ
@MRAMAZRBALLZZ Ай бұрын
Have you considered making this audible?
@DefinitlyAPerson
@DefinitlyAPerson Ай бұрын
0:08 How soon is it? Is it regular soon or OpenAI soon, because if it is OpenAI soon then its the same soon they said for their real time vision.
@Alden1320
@Alden1320 Ай бұрын
Maybe "this time next year" soon
@NovaAmor-b1k
@NovaAmor-b1k Ай бұрын
Can I buy official merch eg - OpenAI t-shirt or cap? If so, where? 🙏
@shahrukhswati
@shahrukhswati Ай бұрын
Fine. We know you guys are against the status quo but what has that anything to do with Sound?
@renecouture3719
@renecouture3719 Ай бұрын
Looking forward to o1. Great work!
@Seriouslydave
@Seriouslydave Ай бұрын
What are they sellin'?
@georgeserban007
@georgeserban007 Ай бұрын
Day 3 anyone???? 😢😢😢
@adaparavikiran
@adaparavikiran Ай бұрын
If I purchase access to the o1 pro model, will my custom GPTs automatically utilize the o1 pro model, or will I need to configure them manually to switch to it?
@therealsmz
@therealsmz Ай бұрын
does this videos's audio record in an iPhone or other phones?
@14mrb67
@14mrb67 Ай бұрын
where can I send you some dji mini mics and I can become an investor this way
@entrepreneerit4490
@entrepreneerit4490 Ай бұрын
Turn up the volume
@neodenjin
@neodenjin Ай бұрын
Please upload this into chatgpt and ask it how to fix your audio quality
@LoKSET
@LoKSET Ай бұрын
Should have included the results of a fine-tuned 4o.
@tuckersnow609
@tuckersnow609 Ай бұрын
AGI but we have no sound guy
@shahaf1234
@shahaf1234 Ай бұрын
Will Sora be allowed to the public in this 12 days thing?
@JuanPabloMontoyaUrdaneta
@JuanPabloMontoyaUrdaneta Ай бұрын
Does anyone know the brand of the red jacket? I love it!
@sammito_
@sammito_ Ай бұрын
How different would it be to approach a problem by fine-tuning using this approach instead of just having very good system prompts with few-shot examples embedded within the system prompt?
@plutostube
@plutostube Ай бұрын
A lot of talks but no date when this will be available for users, anybody knows this?
@plutostube
@plutostube Ай бұрын
never mind, I checked the link, this will be not available for individual users, just for big projects
@TheRealDanNguyen
@TheRealDanNguyen Ай бұрын
When will full o1 be available for API?
@o1-preview
@o1-preview Ай бұрын
as soon as it is ready to release
@SumedhKadoo
@SumedhKadoo Ай бұрын
Turn off Stable Volume in Settings
@o1-preview
@o1-preview Ай бұрын
also, needs a bigger room
@hoangtran0901
@hoangtran0901 Ай бұрын
can we replace the sound/camera crew with AI please?
@udaykumargalidevara4489
@udaykumargalidevara4489 Ай бұрын
Where is day3 video myan
@MOHAMED_DEEB84
@MOHAMED_DEEB84 Ай бұрын
Room voice Eco is trouble; I couldn't continue watching. it is very noisy
@mrepic8263
@mrepic8263 Ай бұрын
Data set must not be limited to json... It should be flexible for other docs like pdf or power point or documents
@andersberg756
@andersberg756 Ай бұрын
That's how you know it's not for you, but will be built into a product you use instead. Curating a dataset for the fine tuning task is at its core. You're probably thinking about RAG, where info can be supplied to the model at inference time, when you ask it stuff.
@ladonteprince
@ladonteprince Ай бұрын
Interesting, but how to get it to 100% accuracy? Is it because it was o1-mini? I wish you would have used o1 or o1 pro with this method since that is the momentum that you started on.
@o1-preview
@o1-preview Ай бұрын
because of how long it trained and how much high quality data it has.. if the data has something that is missing that important to solve it will never reach 99.999%
@harshithvaddiparthy
@harshithvaddiparthy Ай бұрын
what interface is being used here?
@TanakaWatanabi
@TanakaWatanabi Ай бұрын
Why not use NDCG for ranking grading?
@LordDeadSpider
@LordDeadSpider Ай бұрын
What if the predefined correct answer was actually wrong?
@BennuBirdPred
@BennuBirdPred Ай бұрын
Can definitely be applied to GP diagnostics too...
@Ziad.G
@Ziad.G Ай бұрын
Interesting stuff, but the audio is unbearable, you guys were recording on your ipad or what?hopefully tomorrow is better lol
@Alden1320
@Alden1320 Ай бұрын
Sounds like the mic on the ipad or the laptop
@delightfulThoughs
@delightfulThoughs Ай бұрын
self driving is next?
@GalaxyHomeA9
@GalaxyHomeA9 Ай бұрын
Santa didn't pine tune his model because he was busy watching this video to do it.🙃
Sora-12 Days of OpenAI: Day 3
20:27
OpenAI
Рет қаралды 354 М.
OpenAI o3 and o3-mini-12 Days of OpenAI: Day 12
22:05
OpenAI
Рет қаралды 532 М.
Andro, ELMAN, TONI, MONA - Зари (Official Audio)
2:53
RAAVA MUSIC
Рет қаралды 8 МЛН
AI can't cross this line and we don't know why.
24:07
Welch Labs
Рет қаралды 1,5 МЛН
Best of CES 2025
14:50
The Verge
Рет қаралды 590 М.
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 4,3 МЛН
Building OpenAI o1 (Extended Cut)
22:14
OpenAI
Рет қаралды 262 М.
Canvas-12 Days of OpenAI: Day 4
20:01
OpenAI
Рет қаралды 380 М.
Visualizing transformers and attention | Talk for TNG Big Tech Day '24
57:45
Vertical AI Agents Could Be 10X Bigger Than SaaS
42:13
Y Combinator
Рет қаралды 559 М.