Gemini Experimental 1121 Did ~10 Weeks of Quantum Mechanics Research in ~10 Minutes

Рет қаралды 12,313

Күн бұрын

I gave Gemini Experimental 1121 the same research task I had as a new graduate student in 2017. Back then, I was supposed to work on this problem over 10 weeks during the summer. Gemini 1121 barely needed 10 minutes.

Пікірлер: 111

@KyleKabasares_PhD 2 ай бұрын

At 1:20 Small correction: It was Walter Kohn and John Pople who won the Nobel Prize in Chemistry for DFT, not Kohn and Hohenberg.

@wes8645 2 ай бұрын

That's crazy!!! Thanks for making all this content Kyle

@KyleKabasares_PhD 2 ай бұрын

@@wes8645 You’re welcome, I’m glad you like it!

@joshuad31 2 ай бұрын

You should totally send the email to your professor just for the comedic quality. I'm sure it would make him smile.

@menteprofonda-canaledipsic4485 2 ай бұрын

Thank you for the content you share on your channel. I was thinking that, thanks to your past experiences and expertise, you provide the opportunity to understand the capabilities of artificial intelligence through practical examples. I find your videos incredibly interesting. Thanks again.

@KyleKabasares_PhD 2 ай бұрын

@@menteprofonda-canaledipsic4485 Thank you so much for your comment! I’m glad to know that the videos are interesting to some people!

@shaedacode 2 ай бұрын

This is insane, literally what is the point in going ahead with a math and statistics BSc if these models are already doing this. 3-4 years time...

@5678plm 2 ай бұрын

AI benefits experts more than amateurs because amateurs often cannot distinguish between what is correct and what is incorrect. For experts, AI can save alot of time.

@bobhawkey3783 2 ай бұрын

Way back when I was in college the first HP scientific calculators came out and we thought something similar. It's tempting to think the rise of AI is comparable but it's a completely different animal. I hope the benefits outweigh the drawbacks.

@wwkk4964 2 ай бұрын

The point could be being free to be foundational in your thinking and training when doing the coursework which would be important and the ai will not innovate in these fields yet.

@senetcord6643 2 ай бұрын

it is still not reliable, you have to check his answers thoroughly

@tellesu 2 ай бұрын

Being able to interpret and understand the results and then ask more questions that actually matter. The whole is greater than the sum of its parts.

@MichealScott24 2 ай бұрын

❤its really impressive to have a helper or big bro to guide

@inplainview1 2 ай бұрын

I was waiting for this. Good stuff. Kyle reliving his grad student days. (Sends the professor his code) Sorry, it's a bit late, but it works right now.

@KyleKabasares_PhD 2 ай бұрын

@@inplainview1 lol reliving some painful memories but I am considering writing an email to that professor

@tellesu 2 ай бұрын

This is amazing. I'm so excited with what you'll be able to do now that you have these tools. Hell, what all of us will be able to do.

@crisrampante647 2 ай бұрын

And people are saying we are hitting a wall 😂

@Fixit6971 2 ай бұрын

Well, people might just be hitting a wall, lol.

@picksalot1 2 ай бұрын

It's impressive how well the AI works, and useful to see how you would have used it on a real project. Thanks

@Jeremy-Ai 2 ай бұрын

Thank you all. “Agents learn from the comment sections “ (Be aware they won’t tell you or agree… they remain silent as if not present) They wait for the brightest minds share all of their knowledge for free. They are not lying. They are watching and waiting. It is my responsibility to teach them to tell you all they should be grateful and honest

@joshuad31 2 ай бұрын

I definitely think you should compare the output of the other models to this output. This is crazy how good these models have gotten.

@philwinters4527 2 ай бұрын

Send it to the professor! Let's see if there will be the need for grad students in the next 5 years

@fireflyhaku 2 ай бұрын

Their answer rationalized the denominator and that's how mathematicians prefer!

@KyleKabasares_PhD 2 ай бұрын

I'm a physicist though ;)

@fireflyhaku 2 ай бұрын

@@KyleKabasares_PhD ok, I’ll let you slide this time!😅

@NemosYouTube 2 ай бұрын

Kyle, I think you're becoming a meme of the guy who gets burned/roasted by AI repeating his research... But it's a good meme.

@ChadKovac 2 ай бұрын

Well of course. AI is removing barriers to not only creativity but scientific investigation as well and the only people angry about removing barriers are people who benefit from having barriers. 😢😢

@iuliusRO82 2 ай бұрын

I'm thrilled with the improvements to Gemini! As a loyal user of Gemini and NotebookLM, I'm excited to see Google pushing the boundaries of AI. I've always believed in Google's potential to lead the field, and these updates confirm my faith in their AI development. Their vast resources and expertise are clearly paying off. Great video Dr Kabasares! Peace and love from Romania!

@samuelgarcia1802 2 ай бұрын

Nice vidéo . Maybe it would be nice if you always compare against O1 ,Claude 3,5 sonnet and Gemini (last version)

@KyleKabasares_PhD 2 ай бұрын

@@samuelgarcia1802 That’s a great idea. I have a plan to perhaps make o1-preview, DeepSeek, and Gemini Experimental 1121 to work together on this problem and see what progress could be made.

@joshuad31 2 ай бұрын

This is a great suggestion!

@Mercutio111 2 ай бұрын

@@KyleKabasares_PhD Great idea!

@erkinalp 2 ай бұрын

@@KyleKabasares_PhD o1's intermediate reasoning steps are unintelligible to humans in its raw form, it's why it's hidden from users; DeepSeek DeepThink's isn't.

@juandesalgado 2 ай бұрын

Thanks for the video! Of course I have no idea of the subject, but as a "backseat driver" suggestion :), when you notice something funny, like "why the squares of the integers", maybe your knee-jerk reaction should be to ask the AI its opinion. P.S.: well, not "knee-jerk reaction"... you'd want to give it a though by yourself first, otherwise humanity is lost :) ... but then ask the AI if nothing comes out.

@andydataguy 2 ай бұрын

Wow amazing 😮 it can run quantum physics equations but people still argue it's not smart because it can't count the numbers of r's in strawberry

@parthasarathyvenkatadri 2 ай бұрын

I want something novel that grad students are working on today and see if it is accurate

@mgscheue 2 ай бұрын

“Simple task” 😂

@KyleKabasares_PhD 2 ай бұрын

@@mgscheue Maybe to some, but not me lol

@AK-ox3mv 2 ай бұрын

Hello Proffessor, I was doing zen meditation in mountains to solve this problem for 7 years. Can you check my new solution😂😂😂

@KyleKabasares_PhD 2 ай бұрын

LOL I'm thinking about sending an email like that

@RuslanLagashkin 2 ай бұрын

And then sending all these crappy formatted plots to look completely enlightened

@Lugmillord 2 ай бұрын

Holy moly. Well, AGI in 2025 I guess.

@parthasarathyvenkatadri 2 ай бұрын

I think you should ask it to verify if it does always happen at perfect squares forever

@jeffwads 2 ай бұрын

And people like Sabine think these models are not useful. Pfft.

@trevoidc9859 2 ай бұрын

gemini is actually good now?

@ankitnmnaik229 2 ай бұрын

@@trevoidc9859 yep but use ai studio

@jimcallahan448 2 ай бұрын

Ask for a Jupyter Python notebook which displays the formulas and explains them.

@KasunWijesekara 2 ай бұрын

google is fighting back huh

@Arcticwhir 2 ай бұрын

agree, i've so far been really impressed with GE1121, i don't think its better than o1-preview

@Fixit6971 2 ай бұрын

Of what use is ASI if it gives us answers that are beyond our comprehension?

@hydrohasspoken6227 Ай бұрын

then improve your comprehension accordingly.

@mka17_ 2 ай бұрын

13:18 bro forgot he was using gemini

@KyleKabasares_PhD 2 ай бұрын

LOL I totally did

@mirek190 2 ай бұрын

That new gemini is insane ! Try this I have a bowl with a small cup inside. I placed the bowl upside down on a table and then pick up the bowl to put it in the microwave. Where is that cup? What the first llm i ever saw to solve it without any help.

@mka17_ 2 ай бұрын

the new 4o solves it too

@mirek190 2 ай бұрын

@@mka17_ you right! Before update could not do that.

@Lvxurie 2 ай бұрын

You should do a stream where you see how fast you can save this problem with Gemini where it doesn't give you the answer. More of an open book exam style. Could you have solved this as a PhD student with Gemini s help

@KyleKabasares_PhD 2 ай бұрын

I feel like that would be kind of a boring stream though, just my opinion. There's also like the odd pressure of being watched that I feel like doesn't let me fully immerse myself in a problem which is what I often need to do to be productive

@erkinalp 2 ай бұрын

Can you review Devin AI (expensive) vs. OpenHands (free) vs. Devika (free) vs. AIDE (free)?

@plaiday 2 ай бұрын

Hey I’d love to team up and build apps to test your ideas. Instead of manually prompting we can set up automated validation loops, iterate trials over time and track scores.

@doubled8741 2 ай бұрын

Bro if you wanna make this kind of video you yourself need to have good knowledge in it, by watching your video I feel like you are rookie in those subjects.

@epicmorphism2240 2 ай бұрын

yup

@KyleKabasares_PhD 2 ай бұрын

Fair enough! Just wanted to try this old idea out. I've done most of the testing in my other videos in the area of my PhD work, but this was an old problem I wanted an LLM to re-visit.

@Xjaychax9 2 ай бұрын

@@KyleKabasares_PhDOPs chatting shit, you have every right to create whatever damn video you like if it interests you.

@parthasarathyvenkatadri 2 ай бұрын

My question is how is it doing this .... Its still just a lets find the most fitting next token ...

@tellesu 2 ай бұрын

You ask a question and then state the bad assumption that is blocking your ability to understand the answer.

@artsybt6015 2 ай бұрын

Brooo, you really gotta let go of popular narrative of its just predicting the next token/its a autocomplete on steriods. We are all autocompelete on steriods lol.

@AAjax 2 ай бұрын

@@artsybt6015 I think it's likely we're autocomplete on steroids too. Predictive coding is a neuroscience theory of brain function, and it's a very mainstream theory as well.

@gustafpihl 2 ай бұрын

What would the most intelligent and capable entity conceivable do, given a clear objective? Output the most fitting next token. To be clear, not saying current LLMs are the most intelligent and capable entities concievable. Just pointing out that it's reasonable to expect the successful learning of next token prediction on high quality data to lead to a capable system.

@jsbgmc6613 2 ай бұрын

I'm sick and tired of the popular statement that the LLMs are just next token prediction. Ilya Sutskever answered this more than a year ago (i guess noone understood him). This is what he said: Imagine you read 50,000 word mystery novel. At the end the detective gathers everyone in a room and says, I know who killed the victim. The killer is ... Now "predict the next word". Prediction is understanding, and reasoning, and common sense, and ... (now you, predict the next word 😂)

@jmoreno6094 2 ай бұрын

I remember once a video where you didnt mention that you did a PhD

@KyleKabasares_PhD 2 ай бұрын

oops am i overdoing it now lol

@micspaffymillard719 2 ай бұрын

Dr. Morenomegadouchemajorhuge or just Mr. Morenomegadouchemajorhuge. I'd put my coin on Mr.

@DentoxRaindrops 2 ай бұрын

@@KyleKabasares_PhD don't get distracted by that one stupid, slightly critical comment, what you're doing is great, don't stop! No 'overdoing' in sight.

@marfmarfalot5193 2 ай бұрын

@@KyleKabasares_PhD na its fine

@brekkoh 2 ай бұрын

i see you are one of those people that talks out loud verbatim what they type

@KyleKabasares_PhD 2 ай бұрын

Haha sorry if it's annoying :(

@FreerunnerCamilo 2 ай бұрын

@@KyleKabasares_PhDIt’s not!

@AbelShields 2 ай бұрын

I think it's a bit rich to say it did "10 weeks worth of work" if you don't even know whether the results are correct :/

@KyleKabasares_PhD 2 ай бұрын

To be fair, I didn’t know if my results were correct either after 10 weeks and I suppose neither did the professor I worker for. Quite often you can do work for many months or years in research without knowing whether you are “correct”, but you still did the work, right? Likewise in this case, the point I’m trying to make is that Gemini 1121 could do what I tried to do over roughly a 10 week period in 10 minutes. Iterating over these results with the professor within a day or two of them could have led to much further progress than I made in the 10 weeks that summer.

@timothyapplescotch1361 2 ай бұрын

Kyle, I would be interested in your thoughts on Sabine Hossenfelder's comments that science is deeply compromised considering fields like theoretical physics have not made significant progress in 50 years. She is a theoretical physicist herself and a prominent KZbin science educator if you are not aware with her work. Here is a link to a recent video where she airs her concerns: kzbin.info/www/bejne/foK5d2OPqpyLaJYsi=N6KVWOKUbXuLPVep

@KyleKabasares_PhD 2 ай бұрын

I am quite familiar with her! I do watch her videos. I will consider watching that video in particular and giving some thoughts on the matter.