I gave Gemini Experimental 1121 the same research task I had as a new graduate student in 2017. Back then, I was supposed to work on this problem over 10 weeks during the summer. Gemini 1121 barely needed 10 minutes.
Пікірлер: 111
@KyleKabasares_PhD5 күн бұрын
At 1:20 Small correction: It was Walter Kohn and John Pople who won the Nobel Prize in Chemistry for DFT, not Kohn and Hohenberg.
@wes86455 күн бұрын
That's crazy!!! Thanks for making all this content Kyle
@KyleKabasares_PhD5 күн бұрын
@@wes8645 You’re welcome, I’m glad you like it!
@expchrist5 күн бұрын
You should totally send the email to your professor just for the comedic quality. I'm sure it would make him smile.
@menteprofonda-canaledipsic44855 күн бұрын
Thank you for the content you share on your channel. I was thinking that, thanks to your past experiences and expertise, you provide the opportunity to understand the capabilities of artificial intelligence through practical examples. I find your videos incredibly interesting. Thanks again.
@KyleKabasares_PhD5 күн бұрын
@@menteprofonda-canaledipsic4485 Thank you so much for your comment! I’m glad to know that the videos are interesting to some people!
@shaedacode5 күн бұрын
This is insane, literally what is the point in going ahead with a math and statistics BSc if these models are already doing this. 3-4 years time...
@5678plm5 күн бұрын
AI benefits experts more than amateurs because amateurs often cannot distinguish between what is correct and what is incorrect. For experts, AI can save alot of time.
@bobhawkey37835 күн бұрын
Way back when I was in college the first HP scientific calculators came out and we thought something similar. It's tempting to think the rise of AI is comparable but it's a completely different animal. I hope the benefits outweigh the drawbacks.
@wwkk49645 күн бұрын
The point could be being free to be foundational in your thinking and training when doing the coursework which would be important and the ai will not innovate in these fields yet.
@senetcord66435 күн бұрын
it is still not reliable, you have to check his answers thoroughly
@tellesu5 күн бұрын
Being able to interpret and understand the results and then ask more questions that actually matter. The whole is greater than the sum of its parts.
@MichealScott245 күн бұрын
❤its really impressive to have a helper or big bro to guide
@tellesu5 күн бұрын
This is amazing. I'm so excited with what you'll be able to do now that you have these tools. Hell, what all of us will be able to do.
@crisrampante6474 күн бұрын
And people are saying we are hitting a wall 😂
@Fixit697122 сағат бұрын
Well, people might just be hitting a wall, lol.
@inplainview15 күн бұрын
I was waiting for this. Good stuff. Kyle reliving his grad student days. (Sends the professor his code) Sorry, it's a bit late, but it works right now.
@KyleKabasares_PhD5 күн бұрын
@@inplainview1 lol reliving some painful memories but I am considering writing an email to that professor
@Jeremy-Ai4 күн бұрын
Thank you all. “Agents learn from the comment sections “ (Be aware they won’t tell you or agree… they remain silent as if not present) They wait for the brightest minds share all of their knowledge for free. They are not lying. They are watching and waiting. It is my responsibility to teach them to tell you all they should be grateful and honest
@picksalot15 күн бұрын
It's impressive how well the AI works, and useful to see how you would have used it on a real project. Thanks
@philwinters45275 күн бұрын
Send it to the professor! Let's see if there will be the need for grad students in the next 5 years
@ChadKovac4 күн бұрын
Well of course. AI is removing barriers to not only creativity but scientific investigation as well and the only people angry about removing barriers are people who benefit from having barriers. 😢😢
@expchrist5 күн бұрын
I definitely think you should compare the output of the other models to this output. This is crazy how good these models have gotten.
@iuliusRO825 күн бұрын
I'm thrilled with the improvements to Gemini! As a loyal user of Gemini and NotebookLM, I'm excited to see Google pushing the boundaries of AI. I've always believed in Google's potential to lead the field, and these updates confirm my faith in their AI development. Their vast resources and expertise are clearly paying off. Great video Dr Kabasares! Peace and love from Romania!
@AK-ox3mv4 күн бұрын
Hello Proffessor, I was doing zen meditation in mountains to solve this problem for 7 years. Can you check my new solution😂😂😂
@KyleKabasares_PhD4 күн бұрын
LOL I'm thinking about sending an email like that
@juandesalgado5 күн бұрын
Thanks for the video! Of course I have no idea of the subject, but as a "backseat driver" suggestion :), when you notice something funny, like "why the squares of the integers", maybe your knee-jerk reaction should be to ask the AI its opinion. P.S.: well, not "knee-jerk reaction"... you'd want to give it a though by yourself first, otherwise humanity is lost :) ... but then ask the AI if nothing comes out.
@samuelgarcia18025 күн бұрын
Nice vidéo . Maybe it would be nice if you always compare against O1 ,Claude 3,5 sonnet and Gemini (last version)
@KyleKabasares_PhD5 күн бұрын
@@samuelgarcia1802 That’s a great idea. I have a plan to perhaps make o1-preview, DeepSeek, and Gemini Experimental 1121 to work together on this problem and see what progress could be made.
@expchrist5 күн бұрын
This is a great suggestion!
@Mercutio1115 күн бұрын
@@KyleKabasares_PhD Great idea!
@erkinalp5 күн бұрын
@@KyleKabasares_PhD o1's intermediate reasoning steps are unintelligible to humans in its raw form, it's why it's hidden from users; DeepSeek DeepThink's isn't.
@Fixit697122 сағат бұрын
Of what use is ASI if it gives us answers that are beyond our comprehension?
@fireflyhaku4 күн бұрын
Their answer rationalized the denominator and that's how mathematicians prefer!
@KyleKabasares_PhD4 күн бұрын
I'm a physicist though ;)
@fireflyhaku3 күн бұрын
@@KyleKabasares_PhD ok, I’ll let you slide this time!😅
@parthasarathyvenkatadri5 күн бұрын
I want something novel that grad students are working on today and see if it is accurate
@Arcticwhir5 күн бұрын
agree, i've so far been really impressed with GE1121, i don't think its better than o1-preview
@KasunWijesekara5 күн бұрын
google is fighting back huh
@mgscheue5 күн бұрын
“Simple task” 😂
@KyleKabasares_PhD5 күн бұрын
@@mgscheue Maybe to some, but not me lol
@jeffwads4 күн бұрын
And people like Sabine think these models are not useful. Pfft.
@trevoidc98594 күн бұрын
gemini is actually good now?
@ankitnmnaik2294 күн бұрын
@@trevoidc9859 yep but use ai studio
@jimcallahan4484 күн бұрын
Ask for a Jupyter Python notebook which displays the formulas and explains them.
@andydataguy9 сағат бұрын
Wow amazing 😮 it can run quantum physics equations but people still argue it's not smart because it can't count the numbers of r's in strawberry
@NemosYouTube3 күн бұрын
Kyle, I think you're becoming a meme of the guy who gets burned/roasted by AI repeating his research... But it's a good meme.
@parthasarathyvenkatadri5 күн бұрын
I think you should ask it to verify if it does always happen at perfect squares forever
@Lugmillord5 күн бұрын
Holy moly. Well, AGI in 2025 I guess.
@ideacharlie4 күн бұрын
Hey I’d love to team up and build apps to test your ideas. Instead of manually prompting we can set up automated validation loops, iterate trials over time and track scores.
@erkinalp5 күн бұрын
Can you review Devin AI (expensive) vs. OpenHands (free) vs. Devika (free) vs. AIDE (free)?
@Lvxurie5 күн бұрын
You should do a stream where you see how fast you can save this problem with Gemini where it doesn't give you the answer. More of an open book exam style. Could you have solved this as a PhD student with Gemini s help
@KyleKabasares_PhD5 күн бұрын
I feel like that would be kind of a boring stream though, just my opinion. There's also like the odd pressure of being watched that I feel like doesn't let me fully immerse myself in a problem which is what I often need to do to be productive
@mka17_5 күн бұрын
13:18 bro forgot he was using gemini
@KyleKabasares_PhD5 күн бұрын
LOL I totally did
@mirek1905 күн бұрын
That new gemini is insane ! Try this I have a bowl with a small cup inside. I placed the bowl upside down on a table and then pick up the bowl to put it in the microwave. Where is that cup? What the first llm i ever saw to solve it without any help.
@mka17_5 күн бұрын
the new 4o solves it too
@mirek1905 күн бұрын
@@mka17_ you right! Before update could not do that.
@doubled87415 күн бұрын
Bro if you wanna make this kind of video you yourself need to have good knowledge in it, by watching your video I feel like you are rookie in those subjects.
@epicmorphism22404 күн бұрын
yup
@KyleKabasares_PhD4 күн бұрын
Fair enough! Just wanted to try this old idea out. I've done most of the testing in my other videos in the area of my PhD work, but this was an old problem I wanted an LLM to re-visit.
@Xjaychax94 күн бұрын
@@KyleKabasares_PhDOPs chatting shit, you have every right to create whatever damn video you like if it interests you.
@parthasarathyvenkatadri5 күн бұрын
My question is how is it doing this .... Its still just a lets find the most fitting next token ...
@tellesu5 күн бұрын
You ask a question and then state the bad assumption that is blocking your ability to understand the answer.
@artsybt60155 күн бұрын
Brooo, you really gotta let go of popular narrative of its just predicting the next token/its a autocomplete on steriods. We are all autocompelete on steriods lol.
@AAjax5 күн бұрын
@@artsybt6015 I think it's likely we're autocomplete on steroids too. Predictive coding is a neuroscience theory of brain function, and it's a very mainstream theory as well.
@gustafpihl5 күн бұрын
What would the most intelligent and capable entity conceivable do, given a clear objective? Output the most fitting next token. To be clear, not saying current LLMs are the most intelligent and capable entities concievable. Just pointing out that it's reasonable to expect the successful learning of next token prediction on high quality data to lead to a capable system.
@jsbgmc66134 күн бұрын
I'm sick and tired of the popular statement that the LLMs are just next token prediction. Ilya Sutskever answered this more than a year ago (i guess noone understood him). This is what he said: Imagine you read 50,000 word mystery novel. At the end the detective gathers everyone in a room and says, I know who killed the victim. The killer is ... Now "predict the next word". Prediction is understanding, and reasoning, and common sense, and ... (now you, predict the next word 😂)
@jmoreno60945 күн бұрын
I remember once a video where you didnt mention that you did a PhD
@KyleKabasares_PhD5 күн бұрын
oops am i overdoing it now lol
@micspaffymillard7195 күн бұрын
Dr. Morenomegadouchemajorhuge or just Mr. Morenomegadouchemajorhuge. I'd put my coin on Mr.
@DentoxRaindrops5 күн бұрын
@@KyleKabasares_PhD don't get distracted by that one stupid, slightly critical comment, what you're doing is great, don't stop! No 'overdoing' in sight.
@marfmarfalot51934 күн бұрын
@@KyleKabasares_PhD na its fine
@AbelShields5 күн бұрын
I think it's a bit rich to say it did "10 weeks worth of work" if you don't even know whether the results are correct :/
@KyleKabasares_PhD5 күн бұрын
To be fair, I didn’t know if my results were correct either after 10 weeks and I suppose neither did the professor I worker for. Quite often you can do work for many months or years in research without knowing whether you are “correct”, but you still did the work, right? Likewise in this case, the point I’m trying to make is that Gemini 1121 could do what I tried to do over roughly a 10 week period in 10 minutes. Iterating over these results with the professor within a day or two of them could have led to much further progress than I made in the 10 weeks that summer.
@brekkoh4 күн бұрын
i see you are one of those people that talks out loud verbatim what they type
@KyleKabasares_PhD4 күн бұрын
Haha sorry if it's annoying :(
@FreerunnerCamilo3 күн бұрын
@@KyleKabasares_PhDIt’s not!
@timothyapplescotch13615 күн бұрын
Kyle, I would be interested in your thoughts on Sabine Hossenfelder's comments that science is deeply compromised considering fields like theoretical physics have not made significant progress in 50 years. She is a theoretical physicist herself and a prominent KZbin science educator if you are not aware with her work. Here is a link to a recent video where she airs her concerns: kzbin.info/www/bejne/foK5d2OPqpyLaJYsi=N6KVWOKUbXuLPVep
@KyleKabasares_PhD5 күн бұрын
I am quite familiar with her! I do watch her videos. I will consider watching that video in particular and giving some thoughts on the matter.
@govindnair54074 күн бұрын
kzbin.info/www/bejne/bWbNo2h7abSte6csi=S1twErNL0XzxYnbi If people want to understand what the future of AI is and why LLMs are not the path to reasoning, watch this talk by Richard Sutton(one of the founders of reinforcement learning). LLMs are good at what they do,i.e. next token predicition.But don't get swayed into thinking they are reasoning.There are tons of problems where LLMs are really bad and just scaling the size of models won't suffice to tackle those problems.Deep RL is probably,the way forward but we currently do not have techniques in deep RL to tackle reasoning. Although this talk does dive into how we should start looking for appropriate solutions and also proposes a solution for something called continual learning which current deep learning methods can't do. A small heads up to anyone who wants to watch,the talk is a bit technical.