Multimodal prompting with a 44-minute movie | Gemini 1.5 Pro Demo

  Рет қаралды 247,947

Google

Google

Күн бұрын

Пікірлер: 238
@HappyBirthdayGreetings
@HappyBirthdayGreetings 10 ай бұрын
I like that this was done in a honest way
@CK-kd5pn
@CK-kd5pn 10 ай бұрын
Can you elaborate on what you mean by done in an honest way?
@ShouryanNikam
@ShouryanNikam 10 ай бұрын
@@CK-kd5pn They faked the last demo
@leeleo9615
@leeleo9615 10 ай бұрын
they released a demo video a while back showcasing Gemini Ultra. The video was carefully edited in a way and sped up to give you an illusion that the model can give really high quality respond consistently and quickly, which is not the truth. To be precise the demo is half accurate, and Google sugarcoated it.@@CK-kd5pn
@LudwikTrammer
@LudwikTrammer 10 ай бұрын
@@CK-kd5pn They pointed out the sped-up waiting times and showed how long it actually took to generate a response. During the original Gemini launch, they released a famously misleading demo video - it portrayed real-time interactions with the model, which appeared to be based on a live video stream. In reality, the responses were pre-generated from carefully curated static images. It looks like they've learned their lesson, though.
@TamVuongvan-hj3hw
@TamVuongvan-hj3hw 10 ай бұрын
@@CK-kd5pn 0:37
@laStar972chuck
@laStar972chuck 10 ай бұрын
1. This is amazing ! 2. They clearly learned their lesson about last "edited" demo. Always good to see perpetual improvement in an organisation
@KinqNick
@KinqNick 9 ай бұрын
absolutely. this time they have a clear advantage against their competitors (with their token length). Last time with gemini ultra it was "just another llm who cares". I hope that it is as impressive as it looks so far.
@laStar972chuck
@laStar972chuck 9 ай бұрын
@@KinqNick I truly doubt they can maintain this cost structure for long because that must be expansive to run They'll prolly educate the customer into being patient (long response time) or some sort of job queuing/wait But now they need to catch-up and it's all to users benefits. Gonna be a wild ride while it lasts !
@KinqNick
@KinqNick 9 ай бұрын
@@laStar972chuck ya could be. Even in the demo some responses were above 60 sec, what I would accept but how would it change, if thousands if not millions use the service.
@HassanHashemiFarahani
@HassanHashemiFarahani 10 ай бұрын
This is how you create a demo. Simply amazing 👌
@raaajit1
@raaajit1 10 ай бұрын
This is insane , totally insane 😮usage : you don't have to watch your whole CCTV footage , Instead, you can just inquire what needs to be identified.
@poxer1
@poxer1 10 ай бұрын
Great example. Did not think of that use case. Take a document which is 100 pages like a law, insert it and ask it to find a specific part. Or to tell you if the documents of a person allow that person to be accepted in university. Or huge medical file of a patient which needs analysis
@siloporcen
@siloporcen 10 ай бұрын
@user-vx9tn3iz4n you already know the answer to that question lil bro.
@apache937
@apache937 10 ай бұрын
Yes. and it can be runnning in near real time scanning the footage of all the cameras you have for suspicious activities
@Whiztlex
@Whiztlex 10 ай бұрын
​@@apache937With government having access to all the cameras😈 Big brother is about to become all seeing
@Danuxsy
@Danuxsy 10 ай бұрын
Exactly, the more data the better, EYES WILL BE EVERYWHERE.
@strikewave1918
@strikewave1918 10 ай бұрын
With OpenAI's video generation and Google's video comprehension it seems like almost every aspect of digital media is now processable and synthesizable via AI. Absolutley incredible.
@paulinaolender6034
@paulinaolender6034 3 ай бұрын
@RichsrSanch-d3y
@RichsrSanch-d3y Ай бұрын
@@paulinaolender6034
@Hobby_Technology
@Hobby_Technology 10 ай бұрын
The most impressive thing is that when you asked for the time code it responded with simply with "15:34" and not "You have shown me an image of what appears to be a crude drawing of a stick figure below a water tower, with a spout coming from the tower showering water onto the stick figure character ...[3 more paragraphs later]... This is similar to a moment in the film Sherlock Jr. by Buster Keaton in which a character from the movie gets covered in water from a water tower, this happens at 15 minutes and 34 seconds." Sometimes I get really sick of how long-winded ChatGPT is.
@CK-kd5pn
@CK-kd5pn 10 ай бұрын
Can't you just ask for a brief response?
@apache937
@apache937 10 ай бұрын
Yes but if the context window is long it ignores that @@CK-kd5pn
@noahwalsh8426
@noahwalsh8426 10 ай бұрын
Gemini advanced has so many guardrails you spend half the time training the model smh
@TuxraGamer
@TuxraGamer 10 ай бұрын
I really appreciated seeing that too, it really feels like something that would be compatible with the Unix philosophy (the computer must do *explicitly* what it was asked for).
@MendigoLouco
@MendigoLouco 10 ай бұрын
@@CK-kd5pn "Of course, I will try to keep it brief. The time code for the scene depicted in the image you provided is 15:34. Do you need anything else?"
@julienhovan5725
@julienhovan5725 10 ай бұрын
I would definitely like to see an example of asking it a sketch or description of an event that didn’t happen in the Movie, as its one thing to find it, its another to recognize that something doesn’t exist.
@srinivasjakkamsetti2635
@srinivasjakkamsetti2635 9 ай бұрын
..... 😅😊
@ИгорьМясников-т5п
@ИгорьМясников-т5п 3 ай бұрын
@Tracesoftradition
@Tracesoftradition 10 ай бұрын
That’s amazing! Can’t wait to see what else Gemini 1.5 Pro is capable of doing.
@joelhulsey8387
@joelhulsey8387 10 ай бұрын
This is crazy impressive. Wow.
@zaiologyy
@zaiologyy 9 ай бұрын
So neeat! The possibilities are truly endless
@zergar5671
@zergar5671 10 ай бұрын
This will be amazing for going through 24 hour surveillance videos for police investigators to find the exact crime window.
@utomo8
@utomo8 10 ай бұрын
I hope we can have it on consumer CCTV also to better surveillance and alerts. when somebody enter some areas. it can greet or warns by time and known people also
@mistycloud4455
@mistycloud4455 9 ай бұрын
that is dystopian
@DeveloperJS314
@DeveloperJS314 10 ай бұрын
Can't wait to try out 1.5 pro! I am looking forward for 1.5 ultra!
@Romathefirst
@Romathefirst 10 ай бұрын
is the pro the free one ???
@Ricolaaaaaaaaaaaaaaaaa
@Ricolaaaaaaaaaaaaaaaaa 10 ай бұрын
@@Romathefirst yes
@FartuunMohamed-ke5ki
@FartuunMohamed-ke5ki 9 ай бұрын
1:41
@Moh1mmed
@Moh1mmed 10 ай бұрын
Wow, this is really impressive
@bob3160
@bob3160 10 ай бұрын
Fantastic. Can't wait to put it to work. Thanks
@shrikarmadhu2776
@shrikarmadhu2776 10 ай бұрын
damn!! really impressive!! cant wait to try it out ....
@erichwschneider
@erichwschneider 10 ай бұрын
I have to close my mouth, amazing!!!❤️
@crosbja360
@crosbja360 10 ай бұрын
That seems very impressive! AI keeps getting better every few months. Good job Google!
@finbeats
@finbeats 10 ай бұрын
Every day!
@CodingAqyanoos
@CodingAqyanoos 9 ай бұрын
Wow very interesting. Now you can search in videos. That is amazing.
@josephhuelbig
@josephhuelbig 9 ай бұрын
There is an indexer setup you can use in combination with quantized images to find missing people, evidence of crimes, etc. You can draw up quantized images from off sequence outside the square. Needs data mining of quantized images, database, application to Google Maps and index
@sorryaboutyourcats
@sorryaboutyourcats 10 ай бұрын
Great to see massive improvements already! When Bard was originally released, it felt rushed out and was confusingly poor considering how long Google has been in the generative AI space for. Looking forward to trying it out. 😸👍
@Someone7R7
@Someone7R7 10 ай бұрын
Openai is in serious danger 😂
@Tracesoftradition
@Tracesoftradition 10 ай бұрын
I don’t it is. They’re just going to step it up a notch.😊
@finbeats
@finbeats 10 ай бұрын
@@Tracesoftraditionsora vs this. Just another race but this time to agi. Exciting stuff
@ilyass-alami
@ilyass-alami 10 ай бұрын
Gemini pro Evolved and became stronger ❤❤
@SanaagSomaliland
@SanaagSomaliland 9 ай бұрын
Minority Report moment is coming sooner than I thought.
@SanaagSomaliland
@SanaagSomaliland 9 ай бұрын
If this is what the private entreprises are now capable of, imagine what the Gov't has been capable of secretly.
@3ndriu999
@3ndriu999 10 ай бұрын
I don't understand why I've always seen the demos perform better than the official versions...
@finbeats
@finbeats 10 ай бұрын
Your Prompt issue
@DevolperOperation
@DevolperOperation 10 ай бұрын
Stunning.
@dishcleaner2
@dishcleaner2 10 ай бұрын
This is bewildering.
@gabrielfernandez3782
@gabrielfernandez3782 10 ай бұрын
Amazing, simply amazing.
@raphetsonatelier
@raphetsonatelier 10 ай бұрын
Simply incredible 😮 I can't believe it can parse that much information in so little time
@mohamedkarim-p7j
@mohamedkarim-p7j 10 ай бұрын
Thank for sharing👍
@chlee4256
@chlee4256 10 ай бұрын
As impressive as this one is, let's hope Google doesn't get called out like the Gemini demo.
@anav587
@anav587 10 ай бұрын
Doesn't even matter, I doubt they're lying about the 10m window
@nawabifaissal9625
@nawabifaissal9625 10 ай бұрын
@@anav587 yeah same i doubt they would lie about that especially since there's not really a big marketing project to introduce a new model or else, it's "just" an improvement
@pebre79
@pebre79 10 ай бұрын
Also their Google Assistant making phone calls demo :/
@AkameAGKxv
@AkameAGKxv 4 ай бұрын
Cool😮❤️‍🔥
@muru603
@muru603 10 ай бұрын
Freaking amazing!
@_ahrorjon
@_ahrorjon 10 ай бұрын
It’s absolutely insane 😍😍😍😍
@Илья-е4э8ю
@Илья-е4э8ю 10 ай бұрын
I remember 2 years ago, I wanted to find something like these models, But I didn't find them, it was like a closed door with no doorknobs. Thank you for making my dreams come true, This is a dream come true, thank you.
@_abdul
@_abdul 10 ай бұрын
Yup, We're There.
@ayukay4538
@ayukay4538 10 ай бұрын
Sorry google, i didn't have 2 minutes to spare watching this demo video, so i asked an AI to watch it in my stead, and to produce this comment as a result.
@mansoor8228
@mansoor8228 10 ай бұрын
😂😂
@sumansaha295
@sumansaha295 10 ай бұрын
Now this is impressive
@ilyass-alami
@ilyass-alami 10 ай бұрын
We know that Gemini is still constantly developing, because it does not currently compete, and Google is a giant company It should be the queen of artificial intelligence, and provided to users for free
@amjadhussainamjad786
@amjadhussainamjad786 10 ай бұрын
Good luck Gemini
@darkphoenix2
@darkphoenix2 9 ай бұрын
Ok, I don't want people like artists to lose their jobs to AI, but this kind of thing is really cool. Don't know how useful it will be, but it's cool.
@wrcz
@wrcz 10 ай бұрын
thanks, just bought some Google stock
@vikramjeetsingh4627
@vikramjeetsingh4627 9 ай бұрын
Now this was cool!!
@abhiram978-
@abhiram978- 10 ай бұрын
this is soooo good
@ShpanMan
@ShpanMan 10 ай бұрын
Awesome work. Competition will bring the AI gods faster.
@florianfeldhaus6478
@florianfeldhaus6478 10 ай бұрын
Mindblowing
@HiddenExp
@HiddenExp 10 ай бұрын
This means we could generate long and consistent storytelling, damn! And longer AI NPCs memory!
@web3global
@web3global 10 ай бұрын
Just WOW! 🤩
@gptty
@gptty 10 ай бұрын
Good job
@mistycloud4455
@mistycloud4455 9 ай бұрын
The speed of ai progress is insane
@kjb8449
@kjb8449 8 ай бұрын
Super!
@jeanchindeko5477
@jeanchindeko5477 10 ай бұрын
Gemini Pro 1.0 was released in December and 2 months after we already have a huge upgrade with version 1.5. What will this world look like in December 2024?
@wanarchives
@wanarchives 9 ай бұрын
When the real dev makes their own video demo instead of marketing team
@peztopher7297
@peztopher7297 9 ай бұрын
I'd think the simpler the drawing, the better. You have to decide what the important features are. Detail would be noise. Years back, I realized this was how cartoons (images, not video) work. What is the minimum representation, for example, of a mouse?
@MichealHawrylyshen
@MichealHawrylyshen 9 ай бұрын
Where was Base 211 located? Wasn't Feugo Patagonia or the Falklands...
@muhammadasiffarooqi7672
@muhammadasiffarooqi7672 10 ай бұрын
Mind blowing
@alexissvetrev
@alexissvetrev 9 ай бұрын
I. Amazed with all these, how come no one is building home server robots. All the elements are there...
@compareFirepower
@compareFirepower 10 ай бұрын
This is nice.
@bensoos
@bensoos 10 ай бұрын
Are these voices artificial?
@FazriIhsan
@FazriIhsan 10 ай бұрын
When will the Gemini app be released in Indonesia
@dimii27
@dimii27 10 ай бұрын
Am I the only one who finds it funnny that this is filmed on a mac?
@ZweSim
@ZweSim 9 ай бұрын
Amazing
@johnflux1
@johnflux1 9 ай бұрын
Very very nicely done. Absolutely amazing, and so much better video than the last one.
@eeuno-v2r
@eeuno-v2r 10 ай бұрын
the stock price reflect everything
@aigriffin42604
@aigriffin42604 6 ай бұрын
Why does Gemini require tokens?
@r5LgxTbQ
@r5LgxTbQ 10 ай бұрын
just speechless
@danielmartinmonge4054
@danielmartinmonge4054 10 ай бұрын
This is 1.5 but PRO is It? Not ultra. Just giving a crazy amount of context to the more basic model right?
@eyescreamcake
@eyescreamcake 10 ай бұрын
How much did that one minute of compute cost?
@jd5787
@jd5787 10 ай бұрын
Great, i will know how to use gemini next time I need to get info on a movie and associated time code. Now: how do I leverage gemini for my business? Pre-sales, sales, finance operations for instance... How about being able to get proper analysis of legal documents, suggestions and clauses drafting (which chatgpt can do pretty well) before sending to a lawyer for vetting/approval? How about using Gemini in Google meet to get call report/summary that can be inserted in a CRM? How about being able to get relevant mockups, designs and images that can be used and exported in usable format (eos, ai etc). Right now, it looks like Gemini is a powerful model with limited use cases.
@deejay2023i
@deejay2023i 10 ай бұрын
This app not available in France
@hellodavidryan
@hellodavidryan 10 ай бұрын
Dear future overlords, be merciful. Signed, squishy humans.
@eliporter3980
@eliporter3980 9 ай бұрын
Man this stuff is accelerating fast.
@wellingtonmnf_ti
@wellingtonmnf_ti 8 ай бұрын
I can't find the correct words for explain what I saw right now...
@poxer1
@poxer1 10 ай бұрын
Ok, so inteligence and knowledge is now owned by companies like google. Time to learn carpentry
@mansoor8228
@mansoor8228 10 ай бұрын
Nice
@Yewbzee
@Yewbzee 9 ай бұрын
It baffles me that so many “AI experts” are still predicting AGI is many years away. I can only assume they are doing this in bad faith in an attempt to fend off heavy handed regulation.
@christoschri9193
@christoschri9193 10 ай бұрын
Why was Gemini created? you can just improve Google Assistant .... you can just merge the two apps into one ...
@pablonolberto6557
@pablonolberto6557 10 ай бұрын
Openai already feels cornered, how is it possible that Google will present Gemini Pro 1.5 some after Openai takes its ace on generating videos.
@eclectice
@eclectice 9 ай бұрын
Now I can ask Gemini Pro 1.5 to identify any potential shoplifters.
@MohammadFarman-dr9hg
@MohammadFarman-dr9hg 6 ай бұрын
Samsung and Google system fuchsia os 🎉😮😊❤
@eukaryote-prime
@eukaryote-prime 10 ай бұрын
Its a good thing that liquid was blue
@RUMPshit
@RUMPshit 9 ай бұрын
Is google just using RAG calling it 1m token context?.... I dont think the LLM actually reads 1m tokens...
@hooooman.
@hooooman. 10 ай бұрын
Whats that token actually means? Why google raised to a new standard of 1 million token?
@FirasMohamed96
@FirasMohamed96 10 ай бұрын
Can you Imagine Gemini 10.5 what would be? 😳
@theeternalnow6506
@theeternalnow6506 10 ай бұрын
I think we'll have an AI overlord before then at this pace.
@killstreak4767
@killstreak4767 10 ай бұрын
This needs a separate app
@na-gi1xb
@na-gi1xb 9 ай бұрын
🤫🧏‍♂️
@user-ji1ko2vs5g
@user-ji1ko2vs5g 9 ай бұрын
❤ 0:32
@TwoTeaTee
@TwoTeaTee 10 ай бұрын
Holy sheets!
@JazevoAudiosurf
@JazevoAudiosurf 10 ай бұрын
i do believe this demo
@maylahsyari
@maylahsyari Ай бұрын
why google sign in😶
@PeggyBoggs-lr7xc
@PeggyBoggs-lr7xc 9 ай бұрын
0:50 1:01 10 🎉
@tony.cortez
@tony.cortez 10 ай бұрын
Application for this is, use in investigation.
@lowlufi
@lowlufi 7 ай бұрын
Nex gen: Jarvis
@vietlongpham
@vietlongpham 10 ай бұрын
Now who could say GPT is better. Google beat them up
@lomfulanolwazi7241
@lomfulanolwazi7241 7 ай бұрын
amen
@JOHN.Z999
@JOHN.Z999 10 ай бұрын
An efficient multimodal system with a significantly expanded context has impressed me! This is precisely what I need, Google: efficiency and improved reasoning from these models. ❤
@rahneshin752
@rahneshin752 10 ай бұрын
❤❤❤
@ApatheticPerson
@ApatheticPerson 10 ай бұрын
I really hope this is not fake just like the last time.
@Mackcolak-xf5bk
@Mackcolak-xf5bk 10 ай бұрын
Here are some key takeaways from the video: 1. Gemini 1.5 Pro is an experimental new model from google that can understand very long context - up to 1 million tokens. 2. They demonstrated this capability by using a 44-minute Buster Keaton film (over 600,000 tokens) as the context. 3. The model was able to find a specific moment in the film when prompted - it located the exact timecode when a pawn ticket was removed from a person's pocket, and extracted accurate text details from the ticket. 4. The model was also able to match a simple drawing to the correct scene and timecode, showing an understanding of multimodal input combining text and images. 5. Long context understanding opens up new possibilities for complex and detailed questions about books, films, etc that weren't possible with previous models. 6. Responses will vary and may not always be perfect, but the examples show promising capabilities in extracting details from extremely long context. 7. Simple drawings can be an effective way to probe the model's understanding without having to describe the full context again. In summary, Gemini 1.5 Pro demonstrates an early but meaningful capability in long-form reasoning and understanding from both textual and multimodal context.
@CK-kd5pn
@CK-kd5pn 10 ай бұрын
Anthropic?? Don't just copy paste from ai bro read it over
@apache937
@apache937 10 ай бұрын
bot
@Mackcolak-xf5bk
@Mackcolak-xf5bk 10 ай бұрын
@@CK-kd5pn were these takeaways wrong?
@CK-kd5pn
@CK-kd5pn 10 ай бұрын
@@Mackcolak-xf5bk Just look at the first point...
@Mackcolak-xf5bk
@Mackcolak-xf5bk 10 ай бұрын
@@CK-kd5pn i see now, thank you.
@omkarchavan5940
@omkarchavan5940 10 ай бұрын
Google using mac os and not chrome os?
@ianlewin8888
@ianlewin8888 10 ай бұрын
Next step would be making the AI can tell what's real and not.
@BalveerSingh4956-t4t
@BalveerSingh4956-t4t 9 ай бұрын
और सुनो जी आप सत्य मार्ग सबसे बडा धर्म है अलवर राजस्थान भारत से बलवीरसिंह ❤
Introducing Gemini 2.0 | Our most capable AI model yet
2:53
She made herself an ear of corn from his marmalade candies🌽🌽🌽
00:38
Valja & Maxim Family
Рет қаралды 16 МЛН
Sigma Kid Mistake #funny #sigma
00:17
CRAZY GREAPA
Рет қаралды 27 МЛН
VIP ACCESS
00:47
Natan por Aí
Рет қаралды 21 МЛН
Google - 25 Years in Search: The Most Searched
3:49
Google
Рет қаралды 354 МЛН
AI makes a Mark Rober video | Bard with Gemini Pro
6:19
Google
Рет қаралды 29 МЛН
How Google Search Works (in 5 minutes)
5:16
Google
Рет қаралды 20 МЛН
Google - Year in Search 2024
3:57
Google
Рет қаралды 5 МЛН
Using AI to solve complex problems | Gemini
5:01
Google
Рет қаралды 372 М.
Project Astra: Our vision for the future of AI assistants
2:17