The capabilities of multimodal AI

The capabilities of multimodal AI | Gemini Demo

Рет қаралды 3,156,538

Күн бұрын

Our natively multimodal AI model Gemini is capable of reasoning across text, images, audio, video and code. Here are favorite moments with Gemini Learn more and try the model: deepmind.google/gemini
Explore Gemini: goo.gle/how-its-made-gemini
For the purposes of this demo, latency has been reduced and Gemini outputs have been shortened for brevity.
Subscribe to our Channel: / google
Tweet with us on X: / google
Follow us on Instagram: / google
Join us on Facebook: / google
0:00 Intro
0:19 Multimodal Dialogue
1:32 Multilinguality
2:04 Game Creation
2:31 Visual Puzzles
3:17 Making Connections
3:39 Image & Text Generation
4:06 Logic & Spatial Reasoning
4:55 Translating Visuals
5:27 Cultural Understanding

Пікірлер: 3 900

@dpsdps01 7 ай бұрын

Absolutely mindblowing. The amount of understanding the model exhibits here is way way beyond anything else.

@NeuroScientician 7 ай бұрын

It's staged.

@gerardojg 7 ай бұрын

I agree but I wouldn't describe it as "understanding". Identification and cognitively identify possibilities with given data. It is very impressive!

@cajbajthewhite4889 7 ай бұрын

@@NeuroScientician I've gotten GPT-4 V to play tabletop wargames with me and it had decent strategy, and to read my poor quality sketches. If Gemini Ultra succeeds at the benchmarks they claim it does and is built with native multimodality, there's no reason to believe that the video is staged beyond the fact that they've sped up the responses a bit (which is shown in text at the beginning).

@goturmatau 7 ай бұрын

@@NeuroScientician It's surely rehearsed, but don't underestimate the power of the LLM.

@Google 7 ай бұрын

Thrilled to hear you think so! Enjoy using Bard with Gemini Pro ✨

@degenplanet 7 ай бұрын

Just one problem: the video isn’t real. “We created the demo by capturing footage in order to test Gemini’s capabilities on a wide range of challenges. Then we prompted Gemini using still image frames from the footage, and prompting via text.” (Parmy Olsen at Bloomberg was the first to report the discrepancy.)

@buttofthejoke Ай бұрын

They changed the title. Previously it was called "Hands on with Gemini".

@Kudagraz Ай бұрын

it says in the intro "showing it a series of images"

@s3tifpv941 Ай бұрын

@@Kudagraz "There was no voice interaction, nor was the demo happening in real time."

@joshuaryde9028 7 ай бұрын

Google has admitted in a blog post that this video isn’t accurate- the AI “was not responding to the voice or video at all”, but in fact had written prompts to respond to and still images rather than the live drawing/conversation which are not shown in the video.

@NoMercy.62 4 ай бұрын

where did they say that?

@blacknoir2404 Ай бұрын

we'd all have to wait 5 more months for something like this haha

@FamimFred Ай бұрын

Wait, what?

@OmegaGlops Ай бұрын

@@FamimFred The video was faked. If you just search for "Google's best Gemini demo was faked" you can find an article about it, and there's plenty of videos on KZbin delving into the issue. Fortunately, OpenAI's latest ChatGPT 4o model is actually capable of doing a lot of the stuff in this video that Google faked.

@vikingnusantara 18 күн бұрын

Well it is still mindblowing. Because nothing was added in the a.i response.

@ChrisBrooksbank 7 ай бұрын

Im glad to see Google back in the game, this looks next level.

@MikeKleinsteuber 7 ай бұрын

No they ain't. This will never see the light of day in the public arena

@anuragparmar8155 7 ай бұрын

@@MikeKleinsteuberwhy so

@jman 7 ай бұрын

@@MikeKleinsteuber it's already accessible for the public

@reconquista1911 7 ай бұрын

Yeah, evil company is in the game. What bad could happen?

@dexio85 7 ай бұрын

They are trying to look this way for sure. But this is a gimmick and a toy, maybe useful for vision impared, but that's it. Google is not capable of creating working product for the public for years now.

@TimeBucks 7 ай бұрын

The real-time element is by far the most impressive.

@somthingz3928 7 ай бұрын

Don't get your hopes up. It's not real time.

@MdsweetSweet-ox6jp 7 ай бұрын

Nice

@TalwinderDhillonTravels 7 ай бұрын

Lol this is just an edited video Nothing real time

@appletree6741 6 ай бұрын

It’s fake apparently

@beayn 5 ай бұрын

This is their favorite interactions with the AI, so they edit out the ones where it performed poorly which was probably the majority of them. Once they polish it up over the next few years I'm sure it will be able to do this in almost-real-time as in it will probably take several seconds to react to what you're doing... and of course, you'll be able to subscribe for $29.99 per month for faster responses.

@tusharparkhe3245 7 ай бұрын

This is really fascinating! I was waiting for the Gemini and it's finally here! I hope this Gemini is as capable as the video is showcasing it. but I noticed that this video is edited especially when the person rotates the phone while showing the cat's demo at 5:36 that video has clearly been added later...

@familymultiplayergames1226 7 ай бұрын

When did Google lose their way and think it’s ok to fake videos to raise stock prices.

@99.googolplex.percent 4 ай бұрын

There's a chance this exists, but sharing such information publicly might not be feasible in the near future.

@Jonahlyn-uh2uv Ай бұрын

0:06

@YTV-Hoddeok 7 ай бұрын

Such an interesting work!! Hope to see more incredible things in the near future

@masija23 6 ай бұрын

😊😊😊

@klx6265 7 ай бұрын

Absolutely mind blown by the scale of context awareness here. G for Gemini.

@christinestpierre3462 6 ай бұрын

Fascinating 😮 I can’t wait to see what we’ve accomplished in another 5 years

@Maharid 7 ай бұрын

Ok, this was really good to see, this is surely the right direction.

@TicTockBrandShop 7 ай бұрын

I really cannot quite believe what my eyes have just shown me For me, this is the most incredible piece of A.I advancement the world has seen.Period. Mind blown, when I try to just imagine what the A.I world will could become in just a few years from now. Amazing and every other superlative I could throw at you.

@michaelcondon8286 7 ай бұрын

This may be staged just like Google Duplex demo from a few years ago. Don't believe anything from these people until you see it for yourself.

@swatmaster492 7 ай бұрын

it's incredibly misleading and not actually real-time.

@TicTockBrandShop 7 ай бұрын

Ah didn't know that.Thanks my friend.

@Cockroach_underwear 6 ай бұрын

Wow! Great job google,I hope it lives up to everyone’s expectation! Seems like our utopia might not be too far off in the future

@Minimalrevolt-m83 6 ай бұрын

Superb fantastic creative invention from humanity in the 21st century. Advancing and interesting creation. Wish that it could be market to Malaysia soon..!👏🏻

@caelen_c 7 ай бұрын

I always love AI videos from Google

@user-bz9nh1fb5k 7 ай бұрын

That's truly mind-blowing!! looking forward to more amazing things we can do using Gemini!

@Google 7 ай бұрын

The Gemini era will be a great one 😊

@michaelcondon8286 7 ай бұрын

This may be staged just like Google Duplex demo from a few years ago. Don't believe anything from these people until you see it for yourself.

@omoboniike Ай бұрын

The amount of knowledge is mind-blowing!

@khalidaqeel01 5 ай бұрын

In awe of Google Gemini's brilliance! I'm consistently impressed by Gemini's ability to grasp complex concepts, generate creative text formats, and answer my questions in such an informative way. It's like having a super-powered, knowledgeable friend always at my side, ready to tackle any challenge I throw its way. The way Gemini seamlessly blends various forms of intelligence - factual language understanding, code comprehension, and creative thinking - is truly remarkable. ✨ It's clear that Google has poured immense effort into crafting this AI, and it shows in every interaction. Thank you, Gemini team, for creating such a valuable tool! I can't wait to see what you achieve next!

@pratikpandey6680 7 ай бұрын

I love how it can come up with ideas Like the Guess the country game and one with yarn 🤩 Amazing!!!!❤

@Google 7 ай бұрын

So many fun things to try using Bard with Gemini Pro 💡

@nandinisingh2794 7 ай бұрын

Can't wait to try it,with all the understanding this model is able to do it's just amazing.

@SM-qr2kh 7 ай бұрын

Omg!! Amazing! Cant wait to see more possibilities

@SteveJones-qi5hn 6 ай бұрын

That was 6 minutes of my life that I will never get back.

@Press1ForNick 7 ай бұрын

This is mind-blowing! Thanks for giving us a sneak peek into the incredible progress happening in the world of tech, creativity, and communication. This has the potential to be at the heart of everything we do.

@Google 7 ай бұрын

You're very welcome. Thanks for using Bard with Gemini Pro!

@michaelcondon8286 7 ай бұрын

This may be staged just like Google Duplex from a few years ago. Don't believe anything from these people until you see it for yourself.

@alexp.3694 7 ай бұрын

@@Google Oh look - Google has time to answer youtube comments, instead of working on aligning a potentially dangerous tech...

@cgme9535 7 ай бұрын

@@alexp.3694probably someone that manages social media profiles. The engineers are still watching the AI, don’t worry.

@keelfly 6 ай бұрын

@@Google come on now, tell them how you faked it. Your next video should be about that. Be honest for once.

@prem9501 7 ай бұрын

Happy to be alive to witness this ❤. Let's hope that all the hardwork goes into building these AI model will be fruitful and this Gemini will make the world a better place

@cheeks80 4 ай бұрын

This of the time when we can wear AI enabled glasses and we will get real time prompts/ suggestions looking at peoples faces with matches to their profile and mine to see compatibilities. Or better yet how to approach the person and make the best 1st impression based on their likes.... That will be mind blowing

@josephcapricorn 7 ай бұрын

Brilliant. Best wishes to Team Gemini. Keep it up

@cosmicparsec9463 7 ай бұрын

It's fake. Search for recent news.

@utopiankreations 7 ай бұрын

I knew you guys were working on something AMAZING. Glad to see ya back! This is a complete game changer! 💜

@dufung3980 7 ай бұрын

It’s a manhattan project, stop being anything but disappointed in your species. You should look up what Larry Page said at Musk’s 44th birthday and get back to me.

@Azzazel_ 7 ай бұрын

Im sorry but it was fake and staged

@utopiankreations 7 ай бұрын

And how so? If you know then share your facts please? :) @@Azzazel_

@utopiankreations 7 ай бұрын

ummm ok lol Recognizing the proficiency and effort invested in developing this technology does not warrant characterization as a "speciest." I anticipate numerous positive outcomes stemming from the advancements in artificial intelligence, similar to the transformative impact witnessed with the invention of the internet. It is crucial to acknowledge that, like any creation, challenges may arise alongside its benefits. @@dufung3980

@josephman1488 7 ай бұрын

@@Azzazel_ And they put a disclaimer in description which none of you guys even read😂😂

@JakeHaugen 7 ай бұрын

Absolutely next level stuff. The temporal inference was amazing. I was most impressed by it's ability to remember where the ball was and follow it. Seems well versed. What a time to be alive!!!

@tuckerbugeater 7 ай бұрын

not long to be alive

@Google 7 ай бұрын

It's a big day for us all

@michaelcondon8286 7 ай бұрын

This may be staged just like Google Duplex from a few years ago. Don't believe anything from these people until you see it for yourself.

@blueSurfer 7 ай бұрын

It turns out the video is not entirely correct and is edited as mentioned in the description.

@holidayiv4856 7 ай бұрын

So at what point is it appropriate for me to freak out and hurl my PC out the window ???

@abdoufma 7 ай бұрын

I'll have to reserve judgement untill I've seen it in production, but this looks absolutely mind-blowing!

@cbow305 7 ай бұрын

It's fake. They got caught and have has to release more information. Google it ( I understand the irony)

@Yassine-tm2tj 7 ай бұрын

What a journey we’re about to embark on!

@Pudibu 7 ай бұрын

...that ends at bottom of a cliff.

@-reezey-6332 7 ай бұрын

XDDDDDDDDD @@Pudibu

@Paradoxicful 7 ай бұрын

It's okay... We'll let you go first!@@Pudibu

@Google 7 ай бұрын

Thanks for coming along 😁

@adambowman1161 7 ай бұрын

Do we have a choice? @@Google

@tristanwegner 7 ай бұрын

Sad to read elsewhere, that it is not the actual interaction that took place. They cut out the thinking time, that they used text instead of voice and worse: the much more specific prompts (e.g. the human explain the country guessing game, and even gives two examples with screenshots of the finger pointing on the map). Is Google really so unsure about their product, that they have to exaggerate their features in this video? But why? When people get access to it, they will notice it anyway. Example from the blog: They don't show the footage of the hand and Gemini by itself mentions the game. No, they instead upload 3 perfectly timed images of the three gestures and give it the hint "it's a game". And with this, Gemini gets it. Still impressive, but probably GPT4 would do that just as well, whereas the video implies the novel features of real time understanding of live video, which is not there, but delay text response to specific requests to text and images uploaded.

@greatbritishmale 7 ай бұрын

They’ve edited the video guys to make it look better. The AI was not responding to the live actions of this guy, it was responding to still images and text. Very strange to act like its AI is capable of this.

@SoloPirate2003 7 ай бұрын

Tasteful touch at the end with the constellation drawing. So far Gemini is living up to the hype. Looking forward to using it come 2024.

@Google 7 ай бұрын

Can't wait for you to get prompting 🤩

@pylotlight 7 ай бұрын

@@Google Did you guys release an ETA yet for this on to be updated in Bard?

@michaelcondon8286 7 ай бұрын

This may be staged just like Google Duplex demo from a few years ago. Don't believe anything from these people until you see it for yourself.

@stephantual 7 ай бұрын

Can you explain developers.googleblog.com/2023/12/how-its-made-gemini-multimodal-prompting.html?m=1. It looks like you fed the model, some images with some textual hints and then created a video that emulated the look and feel of a live feed presentation. It would be good if you could clarify exactly what we're looking at here

@somnathghosh6165 7 ай бұрын

@@Googleallow twerking videos on KZbin without demoni demoneytization. Corporate crackheads

@21EC 7 ай бұрын

I got shocked and mind blown seeing how smart Gemini is in this video alone, it's kinda scary how advanced and smart it is, what is it? a primitive initial AGI? just WOW

@Shazamthunder 7 ай бұрын

True AGI will never exist. But I think that humans could reach a level with AI where it won't make a difference.

@alternatecheems8145 7 ай бұрын

@@ShazamthunderIt can easily exist with a system of using a main model acting as an OS with multiple portable "module" models.

@gonzalobruna7154 7 ай бұрын

this is staged, sadly. there is a blog where they wrote how this was done, and first of all, this is not in real time, they pass specific frames to the model and they give VERY specific instructions on what to do. The model doesn't guess anyrhing at all. Even the game with the map, in the blog they show they wrote exactly what the instructions of the game were, so the model didn't come up with the idea. it's very dissapointing.

@lolzman122 7 ай бұрын

@@Shazamthunderwhat is ”true agi” and please explain why it won’t ever exist

@electrolove9538 7 ай бұрын

It couldn't tell the line drawing was a duck without feet. Still a ways away. Yet still mindblowing.

@kprabhakar975 6 ай бұрын

Gemini will be great for teachers. Thank you

@Dtweezyy 2 ай бұрын

We see humanity’s biggest competitor in its earliest stages of infancy.

@JohnKooz 7 ай бұрын

I was genuinely increasingly astounded each minute of the Gemini demonstration! With its image recognition, translation capabilities, nutritional advice, geographic knowledge, intuitive features, and even humor, I think Gemini might make a good "friend"! haha! 😀

@BECHEEKHA 7 ай бұрын

Very impressive. Want to try it.

@wqlff2692 7 ай бұрын

lol haven’t seen these type of bots in ages

@bashvim 7 ай бұрын

FRAUD

@ArjunU931 7 ай бұрын

broo ivideyo haha nice kandathil sandhosham ini evidengilum vech kanam

@ximaik094 7 ай бұрын

@@wqlff2692 next level scam actually!!! What is KZbin doing ????

@ivoryas1696 7 ай бұрын

@@wqlff2692 Yo, same! Do be succing, though... 😞

@PaulTurnbull-qz4rj 7 ай бұрын

Google have admitted it was edited to appear this intelligent

@njabulo316 6 ай бұрын

Excellent Work colleagues!

@avrahamshaked2147 7 ай бұрын

Dayum, and here I thought we were entering the phase of diminishing returns and slowing down on AI models before you guys came up with this one haha

@cagnazzo82 7 ай бұрын

Where did you get that idea? December has been a nonstop explosion.

@hastyscorpion 7 ай бұрын

@@stanvassilevlol what a dumb thing to say

@michaelcondon8286 7 ай бұрын

This may be staged just like Google Duplex demo from a few years ago. Don't believe anything from these people until you see it for yourself.

@Abnetfikre 7 ай бұрын

Wow! This is incredible! I'm so excited to see Google pushing the boundaries of AI with Bard. As someone from Ethiopia, Africa, I'm especially thrilled to see this technology accessible to a global audience. The potential for Bard to bridge the information gap and empower people like myself is truly inspiring. Great job, Google! This is just the beginning! 🤩👏🏾

@stienogamez8296 7 ай бұрын

chatGPT is also globally available...

@dufung3980 7 ай бұрын

It’s a manhattan project, stop being anything but disappointed in your species.

@MatthewTheWanderer 7 ай бұрын

@@dufung3980 Go away, troll! This is awesome and will do much more good for the world than harm!

@Google 7 ай бұрын

It's the start of something great ✨

@dufung3980 7 ай бұрын

@@MatthewTheWanderer Idealist optimist=wrong, but hey you're what you're.

@williamx0 Ай бұрын

Anyone else back here after the new Open AI ChatGPT 4o announcement to find the rock paper scissors demo?

@HERKELMERKEL Ай бұрын

ai knows only this game lol.

@user-tq8qi3uv4v 7 ай бұрын

This is the greatest thing i watch in the history of the internet

@gus473 7 ай бұрын

Continuing to be amazed! Thanks, Google! 😎✌️

@Google 7 ай бұрын

Happy to hear you’re excited ❤

@michaelcondon8286 7 ай бұрын

This may be staged just like Google Duplex demo from a few years ago. Don't believe anything from these people until you see it for yourself.

@user-xf7xd2dn1e 6 ай бұрын

Please halp

@ShpanMan 7 ай бұрын

Well done Google, if the model *actually* answers these (and no, it won't be this fast), then you have not disappointed us - the wait was worth it! Now to Gemini 2...

@michaelcondon8286 7 ай бұрын

This may be staged just like Google Duplex from a few years ago. Don't believe anything from these people until you see it for yourself.

@W4rfire 7 ай бұрын

Unfortunately, what you see is not at all what happened. The AI does not actually reply to the person but to a script and pictures containing sometimes more information than we are shown here

@Armeli-wj2fv 6 ай бұрын

oi qquandoaaaaaaaaaaaaaaaaa1alp1alpaaaaaaaaaaaaaaaaa1alpaaaaa1alpaaa1alpaa1alpaqqa1alp1alpa1alpaa

@Clarix_Shorts 6 ай бұрын

But thatcis not the same version

@Novaia-News 6 ай бұрын

Unbelievable, I loved Gemini

@hushhmanish 7 ай бұрын

Game on :) - love that Google is back in action. Congratulations team Google!

@forcanadaru 7 ай бұрын

Incredible, outstanding!

@MandipChhetri Ай бұрын

Yeah, here after gpt-4o

@JustDurant 7 ай бұрын

Incredible, I can’t wait to see where this leads us!!

@Google 7 ай бұрын

The Gemini fun has just begun!

@dexio85 7 ай бұрын

@@GoogleIt's going to be another project you kill after few years/months once all the people involved got promos and moved on to different places :)

@michaelcondon8286 7 ай бұрын

This is just a another marketing stunt to keep the stock pumped so that the shareholders and employees receive their contractly obligated earnings. Don't believe a word these people say until you can test it for yourself. All of this could have been done in post.

@sky37blue 7 ай бұрын

Unemployment?

7 ай бұрын

@@Google I wonder if this is exciting or scary...

@Laviniak 7 ай бұрын

Deeply impressive! And beautifully told!

@wyssli 6 ай бұрын

according to bloomberg: "In reality, the demo also wasn’t carried out in real time or in voice. When asked about the video by Bloomberg Opinion, a Google spokesperson said it was made by “using still image frames from the footage, and prompting via text,” and they pointed to a site showing how others could interact with Gemini with photos of their hands, or of drawings or other objects. In other words, the voice in the demo was reading out human-made prompts they’d made to Gemini, and showing them still images. That’s quite different from what Google seemed to be suggesting: that a person could have a smooth voice conversation with Gemini as it watched and responded in real time to the world around it." Wow Google you must be desperate...

@appletree6741 6 ай бұрын

Yeah quite disappointed

@josemuhongodealmeida907 6 ай бұрын

Realmente é muito incrível o poder desta IA

@alamdaaliartes 6 ай бұрын

pena que é falso.

@brentshaffer9773 7 ай бұрын

Realizing the yarn examples are displayed against the same backdrop as the AI is seeing is both impressive and creepy.

@Inter-Dimensions_Studios 7 ай бұрын

I have always thought Google has the best chance to take generative A.I. to a super level.

@michaelcondon8286 7 ай бұрын

This may be staged just like Google Duplex from a few years ago. Don't believe anything from these people until you see it for yourself.

@ivoryas1696 7 ай бұрын

Inter-Dimensional_Studios Honestly, low-key same. _Especially_ since they "acquired" Deepmind!

@Inter-Dimensions_Studios 7 ай бұрын

@ivoryas1696 I like the competition, looking forward to what others have up their sleeves.

@AllanLaal 6 ай бұрын

truly worthy of a Participation Throphy :'D

@lukewilliamrimmington 7 ай бұрын

This is fascinating and awe-inspiring that a multimodal model can do this! Well done to the Google team who probably had barely any sleep when this dropped.

@dufung3980 7 ай бұрын

It’s a manhattan project, stop being anything but disappointed in your species.

@lukewilliamrimmington 7 ай бұрын

@@dufung3980 This ain't the terminator. This is real life. AI can kill us, it's also a double edged sword. Advancements with these programs can be extremely beneficial to finding cures to cancers and beyond. So, who cares? Dont be-little me or the Google team. Be-little regulators for not doing enough. Dont hate the player hate the game son.

@IsJonBP 7 ай бұрын

It would be great that, as it generates images and audio on the go, it also could generate docs, sheets, slides and even give you some folders with elements inside, maybe in a zipped folder. I dunno, the posibilities are inspiring. When will this model be avaible to the public? It could turn into my principal AI tool!

@h.c4898 7 ай бұрын

It's already hooked on Bard. It's in today's Bard update. But I dunno if it can generate the tasks what u asked for. Bard@ is just an LLM at this point.

@IsJonBP 7 ай бұрын

@@h.c4898 yeah, I was hoping for them to put Bard 'to sleep' and come out with a new rebranding or something like that. I guess I just don't trust Bard in general. I know this feeling is completely subjective though :(.

@AppleTechMaster8 6 ай бұрын

This looks amazing! Gemini has so many new AI capabilities that I’ve never seen before. It’s amazing how it is able to generate images so fast ( 3:46 ). I can’t wait to try it out in real life, and when I can, I’m sure it’s going to be so cool.

@sakushi3931 Ай бұрын

OPENAI DID IT!! THEY DID WHAT GOOGLE COULD NOT

@user-uh4gm8ls8n Ай бұрын

True

@familieweber5556 7 ай бұрын

When this is really working as being shown it is indeed mindblowing. Great job!

@appletree6741 6 ай бұрын

The video is misleading, it’s not real-time. Google has been criticised for this all over the internet

@MrARRMP 7 ай бұрын

As an Ai admirer, this blew my mind. I’ve watched it at least 3 times and I still can’t grasp how big your datasets must have been. Amazing impressive work!

@michaelcondon8286 7 ай бұрын

This may be staged just like Google Duplex from a few years ago. Don't believe anything from these people until you see it for yourself.

@jimmysyar889 7 ай бұрын

You'd be surprised. I've got a 7b model that's only around 10gb and it seems to know all these random things. Hell even wikipedia is less than 25GB in entirety.

@delowerhossain3069 7 ай бұрын

@@jimmysyar889there are 540B model exist

@Vector-dz3jk 7 ай бұрын

@@jimmysyar889what’s a 7b model?

@-long- 7 ай бұрын

@@Vector-dz3jk a model with 7 billion parameters.

@AdaLao 6 ай бұрын

Amazing！I think it can be initially used to improve cognition and memory in the elderly, which will be of great help in preventing Alzheimer's disease. Then, it can also be opened to children's learning, but parental control is required to prevent excessive screen time from affecting the development of brain cells.

@jeffreymitchell4904 7 ай бұрын

The real-time element is by far the most impressive. These sorts of asynchronous interactions are what AI has been missing thus far.

@atlas3650 7 ай бұрын

How do you know it’s real time?

@ethan.johnson 7 ай бұрын

"For the purposes of this demo, latency has been reduced and Gemini outputs have been shortened for brevity."

@SinanAkkoyun 7 ай бұрын

It's not, likely GPT 4 latency when OpenAI servers are under moderate load, as it looks you would need to prompt with a static video file etc

@Bunny501 7 ай бұрын

It's not real time and its not video. Its responding to prompts and shots from this presentation, the responses also have been editorialized. Read the experiment to see how they did it developers.googleblog.com/2023/12/how-its-made-gemini-multimodal-prompting.html

@xsuploader 7 ай бұрын

@@ethan.johnsonshortened outputs aren't a big deal and latency will improve in time

@horacehxw 7 ай бұрын

This is soooo amazing! Much more dynamic and interactive than GPT. Can't wait to give it a try!

@do.xuantung 7 ай бұрын

Check the link in the description, even the current gpt 3.5 can do most of this. Gemini doesn't have live video or voice input from what you are seeing in the video

@appletree6741 6 ай бұрын

@@do.xuantungyeah it’s fake

@Joshuaawoniyi_ng Ай бұрын

This is great, Google. Mindblowing! 🔥💪🏻💯. Waiting to see what will be accomplished in 2 years time.

@evanseesred 7 ай бұрын

I can’t believe this was totally real and not staged whatsoever 😂

@Thirunaking 4 ай бұрын

kzbin.info/www/bejne/bqG2iZSer9l3asU

@vectoralphaSec 7 ай бұрын

That is incredibly impressive and mind blowing. To think that AI has become this capable nowadays. Now the competition is on for Microsoft/ OpenAI to see what they do because Gemini is incredible. Just making the timeline towards true AGI in 2 years(2025) even more credible and achievable.

@michaelcondon8286 7 ай бұрын

This may be staged just like Google Duplex from a few years ago. Don't believe anything from these people until you see it for yourself.

@user-fn9cm5lr5k 7 ай бұрын

the level of abstraction Gemini is capable of is mind-blowing

@gonzalobruna7154 7 ай бұрын

@thefireman17492 7 ай бұрын

@@gonzalobruna7154 that's interesting. Would you care to provide said blogs and articles where this exact point you have mentioned was brought up?

@bernhardd626 7 ай бұрын

All fake

@gonzalobruna7154 7 ай бұрын

@thefireman17492 sure, actually, it is linked on the description of the video itself, but I will link it here for you: developers.googleblog.com/2023/12/how-its-made-gemini-multimodal-prompting.html?m=1

@DotDager 7 ай бұрын

Watching the video: 🤯 Reading the article and actual prompts: 🥱

@myanshu77 7 ай бұрын

Mix artistic imagination with reality, and the result will always appear awesome. Nice advertisement work.

@skypurplecloud 6 ай бұрын

Was this all in realtime? If it was shot in one take, I am impressed. How was the setup created, what tools/accessories and what app components to analyse, pass the details/images to Gemini and interact with the AI?

@isidroundercover 6 ай бұрын

they faked it :/

@joannot6706 6 ай бұрын

No it's written at 0:21 below screen and they go on to explain how it's done. But considering gemini has audio, and video multimodality, it's just a matter of time.

@FUncleDave 4 ай бұрын

Even if you ask Gemini, it tells you it's fake While the video you linked does feature me appearing to look at drawings and guess what they are, it's important to understand that this is a carefully crafted illusion. I don't actually have any visual processing capabilities in the way a human does. In the video, the creators likely used a combination of techniques to create the illusion of me looking at and understanding the drawings. This could involve things like: * Pre-recorded video: The video of me "looking" at the drawings could have been pre-recorded and then edited to make it appear that I was reacting to the drawings in real-time. * Text prompts: The creators could have provided me with text descriptions of the drawings, which I then used to generate my responses. * Human input: It's also possible that a human was involved in providing me with information about the drawings or guiding my responses in some way. Ultimately, the goal of the video is to showcase my ability to process and understand information, not to claim that I have true visual perception. I hope this clarifies the situation!

@ffdalkins 7 ай бұрын

the most astounding features of AI models I've seen..

@michaelcondon8286 7 ай бұрын

This may be staged just like Google Duplex demo from a few years ago. Don't believe anything from these people until you see it for yourself.

@technophile_ 7 ай бұрын

Mind Blown 🤯 Kudos to every single developer who worked on this! You are amazing!

@Google 7 ай бұрын

It takes a village of brilliant folks ✨

@michaelcondon8286 7 ай бұрын

This may be staged just like Google Duplex from a few years ago. Don't believe anything from these people until you see it for yourself.

@ZOXENE 7 ай бұрын

Seems like the village needs new people, Count me in

@alexp.3694 7 ай бұрын

Why is everyone so happy about google building a literal pandora's box?? No one knows what's going inside there and how safe it is... Yet everyone is happy like brainless kids

@jessysarazin2208 7 ай бұрын

I would be mind blown if it wasn't edited to be more impressive

@nasrimarc7050 7 ай бұрын

very excited to use it I was waiting for long time I believe on the ingenuity of Google

@TMracer73 7 ай бұрын

Its confirmed to be fake. Ask google....

@cosmicparsec9463 7 ай бұрын

It's fake. Search for recent news.

@DeveloperJS314 7 ай бұрын

This is sick! This is the most impressive thing I have seen in 2023 for sure.

@NkwawirBeltus 7 ай бұрын

Mindblowing!!. We all knew Google wasn't gonna just let OpenAI win AI battle. This is some next level stuff.

@dufung3980 7 ай бұрын

It’s a manhattan project, stop being anything but disappointed in your species.

@dcos5 7 ай бұрын

they've been working on AI for a long time. and they have limitless data to train on.

@TheRafark 7 ай бұрын

It’s 🧢 tho the video is scripted

@stephantual 7 ай бұрын

It would be if it was real. developers.googleblog.com/2023/12/how-its-made-gemini-multimodal-prompting.html?m=1 They took individual images of the sun and earth that you see in the video and passed it with the eye complete with hints. Then they recorded the answers from the AI using a text to speech system and overlaped it with a video to make it look like the AI is looking at what you see in real time it is not.

@SnoopyDoofie 7 ай бұрын

Except it is being reported as fake by TechCrunch.

@jaimegutierrez5520 7 ай бұрын

Im very happy to be alive in this time. A lot of good technology is coming.

@ManadayMavani 7 ай бұрын

Marvellous stuff! I was pretty confident Google will take the AI race to the next level.

@The_spaceguy 7 ай бұрын

I think google deserves more credit for this and it’s nice to see them actually competing. This model seems really powerful and although I might not use the video input feature, it alone gives a whole lot more promise for audio and text too. Can’t wait to try it.

@do.xuantung 7 ай бұрын

You should see their blog post in the description. It is a lot less impressive than what you are seeing in the video. Such as the map game was an input prompt, Gemini didn't even generate that idea

@DajuSar 6 ай бұрын

Fake stuff xd really impresive how they can be competitive with manufactured test and misleading advertising. Really putting their graint of sand in the ecosystem

@vip_bimmervip_bimmer8033 7 ай бұрын

Seems excellent. Coming from the AI industry, this is impressive. Good work getting back in the game of AI.

@nagatzain480 24 күн бұрын

Personal Assistant: A multimodal AI assistant could seamlessly manage schedules, provide reminders, and offer personalized recommendations by interpreting both spoken instructions and visual cues (e.g., identifying objects or reading documents).

@jensl7687 6 ай бұрын

a purrfect 10, nice one

@devinoxman 7 ай бұрын

The accessibility implications of Geminis ability to perform real time image Analysis are mind blowing, as somebody who can’t see, I can’t wait to try this. This paired with a smart phone, camera or headset with stereoscopic image capture could be a total game changer.

@ilianos 7 ай бұрын

Have you tried other image caption algorithms that can detect objects? If so, I'd be curious to know what your experience was with them. I'm asking because I was already imagining this years ago, when I learned about the program "By my eyes" (which was only done by humans at the time).

@blindstreet 7 ай бұрын

@@ilianos Blind people already enjoying Be My AI.

@ilianos 7 ай бұрын

@@blindstreet I know, that's why I'm asking about the quality of the experience

@gonzalobruna7154 7 ай бұрын

Sadly this is not real time. Actually, it never gets video as a prompt. All the prompts are perfectly selected still images and they add very clear and detailed instructions on what to do with everything there. Actually, when playing the game of the map, they make it look as if the AI created the game, but actually, they gave a VERY specific prompt: "Instructions: Let's play a game. Think of a country and give me a clue. The clue must be specific enough that there is only one correct country. I will try pointing at the country on a map.", so the AI never guessed it. So this is a fake video, and there are certain places where you can tell. If you want to know more about that, check their own blog post: developers.googleblog.com/2023/12/how-its-made-gemini-multimodal-prompting.html

@greatbritishmale 7 ай бұрын

It isn’t real time analysis as you see it. They have altered the video to make it look like it is. What they do is show the AI images and ask it questions via text prompts, and the responses are not as quick as shown. It’s a nice concept video, but not reality.

@netcampostv 7 ай бұрын

Realmente a capacidade desse modelo, irá elevar as IA para o próximo nivel

@BlooFlame 6 ай бұрын

Concordo totalmente

@FamimFred Ай бұрын

OMG! I absolutely LOVED this! I didn't want it to end!! More, pls!😀😃

@tempomail9387 6 ай бұрын

Folks, their model does not work on a live video feed as shown in this video. There is a blog with images and DETAILED text prompts for it. Look for "How it’s Made: Interacting with Gemini through multimodal prompting"

@djayjp 7 ай бұрын

That's it, we're done for!! Nice knowing y'all

@millenialmusings8451 7 ай бұрын

I don't think this is going to end well in short, medium and long term for majority of humanity.

@djayjp 7 ай бұрын

@@millenialmusings8451 Actually I was just being funny. Think about the logic: something more intelligent than us will necessarily make better decisions than us and, therefore, ought to be more ethical than us (as ignorance-irrationality is the source of all unethical behaviour).

@millenialmusings8451 7 ай бұрын

@@djayjp while AGI maybecome super intelligent, I think it still lacks "agency". THe agency will still rest with humans. We all know humans (just like all DNA based life), at their very core are selfish. The fruits of industrial revolution, technological advancements have not been distributed equitably amongs all the people. Similary, the god like powers of AGI will be exploited by a 0.01% at the detriment of others. It has always been that way. Human nature has not changed for last 100,000 years.