Google’s NEW Open-Source Model Is SHOCKINGLY BAD

  Рет қаралды 49,581

Matthew Berman

Matthew Berman

Күн бұрын

Пікірлер: 515
@ALFTHADRADDAD
@ALFTHADRADDAD 11 ай бұрын
Google SHOCKS and STUNS the Open source landscape
@matthew_berman
@matthew_berman 11 ай бұрын
I should have used this title
@TechRenamed
@TechRenamed 11 ай бұрын
Lol we all should have!!
@mickelodiansurname9578
@mickelodiansurname9578 11 ай бұрын
@@matthew_bermanI thought at one stage you were literally going to start slapping your forehead off the keyboard!
@andersonsystem2
@andersonsystem2 11 ай бұрын
Why does most Ai tech channels use that title 😂I just don’t pay attention to titles like that lmao 😂😊
@Kutsushita_yukino
@Kutsushita_yukino 11 ай бұрын
its a meme at this point
@Lukebussnick
@Lukebussnick 11 ай бұрын
My funniest experience with Gemini pro was, I asked it to make a humorous image of a cartoon cat pulling the toilet paper off the roll. It told me that ethically couldn’t because the cat could ingest the toilet paper and it could cause an intestinal blockage 😂
@laviniag8269
@laviniag8269 11 ай бұрын
histerical
@matikaevur6299
@matikaevur6299 11 ай бұрын
@@laviniag8269 but true . .
@MilkGlue-xg5vj
@MilkGlue-xg5vj 11 ай бұрын
Maybe a cat could see the image and do the same
@Lukebussnick
@Lukebussnick 11 ай бұрын
@@MilkGlue-xg5vj haha yeah that would be a real nuisance. But then again, that’s one smart cat. What other potential could that cat have?? 🧐
@MilkGlue-xg5vj
@MilkGlue-xg5vj 11 ай бұрын
@@Lukebussnick Maybe it could become an ai dev at Google
@bits_of_bryce
@bits_of_bryce 11 ай бұрын
Well, I'm never trusting benchmarks without personal testing again.
@richoffks
@richoffks 11 ай бұрын
sorry you had to learn this way
@wilburdemitel8468
@wilburdemitel8468 11 ай бұрын
welcome to real life. Can't wait for you to leave the fantasyland bubble all these tech aibros have built around you.
@Batmancontingencyplans
@Batmancontingencyplans 11 ай бұрын
Gemma 7b makes you realise how much compute Google is using just to output sorry I can't fulfill that request 🤣
@MilkGlue-xg5vj
@MilkGlue-xg5vj 11 ай бұрын
LMFAO
@markjones2349
@markjones2349 10 ай бұрын
So true. Uncensored models are just more fun.
@MilkGlue-xg5vj
@MilkGlue-xg5vj 10 ай бұрын
@@markjones2349 you're talking as if the point of uncensored llms is fun rofl lmfao xd you're just makin' it funnier 🤣
@bigglyguy8429
@bigglyguy8429 11 ай бұрын
You can't gimp the model with excessive censorship, and also have an intelligent model.
@aoolmay6853
@aoolmay6853 11 ай бұрын
These are not open models, these are woke models, appropriately liberal.
@eIicit
@eIicit 11 ай бұрын
To a point, I agree.
@madimakes
@madimakes 11 ай бұрын
The nature of the errors here seem irrelevant to being censored or not.
@bigglyguy8429
@bigglyguy8429 11 ай бұрын
@@madimakes No, the censorship sucks up so much of it's thinking there's little left to actually answer. You can ask the most banal question but it sits there thinking long and hard about if there's any way that could possibly be offensive to the woke? Considered the woke are offended by everything, that's a yes, so it has to work its way around that, then it needs to figure out if it's own reply is offensive (yes, everything is), so it has to find a way around that as well. Often it will fail and say "I'm afraid I can't do that... Dave." Other times it will try, but the answer so gimped and pathetic you'd have been better off asking your cat.
@mickmoon6887
@mickmoon6887 11 ай бұрын
Exactly Model network design is gimped from the creator developers itself when head of Google AI is literally have biased ideological, anti white and very censorship values all proven with their online records that's why those biases reflect onto the model
@zeal00001
@zeal00001 11 ай бұрын
In other words, there are now LLMs with mental challenges as well...
@AlexanderBukh
@AlexanderBukh 11 ай бұрын
That's inclusive, alright.
@StriderAngel496
@StriderAngel496 11 ай бұрын
and diversive@@AlexanderBukh
@chriscarr9852
@chriscarr9852 11 ай бұрын
This is entirely speculation on my part, but I am guessing Google’s AI effort is largely driven by their PR team. A proper engineering team would never release this kind of smoke and mirrors crap. Right?
@chriscarr9852
@chriscarr9852 11 ай бұрын
They have tarnished their brand. It will be interesting to see what happens in the next few years with regard to Google. (I do not have any financial interest in google).
@mistersunday_
@mistersunday_ 11 ай бұрын
Yeah, they are the wrong kind of hacks now
@alttabby3633
@alttabby3633 11 ай бұрын
Or engineering team knows this will be killed off regardless of quality or popularity so why bother.
@richoffks
@richoffks 11 ай бұрын
@@chriscarr9852 we're watching the end of Google smh
@michaelcalmeyerhentschel8304
@michaelcalmeyerhentschel8304 11 ай бұрын
No, Left. They are all one viewpoint at Google and have been so for decades. The PR folks represent the programmers and their programmer-managers and Sr. management.
@drgutman
@drgutman 11 ай бұрын
I'm pretty sure they lobotomized it in the alignment phase :)))
@hikaroto2791
@hikaroto2791 11 ай бұрын
To the point they took the lobotomy fragment and used it in place of the brain, and trashed the actual brain. Not only on models, but on personnel probably
@deflagg
@deflagg 11 ай бұрын
Gemini Advanced is bad, too, compared to gpt4. Gemini sometimes answers in a different language, too cautious, and gets things wrong a lot of the times.
@CruelCrusader90
@CruelCrusader90 11 ай бұрын
"too cautious" is an understatement.
@veqv
@veqv 11 ай бұрын
@@CruelCrusader90 Genuinely. If it's not a question about software development there's a wildly high chance that it'll start quizzing you on why you have the right to know things. I do hobby electronics and wanted to see how it would fare on helping make a charging circuit. It basically refused. Same is true for rectifiers. Too dangerous for me apparently lol. Ask it questions on infosec and it'll answer fine though. It's wild.
@richoffks
@richoffks 11 ай бұрын
@@veqv lmao it refused, all anyone has to do is release a competely uncensored model and they literally take over the industry from their house. I dont know why google is such a fail at every product launch.
@CruelCrusader90
@CruelCrusader90 11 ай бұрын
@@veqv yea i had a similar experience. i asked it to generate a top front and side view of a vehicle chassis to create a 3d model in blender. (for a project im working on) it said the same thing, its to dangerous to generate the image. i didnt expect it to make a good/consistent vehicle chassis across all the angles but i was curious to see how far it was from making it possible. and i dont even know how to scale its potential with that kind of a developer behind its programming. even a one would represent progression at its slowest form, but that would be generous.
@Ferreira019760
@Ferreira019760 11 ай бұрын
Bad doesn't begin to cut it. At this rate, Google will become irrelevant in most of it's services. It makes no difference how much money they have, their policy is wrong and the AI models show it. They are so scared of offending someone or being made liable that their AI actually dictates what happens in the interactions with the users. That doesn't just make it annoying and time wasteful, it means that it cannot learn.Even worse than not learning, it's becoming dumber by the day. I cannot believe i'm saying this, but i miss bard. Gemini doesn't cut it in away way, shape or form. It's probably good for philosophy exercises, but so far I don't see any decent use for it aside from that. Give it enough space to go off in wild tangents and you may get a potentially interesting conversation, but don't expect anything productive from it. I'm done with trying out Google's crap for some time. Maybe in a month or two I will allow myself the luxury of wasting time again to see how they are doing, but not for now. Their free trial is costing me money, that's how bad it is.
@mistersunday_
@mistersunday_ 11 ай бұрын
Until Google spends less time on woke and more time on work, I'm not touching any of their products with a 10-foot pole
@Alistone4512
@Alistone4512 11 ай бұрын
- by a person on KZbin :P
@StriderAngel496
@StriderAngel496 11 ай бұрын
truuuu but you know what he meant lol@@Alistone4512
@natecote1058
@natecote1058 11 ай бұрын
If google keeps messing around with their censored models and under performing open source models, they'll get left in the dust. Mistral could end up way ahead of them in the next few months. They should find that embarassing...
@trsd8640
@trsd8640 11 ай бұрын
This shows one thing: We need other kind of benchmarks. But great video Matthew, thanks!
@MM3Soapgoblin
@MM3Soapgoblin 11 ай бұрын
Deepmind has done some pretty amazing work in the machine learning space. My bet is that they created a fantastic model and that's what was benchmarked. Then the Google execs came along and "fixed" the model for "safety" and this is the result.
@R0cky0
@R0cky0 11 ай бұрын
Let's call it Matthew Benchmark
@R0cky0
@R0cky0 11 ай бұрын
@@MM3SoapgoblinDeepmind should spinoff from Google. It's a shame that they still run under the now Google giving their amazing works in the past
@AIRadar-mc4jx
@AIRadar-mc4jx 11 ай бұрын
Hey Mathew, it's not open-source model because they are not releasing the source code. It's open-weight or open model.
@PMX
@PMX 11 ай бұрын
But... they did? At least for inference, they uploaded both python and cpp implementations of the inference engine for Gemma to github. Which I suspect have bugs since I can't otherwise understand how they can release a model that performs this poorly..
@judedavis92
@judedavis92 11 ай бұрын
Yeah they did release code.
@sandeepghael763
@sandeepghael763 11 ай бұрын
@matthew Berman I think something is wrong with your test setup. I tested the `python 1 to 100` example with Gemma 7B via Ollama, 4bit quantized version (running on CPU) and the model did just fine. Check your prompt template or other setup config.
@hidroman1993
@hidroman1993 11 ай бұрын
He was already recording, so he didn't want to check the setup LOL
@mathematicalninja2756
@mathematicalninja2756 11 ай бұрын
On a bright side, we have a top end model to generate reject responses in the DPO
@user-qr4jf4tv2x
@user-qr4jf4tv2x 11 ай бұрын
can we not have acronyms 😭
@Alice_Fumo
@Alice_Fumo 11 ай бұрын
@@user-qr4jf4tv2x I believe DPO in this context stands for "Direct Preference Optimization" which is a recent alternative technique to RLHF, but with less steps and thus more efficient. I'm actually not 100% sure, but I believe the joke here is that if you try employing this model for DPO to "align" any other base-model, what you get is another model which only ever refuses to respond to anything.
@NOTNOTJON
@NOTNOTJON 11 ай бұрын
Plot twist, Google was so far behind the AI race that they had to ask Llama or GPT 4 to create a model from scratch and this is what they named Gemini / Gemma.
@tteokl
@tteokl 11 ай бұрын
google is so far behind these days, I love Google's design language tho, but their tech ? meh
@Nik.leonard
@Nik.leonard 11 ай бұрын
At the moment, there is a couple of issues with quantization and running the model in llama.cpp (LM Studio uses llama.cpp as backend), so when the issues are fixed, I'm going to re-test the model. That's because is weird that the 2b model gets better responses than the "7b" (really is more like 8.something) model.
@f4ture
@f4ture 11 ай бұрын
Google’s NEW Open-Source Model Is so BAD... It SHOCKED The ENTIRE Industry!
@jbo8540
@jbo8540 11 ай бұрын
Google set the entire OS community back a half hour with this troll release. well played google
@romantroman6270
@romantroman6270 11 ай бұрын
Don't worry, Llama 3 will set the Open Source community 31 minutes ahead lol
@antigravityinc
@antigravityinc 11 ай бұрын
It’s like asking an undercover alien to explain normal Earth things. No.
@aipower-ho1mt
@aipower-ho1mt 11 ай бұрын
😂
@snowhan7006
@snowhan7006 11 ай бұрын
This looks like a hastily completed homework assignment by a student to meet the deadline
@shujin6600
@shujin6600 11 ай бұрын
and that student was highly political and was easy offended to anything
@DeSinc
@DeSinc 11 ай бұрын
Looking at those misspellings and odd symbols all through the code examples, it's clear to see something is mis-tuned in the params for whatever ui you're using not being updated to support this new model. Apparently the interface I was using it with has corrected this as I was able to get coherent text with no misspelling but I did see people online saying they were having the same trouble as you, incoherent text and obvious mistakes everywhere. It's likely something wrong with the parameters that must be updated to values that the model works best with.
@protovici1476
@protovici1476 11 ай бұрын
I'm wondering if this is technically half open-sourced given some critical components aren't available from Google.
@Greenthum6
@Greenthum6 11 ай бұрын
I was absolutely paralyzed by the performance of this model.
@Wanderer2035
@Wanderer2035 11 ай бұрын
Me: I send Pikachu GO! Use STUN attack on Greenthum6 NOW! Pikachu: Pika Pika Pika!!! BBBZZZZZZZZZ ⚡️⚡️⚡️⚡️⚡️ Me: Greenthum6 seems to be in some form of paralysis. Quick Pikachu follow that up with a STUN attack on Greenthum6 NOW! Give him everything you got!!! Pikachu: PIKA…. PIKAAAAAAAAAAA……. CHUUUUUUUUUUUUUUUU!!!!!!! BBBBBBBBBBZZZZZZZZZZZZZZZZ ⚡️⚡️⚡️⚡️⚡️⚡️⚡️⚡️ Greenthum6 = ☠️ ☠️☠️ Me: Aaaahhh that was nice, I’m sure Greenthum6 will make a nice pokimon to my collection 🙂. **I throw my pokiball to Greenthum6 and it captures him as my new pokimon to my collection**
@BTFranklin
@BTFranklin 11 ай бұрын
Could you try lowering the temperature? The answers when you were running it locally look a lot like what I'd expect if the temp was set too high.
@hawa7264
@hawa7264 11 ай бұрын
The 2B-Version of Gemma is quite good for a 2b model actually. The 7b model is - a car crash.
@frobinator
@frobinator 11 ай бұрын
I found the same, the 2B model is much better than the 7B for my set of tasks.
@Random_person_07
@Random_person_07 11 ай бұрын
The thing about Gemini is it has the memory of a goldfish it can barely hold on to any context and you always have to tell it what its supposed to write
@PoorNeighbor
@PoorNeighbor 11 ай бұрын
That was actually really funny. The answers are so out of the blue Mannn
@himeshpunj6582
@himeshpunj6582 11 ай бұрын
Please do fine-tuning based on private data
@michaelcalmeyerhentschel8304
@michaelcalmeyerhentschel8304 11 ай бұрын
Gemma will die a quick death as a valued brand, as did Bud Lite, but will return as Emma (Gemma without the Gullibility of believing/disclosing all the propaganda of Google's programmers and management, including an extra gender twist, just for the heck of it, since this is their primary demographic target).
@gerritpas5553
@gerritpas5553 11 ай бұрын
I've found the trick with models like Gemma, when you add this system prompt it gives more accurate results. THE SYSTEM PROMPT: "Answer questions in the most correct way possible. Question your answers until you are sure it is absolutely correct. You gain 10 points by giving the most correct answers and lose 5 points if you get it wrong."
@h.hdr4563
@h.hdr4563 11 ай бұрын
At this point just use GPT 3.5 or Mixtral why bother with their idiotic model
@RoadTo19
@RoadTo19 11 ай бұрын
@@h.hdr4563 Techniques such as that can help improve responses from any LLM.
@mickelodiansurname9578
@mickelodiansurname9578 11 ай бұрын
have you seen the 26 principles of prompt engineering paper... ?? Its very interesting... works across LLM's too... although the better the LLM I think the less of an improvement there is, compared to the base model without a system message.
@onigurumaface
@onigurumaface 11 ай бұрын
Gemma wasn't trained with any system prompt role.
@MilkGlue-xg5vj
@MilkGlue-xg5vj 11 ай бұрын
Do you understand that it's a 7b model and not 180b one?​@@h.hdr4563
@puremintsoftware
@puremintsoftware 11 ай бұрын
Imagine if Ed Sheeran released that video of DJ Khaled hitting an acoustic guitar, and said "This is my latest Open Source song". Yep. That's this.
@TylerHall56
@TylerHall56 11 ай бұрын
The settings on Kaggle may help- This widget uses the following settings: Temperature: 0.4, Max output tokens: 128, Top-K: 5.
@guillaumepoggiaspalla5702
@guillaumepoggiaspalla5702 11 ай бұрын
Hi, it seems that Gemma doesn't like repetition penalty at all. In your settings you shoudl set it to 1 (off). In LM studio, Gemma is a lot better that way, otherwise it's practically braindead. And about the size of the model, it's an uncompressed GGUF. GGUF is a format but can contains all sorts of quantization. 32Gb is the size of the uncompressed 32bits model that's why it's big and slow. There are quantaizations now and even with importance matrix.
@pixels7223
@pixels7223 11 ай бұрын
I like that you tried it on Hugging Face, cause now I can say with certainty: "Google, why?"
@oriyonay8825
@oriyonay8825 11 ай бұрын
Each parameter is just a floating point number (assuming no quantization) which takes 4 bytes. So 7b parameters is roughly 7b * 4 bytes = 28gb, so 34gb is not that surprising :)
@michaelrichey8516
@michaelrichey8516 11 ай бұрын
Yeah - I was running this yesterday and ran into the same things - as well as the censorship, where it decided that my "I slit a sheet" tongue twister was about self-harm and refused to give an analysis.
@nadinejammet7683
@nadinejammet7683 11 ай бұрын
I think you didn't use the right prompt format. It is an error that a lot of people do with open-source LLMs.
@MattJonesYT
@MattJonesYT 11 ай бұрын
Have you noticed that chatgpt4 is very bad in the last few days? Like it can't remember more than about 5 messages in the conversation and it constantly says things like "I can't help you with that" on random topics that have nothing to do with politics or anything sensitive. It's like they've got the guardrails dialed to randomly clamp down to a millimeter and it can't do anything useful half the time. I have to restart the conversation to get it to continue.
@blisphul8084
@blisphul8084 11 ай бұрын
They switched to gpt4 turbo. The old gpt4 via API is better
@davealexander59
@davealexander59 11 ай бұрын
OpenAI: "So why do you want to leave Google and come to work with our dev team?" Dev: *shows them this video*
@liberty-matrix
@liberty-matrix 11 ай бұрын
"AI will probably most likely lead to the end of the world, but in the meantime, there will be great companies." ~Sam Altman, CEO of OpenAI
@chrisbranch8022
@chrisbranch8022 11 ай бұрын
Google is having it's Blockbuster Video moment - this is embarrassingly bad
@ajaypranav1390
@ajaypranav1390 11 ай бұрын
The size is because of the quantization, the same model with 8 bit much less in size
@MM3Soapgoblin
@MM3Soapgoblin 11 ай бұрын
A google exec spoke at an AI conference I went to recently. He was talking about models and how, if you train them on the entirety of the information available on the internet, they become very "conservative". He said confirmation bias is a huge problem. The proceeded to tell a story about how he tested two models, theirs and an un-named competitor, by asking it to say 5 things white people could do better. They both proceeded to name 5 things and he said stuff like "recognize your privledge, great. These are good things". Then he said he asked them to name 5 things black people could do better. And to his shock, they both named 5 things. The example he gave was "recognize the quality of life that western culture has given you". And he declared "How outrageous that it would say something like that. Talk about white supremacy confirmation bias." Then talked about how they "fixed" their models to only give "culturally appropriate" responses. Deepmind has done some amazing work in the machine learning space and I have a lot of trouble believing that this is what they created. I bet they created a fantastic model and that's what the benchmarks were done against. Then the executives "fixed" the model into the useless thing it is right now.
@notme222
@notme222 11 ай бұрын
The TrackingAI website by Maxim Lott measures the leaning of various LLMs and they're all pretty much what we'd call "politically left" in the US. Which ... I'm not trying to make a thing out of it. There are plenty of reasons for it that aren't conspiracy and Lott himself would be the first to say them. However, seeing that reddit post about "Native American women warriors on the grassy plains of Japan", I wonder if maybe it had been deliberately encouraged to promote multiculturalism in all answers regardless of context.
@HistoryIsAbsurd
@HistoryIsAbsurd 11 ай бұрын
See, when you save the word SHOCKING for when its actually SHOCKING, its WAY more impactful & doesnt sound like you are spitting in the face of your community. Great video! Their half open sourced LLM is hilariously bad
@crabbypaddy5549
@crabbypaddy5549 10 ай бұрын
Super bad....even the 7b model is crap. useless and forever keep on telling me is is not able or intended for the stuff i ask it. But when i ask it what tasks it is indeed for and what tasks it is designed to handle, it list the task that I ask it co do. ANNOYING, useless. Llama2:7b and 70b are much much infinity much better.
@RastaBIasta
@RastaBIasta 11 ай бұрын
Wait wasn't you guys not two long ago sucking Google off saying it's better than GPT 4?
@musikdoktor
@musikdoktor 11 ай бұрын
Massive layoffs at Google next week..
@33gbm
@33gbm 11 ай бұрын
The only Google AI branch I stll find credible is DeepMind. I hope they don't ruin it as well.
@davelundie2866
@davelundie2866 11 ай бұрын
FYI this model is available on Ollama (0.1.26) without the hoops to jump thru, One more thing they also have the quantized versions. I found the 7B (fp16) model bad as you say but for some reason was much happier with the 2B (q4) model.
@Batmancontingencyplans
@Batmancontingencyplans 11 ай бұрын
Google has gone too woke to train LLM's
@fabiankliebhan
@fabiankliebhan 11 ай бұрын
I think there were problems with the model files. The ollama version also had problems but they apparently fixed it now.
@spleck615
@spleck615 11 ай бұрын
Open or open weights, not open source. Can’t inspect the code, rebuild it from scratch, validate the security, or submit pull requests for improvements. You can fine tune it but that’s more like making a mod or wrapper for a binary app than modifying source.
@michaelslattery3050
@michaelslattery3050 11 ай бұрын
This video needs a laugh track and some quirky theme music between sections. I was LOLing and even slapped my knee once. Once again, another great video. This is my fav AI channel.
@drayg0n806
@drayg0n806 11 ай бұрын
0:04 Absolutely! This is the beauty of diversity in the mathematical world. While 4+4 equals 8, the operands being 4 doesn't mean their identity cannot also be 40. Y'all have to respect the diversity.
@VincentVonDudler
@VincentVonDudler 11 ай бұрын
The safeguards of not just Google but most of these corporate models are ridiculous and history will look back on them quite unfavorably as unnecessary garbage and a significant hindrance on people attempting to work creatively. 16:00 - JFC ...this model is just horrible. 20:25 - "...the worst model I've ever tested." Crazy - why would Google release this?!
@careyatou
@careyatou 11 ай бұрын
I was skeptical. I ran the same questions on huggingface and got way better answers. Something was off here.
@GrzegorzMąkosa-o8x
@GrzegorzMąkosa-o8x 11 ай бұрын
It is very likely that his setup is incorrect or there is a bug in the way he loads model
@yogiwp_
@yogiwp_ 11 ай бұрын
Instead of Artificial Intelligence we got Genuine Stupidity
@RomboDawg
@RomboDawg 11 ай бұрын
You need to reupload this video. You used a broken verison of the model... Gema is much better than what you experienced. It can even easily write snake in python despite being less than 13b and a non coding model
@MikeMm-n9n
@MikeMm-n9n 11 ай бұрын
actually I redid the tests using Mathew's exact questions and the results his experience with the model. Either LM studio is using wring chat template, or the settings are off or the gguf is broken . I have a gist with the code I used that I can share, but it seems that the comments with links gets deleted.
@Zale370
@Zale370 11 ай бұрын
like other people pointed out, the model needs to be fine tuned for better outputs
@darshank8748
@darshank8748 11 ай бұрын
He seems to expect a 7B model to compete with GPT4 out of the box
@Garbhj
@Garbhj 11 ай бұрын
​@@darshank8748No, but it should should at least compete with llama 2 7b, as was claimed by google. As we can see here, it does not.
@zerorusher
@zerorusher 11 ай бұрын
Google STUNS Gemma SHOCKING everyone
@erikjohnson9112
@erikjohnson9112 11 ай бұрын
Maybe it was a spelling error by Google: "State of the fart AI model". Yeah this model stinks. Yeah I am exhibiting a 14-year old intellect.
@AlexanderBukh
@AlexanderBukh 11 ай бұрын
State of brain fart it is.
@RhythmBoy
@RhythmBoy 11 ай бұрын
What I find hilarious about Google is that while using Gemini on the web, Google gives you the option to "double check" the responses with Google Search. So, why can't Gemini check itself against Google Search?? It's right there. I think Google is so scared of releasing AI into the wild they're not even trying, and in a way they're right.
@gingerdude1010
@gingerdude1010 11 ай бұрын
This does not match the performance seen on hugging chat at all, you should issue a correction
@b0b6O6
@b0b6O6 11 ай бұрын
the gguf model is large because it is not quantized. when quantized to 4bits the model should be eighth the size.
@gurselunsal6048
@gurselunsal6048 11 ай бұрын
Thanks
@rodvik
@rodvik 11 ай бұрын
Censored to death. Google needs to get off this thought policing train.
@icegiant1000
@icegiant1000 11 ай бұрын
Gemma... its says so in the name, its Gemini without the i part... intelligence.
@DoctorMandible
@DoctorMandible 11 ай бұрын
"A diverse group of warriors..." Ahh feudel Japan, that bastion of diversity. GWGB.
@Hae3ro
@Hae3ro 11 ай бұрын
Microsoft beat Google at AI
@DoctorMandible
@DoctorMandible 11 ай бұрын
Why does it have to understand the context of "dangerous"? Why does the model need to be censored? What children are running LLM's on their desktop computers?? What are we even talking about? Is nobody an adult?!
@GuyJames
@GuyJames 11 ай бұрын
maybe Google's plan to avert the AI apocalypse is to release models so bad that they can never develop AGI
@theit-unicorn1873
@theit-unicorn1873 11 ай бұрын
Ouch! Why would they release this? I mean feeling pressure or not, releasing garbage is just BAD!
@MikeMm-n9n
@MikeMm-n9n 11 ай бұрын
Hi Mathew. Thanks for testing . , I just posted a comment about a test I did using your questions and showing different results to your test when using not the gguf (I included a link to gist) . Was my comment deleted because it contains a link ? happy to resend you the link to the gist. P.S: actually even the 2b model gives decent answers to your questions
@MikeMm-n9n
@MikeMm-n9n 11 ай бұрын
I am actually disappointed that you did not address the multiple comments pointing out the the flaws in your. testing. I thought you would retest the model and set the records straight.
@heiroPhantom
@heiroPhantom 11 ай бұрын
Google had to innovate on the context size. It was the only way the model could hold all the censorship prompts in its memory while responding to queries. That's also why it's so slow. imho 😂
@kostaspramatias320
@kostaspramatias320 11 ай бұрын
Google's research is not focused so much on LLM's, they produce a lot A.I. research on a variety of sectors. That said, their LLM's are so far behind it is not even funny. The multimodal 10 mil context window of Gemini pro, does look pretty good though!
@sitedev
@sitedev 11 ай бұрын
Google just announced a follow up model with full transparency - they admit it’s rubbish and call it Bummer!
@danimal999
@danimal999 11 ай бұрын
I tried it as well on ollama and was completely underwhelmed. It had typos, it had punctuation issues. In my very first prompt which was simply, “hey”. Then when I said it looks like you have some typos, it responded by saying it was correcting *my* text, and then added several more typos and nonsense words to its “corrected text”. I don’t know what’s going on with it, but I wouldn’t trust this to do anything at all. How embarrassing for Google.
@nufh
@nufh 11 ай бұрын
Hard to believe for a company that have massive resource to produce this underwhelming model.
@CapnSnackbeard
@CapnSnackbeard 11 ай бұрын
Why "Open Source?" Free labor. Don't worry, as soon as they get what they want, they will take what was learned from Open Source, and put it in their private models.
@mojowebs
@mojowebs 11 ай бұрын
Having early 2000s experience with google while it tries to work things out, I can tell you the will LAG BEHND UNTIL THEY DON’T. And when they hit the market with their all caught up models, they’ll be in the drivers seat.
@JustSuds
@JustSuds 11 ай бұрын
I love how shocked you are in the opening clip
@orangehatmusic225
@orangehatmusic225 11 ай бұрын
It's almost as if these companies are all using the same AI technology that nobody invented but somehow all these corporations are all using the same open source technologies to build their models. This doesn't pass the smell test.
@orangehatmusic225
@orangehatmusic225 11 ай бұрын
Am I the only one in the world who see's this or are the rest of you too blinded by the tech to notice.
@alakani
@alakani 11 ай бұрын
​@@orangehatmusic225 Thousands of people have been inventing it for a while now, building on each others work, at universities and research labs and in basements for over 30 years, getting mocked by luddites or completely ignored the entire time. Evolutionary biology put in a few billion years of development for us, we pretty much just copied part of that. Unless you're talking about the part that we got from lizard aliens in exchange for permission to microwave your brain
@thecutepika
@thecutepika 11 ай бұрын
The attention models and MoE models are open source It comes down to how much data you have and how much resources and time you have to burn
@orangehatmusic225
@orangehatmusic225 11 ай бұрын
@@thecutepika So you agree that all these companies are piggybacking off the same technology and it just appeared out of nowhere. Nothing has no inventor!
@thecutepika
@thecutepika 11 ай бұрын
@@orangehatmusic225 not out of nowhere, the first idea about LLMs came from a research paper about Attention Neural Networks written by some researchers at Google if I remember correctly, but it came out to be the best algo for NLP we've ever built, and they used the resources available to make the best use out of it
@seventyfive7597
@seventyfive7597 11 ай бұрын
I disagree, when you are being beat by others in AI, the strategy should NEVER be retreat and regroup, it should be to release closed models for review in order to receive training data, and release limited open source in order to see how the community improves on it and draw from it to your closed models. And that's what Google's trying to do.
@Chris-se3nc
@Chris-se3nc 11 ай бұрын
This model behaves like someone that’s nervous on a whiteboarding interview
@planetchubby
@planetchubby 11 ай бұрын
"Janine is faster than Joe." This in itself should be enough to revisit all job interviews involved at Google.
@sedat4151
@sedat4151 10 ай бұрын
Google is really making a name for themselves in AI. They’re pretty good at this….
@BuPhoonBaba
@BuPhoonBaba 11 ай бұрын
The fact google charges and doesn't link to Google accounts and services caused me to delete my free account immediately. 2 free months of Gemini? No thanks, cancel immediately.
@Nik.leonard
@Nik.leonard 11 ай бұрын
I tested recently gemma:7b with ollama 0.1.27 and now the model doesn't respond with gibberish. The only different behavior I noticed compared with other llama based models is that It tends to output more markup. As I said before, I don't know who quantized the model used by ollama, but was not TheBloke, and llama.cpp had a lot of commits the past week for addressing issues with quantization and inference, so maybe the model should be retested.
@AINEET
@AINEET 11 ай бұрын
You should add some politically incorrect questions to your usual ones after this week's drama
@clumsymoe
@clumsymoe 11 ай бұрын
To declare openly that their state of the art newest model relies on the same architecture is a massive PR disaster, possibly one of the largest this year 😮‍💨
@WolfgangAzevedo
@WolfgangAzevedo 11 ай бұрын
I was so impressed with how Google could release such bad thing....
@mayorc
@mayorc 11 ай бұрын
Consider how bad it is, now imagine using a quantized version in 4 bit, how much worse can it go.
@jelliott3604
@jelliott3604 11 ай бұрын
Re: it thinking that "cocktail" might be a bit rude ... not a patch on when Scunthorpe United FC updated their message boards with a profanity blocker and started to wonder why nothing was getting posted anymore
@VforVictorYT
@VforVictorYT 11 ай бұрын
Can't understand how anyone can think that even releasing such a subpar model and associate your brand with it is a good thing. Waiting until your model is at least at the same level as the best model when you are google should be the minimum bar, Google is supposed to be the big fish.
@fo.c.horton
@fo.c.horton 11 ай бұрын
additionally, that formula to count to 100 is jibberish. for every number in the range of numbers 2, 98, and +: print number. Range can't accept 3 arguments and + is not a valid argument.
@BlayneOliver
@BlayneOliver 11 ай бұрын
Looks like we’re no longer in the age of Google. Crazy
@AhmetTemizTR
@AhmetTemizTR 11 ай бұрын
I guess this must be what they call AGI. These answers are far beyond human comprehension.
@JohnLewis-old
@JohnLewis-old 11 ай бұрын
I know a lot of people are going to be disappointed, but honestly I'm do glad you tested it.
Attention in transformers, step-by-step | DL6
26:10
3Blue1Brown
Рет қаралды 2,1 МЛН
To Brawl AND BEYOND!
00:51
Brawl Stars
Рет қаралды 17 МЛН
coco在求救? #小丑 #天使 #shorts
00:29
好人小丑
Рет қаралды 120 МЛН
So Cute 🥰 who is better?
00:15
dednahype
Рет қаралды 19 МЛН
Леон киллер и Оля Полякова 😹
00:42
Канал Смеха
Рет қаралды 4,7 МЛН
DeepSeek R1 Shocked The World - Reactions Explained
20:34
Matthew Berman
Рет қаралды 50 М.
DeepSeek R1 Fully Tested - Insane Performance
15:10
Matthew Berman
Рет қаралды 721 М.
NVIDIA CEO Jensen Huang's Vision for Your Future
1:03:03
Cleo Abram
Рет қаралды 262 М.
Small Language Models Explained: The Future of Business Transformation
32:24
Ragnar Pitla (Make it Happen)
Рет қаралды 16 М.
Run ANY Open-Source LLM Locally (No-Code LMStudio Tutorial)
14:11
Matthew Berman
Рет қаралды 99 М.
Microsoft CEO: "Agents Will Replace ALL Software”
14:48
Matthew Berman
Рет қаралды 59 М.
AGI Fallout: Shocking Predictions About Society's Future
27:43
Matthew Berman
Рет қаралды 96 М.
Gemini has a Diversity Problem
17:36
Yannic Kilcher
Рет қаралды 53 М.
To Brawl AND BEYOND!
00:51
Brawl Stars
Рет қаралды 17 МЛН