DeepSeek-V3 (Fully Tested) : RIP 3.5 Sonnet & O1! This Opensource Model Beats Claude 3.5 Sonnet!

  Рет қаралды 15,360

AICodeKing

AICodeKing

Күн бұрын

Пікірлер: 104
@wedding_photography
@wedding_photography 12 сағат бұрын
We know that when question 4 is answered correctly, the AGI has been achieved.
@fabiankliebhan
@fabiankliebhan 10 сағат бұрын
o1 pro gets it correct
@markantscott
@markantscott 9 сағат бұрын
Q4 is bigger than mere AGI. It is the ability to answer obscure English Pub Quiz Night questions.
@tescOne
@tescOne 8 сағат бұрын
@@fabiankliebhan sonnet too
@kafkaesqued
@kafkaesqued 7 сағат бұрын
What is so special about question 4?
@seanlbrennan
@seanlbrennan 7 сағат бұрын
Deepthink option gets Q4 right but took two turns. First turn it ran out of tokens testing words and used a 10 letter word. asked it to keep going and it gives Sententious. The next turn it came up with Transparent right away.
@theorderofz
@theorderofz 12 сағат бұрын
Thanks for always putting us on, mate
@EditUMedia
@EditUMedia 12 сағат бұрын
Thank you so much for these videos covering new models. Merry Christmas
@samuelsilveira9709
@samuelsilveira9709 12 сағат бұрын
Merry Christmas, codeking
@sinapxiagency
@sinapxiagency 12 сағат бұрын
King, i dont know how you get this reviews so fast even in holidays, thank you so much
@Rumble2024injungle
@Rumble2024injungle 9 минут бұрын
He is panda Claude not santa Claus
@HedleyPugh
@HedleyPugh 9 сағат бұрын
The "preview" of DeepSeek's new V3 model takes 2nd place on the aider polyglot leaderboard. 1: 62% o1 2: 48% DeepSeek V3 Preview 3: 45% Sonnet 4: 38% Gemini-exp-1206 5: 33% o1-mini
@ElvinHoney707
@ElvinHoney707 10 сағат бұрын
o1 passes question 4: "A suitable answer is "SENTENTIOUS." It is an English adjective (from Latin "sententiosus"), it has 11 letters, begins and ends with S, and its vowels (e, e, i, o, u) appear in strictly alphabetical (non‐decreasing) order."
@notshekhar4738
@notshekhar4738 12 сағат бұрын
Even with a slightly altered prompt like ' 'what mode are you using?' (with a single quote at the beginning), the model still responds with 'GPT-4'. This raises questions about its underlying architecture.
@gui1236100
@gui1236100 12 сағат бұрын
Maybe they trained on data generated by gpt-4
@BACA01
@BACA01 11 сағат бұрын
@@gui1236100 They stole it as always 😁
@wwkk4964
@wwkk4964 11 сағат бұрын
They tend to tend to say that, even Gemini would say it last year. everyone trained on Chatgpt.
@GRVTY3
@GRVTY3 10 сағат бұрын
i'm using it in cline with openrouter deepseek chat api, and it keeps saying it's claude and acting like claude. something really sus going on here
@boynet2
@boynet2 10 сағат бұрын
@@GRVTY3 maybe cline prompt has something like "you are Claude..." ?
@Kevencebazile
@Kevencebazile 12 сағат бұрын
Merry Christmas Brother Love your content
@jacobfloyd6929
@jacobfloyd6929 12 сағат бұрын
Brother you really have extremely valuable content. Have you ever thought about running a community/course? I’m sure there’s a lot of people looking to collaborate with like minded people, especially since AI is so tough to stay on top of.
@AICodeKing
@AICodeKing 12 сағат бұрын
I already have a membership on my channel where I post in-depth tutorials for niche topics.
@jacobfloyd6929
@jacobfloyd6929 12 сағат бұрын
@ thank you I’m gonna look into that. Are you opposed to creating a discord or Skool community? That way everyone can collaborate on new stuff they’re finding, we all know networking is everything but it’s tough to find a valuable community.
@theorderofz
@theorderofz 12 сағат бұрын
@@jacobfloyd6929true. That would work pretty well.
@jmg9509
@jmg9509 10 сағат бұрын
This guy in the vid sounds like ai lol
@SipChai
@SipChai 12 сағат бұрын
Panda has replaced Santa. Poor old man.
@chyldstudios
@chyldstudios 12 сағат бұрын
Love to see this
@santypk5
@santypk5 Сағат бұрын
Why Australia and not Mongolia ?
@notshekhar4738
@notshekhar4738 12 сағат бұрын
I tried prompting the model with 'what model are you using to response to this chat?' and it said 'GPT-4'. When I followed up with 'who developed you?', it answered 'OpenAI'. This makes me wonder if the system is actually utilizing OpenAI APIs.
@gui1236100
@gui1236100 12 сағат бұрын
Maybe just training data generated by gpt-4
@notshekhar4738
@notshekhar4738 11 сағат бұрын
@@gui1236100 maybe yess
@BACA01
@BACA01 11 сағат бұрын
When it was deepseek v2 it was saying that it's a gpt3.5 and now it says it's gpt4
@Nomadnotepad
@Nomadnotepad 5 сағат бұрын
Tell me you don’t understand how training data works without telling me you don’t know how training data works.
@ram49967
@ram49967 12 сағат бұрын
Super questions for the LLM! It's ok with me to give it a pass on Question 3, even though it used the first letter and not the second letter to make the Haiku.
@displayname7t4
@displayname7t4 11 сағат бұрын
Most useful channel in youtube right now
@d.d.z.
@d.d.z. 11 сағат бұрын
With Qwen and Deepseek China strikes back. So amazing to live in 2025.
@salimalsenani2614
@salimalsenani2614 11 сағат бұрын
Everyone wants to take sonnet down.. but no one could! It remains the king of coding.
@thanartchamnanyantarakij9950
@thanartchamnanyantarakij9950 9 сағат бұрын
Not at this time. You can check by yourself
@salimalsenani2614
@salimalsenani2614 9 сағат бұрын
@thanartchamnanyantarakij9950 I just tested Deepseek V3, Gemini 2.0 Flash, and Sonnet, asked them to create amazing landing page for coffee brand. Sonnet won by far in terms of design and following correct prompts. Second is so close Deepseek V3 and Gemini 2.0 Flash, but I preferred the Deepseek it's really amazing 🤩.
@Osys91
@Osys91 6 сағат бұрын
​@@thanartchamnanyantarakij9950 did it outperformed sonnet? I quickly tested some code and sonnet was still performing better
@TitoSadek
@TitoSadek 11 сағат бұрын
Merry Christmas , I love your content , thanks
@Reverse-sg5rn
@Reverse-sg5rn 12 сағат бұрын
Merry Christmas. Can you do code testing with cline and aider on it?
@flutterflowexpert
@flutterflowexpert 10 сағат бұрын
New questions! Finally! 🎉❤
@alexjensen990
@alexjensen990 6 сағат бұрын
Well, color me surprised. I look forward to using it. Especially the lite model.
@andrinSky
@andrinSky 12 сағат бұрын
Hello Is it possible to work with Deepseek perhaps with Cline or RooCline. If yes how can i do this. Because this would be very Great! I could be very good for Coding.
@sinapxiagency
@sinapxiagency 12 сағат бұрын
Use in cline the Open ai compatible api
@andrinSky
@andrinSky 11 сағат бұрын
@@sinapxiagency And how are the Settings under "Open AI compatible AI"? I Mean the "Base URL"? and The "Model ID"?
@finnpoitier
@finnpoitier 11 сағат бұрын
@@sinapxiagency Do you know, which provider? Openrouter?
@Bangs_Theory
@Bangs_Theory 11 сағат бұрын
Merry X-mas King!
@Wesley58481
@Wesley58481 8 сағат бұрын
Tks for sharing!! u so incredible!
@mrinalraj4801
@mrinalraj4801 10 сағат бұрын
Thanks for the video 😊
@SudeeptoDutta
@SudeeptoDutta 10 сағат бұрын
So, If I'm already paying for the 2.5 API using Continue extension, it should automatically start using v3 right? No need to configure any new API key right?
@AICodeKing
@AICodeKing Сағат бұрын
Yes, it should automatically switch
@JohnLewis-old
@JohnLewis-old 12 сағат бұрын
How fast are inference speeds? Did it do well in fine?
@jeffwads
@jeffwads 6 сағат бұрын
I asked QwQ 32b the 4th question and it refused on the grounds that it may be part of a competition test and it wouldn't be fair, etc. It can be stubborn at times but I hope this isn't a sign of things to come.
@fabiankliebhan
@fabiankliebhan 10 сағат бұрын
Will deepseek v3 be available for cursor?
@Teetanthegamer
@Teetanthegamer 11 сағат бұрын
Can you please make a tutorial on how to use it with cline locally through ollama or through paid api ?
@maddoxthorne2297
@maddoxthorne2297 7 сағат бұрын
Christmas gift galore.🎁❤️
@DouhaveaBugatti
@DouhaveaBugatti 2 сағат бұрын
Um can you also add questions for coding in other frameworks like svelte etc. This will tell how much useful this model can be for building real applications
@greenpulp.
@greenpulp. 12 сағат бұрын
Nice! How do we use it with Cline in VS Code?
@karamjittech
@karamjittech Сағат бұрын
Use openai compatible from Cline settings.
@dfasfa6657
@dfasfa6657 11 сағат бұрын
DeepSeek told me it was GPT 4 after asking "what AI model are you"
@skpassegna
@skpassegna 10 сағат бұрын
Maybe it is already fixed, since it is replying correctly.
@UsmanAli-ve6tq
@UsmanAli-ve6tq 12 сағат бұрын
Is there any model which was able to answer question 4 and achieved 100% score.
@voltax4435
@voltax4435 9 сағат бұрын
Finally a real sonnet alternative, and way cheaper!
@nolannosike
@nolannosike 7 сағат бұрын
is question 5 correct? you should get a decimal no? 20% of 48 is 9.6 so shouldnt the answer be 38, 38.4 to be exact? the way it did it also seems correct but we're getting two diff answers.
@AquaAstronaut23
@AquaAstronaut23 6 сағат бұрын
That’s 20% of the inflated number 48 not the original number 40. You need to divide by the percentage as a decimal (1.2) to work it back.
@uniq6318
@uniq6318 8 сағат бұрын
Without using deep thinking That's amazing
@TawnyE
@TawnyE 12 сағат бұрын
E Merry Christmas 🎄🎅
@collinsk8754
@collinsk8754 11 сағат бұрын
Great work. And it's finally up to date with NextJS 14! 🙌🙌
@aislanarislou
@aislanarislou 6 сағат бұрын
What about programming skills ?
@DemocracyDecoded
@DemocracyDecoded 11 сағат бұрын
I need to see a direct comparison to sonnet 3.5 new in all your videos buddy.
@Opeyemi.sanusi
@Opeyemi.sanusi 10 сағат бұрын
I am amazed
@ProkopHapala
@ProkopHapala 7 сағат бұрын
The biggest problem I always have with using DeepSeek for programming is the speed (14 token/s, its like 5times slower than Sonet and 10 slower than GPT), I hope they adress also that.
@VincentLannoo-Bourton
@VincentLannoo-Bourton 6 сағат бұрын
I know, there is twoo Rounds... But... first: After careful research, there is no known English adjective of Latin origin of 11 letters that has the first and last letter identical and whose vowels appear in alphabetical order. second was jusr "are you sure?" and: The adjective that fits all the specified criteria is "sententious." 11 letters: S-E-N-T-E-N-T-I-O-U-S. Adjective: It describes someone who is given to moralizing or using maxims. Latin origin: The word derives from the Latin "sententia," meaning opinion or meaning. First and last letters are the same: Both are 'S.' Vowels in alphabetical order: The vowels are E, E, I, O, U, which follow the sequence A, E, I, O, U (with A not present). Thus, "sententious" satisfies all the conditions.
@misterleo885
@misterleo885 11 сағат бұрын
QvQ 72B Test please
@paulyflynn
@paulyflynn 2 сағат бұрын
amazing
@fun8711
@fun8711 7 сағат бұрын
Question number 4 stand on ten toes
@varunaeeriyaulla
@varunaeeriyaulla 12 сағат бұрын
Bro, I just asked, "What is the AI model I’m chatting with?" (using the Deepseek API via OpenWebUI). The answer is "You're currently chatting with OpenAI's GPT-4". I asked the same question from the chat and the code model. Are they reselling the OpenAPI GPT4???? Crazy. Please run a test.
@UsmanAli-ve6tq
@UsmanAli-ve6tq 12 сағат бұрын
I got the same answer :)
@gui1236100
@gui1236100 12 сағат бұрын
Maybe they used training data generated by gpt-4
@varunaeeriyaulla
@varunaeeriyaulla 12 сағат бұрын
@@UsmanAli-ve6tq Yes, I asked the same question from the Deepseek chat interface, and it says "You're currently chatting with DeepSeek-V3". Very strange.
@varunaeeriyaulla
@varunaeeriyaulla 11 сағат бұрын
@@gui1236100 then why it's only one API but not on deepseek chat interface?
@dfasfa6657
@dfasfa6657 10 сағат бұрын
@@gui1236100 where are u from?
@pranjalsuthar9476
@pranjalsuthar9476 8 сағат бұрын
hey...You are making amazing videos. Please make video on organised files by AI
@JoraMacKornev
@JoraMacKornev 3 сағат бұрын
Rip 3.5 Sonnet and O3 😅
@limjuroy7078
@limjuroy7078 10 сағат бұрын
Interesting!!!
@perfectartiste6332
@perfectartiste6332 12 сағат бұрын
merry Christmas, first here
@miselgpt
@miselgpt 12 сағат бұрын
Why not Mongolia? 😉
@formixcode
@formixcode 9 сағат бұрын
yes I can sense their suddenly better in coding solving here is why
@aleksanderspiridonov7251
@aleksanderspiridonov7251 8 сағат бұрын
Finally🎉🎉🎉🎉🎉🎉🎉❤
@rashad6459
@rashad6459 8 сағат бұрын
I cant keep up😂😂😂
@sontieudev
@sontieudev 12 сағат бұрын
In v2.5 its slow, and impossible to use in my usecases.
@Luca-xr7bs
@Luca-xr7bs 7 сағат бұрын
Uhmm I dunno
@aculz
@aculz 12 сағат бұрын
wow, i have been waiting for this. i use deepseek as my main LLM since its the cheapest. great job to cover this model it seems we get our open-source model king this end of the year. Marry Christmas and Happy new year everyone 🎄🎄
I Scraped the Entire Steam Catalog, Here’s the Data
11:29
Newbie Indie Game Dev
Рет қаралды 598 М.
Quando eu quero Sushi (sem desperdiçar) 🍣
00:26
Los Wagners
Рет қаралды 15 МЛН
Tuna 🍣 ​⁠@patrickzeinali ​⁠@ChefRush
00:48
albert_cancook
Рет қаралды 148 МЛН
go is great i hate it
14:44
SST
Рет қаралды 21 М.
RooCline: This Cline Fork is FAST & BETTER and Beats CLINE?
10:24
Aider + Gemini 2 (Exp) versus Claude 3.5 Sonnet (AI Coding King!)
25:44
Marvijo Software
Рет қаралды 3,4 М.
Should software developers quit after 40?
6:48
Tom Gregory Tech
Рет қаралды 49 М.
Playing Music on the Oldest Running Computer in America!
27:06
Usagi Electric
Рет қаралды 143 М.
I Redesigned the ENTIRE YouTube UI from Scratch
19:10
Juxtopposed
Рет қаралды 728 М.
Devin review: is it a better AI coding agent than Cursor?
9:18
Steve (Builder.io)
Рет қаралды 103 М.
Quando eu quero Sushi (sem desperdiçar) 🍣
00:26
Los Wagners
Рет қаралды 15 МЛН