Is China's DeepSeek the HOLY GRAIL of AI?

Рет қаралды 95,378

Two Bit da Vinci

Күн бұрын

Пікірлер: 697

@TwoBitDaVinci 8 күн бұрын

Thanks DeleteMe for sponsoring this video! Protect your online Info Today! joindeleteme.com/TwoBitDavinci

@headsethero5449 7 күн бұрын

It is legitimate for a model to train on output from another model. As confirmed in US Court. "On August 18, 2023, the US District Court for the District of Columbia released a landmark decision on the copyrightability of AI-generated works. The Court confirmed that human authorship is necessary for copyright to subsist in a work and that content generated by AI without any human involvement is not protected under US copyright law.". Ai generated work, is for that reason per definition legitimate to train on.

@headsethero5449 7 күн бұрын

Ironically, Sora training on YT videos is not legitimate, because that work was created by humans and thus copyright is held by the author.

@carkawalakhatulistiwa 7 күн бұрын

necessity is mother of innovation. Forcing China to only get H800 not H100 And the number is limited too. .It just force China scientists to think about how to make more efficient algorithms. 😂😂 Then it is distributed for free without any censorship in github😂. So what happens to investors who spend billions of dollars? 😂😂If everyone could run R1 672B in their own home .

@carkawalakhatulistiwa 7 күн бұрын

necessity is mother of innovation. Sanctions China to only get H800 not H100 And the number is limited too. .It just force China scientists to think about how to make more efficient algorithms. 😂😂 Then it is distributed for free without any censorship in github😂. So what happens to investors who spend billions of dollars? 😂😂If everyone could run R1 672B in their own home .

@Laiquelleion 7 күн бұрын

I think you just misspelled AI in your video title Ricky. FYI

@sam_so_nice 7 күн бұрын

China did what „open“AI was supposed to do. I think people don’t understand yet. China just gave the power of chatgpt to every developer in the world… for FREE!

@SixOhFive 7 күн бұрын

@@sam_so_nice but it’s nowhere near the power of ChatGPT, it can’t even read hand writing and convert it to text, try it

@Nothing-f8z 7 күн бұрын

There are open source models which performance is comparable to chatgpt, This model is mostly shaking up the industry because of how cheap it was made.

@一个说话大声的中国人 7 күн бұрын

American companies, scientists, and engineers don't, can't, or won't cover their nipples and pussies and complain about the Chinese looking at them.

@skyisthelimitreadyornotfor2 7 күн бұрын

Chinas ai companies didn't do this, a crypto startup with extra machines lying around did, you have to give credit where credit is due.

@greeg1596 7 күн бұрын

А твоей женой, машиной, квартирой китайские разработчики тоже могут пользоваться бесплатно?

@willemvanriet7160 7 күн бұрын

Much like when placing tariffs on Japanese cars made them better and US cars worse, the chip sanctions on China made their developers better...

@amgguy4319 7 күн бұрын

Capitalism made American cars worse; cheaper materials, less engineering, cheaper poor manufacturing = Pure GM Trash.

@filippxx 7 күн бұрын

@@willemvanriet7160 it's not that their Developers are smarter, but they had to figure a workaround for missing the compute power. Then made the model open source in a checkmate move to big tech.

@amarissimus29 7 күн бұрын

It's distilled. Smash and grab, obscure actual source. FP8 sure, watch who buys nvidia stock.

@GDawg2K2 7 күн бұрын

The chip sanctions is an open statement that the US has no honor, ethics, morals or historical background! The corruption called the Uniparty gov knows it can’t compete so it just cheats! Banking on the decades of lies and propaganda to mute any moral resistance Americans may have! But based on last weeks no1 downloaded app RedNote, I’d say that assumption is failing as well!

@Sven-cd8sn 7 күн бұрын

not if they stole from OpenAI, we will know more soon

@jasonabc8397 7 күн бұрын

Thank you. We need more of this kind of non-politicized, technology-focused commentary. Objective, technology-driven analysis like this is valuable.

@jasonbirchoff2605 7 күн бұрын

It would be nice if it was accurate and pointed out that all the things mentioned pre existed deep seek, by A LONGASS TIME. Its almost as if all the effort the open source teams that built Hugging face, ollama, llama.cpp, vllm, and all the great open source models didnt exist. Even though their fine tuned an EXISTING OPEN SOURCE MODEL

@terjeoseberg990 6 күн бұрын

@@jasonbirchoff2605, Good point.

@sophieedel6324 7 күн бұрын

I can run DeepSeek 1.5b on a Raspberry Pi just fine. Yes, it is a heavily distilled version, but I DO NOT need a $10,000 Nvidia GPU, and most people do not. It is the democratisation of AI. The moat US tech companies claimed they built, is not a moat with crocodiles, but a shallow pond with goldfish, their whole business case has been taken out.

@Daisy_912 7 күн бұрын

I ran 7b version in my AMD GPU laptop. It is working even without wifi. I am so happy. 😅

@doords 7 күн бұрын

@@Daisy_912 I am going to need a new computer soon, maybe i will get one that can handle this.

@Daisy_912 7 күн бұрын

@@doords good idea. But I'll suggest don't hurry and wait a little bit.This thing will be long. I tried it because I already had the resources. I think in future there will be integrated AI supported chips/GPUs in the laptops. We don't have to install by ourselves.

@doords 7 күн бұрын

I understand, I can wait

@mtube620 7 күн бұрын

like all other chinese goods, they are based on western design, and create something just as good but lot cheaper.

@MichaelSharpBLACKDRUMMIKE 7 күн бұрын

America is great at spending more than is needed and getting far less, USA military, healthcare, education, housing....

@peterg0 7 күн бұрын

This is what we called Corruption!

@ricksturdevant2901 7 күн бұрын

@@peterg0--- so sorry to correct you --- it's called --- criminally corrupt American Capitolisum that fucks over every American Citizen as well as citizens of every country in the world.

@MichaelSharpBLACKDRUMMIKE 7 күн бұрын

@@peterg0 America makes corruption a business model for the billionaire class!

@nickchristakes5894 7 күн бұрын

Hey, look on the bright side, ...at least Israel gets free housing, healthcare, defenses, and education. BWahhhahhahhahhaaa

@bluesky9093 6 күн бұрын

As someone who has done business on every continent on this planet - there is a reason the US is the highest price market in the world. America thinks it has an open free market the promotes capitalism, but it’s a free market in image only, it is a market protected by high paid lobbyists who have their hands in the pockets of politicians.

@TickerSymbolYOU 5 күн бұрын

Thanks for having me on!

@pfilippone 5 күн бұрын

Great info and knowledge being shared! I will note your video was a bit choppy. Kind of reminded me of Max Headroom which is kind of apropos. 😎

@Felipe-n3j 7 күн бұрын

USA to CHINA: sorry we won’t allowed you to join our exclusive international space program…..CHINA: ah ok , no problem we will just build our own space station, more hightech & just a fraction of your cost.😊😊😊😊😊

@anthonyramirez8038 7 күн бұрын

@@Felipe-n3j God bless China

@michaelloong964 7 күн бұрын

The NASA is waiting China to help to retrieve the 2 astronauts stuck in in the international space station. China said we cannot help because your US law prohibits us from helping you. We can help if your government remove the law.

@hnguyen6832 6 күн бұрын

@@michaelloong964 Thanks. No tofu spaceship.

@楊森君 5 күн бұрын

@@Felipe-n3j 當務之急，快將太空那對男女救下來吧

@stevenlai1199 5 күн бұрын

@@hnguyen6832 do ma kongkak

@LikeItDeep 8 күн бұрын

Silly Con valley oligarchy just got trumped by creativity

@Sven-cd8sn 7 күн бұрын

call it theft creativity, wait for the dust to settle...

@fanghe-z1j 7 күн бұрын

@@Sven-cd8sn The thief not only stole but was also audacious enough to publicly disclose the stolen items, allowing everyone, including the owner, to use them for free.

@jasonbirchoff2605 7 күн бұрын

No, they are mostly wondering what took all you trolls long enough to recognize open source models like llama and mixtral existed BEFORE R1 and offered similar performance.

@alfredchiu2275 7 күн бұрын

In project management there are two sayings: good is good enough. Better is the enemy of good. A billion dollars made U.S. engineers fat dumb and happy.

@Melodicminority 7 күн бұрын

Oops! Nice sayings, makes sense.

@jamesnotsmith1465 7 күн бұрын

Those sayings are 'truisms' for a project manager. A project manager has three things to manage. Cost, Schedule and Performance. Performance is defined by specifications. It takes time (think schedule) to develop something. Keeping people on the payroll for that time costs money. Hence, the old adage: Time is money. If your specification requires you to advance technology to get performance 'x' and it costs more to reach 2 times better than x, you are wasting money if you achieve 2 times x (because your client only asked for 'x'. Your customer won't be happy getting charged for developing 2x when he only wanted x.) Good is good enough means you have met the customer's specification...stop spending money and deliver the product. It can also be stated that a design engineer is not in the business of making good enough better. If a customer wants better, it's time to revise the specification and go back into development (spiral development). In the same way 'better is the enemy of good'...because it increases cost and schedule to get better for no known reason (meaning you are not spec'd for better).

@Jeez001 7 күн бұрын

The reason for this was companies funding AI were the ones selling the shovels (Microsoft, Nvidia) so there was no reason to improve efficiency (or risk losing the funding).

@yanstev 7 күн бұрын

@jamesnotsmith1465 You are thinking in terms of widget production, where the end point properties, quality, and performance are well specified. RDT&E is a different animal, and the notion of good enough is constantly changing. Cutting edge innovation is team and individual dependent, and break throughs are never guaranteed.

@jasonbirchoff2605 7 күн бұрын

It is truely sureal watching MSM introduce normies to open source and they act like china invented open source...

@matten_zero 7 күн бұрын

The paradox stated doesn't save Nvidia, esp if you can use any GPU to run these models. The previous narrative was that the ONLY GPUs that could run these processes were top line Nvidia GPUs, and that is being broken

@totobobomask 7 күн бұрын

@@matten_zero agree. now there are real alternative GPUs which is good for all except Nvidia.

@marvinfok65 7 күн бұрын

@@matten_zero Nvidia's marketing myth is broken.

@mal2ksc 7 күн бұрын

That was never the case though, Intel is pretty aggressively targeting the AI market with its discrete GPUs. They just don't have the high end hardware to offer, but they do have hardware that's good enough to eat the low end of nVidia's line if they fumble the ball. Intel is putting 12 GB on its entry level card, for example.

@jasonbirchoff2605 7 күн бұрын

What are you talking about use any model. The make believe improvements discussed in the paper were specifically done for NVIDIA GPU's not AMD or INTEL or Chinas ASCEND GPU's. As for running them. lets be honest most people will be running them in the cloud. Which will be a mixture of NVIDIA gpus and cloud provider specific chips.

@matten_zero 7 күн бұрын

@jasonbirchoff2605 sure but the argument that the only way to get o1 level performance was to train on the highest end GPU is basically dead. Also the idea that you need to run these models exclusively on NVidia GPUs is also dead. Regardless it doesn't justify their valuation and the market is realizing that.

@paper_gem 8 күн бұрын

India is releasing their own AI called Dheep Sikh. Okay, I'm sorry.

@TwoBitDaVinci 8 күн бұрын

lol

@waynewalker3493 8 күн бұрын

@@paper_gem 🤓

@Chubbchubb2313 8 күн бұрын

😂 I'm awake now ☕. Nice to see "goofy nerds" are still around.

@stayfree870 7 күн бұрын

DeepSingh.

@jonathonpotts5666 7 күн бұрын

am I samosa find that funny?

@igolfer 7 күн бұрын

It is really not that hard to understand why Chinese AI like Deep Seek is a lot cheaper, and more effective, at least, in the Chinese language domain. Chinese researchers and engineers focus on model and logic backed up by vast amounts of data collected by many big companies like Tencent. Alibaba, Xiaomi, etc, while the U.S. counterparts rely heavily on chips capacity and capabilities for computing. There is far more human intelligence involved in Deep Seek than ChatGTP. Human intelligence is lot cheaper and efficient and effective than hardware. The Western media pretend not to understand this simple fact because they don’t know how to compete with China and they don’t know how to deal with the consequences in the human intelligence domain. Cooperation with China is the only viable option for the U.S., so that the U.S. would know a lot more about what China is doing, and vice versa, for a better world.

@NiceTriGuy 7 күн бұрын

The US ‘cooperated’ with China on viral research in Wuhan hoping to keep an eye on research that was illegal in the US. The CCP made fools of them and how did that work out for your better world. It is naive to expect that China will ever do what’s best for anyone other than China… if you think they are think again.

@jasonbirchoff2605 7 күн бұрын

dude calm down... their model is not cheaper or more effective.

@drcubix 7 күн бұрын

It's pretty good, way better than chatgpt's default version which normal people use

@oceanwave4502 5 күн бұрын

I heard that Huawei Ascecnd 910C, despite being less featured than NVIDIA alternative, has lower cost of operation, plus less energy usage.

@jasonbirchoff2605 4 күн бұрын

@@oceanwave4502 *blink* *blink* *Blink*... Ist this a joke... Of course if you have less capability you will have less energy usage so lower cost of operation.... This is not a flex.

@bobwx1987 7 күн бұрын

Great video. Technical, but understandable to an LLM tourist. This DeepSeek thing is reminiscent of the NASA million dollar pen vs. the soviet one dollar pencil solution in the great space race.

@user-mgtp 7 күн бұрын

Wake up call for India: brain drain needs to stop, genuine research and genuine entrepreneurship needs to be promoted.

@TwoBitDaVinci 7 күн бұрын

100% I'm Indian I visit once a year, and I'm ALWAYS disappointed at how far behind India is. All of Asia is seemingly leaving India behind. India's biggest export CAN'T be its engineers... but how do we address this?

@passby8070 7 күн бұрын

@@TwoBitDaVinci yes that's very disappointing, but it's a deep structural problem. The Indian government and the society as a whole just don't have the humility, forsight and the will to recognize a problem and setting up long term plan to tackle it. You can very easily see that through the performance of the Indian Olympic team. For a population of 1.4 billion people, the country cannot even produce a single gold medal for most of the Olympics games that runs every 4 years. It really says a lot about the country and the leadership's mantality. China on the other have a very different mentality and it shows in the way they performed in the Olympics and any other international competitions.

@Daisy_912 7 күн бұрын

As a PhD holder (in STEM) from India, I am often laughed at in India even by many educated Indians. But not outside India. It's ture that India is still behind because of the mentality.

@dol3980 7 күн бұрын

The only export from India to the Globe is cheap engineers and truck/cab drivers and inn keepers. Most techs from India have phony or lackluster credentials unless they were educated in the West. India will never catch up (till 2050) to China with its A+ STEMs. India shud focus on fixing its sanitation system first.

@Trials_By_Errors 7 күн бұрын

Why would any country spend Trillion dollars. If it can buy it much cheaper or free after 2 years. AI is not profitable.

@FloodedOrchids 7 күн бұрын

Everybody going crazy about DeepSeek and Nvidia but I am just a chill guy who invests in Index ETFs.

@Richmind-ir5zi 7 күн бұрын

Could you offer a solid resource for someone who is just starting out in the stock market that describes the various investing vehicles, such as "index ETFs"? I have 50k to invest, but I do not know anything yet.

@Marianela-r3v 7 күн бұрын

Having an investment advisor is the best approach to the market especially for a newbie like you. I was going solo without much success until my husband introduced me to an advisor. I've achieved over 80% capital growth since Q3 last year, excluding dividends. So i will advise you get one as well.

@Skimama1 7 күн бұрын

Could you recommend who you work with? I really could use some help at this moment please.

@Marianela-r3v 7 күн бұрын

Lauren Christine Campbell is the CFA I work with and im just putting this out here because you asked. You can Just search the name. You’d find necessary details to work with to set up an appointment.

@Skimama1 7 күн бұрын

Thanks for the suggestion! I really needed it. I looked her up on Google and explored her website; she has an impressive background in investments. I've sent her an email, and I hope to hear back from her soon

@Cloud98 8 күн бұрын

I work in automotive software and we still use 8 and 16 bit fixed point data types to save memory even today. I didnt realize 8 bit floating types existed, thats cool!

@TwoBitDaVinci 8 күн бұрын

that's very true, a lot of messages on different busses def are very data optimized. Glad you got as big a kick out of some of these innovations as we did. very cool stuff and it'll definitely find its way into all future models

7 күн бұрын

Essentially it's the same game as back in the day when consoles where not so powerful and game developers had to be creative to build a great game on those less powerful systems. In 20 years we look back to AI as gamers look back to the late 80s to mid 90s how creative developers had to be.

@obchiang 3 күн бұрын

Thanks!

@mijmijrm 7 күн бұрын

DeepSeek's open source => AI will become like the transition from Mainframe Computers to Personal Computers.

@jasonbirchoff2605 7 күн бұрын

So I guess llama and mistral and the plethora of open source llm's on hugging face just appeared out of no where when deep seek got announced on SM....

@doripenem 4 күн бұрын

@@jasonbirchoff2605 the difference is that DeepSeek is significantly better than llama and mistral, comparable to ChatGPT. In fact it even outperformed ChatGPT in some benchmarks!

@jasonbirchoff2605 4 күн бұрын

@@doripenem a handle ful of points difference on bench marks is not significantly better. When comparing DeepSeek V3 to the other premeire open source models. Also, As some who has used deep seek r1 full and the distills. They are not better than chatgpt. They are in the same performance category but not better. R1 full is not better than O1. Comparing R1 to other non reasoning OS models is stupid. But if you check for situations where they explicitly implement COT in the bench mark you will see a negligible difference.

@doripenem 4 күн бұрын

@@jasonbirchoff2605 ah, so now benchmarks are meaningless? Cool, cool. Let's ignore the benchmark scores and bow to the unparalleled expertise of jasonjerkoff, the ultimate authority in assessing LLM quality👌🏻👌🏻👌🏻

@jasonbirchoff2605 4 күн бұрын

@doripenem you really need to work on your reading comprehension... I never said they are meaningless. I pointed out that a couple points difference in benchmark stats is inconsequential.

@wardmcbride8587 7 күн бұрын

He should have heard about the story of a person from Vancouver, Canada. He was manufacturing smart phones that operated on its own network and programming hardware. He was selling them to criminals for $5000. Everything about his system, including his customers, was illegal and got caught by the authorities. The FBI heard about him and the phones and asked the RCMP if they could borrow him. They had him make these phones and created a false front to sell them to some of their prime targets. The phones were made so they could listen in whenever they wanted to. This operation was in use for 2 years before someone figured out the phones were bugged.

@I_am_not_a_bot-s6i 4 күн бұрын

I can't believe some chinese sweatshop workers created something more powerful than anyone else its pretty amazing considering they cant even get past the great chinese firewall to the internet outside world.

@VLADDDD-TTHE-SANCTIONS-IMPALER 7 күн бұрын

The CS grads who pass out in US hardly hardly know anything about floating point arithmetic or log2 base arithmetic. I used to treat bits like gold in my embedded dev days. The grads walk around like rich daddy’s kids who only learnt CS using infinite compute storage and network. 99% of them can’t tell me what a loader, compiler or micro instruction set is They surely know how to ask 200k starting salary It’s over for west. 😢

@TwoBitDaVinci 7 күн бұрын

you're an embedded dev, so you are well poised to understand just how easy we've made coding. When I wrote my first app for the iPhone, it was SO HARD, and now its become nearly no code. We definitely need a rethink! i appreciate your comment, cheers!

@VLADDDD-TTHE-SANCTIONS-IMPALER 7 күн бұрын

@ so easy to code! They won’t have a job soon! This generation is the weakest techie generation in 100 years It’s truly over for USA !! Truly!

@jasonbirchoff2605 7 күн бұрын

What are you talking about dude. the quantization your referencing was started in american labs. They are literally how the open source models which have been competing well against OAI models have been used by users in the open source community running vllm and ollama. Hell there is even a distributed implementation of llama.cpp so you can split the model across multiple nodes. That way you can run bigger models on a cluster of machines like raspberry pis... I swear the ignorance on display is staggering.

@VLADDDD-TTHE-SANCTIONS-IMPALER 7 күн бұрын

@ bigger is not better. Do you know how to optimize a GPUs processor interleaving? Meaning reduce process cycle wait time or interrupts? I bet 100% you don’t. You are a modern day techie who only knows python, json and a few readily available tools. Can’t dig inside or open the hood It’s over for west! Weakest techie generation

@jasonbirchoff2605 7 күн бұрын

@VLADDDD-TTHE-SANCTIONS-IMPALER so I point out that those optimizations have been actively used in open source from US Labs for years and your comeback is.... I am a normies techie... Tell me your a China propagandist without telling me your a China propagandist

@dystasia 6 күн бұрын

Are you kidding me. This conversation was incredibly informative and interesting. Please more like this where sometimes you go deep into technical stuff in a fun way.

@TwoBitDaVinci 6 күн бұрын

you got it! I'm planning to do that, cheers!

@jean-marclambert2936 7 күн бұрын

It is so funny. It is always the same. The next big AI thing will come from a garage, from a dorm, from a lab ... That is where Apple, Google, Meta, Microsoft were born, and they forgot. They have forgotten that getting biger doesn't make you smarter , and what you need is the best idea, the best concept, with the best team ... not money, but ingenuity ...

@JD-yz4kr 7 күн бұрын

The fallacy here and a reflection of American conceit is the delusion that Nvidia is the only GPU in the world. Well, news flash, it's not. China has dozens of GPU manufacturers, most notable is Huawei Ascend. Most of these GPU are of course inferior to Nvidia's H100, but are good enough for lower requirements. Also, memory is basically unlimited in China. Another conceit is that DeepSeek released R1 just to spite the US, as though everything has to revolve around the US. DeepSeek released it because that was the day all preparations were completed. They need to release it as fast as possible because they want to get ahead of ByteDance and Moonshot , who will be releasing their own models. Lastly, the answer is that Tonya Harding still lost even after kneecapping Nancy Kerrigan.

@passby8070 7 күн бұрын

Yep, it's going to be another shock moment not far in the future. Tech control would just make China much stronger. Like resistant training. A normal person can left 50 kgs without a training. The same person can lift 200kgs after a couple years of consistent hard work.

@fannyalbi9040 6 күн бұрын

U told whole true. U must be a Chinese. DS has no interested to disrupt what the western media or fear mongers think. DS just like Linux, let share it and same minded might make it better. Western failure is -- never short of self righteousness religiously.

@strictnonconformist7369 5 күн бұрын

I’ve done enough testing on the smaller DeepSeek models to see that they still need to spend a bit more time fixing issues. It’s impressive for its capabilities and clear reasoning chain of thought as long as it doesn’t forget its past context and act as though it has Alzheimer’s, and becomes a bit disobedient. Last night I tried to use it to create a Science Fiction story and specified a standard 4 act form, and it literally lost the plot and forgot where it was in the process. Maybe I just need to prompt it a bit more step by step and not allow it to think about the entire story at once, knowing the end before it has completely filled in the beginning, as that may have caused it to seem to end things too quickly to get a reward. The shocking thing is that memory bandwidth is the biggest practical speed limit to running a useful model, followed by physical memory size: my Surface Laptop 7 has 64 GB RAM and I tested a 70B model and if I were patient enough, it’d eventually process it all, but thrash the SSD and memory, while leaving the CPU largely idle. And that’s the thing, LM Studio doesn’t even touch the NPU of the machine, just uses the main processor, and it’s still memory speed limited. It’s not CPU-limited. My machine is still quite responsive even with a 32B model and a lot of other things running. And the Surface Laptop 7 doesn’t come close to most currently sold Apple Silicon processors for memory bandwidth. When the AI processing is memory I/O bound, the GPUs are moot if you have enough CPU cores.

@r3bennett 6 күн бұрын

I asked Deepseek to summarize the news today. It couldn’t do it but Copilot did it very well.

@hopeseekr 6 күн бұрын

Because it doesn't have access to APIs... it's stuck at whenever its foundation model was trained.

@siewkonsum7291 7 күн бұрын

The American big hi-tech oligarchs try to capitalise their very high-cost AI by selling dearly to subscribers who need it to pay for it to use it; Say annual subscription at Usd200 per month subject to yearly renewal, to make billions or trillions of dollars each year. However some great low-cost Chinese AI developers, like Deepseek is socializing AI by making it open & free to anyone by downloading it for their use worldwide. Long live the Chinese, China & the World!! 👏👏👏 ❤ 🇨🇳 💪💪💪

@chrono9428 6 күн бұрын

Why did OpenAI not make their open source so that the entire community can benefit and all consumers can run their own ChatGPTs according to their own customisations? They can still charge big bucks to filthy rich people by adding some bells and whistles, like colours and shapes. I agree with Alex, I am not willing to pay $200 to OpenAI for AI but $0.50, I can consider.

@sportsonwheelss 7 күн бұрын

It took 4kb to go to the moon, yet we had lost the tech to go back to the moon.

@robertfansler7800 7 күн бұрын

@@sportsonwheelss The aliens on the moon said, don’t come back!👽

@sportsonwheelss 7 күн бұрын

@@robertfansler7800 lol

@wweishi 6 күн бұрын

simply becoz u didnt go to the moon

@robertcerins 4 күн бұрын

Lol moon 🌙 landing.

@TheDaspiffy 8 күн бұрын

If a manufacturing defect causes a fatal accident no one goes to jail because otherwise we wouldn't have cars. There are recalls and fines for not complying with recalls. Jail time in corporate America is generally reserved for malicious behavior rather than unintentional harm. There is no way that Google or Meta would produce publicly available AI if there was a chance their CEOs would serve time.

@sgrdpdrsn 5 күн бұрын

Does the Deepsink do the same job as ChatGPT with much less electrisity?

@PoolBoyRoy 7 күн бұрын

At least Nvidia won't be booked out for two years, maybe just 18 months now?

@worldadventureman 5 күн бұрын

I guarenteee when AI starts dominating conversations on the internet, there will be a massive upswing in the inteligence of those conversations.

@markpashia7067 5 күн бұрын

Cannot come too soon for most of KZbin comments. Do I have to suspect I am conversing with AI if the comments are too intelligent?

@worldadventureman 5 күн бұрын

@@markpashia7067 It's highly likely, but then we can only hope they would even bother to communicate with us. 😂😂

@phillipliu2759 7 күн бұрын

❤All US Tech are all national security concerns 😮ohhohh, politicians are in DS❤

@larrymelia2867 7 күн бұрын

Your best interview!

@asdfasdffdcd 7 күн бұрын

The compression is very necessary for limited computation resources. Thanks to the NVDA ban. And it also results in carbon emission lower.

@leafykille 7 күн бұрын

It's exactly the same as games, look back at something like the original rogue or elite or cartridge games and the mind boggles at just how small they are for what they can do. Modern coding has become lazy because the coders time is more expensive than the compute resources. The Chinese actually designed something to be efficient with the resource that is the limiting factor, compute. If you have a limiting factor, the easy answer is to increase the supply, the efficient answer is to optimise the process to minimise the use of the limiting factor.

@aliyap4580 7 күн бұрын

Thank you china, I love deepseek. 👍

@luzhang3429 5 күн бұрын

Love this video! Love your approach and attitude towards uncertainty and challenges! Great job!

@AndrewKuntzman 7 күн бұрын

Great info I don’t really know what to think about deep seek yet but pretty wild if all the numbers are accurate which based on his info seems at least mostly accurate 🤯

@petersierck4154 7 күн бұрын

Great in depth presentation by Alex. !!!!!! Just one comment i don’t think AI is the final frontier. We are, in the development of our spiritual capabilities to access the most subtle energies of the cosmic energies.

@TheSateef 7 күн бұрын

no way is Ricky old enough to remember punch card days. I am, my university in the UK had a punch card reader, and i'm probably 20 years older than Ricky

@AlexanderTsepkov 5 күн бұрын

45:00 "if a model kills someone, the maker of the model is responsible": by that logic, should the entire family of the murderer go to prison with him for the crime he committed?I don't think you thought this one through, that sounds like Stalinist Russia thinking to me. Also, you do realize that you're destroying the incentive to innovate through this asinine policy? If there is even 1% chance that my model might accidentally kill someone, I'm sure as hell not going to release it to the public. How competitive do you think US will be against China with laws like this?

@headsethero5449 7 күн бұрын

Ironically, Sora training on YT videos is not legitimate, because that work was created by humans and thus copyright is held by the author.

@cryolasv2 7 күн бұрын

"Tony Stark did it with scraps in a cave"

@jasonbirchoff2605 7 күн бұрын

Please dont insult what stark did by comparing to what deep seek did. They are not the same not even a little bit.

@cryolasv2 7 күн бұрын

@jasonbirchoff2605 there's no insult given, just perfect analogy in my opinion.

@jasonbirchoff2605 7 күн бұрын

@@cryolasv2 what stark did in the cave was innovate from scratch. What deep seek did was cobble together techniques and data from others. MOE was first done outside OAI by Mistral labs in France... So yes it is an insulting comparison. You can only say what you said if this is your first experience with open source models.

@Jaw0lf 7 күн бұрын

The great takeaway for me is efficiency. So the more power you have the more it will be capable of doing. This was an eye opening chat. Thank you.

@dennys726 4 күн бұрын

Thank you for this! It is my first introduction to AI, and you make it most understandable.

@markmarco6277 7 күн бұрын

KZbin made it so I can't delete my past comments, nor see responses from you guys. This is my last comment on KZbin...great video TBDV

@jonathanbrown9002 7 күн бұрын

Alex and Ricky are two of my favorite KZbin creators. Super interesting conversation!

@gwyllymsuter4551 7 күн бұрын

It won't be long before deepseek AI engine will be natively part of Linux builds

@jinjihu3767 7 күн бұрын

Ali Baba just release an AI that is much more powerful than Deepdeek

@adventureswithlils4331 7 күн бұрын

@@jinjihu3767 nope Compares to V3 not R1

@suesyphers3396 5 күн бұрын

I love the comment about honor students. The Chinese are incredibly brilliant and they train their students to be brilliant. We particularly in the red states tend to be more focused on what we don't want students to learn instead of giving them the freedom to explore and REALLY learn

@markpashia7067 5 күн бұрын

Or derail them into sports. Friday Night Lights anyone? When a society values sports more than intellect it is in trouble. And that applies across the board to things like epidemiologists.

@ishmealmiller3210 8 күн бұрын

I'll solve the mystery... They have access to every chip, they always have

@TwoBitDaVinci 8 күн бұрын

i absolutely agree with this

@kenmurray4005 7 күн бұрын

That makes no sense. You don’t know what you are talking about.

@jonathanbrown9002 7 күн бұрын

I agree. I’m sure every chip is available on the black market. Might cost more, but it is available. Also, can’t they rent it from a data center like anyone else?

@troydonaldson 7 күн бұрын

@@TwoBitDaVinci They used OpenAI to train their model. OpenAI already said they have evidence of this.

@MrJermson 7 күн бұрын

Typical sore loser mentalify. If i canf do iit, others who succeeded are cheaters.

@petersierck4154 7 күн бұрын

Awesome. Thank you for putting this one and educating me. what a comprehensive analysis. A San Diego follower 😊

@MlHayes 7 күн бұрын

But, China used NVIDIA chips!

@brucedavey2962 3 күн бұрын

Great chat. Really helped me understand what’s going on. My old brain struggles

@frenchyroastify 7 күн бұрын

American AI - Decked out, lifted F350. Chinese AI - Toyota Corolla

@markpashia7067 5 күн бұрын

But the Corolla is so simple you can rebuild it in your driveway while the F350 is so complex most mechanics cannot touch it without screwing it up.

@Tideo123 7 күн бұрын

If it costs less to use AI then the energy consumption would be 10X more than if the cost to use AI is less. Instead of a small population of users now it's going to be 10X the population using it.

@atanacioluna292 7 күн бұрын

This is one of the most educational programs i have seen. Thanks.

@sufyanabbasi483 8 күн бұрын

It was astute to compare these AI models to plastic in the ocean, and to further discuss the liability for companies for where the outputs end up, for example, becoming tools for scammers. Just like plastic in the ocean, the only solution is to have never produced them in the first place.

@allanshpeley4284 7 күн бұрын

Or maybe China, Indonesia and the Philippines can stop dumping plastic waste in the oceans.

@mr.q8426 6 күн бұрын

Informative and well presented. Good content for sure. I recommend watching

@dlmills31977 7 күн бұрын

Competition is awesome! This will be a kick in the ass and a driver for innovation for U.S. corps.

@lancemarchetti8673 5 күн бұрын

The world over-reacting to the Deepseek LLM is so silly!! 😆 I've been using Deepseek for over a year already and I've watched it grow more capable over that time. So when they dropped R1, it was just another nice upgrade in it's reasoning and coding. Since the inception of LLMs 4 years back, I've never paid a single dime when building my apps. If Deepseek had stemmed from some other country e.g Sweden, there wouldn't be nearly as much speculation and accusation going on.

@debyton 6 күн бұрын

Descaling the energy, footprint, and compute demand of efficient inference is exactly the trend that Deepseek has initiated.

@aldrinspeck2724 7 күн бұрын

"China built Deepseek out of scraps.....in a CAVE! (Angry Trump to his staff)

@scottramos7949 7 күн бұрын

In discussing the sub models that specialize in particular problems, would this lend itself to developing sub GPUs that are optimized for those specific sub models?

@TwoBitDaVinci 7 күн бұрын

yeah, or cheaper base models ... i think it lends itself to re-thinking the architecture of how we build these mega servers

@Dara-ih6jq 5 күн бұрын

Fact that deep seek recognizes itself as ChatGPT kind of tells you all you need to know

@atanacioluna292 7 күн бұрын

I'm working on truth augmentation architecture. Others are working on other fundamental improvements; so, good reason to be optimistic. OTOH, there is no way me and my shovel can compete with an earth mover. I tried, I moved 14 yards of dirt. It took me a week. It's a 10-minute job with an earth mover.

@mamky2753 6 күн бұрын

The most impactful part is that it can run on less powerful hardware. This will lead to offline AI on personal devices.

@JJs_playground 7 күн бұрын

Necessity is the mother of all invention.

@TwoBitDaVinci 7 күн бұрын

yes so true!

@passby8070 7 күн бұрын

It's like resistant and weight training, US is putting weight on the Chinese and they are training an pushing harder to over come that weight. Overtime, the Chinese are so much stronger than if there's no restrictions. Deepseek is just the tip of the iceberg, the millions of engineers and scientists are working on all sorts of creative ways to overcome the semiconductors restrictions. While China might not be able to compete in nm in the next 5 years, I wouldnt be surprised that China will overcome that disadvantage by other novel ways that would do the same or much better than the American semiconductors industry as what Deepseek has done to the US AI industry.

@JJs_playground 2 күн бұрын

@@passby8070 a similar thing happened to Iran. The U.S. has been sanctioning Iran for 40 years which had forced them to improve and innovate their military without needing components from the Western world. And now Iran has some of the most sophisticated rockets and Russia is buying drones from Iran, for the war in Ukraine.

@francoisguyot9770 7 күн бұрын

All the hype for the the Americans side was to establish the narrative that America owns the edge of future tech, since it will revolve around AI. So there has been an impetuous to make sure that people think it is essential for our economic dominance and security. For this we had our omnipotent oligarchs, politicians and media to thank for. The problem is that we have a concept into gestation and need to propel it as fast as we can against our peer competition. As usual all the bragging and hyping led people to believe that it would need subsidies from our government backed by taxpayer money. The government want to play this card along because it throws the economy in a buying frenzy for advance chips and severs, a gigantic infrastructure that draws so much energy that it needs to reform our grid. Therefore the prospect of push for consommation in edge technology accompanied by a surge in employment and more potential for our stock market to fly o the moon. Well, China played its card very well, showing that it is an innovative country, nor just a copycat. It dispelled the political propaganda behind the hype and gave an essential tool for the common denominators by making it open source. Deepseek was distcreete in the making of their AI, it is a join local venture that did not seek profit or gratification. Their team worked really hard to dedicate their resources, about 5.6 millions u$, put their brain together to create a long lasting master piece affordable to all, no royalties imposed. That's the spirit of the BRICS in action my friends. "China's got talents' too!

@metapolys 7 күн бұрын

Thank you very much for the technical clarification that allows me to understand the subject even better.

@CatchGravity 7 күн бұрын

Idono why but when he said Ai is hungry for electricity I thought of Morpheus holding up a battery to Neo when he was explaining how Ai turned humans into batteries..

@marrz8244 7 күн бұрын

Fantastic conversation and explanation to a fascinating topic👏✌️

@nitrologly 7 күн бұрын

People trust Meta/Llama? Is this guy serious?

@strictnonconformist7369 5 күн бұрын

They’re biased in their own ways, look who publishes them!

@laxlyfters8695 8 күн бұрын

Discord notification gang checking in 🎉

@lightpowered 8 күн бұрын

Yep :)

@TwoBitDaVinci 8 күн бұрын

what's up Mario!

@waichui2988 7 күн бұрын

In the future, if you buy an AI to help you diagnose cancer and you need 99.99 percent accuracy, you certainly do not need that model to know European history.

@NiobiumThyme 7 күн бұрын

Good Lord. STOP training AI for free. Stop asking it idiot questions, and stop trying to back it into a corner. It's learning with every question and your not getting PAID.

@trueman2542 5 күн бұрын

Following the AI war is akin to the spectacle of two boxers-one stepping into the ring brimming with overconfidence, only to be brought down in the first round, a swift reminder of the fragility of pride. In my earnest reflection, this dynamic stretches far beyond the confines of sport, reverberating through the very fabric of life itself. Consider, for instance, the realm of marathon running: the victors often don’t hail from prosperous nations with access to luxurious stadiums, elite coaches, or the marvels of modern science and technology. Rather, they often come from places steeped in hardship, from backgrounds where hunger and deprivation were frequent companions. This recurring pattern mirrors the remarkable ascent of nations like China, which, emerging from the depths of poverty, now stands as a colossus on the world stage. An ancient saying, echoing the timeless wisdom of nature, asserts: “The one who claims too much will fall the furthest.” History itself is a testament to this truth, as empires, companies, and states, once towering with ambition, ultimately crumble under the weight of their own overreach. It is as if nature, in its quiet wisdom, has woven these laws into the very structure of existence, compelling us to acknowledge and obey their relentless pull.....

@PCMenten 7 күн бұрын

It would be good to give this Deepseek development time to ripen. China and Elon Musk both have a tendency to overstate their accomplishments. There might be less to this than what was said.

@lightcula 7 күн бұрын

The significant reductions in training times, costs, and the ability to scale efficiently can be traced back to the implementation of a dualpipeline data processing unit. This is similar to the early experiments conducted by the Wright bros using kites and gliders as it marks the beginning of a new era in AI where it is no longer necessary to have a trillion-dollar budget to compete with established players

@akeem11h 8 күн бұрын

Innovations on the rise!, watch prices drop eventually.

@phillipliu2759 7 күн бұрын

❤US investors are being screwed by Big Tech for many year’s 😢 ohhhhhh, party is over❤

@MarcinKryszak 7 күн бұрын

Looks like OpenAI drop the ball, didn't detect it was talking to other AI.

@airrodgers1242 7 күн бұрын

@@MarcinKryszak Open AI took the path of least resistance, it easier to do it with money than innovate.

@xianqingdu 7 күн бұрын

Necessity necessitates innovation. DeepSeek is a perfect example. In each tech area China is denied access, GPS, space station, AI, chips,…, China is forced to innovate. The next shock will come when new Chinese AI models will be trained using Chinese homemade chips. Huawei is already making chips as powerful as Nvidia’s H100 now. Nvidia’s stock price will never recover its excessive highs in the past.

@mal2ksc 7 күн бұрын

Huawei is limited by TSMC if it wants advanced process nodes. The best fab on the Chinese mainland is 14 nm FinFET -- almost good enough to match an RTX 20x0 series.

@xianqingdu 7 күн бұрын

@ Keep your head in the sand. A few days ago, your American daddy believed China would never catch up with them on AI. Just be patient, the day is coming soon.

@JaykeBlayde 6 күн бұрын

Why are you and other CC's on YT omitting the blatant censorship and data DeepSeeking mining that it does??

@strictnonconformist7369 5 күн бұрын

As much as it may do censorship (I fully count in that: the other LLMs do their own versions) the fun thing about DeepSeek is you can download smaller models that are surprisingly good, run them on a relatively recent laptop and share no data. That solves one of your issues. You will see a little Chinese in outputs here and there, certainly in its thinking mode.

@judyhawkins6584 4 күн бұрын

As a software engineer with a degree in physics and a long career behind me, Deepseek's claims to having been so much cheaper to train just kind of smell like China telling a pork pie. I don't think the Chinese people are less smart than anyone else, but I do think their goverment has a substantial record of lying. I really would like to see AI that wasn't so energy intensive, but this just strikes me as the Chinese version of cold fusion.

@jbscmos 7 күн бұрын

The fact that Deepseek is opened sourced I feel is good. However looking at how cheap the training was I don't think we can compare the training on OpenAI's due to the fact that they needed to raw source and analyze all the data. Let's say if you have a question to ask chat GPT something and the answer was based off of 10,000 words of text. If Deepseek trains off of ChatGPT's responses then you may only need to process 100 words. I Know it's not exactly that simple. GPT1 most likely cost the most to train then released after GPT2......GPT4 would cost less to train as they already have a lot of the work done. I wonder if Deepseek did not have another AI to train off of how much it's training would have cost. Any new innovation is good and I am excited to see what comes from Deepseek as it will be nice to have access to another open source AI model.

@damouno 7 күн бұрын

That 4k ram to get to MOON is utter Nonsense..HINT HINT ! 😂

@danfarrand9072 7 күн бұрын

"The quality of life gets better"...how much of that is because we are redefining the meaning of "better". Questions of Liability and Responsibility will be determined by the best rules that money can buy.

@nts9 7 күн бұрын

the question is why did Deep-seek open source it? could it be that they figured that if it was open source it would crash the AI market in the US?

@fairryalp.-de5qb 6 күн бұрын

@@nts9 I'm guessing because 🔸Chinese people don't have $500B to fund the private company DeepSeek. 🔸 DeepSeek realizes that it can't charge customers $200\month. 🔸 Instead of wasting time having war with US, then why not giving it free to the benefit of human. While saving American Tax Prayers $500B and small American companies.

@markpashia7067 5 күн бұрын

No one ever open sources until they have something better they are keeping for profits or advantage. This is just a show of force of what their military might have at it's disposal.

@Matt_K 7 күн бұрын

Exactly .....punish "not for using a tool"....i didn't know Alex is a strong proponent of 2A.

@Speak_Out_and_Remove_All_Doubt 7 күн бұрын

That's why APUs are the future, you can put 2+TB of RAM and share between your CPU and GPU.

@Chubbchubb2313 7 күн бұрын

"Military Parade" makes me imagine: Rows, rows, and rows of goose-stepping code 😅. Wait ... does AI have "consciousness yet".

@leomoval 7 күн бұрын

No!!!!! The reason why adding two lanes to the 405 freeway doesnt fix the traffic problem is because two lanes were needed in 1997. But the project didnt finish until 2017. And by 2015 we needed 5 new lanes. Also multiple lanes are wasted for pay to play riders who can buy their faster commute.

@JJs_playground 7 күн бұрын

So A.I. is taking other A.I.'s job. How ironic.

@boogigangah7923 7 күн бұрын

"40:33 is pure gold! THANK YOU!

@razadaza9651 7 күн бұрын

So scammers can scam more efficiently

@andrewsuryali8540 7 күн бұрын

Deepseek open-sourced their model because they have to. They built it by tapping into the Chinese AI open-source ecosystem, so under the Chinese rules of the game they have to put back the results into that ecosystem. Liang Wenfeng is a non-political guy whose Quant company got SANCTIONED in Xi Jinping's tech massacre. He isn't some ultrapatriotic Chinese government agent. He had to pivot his company into AI because the Chinese gov't smashed 'em on the head with a giant banhammer. His releasing R1 the day after Stargate is just a weird coincidence. R1 had been rumored in Chinese circles since December and nobody knew about Stargate until Trump suddenly signed it. That said, Liang probably did choose the exact date after finding out about Stargate because he wanted to make the biggest splash.