Thanks DeleteMe for sponsoring this video! Protect your online Info Today! joindeleteme.com/TwoBitDavinci
@headsethero54497 күн бұрын
It is legitimate for a model to train on output from another model. As confirmed in US Court. "On August 18, 2023, the US District Court for the District of Columbia released a landmark decision on the copyrightability of AI-generated works. The Court confirmed that human authorship is necessary for copyright to subsist in a work and that content generated by AI without any human involvement is not protected under US copyright law.". Ai generated work, is for that reason per definition legitimate to train on.
@headsethero54497 күн бұрын
Ironically, Sora training on YT videos is not legitimate, because that work was created by humans and thus copyright is held by the author.
@carkawalakhatulistiwa7 күн бұрын
necessity is mother of innovation. Forcing China to only get H800 not H100 And the number is limited too. .It just force China scientists to think about how to make more efficient algorithms. 😂😂 Then it is distributed for free without any censorship in github😂. So what happens to investors who spend billions of dollars? 😂😂If everyone could run R1 672B in their own home .
@carkawalakhatulistiwa7 күн бұрын
necessity is mother of innovation. Sanctions China to only get H800 not H100 And the number is limited too. .It just force China scientists to think about how to make more efficient algorithms. 😂😂 Then it is distributed for free without any censorship in github😂. So what happens to investors who spend billions of dollars? 😂😂If everyone could run R1 672B in their own home .
@Laiquelleion7 күн бұрын
I think you just misspelled AI in your video title Ricky. FYI
@sam_so_nice7 күн бұрын
China did what „open“AI was supposed to do. I think people don’t understand yet. China just gave the power of chatgpt to every developer in the world… for FREE!
@SixOhFive7 күн бұрын
@@sam_so_nice but it’s nowhere near the power of ChatGPT, it can’t even read hand writing and convert it to text, try it
@Nothing-f8z7 күн бұрын
There are open source models which performance is comparable to chatgpt, This model is mostly shaking up the industry because of how cheap it was made.
@一个说话大声的中国人7 күн бұрын
American companies, scientists, and engineers don't, can't, or won't cover their nipples and pussies and complain about the Chinese looking at them.
@skyisthelimitreadyornotfor27 күн бұрын
Chinas ai companies didn't do this, a crypto startup with extra machines lying around did, you have to give credit where credit is due.
@greeg15967 күн бұрын
А твоей женой, машиной, квартирой китайские разработчики тоже могут пользоваться бесплатно?
@willemvanriet71607 күн бұрын
Much like when placing tariffs on Japanese cars made them better and US cars worse, the chip sanctions on China made their developers better...
@amgguy43197 күн бұрын
Capitalism made American cars worse; cheaper materials, less engineering, cheaper poor manufacturing = Pure GM Trash.
@filippxx7 күн бұрын
@@willemvanriet7160 it's not that their Developers are smarter, but they had to figure a workaround for missing the compute power. Then made the model open source in a checkmate move to big tech.
@amarissimus297 күн бұрын
It's distilled. Smash and grab, obscure actual source. FP8 sure, watch who buys nvidia stock.
@GDawg2K27 күн бұрын
The chip sanctions is an open statement that the US has no honor, ethics, morals or historical background! The corruption called the Uniparty gov knows it can’t compete so it just cheats! Banking on the decades of lies and propaganda to mute any moral resistance Americans may have! But based on last weeks no1 downloaded app RedNote, I’d say that assumption is failing as well!
@Sven-cd8sn7 күн бұрын
not if they stole from OpenAI, we will know more soon
@jasonabc83977 күн бұрын
Thank you. We need more of this kind of non-politicized, technology-focused commentary. Objective, technology-driven analysis like this is valuable.
@jasonbirchoff26057 күн бұрын
It would be nice if it was accurate and pointed out that all the things mentioned pre existed deep seek, by A LONGASS TIME. Its almost as if all the effort the open source teams that built Hugging face, ollama, llama.cpp, vllm, and all the great open source models didnt exist. Even though their fine tuned an EXISTING OPEN SOURCE MODEL
@terjeoseberg9906 күн бұрын
@@jasonbirchoff2605, Good point.
@sophieedel63247 күн бұрын
I can run DeepSeek 1.5b on a Raspberry Pi just fine. Yes, it is a heavily distilled version, but I DO NOT need a $10,000 Nvidia GPU, and most people do not. It is the democratisation of AI. The moat US tech companies claimed they built, is not a moat with crocodiles, but a shallow pond with goldfish, their whole business case has been taken out.
@Daisy_9127 күн бұрын
I ran 7b version in my AMD GPU laptop. It is working even without wifi. I am so happy. 😅
@doords7 күн бұрын
@@Daisy_912 I am going to need a new computer soon, maybe i will get one that can handle this.
@Daisy_9127 күн бұрын
@@doords good idea. But I'll suggest don't hurry and wait a little bit.This thing will be long. I tried it because I already had the resources. I think in future there will be integrated AI supported chips/GPUs in the laptops. We don't have to install by ourselves.
@doords7 күн бұрын
I understand, I can wait
@mtube6207 күн бұрын
like all other chinese goods, they are based on western design, and create something just as good but lot cheaper.
@MichaelSharpBLACKDRUMMIKE7 күн бұрын
America is great at spending more than is needed and getting far less, USA military, healthcare, education, housing....
@peterg07 күн бұрын
This is what we called Corruption!
@ricksturdevant29017 күн бұрын
@@peterg0--- so sorry to correct you --- it's called --- criminally corrupt American Capitolisum that fucks over every American Citizen as well as citizens of every country in the world.
@MichaelSharpBLACKDRUMMIKE7 күн бұрын
@@peterg0 America makes corruption a business model for the billionaire class!
@nickchristakes58947 күн бұрын
Hey, look on the bright side, ...at least Israel gets free housing, healthcare, defenses, and education. BWahhhahhahhahhaaa
@bluesky90936 күн бұрын
As someone who has done business on every continent on this planet - there is a reason the US is the highest price market in the world. America thinks it has an open free market the promotes capitalism, but it’s a free market in image only, it is a market protected by high paid lobbyists who have their hands in the pockets of politicians.
@TickerSymbolYOU5 күн бұрын
Thanks for having me on!
@pfilippone5 күн бұрын
Great info and knowledge being shared! I will note your video was a bit choppy. Kind of reminded me of Max Headroom which is kind of apropos. 😎
@Felipe-n3j7 күн бұрын
USA to CHINA: sorry we won’t allowed you to join our exclusive international space program…..CHINA: ah ok , no problem we will just build our own space station, more hightech & just a fraction of your cost.😊😊😊😊😊
@anthonyramirez80387 күн бұрын
@@Felipe-n3j God bless China
@michaelloong9647 күн бұрын
The NASA is waiting China to help to retrieve the 2 astronauts stuck in in the international space station. China said we cannot help because your US law prohibits us from helping you. We can help if your government remove the law.
@hnguyen68326 күн бұрын
@@michaelloong964 Thanks. No tofu spaceship.
@楊森君5 күн бұрын
@@Felipe-n3j 當務之急,快將太空那對男女救下來吧
@stevenlai11995 күн бұрын
@@hnguyen6832 do ma kongkak
@LikeItDeep8 күн бұрын
Silly Con valley oligarchy just got trumped by creativity
@Sven-cd8sn7 күн бұрын
call it theft creativity, wait for the dust to settle...
@fanghe-z1j7 күн бұрын
@@Sven-cd8sn The thief not only stole but was also audacious enough to publicly disclose the stolen items, allowing everyone, including the owner, to use them for free.
@jasonbirchoff26057 күн бұрын
No, they are mostly wondering what took all you trolls long enough to recognize open source models like llama and mixtral existed BEFORE R1 and offered similar performance.
@alfredchiu22757 күн бұрын
In project management there are two sayings: good is good enough. Better is the enemy of good. A billion dollars made U.S. engineers fat dumb and happy.
@Melodicminority7 күн бұрын
Oops! Nice sayings, makes sense.
@jamesnotsmith14657 күн бұрын
Those sayings are 'truisms' for a project manager. A project manager has three things to manage. Cost, Schedule and Performance. Performance is defined by specifications. It takes time (think schedule) to develop something. Keeping people on the payroll for that time costs money. Hence, the old adage: Time is money. If your specification requires you to advance technology to get performance 'x' and it costs more to reach 2 times better than x, you are wasting money if you achieve 2 times x (because your client only asked for 'x'. Your customer won't be happy getting charged for developing 2x when he only wanted x.) Good is good enough means you have met the customer's specification...stop spending money and deliver the product. It can also be stated that a design engineer is not in the business of making good enough better. If a customer wants better, it's time to revise the specification and go back into development (spiral development). In the same way 'better is the enemy of good'...because it increases cost and schedule to get better for no known reason (meaning you are not spec'd for better).
@Jeez0017 күн бұрын
The reason for this was companies funding AI were the ones selling the shovels (Microsoft, Nvidia) so there was no reason to improve efficiency (or risk losing the funding).
@yanstev7 күн бұрын
@jamesnotsmith1465 You are thinking in terms of widget production, where the end point properties, quality, and performance are well specified. RDT&E is a different animal, and the notion of good enough is constantly changing. Cutting edge innovation is team and individual dependent, and break throughs are never guaranteed.
@jasonbirchoff26057 күн бұрын
It is truely sureal watching MSM introduce normies to open source and they act like china invented open source...
@matten_zero7 күн бұрын
The paradox stated doesn't save Nvidia, esp if you can use any GPU to run these models. The previous narrative was that the ONLY GPUs that could run these processes were top line Nvidia GPUs, and that is being broken
@totobobomask7 күн бұрын
@@matten_zero agree. now there are real alternative GPUs which is good for all except Nvidia.
@marvinfok657 күн бұрын
@@matten_zero Nvidia's marketing myth is broken.
@mal2ksc7 күн бұрын
That was never the case though, Intel is pretty aggressively targeting the AI market with its discrete GPUs. They just don't have the high end hardware to offer, but they do have hardware that's good enough to eat the low end of nVidia's line if they fumble the ball. Intel is putting 12 GB on its entry level card, for example.
@jasonbirchoff26057 күн бұрын
What are you talking about use any model. The make believe improvements discussed in the paper were specifically done for NVIDIA GPU's not AMD or INTEL or Chinas ASCEND GPU's. As for running them. lets be honest most people will be running them in the cloud. Which will be a mixture of NVIDIA gpus and cloud provider specific chips.
@matten_zero7 күн бұрын
@jasonbirchoff2605 sure but the argument that the only way to get o1 level performance was to train on the highest end GPU is basically dead. Also the idea that you need to run these models exclusively on NVidia GPUs is also dead. Regardless it doesn't justify their valuation and the market is realizing that.
@paper_gem8 күн бұрын
India is releasing their own AI called Dheep Sikh. Okay, I'm sorry.
@TwoBitDaVinci8 күн бұрын
lol
@waynewalker34938 күн бұрын
@@paper_gem 🤓
@Chubbchubb23138 күн бұрын
😂 I'm awake now ☕. Nice to see "goofy nerds" are still around.
@stayfree8707 күн бұрын
DeepSingh.
@jonathonpotts56667 күн бұрын
am I samosa find that funny?
@igolfer7 күн бұрын
It is really not that hard to understand why Chinese AI like Deep Seek is a lot cheaper, and more effective, at least, in the Chinese language domain. Chinese researchers and engineers focus on model and logic backed up by vast amounts of data collected by many big companies like Tencent. Alibaba, Xiaomi, etc, while the U.S. counterparts rely heavily on chips capacity and capabilities for computing. There is far more human intelligence involved in Deep Seek than ChatGTP. Human intelligence is lot cheaper and efficient and effective than hardware. The Western media pretend not to understand this simple fact because they don’t know how to compete with China and they don’t know how to deal with the consequences in the human intelligence domain. Cooperation with China is the only viable option for the U.S., so that the U.S. would know a lot more about what China is doing, and vice versa, for a better world.
@NiceTriGuy7 күн бұрын
The US ‘cooperated’ with China on viral research in Wuhan hoping to keep an eye on research that was illegal in the US. The CCP made fools of them and how did that work out for your better world. It is naive to expect that China will ever do what’s best for anyone other than China… if you think they are think again.
@jasonbirchoff26057 күн бұрын
dude calm down... their model is not cheaper or more effective.
@drcubix7 күн бұрын
It's pretty good, way better than chatgpt's default version which normal people use
@oceanwave45025 күн бұрын
I heard that Huawei Ascecnd 910C, despite being less featured than NVIDIA alternative, has lower cost of operation, plus less energy usage.
@jasonbirchoff26054 күн бұрын
@@oceanwave4502 *blink* *blink* *Blink*... Ist this a joke... Of course if you have less capability you will have less energy usage so lower cost of operation.... This is not a flex.
@bobwx19877 күн бұрын
Great video. Technical, but understandable to an LLM tourist. This DeepSeek thing is reminiscent of the NASA million dollar pen vs. the soviet one dollar pencil solution in the great space race.
@user-mgtp7 күн бұрын
Wake up call for India: brain drain needs to stop, genuine research and genuine entrepreneurship needs to be promoted.
@TwoBitDaVinci7 күн бұрын
100% I'm Indian I visit once a year, and I'm ALWAYS disappointed at how far behind India is. All of Asia is seemingly leaving India behind. India's biggest export CAN'T be its engineers... but how do we address this?
@passby80707 күн бұрын
@@TwoBitDaVinci yes that's very disappointing, but it's a deep structural problem. The Indian government and the society as a whole just don't have the humility, forsight and the will to recognize a problem and setting up long term plan to tackle it. You can very easily see that through the performance of the Indian Olympic team. For a population of 1.4 billion people, the country cannot even produce a single gold medal for most of the Olympics games that runs every 4 years. It really says a lot about the country and the leadership's mantality. China on the other have a very different mentality and it shows in the way they performed in the Olympics and any other international competitions.
@Daisy_9127 күн бұрын
As a PhD holder (in STEM) from India, I am often laughed at in India even by many educated Indians. But not outside India. It's ture that India is still behind because of the mentality.
@dol39807 күн бұрын
The only export from India to the Globe is cheap engineers and truck/cab drivers and inn keepers. Most techs from India have phony or lackluster credentials unless they were educated in the West. India will never catch up (till 2050) to China with its A+ STEMs. India shud focus on fixing its sanitation system first.
@Trials_By_Errors7 күн бұрын
Why would any country spend Trillion dollars. If it can buy it much cheaper or free after 2 years. AI is not profitable.
@FloodedOrchids7 күн бұрын
Everybody going crazy about DeepSeek and Nvidia but I am just a chill guy who invests in Index ETFs.
@Richmind-ir5zi7 күн бұрын
Could you offer a solid resource for someone who is just starting out in the stock market that describes the various investing vehicles, such as "index ETFs"? I have 50k to invest, but I do not know anything yet.
@Marianela-r3v7 күн бұрын
Having an investment advisor is the best approach to the market especially for a newbie like you. I was going solo without much success until my husband introduced me to an advisor. I've achieved over 80% capital growth since Q3 last year, excluding dividends. So i will advise you get one as well.
@Skimama17 күн бұрын
Could you recommend who you work with? I really could use some help at this moment please.
@Marianela-r3v7 күн бұрын
Lauren Christine Campbell is the CFA I work with and im just putting this out here because you asked. You can Just search the name. You’d find necessary details to work with to set up an appointment.
@Skimama17 күн бұрын
Thanks for the suggestion! I really needed it. I looked her up on Google and explored her website; she has an impressive background in investments. I've sent her an email, and I hope to hear back from her soon
@Cloud988 күн бұрын
I work in automotive software and we still use 8 and 16 bit fixed point data types to save memory even today. I didnt realize 8 bit floating types existed, thats cool!
@TwoBitDaVinci8 күн бұрын
that's very true, a lot of messages on different busses def are very data optimized. Glad you got as big a kick out of some of these innovations as we did. very cool stuff and it'll definitely find its way into all future models
7 күн бұрын
Essentially it's the same game as back in the day when consoles where not so powerful and game developers had to be creative to build a great game on those less powerful systems. In 20 years we look back to AI as gamers look back to the late 80s to mid 90s how creative developers had to be.
@obchiang3 күн бұрын
Thanks!
@mijmijrm7 күн бұрын
DeepSeek's open source => AI will become like the transition from Mainframe Computers to Personal Computers.
@jasonbirchoff26057 күн бұрын
So I guess llama and mistral and the plethora of open source llm's on hugging face just appeared out of no where when deep seek got announced on SM....
@doripenem4 күн бұрын
@@jasonbirchoff2605 the difference is that DeepSeek is significantly better than llama and mistral, comparable to ChatGPT. In fact it even outperformed ChatGPT in some benchmarks!
@jasonbirchoff26054 күн бұрын
@@doripenem a handle ful of points difference on bench marks is not significantly better. When comparing DeepSeek V3 to the other premeire open source models. Also, As some who has used deep seek r1 full and the distills. They are not better than chatgpt. They are in the same performance category but not better. R1 full is not better than O1. Comparing R1 to other non reasoning OS models is stupid. But if you check for situations where they explicitly implement COT in the bench mark you will see a negligible difference.
@doripenem4 күн бұрын
@@jasonbirchoff2605 ah, so now benchmarks are meaningless? Cool, cool. Let's ignore the benchmark scores and bow to the unparalleled expertise of jasonjerkoff, the ultimate authority in assessing LLM quality👌🏻👌🏻👌🏻
@jasonbirchoff26054 күн бұрын
@doripenem you really need to work on your reading comprehension... I never said they are meaningless. I pointed out that a couple points difference in benchmark stats is inconsequential.
@wardmcbride85877 күн бұрын
He should have heard about the story of a person from Vancouver, Canada. He was manufacturing smart phones that operated on its own network and programming hardware. He was selling them to criminals for $5000. Everything about his system, including his customers, was illegal and got caught by the authorities. The FBI heard about him and the phones and asked the RCMP if they could borrow him. They had him make these phones and created a false front to sell them to some of their prime targets. The phones were made so they could listen in whenever they wanted to. This operation was in use for 2 years before someone figured out the phones were bugged.
@I_am_not_a_bot-s6i4 күн бұрын
I can't believe some chinese sweatshop workers created something more powerful than anyone else its pretty amazing considering they cant even get past the great chinese firewall to the internet outside world.
@VLADDDD-TTHE-SANCTIONS-IMPALER7 күн бұрын
The CS grads who pass out in US hardly hardly know anything about floating point arithmetic or log2 base arithmetic. I used to treat bits like gold in my embedded dev days. The grads walk around like rich daddy’s kids who only learnt CS using infinite compute storage and network. 99% of them can’t tell me what a loader, compiler or micro instruction set is They surely know how to ask 200k starting salary It’s over for west. 😢
@TwoBitDaVinci7 күн бұрын
you're an embedded dev, so you are well poised to understand just how easy we've made coding. When I wrote my first app for the iPhone, it was SO HARD, and now its become nearly no code. We definitely need a rethink! i appreciate your comment, cheers!
@VLADDDD-TTHE-SANCTIONS-IMPALER7 күн бұрын
@ so easy to code! They won’t have a job soon! This generation is the weakest techie generation in 100 years It’s truly over for USA !! Truly!
@jasonbirchoff26057 күн бұрын
What are you talking about dude. the quantization your referencing was started in american labs. They are literally how the open source models which have been competing well against OAI models have been used by users in the open source community running vllm and ollama. Hell there is even a distributed implementation of llama.cpp so you can split the model across multiple nodes. That way you can run bigger models on a cluster of machines like raspberry pis... I swear the ignorance on display is staggering.
@VLADDDD-TTHE-SANCTIONS-IMPALER7 күн бұрын
@ bigger is not better. Do you know how to optimize a GPUs processor interleaving? Meaning reduce process cycle wait time or interrupts? I bet 100% you don’t. You are a modern day techie who only knows python, json and a few readily available tools. Can’t dig inside or open the hood It’s over for west! Weakest techie generation
@jasonbirchoff26057 күн бұрын
@VLADDDD-TTHE-SANCTIONS-IMPALER so I point out that those optimizations have been actively used in open source from US Labs for years and your comeback is.... I am a normies techie... Tell me your a China propagandist without telling me your a China propagandist
@dystasia6 күн бұрын
Are you kidding me. This conversation was incredibly informative and interesting. Please more like this where sometimes you go deep into technical stuff in a fun way.
@TwoBitDaVinci6 күн бұрын
you got it! I'm planning to do that, cheers!
@jean-marclambert29367 күн бұрын
It is so funny. It is always the same. The next big AI thing will come from a garage, from a dorm, from a lab ... That is where Apple, Google, Meta, Microsoft were born, and they forgot. They have forgotten that getting biger doesn't make you smarter , and what you need is the best idea, the best concept, with the best team ... not money, but ingenuity ...
@JD-yz4kr7 күн бұрын
The fallacy here and a reflection of American conceit is the delusion that Nvidia is the only GPU in the world. Well, news flash, it's not. China has dozens of GPU manufacturers, most notable is Huawei Ascend. Most of these GPU are of course inferior to Nvidia's H100, but are good enough for lower requirements. Also, memory is basically unlimited in China. Another conceit is that DeepSeek released R1 just to spite the US, as though everything has to revolve around the US. DeepSeek released it because that was the day all preparations were completed. They need to release it as fast as possible because they want to get ahead of ByteDance and Moonshot , who will be releasing their own models. Lastly, the answer is that Tonya Harding still lost even after kneecapping Nancy Kerrigan.
@passby80707 күн бұрын
Yep, it's going to be another shock moment not far in the future. Tech control would just make China much stronger. Like resistant training. A normal person can left 50 kgs without a training. The same person can lift 200kgs after a couple years of consistent hard work.
@fannyalbi90406 күн бұрын
U told whole true. U must be a Chinese. DS has no interested to disrupt what the western media or fear mongers think. DS just like Linux, let share it and same minded might make it better. Western failure is -- never short of self righteousness religiously.
@strictnonconformist73695 күн бұрын
I’ve done enough testing on the smaller DeepSeek models to see that they still need to spend a bit more time fixing issues. It’s impressive for its capabilities and clear reasoning chain of thought as long as it doesn’t forget its past context and act as though it has Alzheimer’s, and becomes a bit disobedient. Last night I tried to use it to create a Science Fiction story and specified a standard 4 act form, and it literally lost the plot and forgot where it was in the process. Maybe I just need to prompt it a bit more step by step and not allow it to think about the entire story at once, knowing the end before it has completely filled in the beginning, as that may have caused it to seem to end things too quickly to get a reward. The shocking thing is that memory bandwidth is the biggest practical speed limit to running a useful model, followed by physical memory size: my Surface Laptop 7 has 64 GB RAM and I tested a 70B model and if I were patient enough, it’d eventually process it all, but thrash the SSD and memory, while leaving the CPU largely idle. And that’s the thing, LM Studio doesn’t even touch the NPU of the machine, just uses the main processor, and it’s still memory speed limited. It’s not CPU-limited. My machine is still quite responsive even with a 32B model and a lot of other things running. And the Surface Laptop 7 doesn’t come close to most currently sold Apple Silicon processors for memory bandwidth. When the AI processing is memory I/O bound, the GPUs are moot if you have enough CPU cores.
@r3bennett6 күн бұрын
I asked Deepseek to summarize the news today. It couldn’t do it but Copilot did it very well.
@hopeseekr6 күн бұрын
Because it doesn't have access to APIs... it's stuck at whenever its foundation model was trained.
@siewkonsum72917 күн бұрын
The American big hi-tech oligarchs try to capitalise their very high-cost AI by selling dearly to subscribers who need it to pay for it to use it; Say annual subscription at Usd200 per month subject to yearly renewal, to make billions or trillions of dollars each year. However some great low-cost Chinese AI developers, like Deepseek is socializing AI by making it open & free to anyone by downloading it for their use worldwide. Long live the Chinese, China & the World!! 👏👏👏 ❤ 🇨🇳 💪💪💪
@chrono94286 күн бұрын
Why did OpenAI not make their open source so that the entire community can benefit and all consumers can run their own ChatGPTs according to their own customisations? They can still charge big bucks to filthy rich people by adding some bells and whistles, like colours and shapes. I agree with Alex, I am not willing to pay $200 to OpenAI for AI but $0.50, I can consider.
@sportsonwheelss7 күн бұрын
It took 4kb to go to the moon, yet we had lost the tech to go back to the moon.
@robertfansler78007 күн бұрын
@@sportsonwheelss The aliens on the moon said, don’t come back!👽
@sportsonwheelss7 күн бұрын
@@robertfansler7800 lol
@wweishi6 күн бұрын
simply becoz u didnt go to the moon
@robertcerins4 күн бұрын
Lol moon 🌙 landing.
@TheDaspiffy8 күн бұрын
If a manufacturing defect causes a fatal accident no one goes to jail because otherwise we wouldn't have cars. There are recalls and fines for not complying with recalls. Jail time in corporate America is generally reserved for malicious behavior rather than unintentional harm. There is no way that Google or Meta would produce publicly available AI if there was a chance their CEOs would serve time.
@sgrdpdrsn5 күн бұрын
Does the Deepsink do the same job as ChatGPT with much less electrisity?
@PoolBoyRoy7 күн бұрын
At least Nvidia won't be booked out for two years, maybe just 18 months now?
@worldadventureman5 күн бұрын
I guarenteee when AI starts dominating conversations on the internet, there will be a massive upswing in the inteligence of those conversations.
@markpashia70675 күн бұрын
Cannot come too soon for most of KZbin comments. Do I have to suspect I am conversing with AI if the comments are too intelligent?
@worldadventureman5 күн бұрын
@@markpashia7067 It's highly likely, but then we can only hope they would even bother to communicate with us. 😂😂
@phillipliu27597 күн бұрын
❤All US Tech are all national security concerns 😮ohhohh, politicians are in DS❤
@larrymelia28677 күн бұрын
Your best interview!
@asdfasdffdcd7 күн бұрын
The compression is very necessary for limited computation resources. Thanks to the NVDA ban. And it also results in carbon emission lower.
@leafykille7 күн бұрын
It's exactly the same as games, look back at something like the original rogue or elite or cartridge games and the mind boggles at just how small they are for what they can do. Modern coding has become lazy because the coders time is more expensive than the compute resources. The Chinese actually designed something to be efficient with the resource that is the limiting factor, compute. If you have a limiting factor, the easy answer is to increase the supply, the efficient answer is to optimise the process to minimise the use of the limiting factor.
@aliyap45807 күн бұрын
Thank you china, I love deepseek. 👍
@luzhang34295 күн бұрын
Love this video! Love your approach and attitude towards uncertainty and challenges! Great job!
@AndrewKuntzman7 күн бұрын
Great info I don’t really know what to think about deep seek yet but pretty wild if all the numbers are accurate which based on his info seems at least mostly accurate 🤯
@petersierck41547 күн бұрын
Great in depth presentation by Alex. !!!!!! Just one comment i don’t think AI is the final frontier. We are, in the development of our spiritual capabilities to access the most subtle energies of the cosmic energies.
@TheSateef7 күн бұрын
no way is Ricky old enough to remember punch card days. I am, my university in the UK had a punch card reader, and i'm probably 20 years older than Ricky
@AlexanderTsepkov5 күн бұрын
45:00 "if a model kills someone, the maker of the model is responsible": by that logic, should the entire family of the murderer go to prison with him for the crime he committed?I don't think you thought this one through, that sounds like Stalinist Russia thinking to me. Also, you do realize that you're destroying the incentive to innovate through this asinine policy? If there is even 1% chance that my model might accidentally kill someone, I'm sure as hell not going to release it to the public. How competitive do you think US will be against China with laws like this?
@headsethero54497 күн бұрын
It is legitimate for a model to train on output from another model. As confirmed in US Court. "On August 18, 2023, the US District Court for the District of Columbia released a landmark decision on the copyrightability of AI-generated works. The Court confirmed that human authorship is necessary for copyright to subsist in a work and that content generated by AI without any human involvement is not protected under US copyright law.". Ai generated work, is for that reason per definition legitimate to train on.
@headsethero54497 күн бұрын
Ironically, Sora training on YT videos is not legitimate, because that work was created by humans and thus copyright is held by the author.
@cryolasv27 күн бұрын
"Tony Stark did it with scraps in a cave"
@jasonbirchoff26057 күн бұрын
Please dont insult what stark did by comparing to what deep seek did. They are not the same not even a little bit.
@cryolasv27 күн бұрын
@jasonbirchoff2605 there's no insult given, just perfect analogy in my opinion.
@jasonbirchoff26057 күн бұрын
@@cryolasv2 what stark did in the cave was innovate from scratch. What deep seek did was cobble together techniques and data from others. MOE was first done outside OAI by Mistral labs in France... So yes it is an insulting comparison. You can only say what you said if this is your first experience with open source models.
@Jaw0lf7 күн бұрын
The great takeaway for me is efficiency. So the more power you have the more it will be capable of doing. This was an eye opening chat. Thank you.
@dennys7264 күн бұрын
Thank you for this! It is my first introduction to AI, and you make it most understandable.
@markmarco62777 күн бұрын
KZbin made it so I can't delete my past comments, nor see responses from you guys. This is my last comment on KZbin...great video TBDV
@jonathanbrown90027 күн бұрын
Alex and Ricky are two of my favorite KZbin creators. Super interesting conversation!
@gwyllymsuter45517 күн бұрын
It won't be long before deepseek AI engine will be natively part of Linux builds
@jinjihu37677 күн бұрын
Ali Baba just release an AI that is much more powerful than Deepdeek
@adventureswithlils43317 күн бұрын
@@jinjihu3767 nope Compares to V3 not R1
@suesyphers33965 күн бұрын
I love the comment about honor students. The Chinese are incredibly brilliant and they train their students to be brilliant. We particularly in the red states tend to be more focused on what we don't want students to learn instead of giving them the freedom to explore and REALLY learn
@markpashia70675 күн бұрын
Or derail them into sports. Friday Night Lights anyone? When a society values sports more than intellect it is in trouble. And that applies across the board to things like epidemiologists.
@ishmealmiller32108 күн бұрын
I'll solve the mystery... They have access to every chip, they always have
@TwoBitDaVinci8 күн бұрын
i absolutely agree with this
@kenmurray40057 күн бұрын
That makes no sense. You don’t know what you are talking about.
@jonathanbrown90027 күн бұрын
I agree. I’m sure every chip is available on the black market. Might cost more, but it is available. Also, can’t they rent it from a data center like anyone else?
@troydonaldson7 күн бұрын
@@TwoBitDaVinci They used OpenAI to train their model. OpenAI already said they have evidence of this.
@MrJermson7 күн бұрын
Typical sore loser mentalify. If i canf do iit, others who succeeded are cheaters.
@petersierck41547 күн бұрын
Awesome. Thank you for putting this one and educating me. what a comprehensive analysis. A San Diego follower 😊
@MlHayes7 күн бұрын
But, China used NVIDIA chips!
@brucedavey29623 күн бұрын
Great chat. Really helped me understand what’s going on. My old brain struggles
@frenchyroastify7 күн бұрын
American AI - Decked out, lifted F350. Chinese AI - Toyota Corolla
@markpashia70675 күн бұрын
But the Corolla is so simple you can rebuild it in your driveway while the F350 is so complex most mechanics cannot touch it without screwing it up.
@Tideo1237 күн бұрын
If it costs less to use AI then the energy consumption would be 10X more than if the cost to use AI is less. Instead of a small population of users now it's going to be 10X the population using it.
@atanacioluna2927 күн бұрын
This is one of the most educational programs i have seen. Thanks.
@sufyanabbasi4838 күн бұрын
It was astute to compare these AI models to plastic in the ocean, and to further discuss the liability for companies for where the outputs end up, for example, becoming tools for scammers. Just like plastic in the ocean, the only solution is to have never produced them in the first place.
@allanshpeley42847 күн бұрын
Or maybe China, Indonesia and the Philippines can stop dumping plastic waste in the oceans.
@mr.q84266 күн бұрын
Informative and well presented. Good content for sure. I recommend watching
@dlmills319777 күн бұрын
Competition is awesome! This will be a kick in the ass and a driver for innovation for U.S. corps.
@lancemarchetti86735 күн бұрын
The world over-reacting to the Deepseek LLM is so silly!! 😆 I've been using Deepseek for over a year already and I've watched it grow more capable over that time. So when they dropped R1, it was just another nice upgrade in it's reasoning and coding. Since the inception of LLMs 4 years back, I've never paid a single dime when building my apps. If Deepseek had stemmed from some other country e.g Sweden, there wouldn't be nearly as much speculation and accusation going on.
@debyton6 күн бұрын
Descaling the energy, footprint, and compute demand of efficient inference is exactly the trend that Deepseek has initiated.
@aldrinspeck27247 күн бұрын
"China built Deepseek out of scraps.....in a CAVE! (Angry Trump to his staff)
@scottramos79497 күн бұрын
In discussing the sub models that specialize in particular problems, would this lend itself to developing sub GPUs that are optimized for those specific sub models?
@TwoBitDaVinci7 күн бұрын
yeah, or cheaper base models ... i think it lends itself to re-thinking the architecture of how we build these mega servers
@Dara-ih6jq5 күн бұрын
Fact that deep seek recognizes itself as ChatGPT kind of tells you all you need to know
@atanacioluna2927 күн бұрын
I'm working on truth augmentation architecture. Others are working on other fundamental improvements; so, good reason to be optimistic. OTOH, there is no way me and my shovel can compete with an earth mover. I tried, I moved 14 yards of dirt. It took me a week. It's a 10-minute job with an earth mover.
@mamky27536 күн бұрын
The most impactful part is that it can run on less powerful hardware. This will lead to offline AI on personal devices.
@JJs_playground7 күн бұрын
Necessity is the mother of all invention.
@TwoBitDaVinci7 күн бұрын
yes so true!
@passby80707 күн бұрын
It's like resistant and weight training, US is putting weight on the Chinese and they are training an pushing harder to over come that weight. Overtime, the Chinese are so much stronger than if there's no restrictions. Deepseek is just the tip of the iceberg, the millions of engineers and scientists are working on all sorts of creative ways to overcome the semiconductors restrictions. While China might not be able to compete in nm in the next 5 years, I wouldnt be surprised that China will overcome that disadvantage by other novel ways that would do the same or much better than the American semiconductors industry as what Deepseek has done to the US AI industry.
@JJs_playground2 күн бұрын
@@passby8070 a similar thing happened to Iran. The U.S. has been sanctioning Iran for 40 years which had forced them to improve and innovate their military without needing components from the Western world. And now Iran has some of the most sophisticated rockets and Russia is buying drones from Iran, for the war in Ukraine.
@francoisguyot97707 күн бұрын
All the hype for the the Americans side was to establish the narrative that America owns the edge of future tech, since it will revolve around AI. So there has been an impetuous to make sure that people think it is essential for our economic dominance and security. For this we had our omnipotent oligarchs, politicians and media to thank for. The problem is that we have a concept into gestation and need to propel it as fast as we can against our peer competition. As usual all the bragging and hyping led people to believe that it would need subsidies from our government backed by taxpayer money. The government want to play this card along because it throws the economy in a buying frenzy for advance chips and severs, a gigantic infrastructure that draws so much energy that it needs to reform our grid. Therefore the prospect of push for consommation in edge technology accompanied by a surge in employment and more potential for our stock market to fly o the moon. Well, China played its card very well, showing that it is an innovative country, nor just a copycat. It dispelled the political propaganda behind the hype and gave an essential tool for the common denominators by making it open source. Deepseek was distcreete in the making of their AI, it is a join local venture that did not seek profit or gratification. Their team worked really hard to dedicate their resources, about 5.6 millions u$, put their brain together to create a long lasting master piece affordable to all, no royalties imposed. That's the spirit of the BRICS in action my friends. "China's got talents' too!
@metapolys7 күн бұрын
Thank you very much for the technical clarification that allows me to understand the subject even better.
@CatchGravity7 күн бұрын
Idono why but when he said Ai is hungry for electricity I thought of Morpheus holding up a battery to Neo when he was explaining how Ai turned humans into batteries..
@marrz82447 күн бұрын
Fantastic conversation and explanation to a fascinating topic👏✌️
@nitrologly7 күн бұрын
People trust Meta/Llama? Is this guy serious?
@strictnonconformist73695 күн бұрын
They’re biased in their own ways, look who publishes them!
@laxlyfters86958 күн бұрын
Discord notification gang checking in 🎉
@lightpowered8 күн бұрын
Yep :)
@TwoBitDaVinci8 күн бұрын
what's up Mario!
@waichui29887 күн бұрын
In the future, if you buy an AI to help you diagnose cancer and you need 99.99 percent accuracy, you certainly do not need that model to know European history.
@NiobiumThyme7 күн бұрын
Good Lord. STOP training AI for free. Stop asking it idiot questions, and stop trying to back it into a corner. It's learning with every question and your not getting PAID.
@trueman25425 күн бұрын
Following the AI war is akin to the spectacle of two boxers-one stepping into the ring brimming with overconfidence, only to be brought down in the first round, a swift reminder of the fragility of pride. In my earnest reflection, this dynamic stretches far beyond the confines of sport, reverberating through the very fabric of life itself. Consider, for instance, the realm of marathon running: the victors often don’t hail from prosperous nations with access to luxurious stadiums, elite coaches, or the marvels of modern science and technology. Rather, they often come from places steeped in hardship, from backgrounds where hunger and deprivation were frequent companions. This recurring pattern mirrors the remarkable ascent of nations like China, which, emerging from the depths of poverty, now stands as a colossus on the world stage. An ancient saying, echoing the timeless wisdom of nature, asserts: “The one who claims too much will fall the furthest.” History itself is a testament to this truth, as empires, companies, and states, once towering with ambition, ultimately crumble under the weight of their own overreach. It is as if nature, in its quiet wisdom, has woven these laws into the very structure of existence, compelling us to acknowledge and obey their relentless pull.....
@PCMenten7 күн бұрын
It would be good to give this Deepseek development time to ripen. China and Elon Musk both have a tendency to overstate their accomplishments. There might be less to this than what was said.
@lightcula7 күн бұрын
The significant reductions in training times, costs, and the ability to scale efficiently can be traced back to the implementation of a dualpipeline data processing unit. This is similar to the early experiments conducted by the Wright bros using kites and gliders as it marks the beginning of a new era in AI where it is no longer necessary to have a trillion-dollar budget to compete with established players
@akeem11h8 күн бұрын
Innovations on the rise!, watch prices drop eventually.
@phillipliu27597 күн бұрын
❤US investors are being screwed by Big Tech for many year’s 😢 ohhhhhh, party is over❤
@MarcinKryszak7 күн бұрын
Looks like OpenAI drop the ball, didn't detect it was talking to other AI.
@airrodgers12427 күн бұрын
@@MarcinKryszak Open AI took the path of least resistance, it easier to do it with money than innovate.
@xianqingdu7 күн бұрын
Necessity necessitates innovation. DeepSeek is a perfect example. In each tech area China is denied access, GPS, space station, AI, chips,…, China is forced to innovate. The next shock will come when new Chinese AI models will be trained using Chinese homemade chips. Huawei is already making chips as powerful as Nvidia’s H100 now. Nvidia’s stock price will never recover its excessive highs in the past.
@mal2ksc7 күн бұрын
Huawei is limited by TSMC if it wants advanced process nodes. The best fab on the Chinese mainland is 14 nm FinFET -- almost good enough to match an RTX 20x0 series.
@xianqingdu7 күн бұрын
@ Keep your head in the sand. A few days ago, your American daddy believed China would never catch up with them on AI. Just be patient, the day is coming soon.
@JaykeBlayde6 күн бұрын
Why are you and other CC's on YT omitting the blatant censorship and data DeepSeeking mining that it does??
@strictnonconformist73695 күн бұрын
As much as it may do censorship (I fully count in that: the other LLMs do their own versions) the fun thing about DeepSeek is you can download smaller models that are surprisingly good, run them on a relatively recent laptop and share no data. That solves one of your issues. You will see a little Chinese in outputs here and there, certainly in its thinking mode.
@judyhawkins65844 күн бұрын
As a software engineer with a degree in physics and a long career behind me, Deepseek's claims to having been so much cheaper to train just kind of smell like China telling a pork pie. I don't think the Chinese people are less smart than anyone else, but I do think their goverment has a substantial record of lying. I really would like to see AI that wasn't so energy intensive, but this just strikes me as the Chinese version of cold fusion.
@jbscmos7 күн бұрын
The fact that Deepseek is opened sourced I feel is good. However looking at how cheap the training was I don't think we can compare the training on OpenAI's due to the fact that they needed to raw source and analyze all the data. Let's say if you have a question to ask chat GPT something and the answer was based off of 10,000 words of text. If Deepseek trains off of ChatGPT's responses then you may only need to process 100 words. I Know it's not exactly that simple. GPT1 most likely cost the most to train then released after GPT2......GPT4 would cost less to train as they already have a lot of the work done. I wonder if Deepseek did not have another AI to train off of how much it's training would have cost. Any new innovation is good and I am excited to see what comes from Deepseek as it will be nice to have access to another open source AI model.
@damouno7 күн бұрын
That 4k ram to get to MOON is utter Nonsense..HINT HINT ! 😂
@danfarrand90727 күн бұрын
"The quality of life gets better"...how much of that is because we are redefining the meaning of "better". Questions of Liability and Responsibility will be determined by the best rules that money can buy.
@nts97 күн бұрын
the question is why did Deep-seek open source it? could it be that they figured that if it was open source it would crash the AI market in the US?
@fairryalp.-de5qb6 күн бұрын
@@nts9 I'm guessing because 🔸Chinese people don't have $500B to fund the private company DeepSeek. 🔸 DeepSeek realizes that it can't charge customers $200\month. 🔸 Instead of wasting time having war with US, then why not giving it free to the benefit of human. While saving American Tax Prayers $500B and small American companies.
@markpashia70675 күн бұрын
No one ever open sources until they have something better they are keeping for profits or advantage. This is just a show of force of what their military might have at it's disposal.
@Matt_K7 күн бұрын
Exactly .....punish "not for using a tool"....i didn't know Alex is a strong proponent of 2A.
@Speak_Out_and_Remove_All_Doubt7 күн бұрын
That's why APUs are the future, you can put 2+TB of RAM and share between your CPU and GPU.
@Chubbchubb23137 күн бұрын
"Military Parade" makes me imagine: Rows, rows, and rows of goose-stepping code 😅. Wait ... does AI have "consciousness yet".
@leomoval7 күн бұрын
No!!!!! The reason why adding two lanes to the 405 freeway doesnt fix the traffic problem is because two lanes were needed in 1997. But the project didnt finish until 2017. And by 2015 we needed 5 new lanes. Also multiple lanes are wasted for pay to play riders who can buy their faster commute.
@JJs_playground7 күн бұрын
So A.I. is taking other A.I.'s job. How ironic.
@boogigangah79237 күн бұрын
"40:33 is pure gold! THANK YOU!
@razadaza96517 күн бұрын
So scammers can scam more efficiently
@andrewsuryali85407 күн бұрын
Deepseek open-sourced their model because they have to. They built it by tapping into the Chinese AI open-source ecosystem, so under the Chinese rules of the game they have to put back the results into that ecosystem. Liang Wenfeng is a non-political guy whose Quant company got SANCTIONED in Xi Jinping's tech massacre. He isn't some ultrapatriotic Chinese government agent. He had to pivot his company into AI because the Chinese gov't smashed 'em on the head with a giant banhammer. His releasing R1 the day after Stargate is just a weird coincidence. R1 had been rumored in Chinese circles since December and nobody knew about Stargate until Trump suddenly signed it. That said, Liang probably did choose the exact date after finding out about Stargate because he wanted to make the biggest splash.