Building the Software 2 0 Stack (Andrej Karpathy)

  Рет қаралды 65,370

Databricks

Databricks

Күн бұрын

A lot of our code is in the process of being transitioned from Software 1.0 (code written by humans) to Software 2.0 (code written by an optimization, commonly in the form of neural network training). In the new paradigm, much of the attention of a developer shifts from designing an explicit algorithm to curating large, varied, and clean datasets, which indirectly influence the code. I will provide a number of examples of this ongoing transition, cover the advantages and challenges of the new stack, and outline multiple opportunities for new tooling.
To learn more: databricks.com/product/gettin...
About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Read more here: databricks.com/product/unifie...
Connect with us:
Website: databricks.com
Facebook: / databricksinc
Twitter: / databricks
LinkedIn: / databricks
Instagram: / databricksinc Databricks is proud to announce that Gartner has named us a Leader in both the 2021 Magic Quadrant for Cloud Database Management Systems and the 2021 Magic Quadrant for Data Science and Machine Learning Platforms. Download the reports here. databricks.com/databricks-nam...

Пікірлер: 59
@ExcessumGaming
@ExcessumGaming 5 жыл бұрын
This should be seen by a lot more people!
@godspeed133
@godspeed133 3 жыл бұрын
How had this got under 20,000 views and under like 10 comments?? This guy is literally one of the leaders in the industry, he's right at the bleeding edge. That's utterly diabolical
@paulcassidy4559
@paulcassidy4559 3 жыл бұрын
Compare with Ben Goertzel's discussions with Charles Hoskinson where they're literally openly sharing their ideas and thoughts around the bleeding edge in decentralized tech and decentralized AI applications with the entire world. Also getting low 5 digits view counts. Oh well. Just means people like you and I are ahead of the curve!
@wbrandler
@wbrandler 3 жыл бұрын
Only a small number can be at the bleeding edge by definition!
@TA-ob6kz
@TA-ob6kz 3 жыл бұрын
Tools for Datasets writing software.
@ihateyourusernames
@ihateyourusernames 3 жыл бұрын
​@@paulcassidy4559 agreed :) most of us are here due to a shared sense of foresight. good luck to all of you on your projects :)
@compact6264
@compact6264 3 жыл бұрын
I did labeling job in the past and this guy described it perfectly, I feel better now for sucking at labeling :)
@tipoomaster
@tipoomaster 3 жыл бұрын
"Elon was like, vision can see raindrops...And now it's my problem" I feel like that's a fair snapshot at life in the FSD team lol
@DogaOzgon
@DogaOzgon 3 жыл бұрын
This was very insightful, definitely more people needs to see this!
@chrissanchez1891
@chrissanchez1891 3 жыл бұрын
Ahead of his time. Very thought provoking.
@Tony-cj6jy
@Tony-cj6jy 5 жыл бұрын
6:45 This is a nice proof of concept to me and explains why they could make progress much faster. First is an example of a hybrid heuristics approach (used by Waymo and others) and a pure deep learning approach (used by Tesla). The second one is superior in recognizing parked cars but requires massive amounts of real-world data. Only Tesla has that real-world data, because it's system is installed in about half a million cars by now. All the other "experts" thaught it was too dangerous to test self-driving software in cars being driven by consumers. They also thaught it was necessary to have Lidar, still too expensive to apply on a massive scale.
@SyeamTechDemon
@SyeamTechDemon 4 жыл бұрын
Actually, Tesla does use heuristics for certain things, not everything is done through deep and machine learning approaches. Elon Musk says this himself during Autonomy Day, that sometimes it's quite stupid to have a neural network for simple things, you might as well just use heuristics.
@rafaellino7168
@rafaellino7168 3 жыл бұрын
@@SyeamTechDemon makes you wonder about this NN for running the wipers then... maybe he thought it would help the NN see through rain if it could recognize it
@MuscleTeamOfficial
@MuscleTeamOfficial Жыл бұрын
Karpathy, you will always be a legend in my book.
@eduardorivara6129
@eduardorivara6129 3 жыл бұрын
Great talk!
@GoogleUser-ee8ro
@GoogleUser-ee8ro 3 жыл бұрын
This whole idea of software 2.0 where we human define the scope/region/boundary of the problem and let the program/machine/AI find optimization(s) for the problem reminds one of how we approach PDE problems.
@Noiseofdrums
@Noiseofdrums 3 жыл бұрын
Andrej Legend Karpathy
@Level6
@Level6 4 жыл бұрын
Good!!
@liangcity
@liangcity 5 жыл бұрын
very good video
@IvanGarcia-cx5jm
@IvanGarcia-cx5jm 3 жыл бұрын
There is a book named "Working Effectively with Legacy Code" for procedural/algorithmic/classical (software 1.0) code. It would be nice to see a book like this for statistical based/deep learning networks code (software 2.0). I guess since all of this is relatively new there is no deep learning legacy code yet (although companies like Google might have a bunch of it already). But it would be nice to see how maintenance and refactoring would be done in not only in legacy code for NN, but also legacy network parameters that have been calibrated with potentially years of training (if that exists). I am not sure if this really matters for software 2.0.
@wimveninga1714
@wimveninga1714 Жыл бұрын
Wouldn't the "maintenance part be" on the data, the quality of the tagging and the process of continuously improving the model based on new data?
@IvanGarcia-cx5jm
@IvanGarcia-cx5jm Жыл бұрын
@@wimveninga1714 Makes sense. The maintenance is on the data to train the model. But it seems to me that the weights of the network are unmaintainable. The only things a DL engineer modifies are the model and training data, hence, that is the only thing to maintain. Maintaining weights are like maintaining binaries, it does not make sense.
@jimmyt_1988
@jimmyt_1988 3 жыл бұрын
So well said. Visionary. Let's do this guys!
@Vale46NL
@Vale46NL 2 жыл бұрын
who is this guy man
@quosswimblik4489
@quosswimblik4489 3 жыл бұрын
Could AI recode the net for more slower rural users. Could a company have a lite site maker. If I wanted to get rich I would make an eternal search engine that was always learning about what it less understood to autogenerate labeled datasets.
@scottdavidcraig
@scottdavidcraig Жыл бұрын
🎯
@colinmaharaj
@colinmaharaj 3 жыл бұрын
I wish I could work there. But I dont even have a 1st degree
@Trashbag-Sounds
@Trashbag-Sounds 3 жыл бұрын
Why don’t you just use the user wipe input to train? If there is ketchup on the windscreen, the driver will use the wiper stick to wipe. To you can detect this as an “automated system failure” and retrain on this. End to end. Isn’t the pretty straight forward? Even for gals positives. If it gets to excited in a tunnel, the driver should be able to turn it down or off. So just use this as an intervention. You do this with autopilot. Why not with the wipers
@SuperChooser123
@SuperChooser123 3 жыл бұрын
Well they probably have it a little harder than just throwing a bunch of pictures of a tunnel rear view to the trainer
@Splish_Splash
@Splish_Splash Жыл бұрын
it doesn't solve the problem with tunnels
@Trashbag-Sounds
@Trashbag-Sounds Жыл бұрын
@@Splish_Splash of cause. The user disables the wiper and this data is collected. So this way tunnel data gets send to Tesla in order to train on these .. automatically
@Splish_Splash
@Splish_Splash Жыл бұрын
@@Trashbag-Sounds but you will need huge amount of time to collect this data with tunnels and wipers, and they don't push new release of autopilot if it doesn't properly work
@Trashbag-Sounds
@Trashbag-Sounds Жыл бұрын
@@Splish_Splash well I assumed they already had a version on the road that wasn’t working properly
@TA-ob6kz
@TA-ob6kz 3 жыл бұрын
Tools for Datasets writing software.
@openroomxyz
@openroomxyz 3 жыл бұрын
Are people who annotate your data paid decently?
@rafaellino7168
@rafaellino7168 3 жыл бұрын
hahahaha
@frankhandroid1252
@frankhandroid1252 2 жыл бұрын
hmm. sounds like "we have no idea why it works or when it will not, but we sell it anyway"... Even a wiper can become safety relevant if it starts wiping like mad at the wrong moment, distracting the driver (as long as there is one).
@Splish_Splash
@Splish_Splash Жыл бұрын
there's a thing naming "testing" and "shadow mode"
@kythrathesuntamer9715
@kythrathesuntamer9715 3 жыл бұрын
I want to understand what he means by programming 2.0 so i'm here.
@frankhandroid1252
@frankhandroid1252 2 жыл бұрын
Also shouldn't dub it "2.0" it "1.0" is not replaced...
@blackthirt33n
@blackthirt33n 2 жыл бұрын
Andrej karpathy
@andzrit
@andzrit Жыл бұрын
Whole issue is: the presented notion of Software 2.0 collapses on it self. Suddenly the data is the entire program? No. The data is still the data. Software 2.0 is essentially Data 2.0 Software 2.0 would incorporate Data 2.0 and Networks 2.0 and so on Reactive, layered programming style with a functional category approach probably makes most sense for the Programming 2.0 of Software 2.0
@Splish_Splash
@Splish_Splash Жыл бұрын
functional programming bruh
@Vale46NL
@Vale46NL 2 жыл бұрын
A problem does not exist
@Vale46NL
@Vale46NL 2 жыл бұрын
To solve aproblem you must see no problem
@Vale46NL
@Vale46NL 2 жыл бұрын
A problem does not exist
@thomasharrison2741
@thomasharrison2741 3 жыл бұрын
08:42 - You know this is literally the secret to how God runs the Universe, right? lots of data, few rules. the Universe figures it out.
@maqboolfida786
@maqboolfida786 3 жыл бұрын
Why not use AI to label data? Won't that be more accurate?
@rafaellino7168
@rafaellino7168 3 жыл бұрын
you train the AI to label the data by having humans do it first. AIs have no idea of what they're looking at - they're (mostly) just smarter data processing engines using Big Data human input
@Vale46NL
@Vale46NL 2 жыл бұрын
wrong!
@Vale46NL
@Vale46NL 2 жыл бұрын
program space consisting problems is a waste of space
@Vale46NL
@Vale46NL 2 жыл бұрын
this is all wasted time thinking about
@Vale46NL
@Vale46NL 2 жыл бұрын
he doesnt say anything actually. just like stupid ai
@444haluk
@444haluk 3 жыл бұрын
Intelligence isn't about struggling to differentiate between the droplets a few cm away and the sun a few light minutes away. The end doesn't justify the means. Your company's entropy reduction methods suck. Big time.
@444haluk
@444haluk 3 жыл бұрын
This dude be like: do not try to explain it, it works... most of the time. What a naive problem solver.
Heroes of Deep Learning: Andrew Ng interviews Andrej Karpathy
15:11
Preserve Knowledge
Рет қаралды 176 М.
ISSEI funny story😂😂😂Strange World | Magic Lips💋
00:36
ISSEI / いっせい
Рет қаралды 110 МЛН
маленький брат прыгает в бассейн
00:15
GL Show Russian
Рет қаралды 3,7 МЛН
Кәріс тіріма өзі ?  | Synyptas 3 | 8 серия
24:47
kak budto
Рет қаралды 1,6 МЛН
Andrej Karpathy - AI for Full-Self Driving at Tesla
30:09
Matroid
Рет қаралды 358 М.
[1hr Talk] Intro to Large Language Models
59:48
Andrej Karpathy
Рет қаралды 1,8 МЛН
What is RAG? (Retrieval Augmented Generation)
11:37
Don Woodlock
Рет қаралды 65 М.
How I'd Learn to be a Data Analyst in 2024
13:17
Luke Barousse
Рет қаралды 208 М.
Tesla AI Andrej Karpathy on Scalability in Autonomous Driving
27:39
Preserve Knowledge
Рет қаралды 8 М.
Torvalds Speaks: Impact of Artificial Intelligence on Programming
5:05
Mastery Learning
Рет қаралды 679 М.
deeplearning.ai's Heroes of Deep Learning: Andrej Karpathy
15:11
DeepLearningAI
Рет қаралды 16 М.
[CVPR'21 WAD] Keynote - Andrej Karpathy, Tesla
31:32
WAD at CVPR
Рет қаралды 44 М.
Как открыть дверь в Jaecoo J8? Удобно?🤔😊
0:27
Суворкин Сергей
Рет қаралды 1,2 МЛН
Fiber kablo
0:15
Elektrik-Elektronik
Рет қаралды 2,2 МЛН
Купите ЭТОТ БЮДЖЕТНИК вместо флагманов от Samsung, Xiaomi и Apple!
13:03
Thebox - о технике и гаджетах
Рет қаралды 40 М.
Главная проблема iPad Pro M4 OLED!
13:04
THE ROCO
Рет қаралды 47 М.
Клавиатура vs геймпад vs руль
0:47
Balance
Рет қаралды 979 М.