Thank you so much for responding to my request for making a CUDA programming. I have donated 0.1 BTC to your account as a way to thank you. My professor has done so many hours trying to explain CUDA and none of my classmates really understood. I just can not believe that you do all this for free and that is why me and my classmates have decided to collect some funds to donate to you. Thanks for all that you do and please keep going.
@AhmadBazzi2 жыл бұрын
Thank you for the donation, it really means a lot !
@aishahoura26192 жыл бұрын
@@AhmadBazzi No thank you !
@mdrubelahmed64342 жыл бұрын
Thank you so much for responding to my request for making a CUDA programming.
@btspower38442 жыл бұрын
Wow amazing
@btspower38442 жыл бұрын
Wow amazing
@denizart22552 жыл бұрын
You just opened my eyes to parallel programming. Thanks for the quick overview.
@mdrubelahmed64342 жыл бұрын
Too hard to find high -quality content like this these days. Thank you so much
@nilsu19412 жыл бұрын
12:36 This guy is a God !
@sportscorridor40642 жыл бұрын
very nice
@mdrubelahmed64342 жыл бұрын
So beautiful
@aoungamingyt31602 жыл бұрын
Thank you so much. Probably the best introdution to CUDA with Python. The example you use, while very basic, touches on usage of blocks, which is usually omitted in other introduction-level tutorials. Great stuff! Hope you return with some more videos. I have subscribed!
@danielsalazar66372 жыл бұрын
Excelent
@sksk-lo8kc2 жыл бұрын
That was very well explained. I have only have taken one course, and you made it clearer than my professor or fellow students ever did.
@Drex.Yt12 жыл бұрын
Too hard to find high-quality content like this these days. Thank you so much
@leonelaguilera90592 жыл бұрын
this was such an excellent video
@tatldunyas24712 жыл бұрын
Just did my research and this guy is at one of the most prestigious universities in the world ! No wonder why his lectures come up neat !
@apogeetheboss99992 жыл бұрын
as a data scientist +2 years of experience, i ALWAYS learn something new with your content! please nich, never stop doing this things, and also, never cut your smile in your face, even if your are having bugs!!
@pantherofficial50592 жыл бұрын
I have been looking into gpu programming using numba and python for a while, this seems to be the best tutorial I was able to find so far.. . thank you
@excolabirbuyuyecek94382 жыл бұрын
LOL. Loved the graphic at 6:23! Brought tears to my eyes.
@basslvers45012 жыл бұрын
wanted to comment that the information in this presentation is very well structured and the flow is excellent.
@erenbasak76942 жыл бұрын
Hey this is super useful! I elected High Performance Computing and Microprocessors and Embedded Systems modules for my degree, and this channel has become my go-to guide.
@muradhesenov52452 жыл бұрын
the essence of Deep learning in a few lines of code... awesome
@RAVIShankar-bm4ou2 жыл бұрын
Thank you so much for this series! It's so clear and easy to follow
@adeeshaamabidu96162 жыл бұрын
Love the channel Nicholas, have recently graduated from an NLP Master's degree and seeing you explain stuff in a simpler way and your coding challenges is really helping me connect with the material I've learned! Keep it up and I'll keep watching!
@-oof10162 жыл бұрын
Ahmad , thanks for taking time to create these videos. It is unfortunate that people view your videos and then feel inspired to complain about a free gift. Folks could just keep it moving or add helpful insights.
@todotasks76452 жыл бұрын
Too hard to find high-quality content like this these days. ⚡
@tajaochrisciamae41212 жыл бұрын
what a passionate tutorial! I wish you were my professor for my parallel programming course. Well done!
@LouieVon2 жыл бұрын
This is the best introduction to CUDA I've seen, thanks a lot !
@user-sh6jw1ce3m2 жыл бұрын
#
@youtubemullim3192 жыл бұрын
I feel like Cuda has been demystified. Very glad I found your series.
@user-sh6jw1ce3m2 жыл бұрын
#
@astaadxofficials78132 жыл бұрын
I have no idea what kind of videos i am watching ... but i sure will learn
@rithusvlogtime33422 жыл бұрын
Fantastic tutorials on CUDA. You deserve more followers.
@user-sh6jw1ce3m2 жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies..⤴️
@teamchanel43842 жыл бұрын
I'm doing an internship in a research lab and I'll have to program some kernels to implement Blas primitives, this video really helps :)
@user-sh6jw1ce3m2 жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies..⤴️
@cavansirmahmudov2172 жыл бұрын
You saved me, i had to read the PointNet2 implementation for my BCS thesis. this made the job much easier!
@kelechijames55772 жыл бұрын
This is very helpful. Most people don't realize the overheads and code refactoring necessary to take advantages of the GPUs. I am going to refactor a simple MNIST training propgram I have which currently uses only Numpy. See if I can get meaningful improvements in training time.
@fatihcalidkan22542 жыл бұрын
holy shit, i was looking into this to speed up my mandelbrot-zooms and they are what you use as an example! This is a dream come true!
@SiTacuissem2 жыл бұрын
Interesting, but two remarks: Example 1: on my setup (3080Ti, i7-8700K, running in WSL2 with Ubuntu 22.04) vector multiplication runs actually *faster* on CPU (if you either use the vectorized formulation in MultiplyMyVectors with target "cpu" or, simply, a*b instead of the unnecessary for loop in the CPU code). IMO that is mostly due to the overhead of copying the data to the GPU memory. Example 2: to get a fair comparison, you should also use the JIT for FillArrayWithouGPU, decorating with @jit(target_backend="cpu"). Then, GPU array filling is still faster, but only by a factor of 2.
@Kvmizo2 жыл бұрын
This was by far one of the most enlightening videos you have put up on your channel. Thanks and keep up the good work!!
@mrfcbs12512 жыл бұрын
Oh Ahmad , your tutorials are incredible and inspiring....
@Fiekriekd2 жыл бұрын
and that's what I call a great tutorial. Thankyou sir. I wish you make more tutorials.
@user-sh6jw1ce3m2 жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies...⬆️
@mustafasamet27832 жыл бұрын
Ayyyy, so glad you like it @Patrick. For the last two weeks I've just been making videos on stuff I find hard or want to get my head around I figure it's not just me staring there at some of these concepts like huh?!? Thanks for checking it out!!
@lixaxel68152 жыл бұрын
Excellent example of vector addition of using for loop and using CUDA
@Animals-vi5wt2 жыл бұрын
Woah congrats @Ally 🎊 🎉 glad you’re enjoying the challenges, plenty more to come!!
@notachannel26012 жыл бұрын
It is effectively a very easy approach to harness the power of cuda in simple python scripts.
@turkceraplyrics67012 жыл бұрын
Hey Ahmad , I love watching your videos because of the way you tell the story. Great graphics mate. Love the reference to rocket man too... lol keep up the good work.
@merthanozer29642 жыл бұрын
Ohh, yes, Thank you, and the documentation at nvidia site about CUDA is highly professionally written. Thank you.
@TheTurksxayers2 жыл бұрын
Thank you for this great introduction to numba and more specifically numba+cuda.
@mipxello77492 жыл бұрын
Very well explained. The best CUDA explaination I have come across uptil now 😊😊. Keep up the spirits sir.👍👍
@user-sh6jw1ce3m2 жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies...⤴️
@yasincaferzade80692 жыл бұрын
Great video, I like this kind of video where you code some AI task counterclock, you teach us the concepts and show us the reality of implementing it👏
@emircoltu8752 жыл бұрын
Once you initialized lr to 0.0, I knew you were going to forget to change it lol. Love the challenges tho, keep doing them, I think it would be cool to see how you implement a neural network from scratch
@peloizol89472 жыл бұрын
I like how you did the website for documenting the video notes for reference later
@arifefe42642 жыл бұрын
KZbin questions are typically the best way to handle an issue or an issue request on Github on a project demo. I will do my best to answer when I have time.
@crystalannringor10252 жыл бұрын
Thank you so much for this video. It has helped me massively to prepare for my computer science exam.
@نٌے.نٌے.نٌےۦصہٰ̐كْٰٓاكہٰ̐ہٰ̐يي2 жыл бұрын
OHHHH MANNN, I thought about doing that but I was debating whether I'd hit the 15 minute deadline already. Good suggestion @Julian!
@mjmlangenihd87062 жыл бұрын
yes, you could do this by hand, which would be a great challenge in distributed computing to code by hand. Another option is to use a framework/platform like AWS Sagemaker to do distributed kmeans. Most organizations will do this.
@uniquevlogsbyadil2 жыл бұрын
This was oddly intense. Great job Nicholas! Even though you ran out of time, this video is still a win to me. 😉
@totallycz68192 жыл бұрын
You are a lifesaver @Spencer, will do it next time i'm on the streaming rig!
@prietjepruck Жыл бұрын
Thank you very much for this tutorial. I would love to have the code available because typing it in myself from the video is a bit hard especially with the atocomplete on all the time. Keep up the good work.
@beratcansamur15172 жыл бұрын
An insanely underrated series!!!
@user-sh6jw1ce3m2 жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies..⤴️
@pandamusic_tz2 жыл бұрын
The Knowledge of Ahmad knows no bounds.
@a.s.m.rashedchowdhury7842 жыл бұрын
opened my eyes to parallel programming
@user-sh6jw1ce3m2 жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies..⬆️
@bombosbikanal38582 жыл бұрын
Perfect Video! Saw was revealing to me to understand how it works. Thank you! I am a new subscriber of your channel. Regards from Buenos Aires, Argentina
@hilalkoskli62662 жыл бұрын
Wow It is really awesome! It is much better than a tutorial from university! Thanks!
@user-sh6jw1ce3m2 жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies...⬆️
@mendes.022 жыл бұрын
This is an academic example that shows the process of copying data to the GPU, doing a vectorized operation, then showing the results. Actually what makes sense on the GPU vs CPU is something I didn't cover, and am hoping other can figure out some cool ideas.
@tugrasolak78522 жыл бұрын
this is extremely helpful. you did an amazing job explaining the foundations
@user-sh6jw1ce3m2 жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies...⤴️
@enescakmak66992 жыл бұрын
It works on both AMD and NVIDIA. If you have CUDA code, you can convert it to HIP with their automated tool, there is very little CUDA specific that can't be just translated over.
@maher94226 ай бұрын
يعطيك العافية اخوي احمد بدي اطلب منك طلب بسيط انه تعمل نفس الكورس عربي عارف مش حيكون كثير المشاهدات. ولكن اخوانك يحتاجوك اكثر من الاجانب انا بفهم عليك بس فيه غيري بيحبوا المجال وبيحبوا يتعلموه بلغتهم. اذا انت ما تملك الوقت اسمحلي اترجم الفيديو واشرحه عندي بلايك من عندك على انك موافق.
@sachinram37832 жыл бұрын
Sir,make more detailed sessions on CUDA,your explanation is great
@likky22292 жыл бұрын
The video was very helpful for me. Many thanks to the author for developing his audience with interesting and useful content
@bodyprodaction97182 жыл бұрын
Well just built a new rig with a 980ti and a 4790k so I'm gonna put that to test. Thank you for your wonderful explanation :D
@theatlantisreport15952 жыл бұрын
This reminds me a lot of the computer tutorial tapes from the 90s
@yigitboran55652 жыл бұрын
That's mostly how it works. It's more like sorting the stones by its color and pattern and counting each variety. Using the CPU way, you would need to count each variety separately. If you have 100 different colors and patterns, that would take a long time to count (even if you could count extremely accurate and fast, similar to how the CPU makes up for it's lack of parallelism). The GPU way lets many people count them. Given 100 people (like the GPU), each person would count each variety at the same time.
@vanshd68842 жыл бұрын
PS. I really so moved for your stock price episode. thank you so sosososo much.
@imsoumyajitbag2 жыл бұрын
Awesome video !! It's preety cool to see such theoretical concepts coded and explained like this. Keep going Nich !!
@yusifhsnov18022 жыл бұрын
So stoked you liked it 🙏
@cemiltuna2 жыл бұрын
Thanks for the video, I found the first half and the wrap up really excellent.
@hgmalani212 жыл бұрын
It's very informative and a good intro to CUDA programming. Thanks very much!
@user-sh6jw1ce3m2 жыл бұрын
#
@gularif12 жыл бұрын
On the PC side Matrox was the first company to introduce GPU's . This was followed by ATI . NVidia came into the scene after the success of these 2 Canadian companies. Matrox's original 3D board was a 3 setboard with custom asics. I believe NVIDA actually acquired ATI. So yes, NVIDIA was not the first but they are the biggest in the space now. Matrox is still around but more involved in the industrial and nice markets.
@putraairdrop12602 жыл бұрын
It's great video programming sir,, hope the best for you
@mehmetak43492 жыл бұрын
What makes the CPU better than the GPU is that each core is clocked at a faster speed and has many built-in instructions like SSE, allowing data to be processed faster. This provides a tremendous benefit to programs that only run on 1 core. In rendering where multiple cores can be used, you would need the CPU to process pixels about 5+x faster to match the GPU's performance.
@pusher70512 жыл бұрын
glad to see you take it as a feedback and not as a hate comment
@agusexclusife25772 жыл бұрын
Can't wait to see Juan's better tutorial that he's definitely going to release :') lmao. Great video Ahmad .
@blackiselia80542 жыл бұрын
i need to say this: you are the gamechanger here!!
@weds82962 жыл бұрын
Thanks for making all these topics very approachable!
@ricardomilos58892 жыл бұрын
This was a great video to me, I have very limited C++ experience and was looking for an explanation of CUDA. Another video like this could easily have been 70-80% over my head. This one was only about 15% whoosh. And now I actually find C++ interesting again!
@denizugurbiltekin6222 жыл бұрын
Thank you so very much. This is the exact kind of material I was looking for on this very specific subject. Kudos.
@keremkipri94362 жыл бұрын
Thanks for the video, subscribed! A suggestion : this small change to your code would demonstrate a real-world gradient descent solution for linear regression with noisy data. E.g. :
@aysezuhre86062 жыл бұрын
Ahmad sorry for bother you, the problem was not installing Cuda Toolkit, srly I hate people who doesnt watch full video closely and ask stupid questions....and now I m one of them :D. Thx alot for this tutorial in 2 months i will try write my own GPU operator for my program, would be interting if this will be faster than CPU. (Btw using normal Visual code in python 3.10 env. on win 11, so far so good. (Altrough i have some code output delay problem when using openCV for some strange reason)
@aliyensagaltc31212 жыл бұрын
You are bloody watching a master at work xD
@TheGameboyTheDream1232 жыл бұрын
@nvidia I personally think the way you did the demonstration was perfectly sufficient. IMO, fancy graphics are unnecessary. Good job.
@coolboy_04592 жыл бұрын
I was needing this!!! Thanks a lot, Sir!!!!
@besttwitcher45692 жыл бұрын
YESSSS, right?! Glad you liked it Miguel!
@Voicemelod2 жыл бұрын
Amazing! I'm learning so much watching you code. Thank you for sharing.
@Muhannad_ALAZZO2 жыл бұрын
wold love to see a video on what are a few CUDA programming challenges
@kolaybreaworlds31782 жыл бұрын
Also, the CT5 simulator from 1981 may not count as being from the '70s or '60s, but from what I understand, the CT5 was capable of realtime, rasterized, 3D polygonal rendering and was $20 million at the time. It used gouraud shading, if memory serves. There were several other CT (continuous tone) simulators developed by E&S in the '70s that did something similar or of much lower capability than the CT5 of '81. There was also the Digistar planeteriums that date back to the early '80s, and the Picture System goes back to at least the early '80s. Might be vector or raster, not entirely sure myself, though.
@MTHHC2 жыл бұрын
Hey, thanks for explanation! Very well done 👍 I am downloading CUDA 💪
@mehmetplgx80252 жыл бұрын
Love your videos. Please don't stop!
@vipyt85502 жыл бұрын
It can be found in O(1). As far as I remember the formula is derived using LDU decomposition or Diagonalising a matrix, for matrix exponentiation.
@Cardexs2 жыл бұрын
This is really helpful for my computing. Thank you.
@yapmabekanka71512 жыл бұрын
This is amazing! Thank you for taking effort to make it!
@floricicalaurentiumarian1672 жыл бұрын
Best CUDA tutorial ever...can you please make a tutorial on opencl and MPI?
@user-sh6jw1ce3m2 жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies...⬆️
@bekirok45032 жыл бұрын
I tried to follow this on my Windows 10 machine. The function you call as at 7:16 is still create_fractal() and not mandel_kernel() so I don't see why it is faster. When I changed it to mandel_kernel(), it complained that I hat to provide a launch configuration, telling the gpu how many grids and blocks to create. I added it like so (First properly setting a grid and block variable): mandel_kernel[grid, block](-2.0, 1.0, -1.0, 1.0, image, 20). It then worked and really was nearly 100x faster than the jit version.
@aliarda97192 жыл бұрын
Thanks a million @Lakshman!! I try to keep it pretty tight so it’s a good challenge otherwise I know I’ll just talk for 22 minutes anyway😅
@alidoruk55882 жыл бұрын
This guy is so underrated.
@wintergacha53702 жыл бұрын
Very nice tutorial. I really liked it. It's brief, to the point and very clear. Thanks. Could you please make a video for the same example but in Linux?
@herseydenbirazck20512 жыл бұрын
HEYYYYY PHIL!! Long time no see, thanks a mil!!
@halilkuser84552 жыл бұрын
Sorry for bothering you, the problem was not installing Cuda Toolkit, srly I hate people who doesnt watch full video closely and ask stupid questions....and now I m one of them :D. Thx alot for this tutorial in 2 months i will try write my own GPU operator for my program, would be interting if this will be faster than CPU. (Btw using normal Visual code in python 3.10 env. on win 11, so far so good. (Altrough i have some code output delay problem when using openCV for some strange reason)
@dragnn14302 жыл бұрын
Dear Ahmad, you are 30 years old only doing post-doc ? I'm sorry but this to me sounds very underrated. Postdocs are not always well compensated for their work but spend a lot of time working and doign research. If i were you, i'd invest more time on my youtube channel, rather than doing something that does not compensate well.
@Марат-ъ1в5у2 жыл бұрын
Йдйж
@Марат-ъ1в5у2 жыл бұрын
----
@Марат-ъ1в5у2 жыл бұрын
-----1--11
@Марат-ъ1в5у2 жыл бұрын
Уу3уййфйфйфяфффЯффффыяяыыяфяфыффффффыыяяяЯш
@altairgeo74623 ай бұрын
@@Марат-ъ1в5у гений
@saplamacay15792 жыл бұрын
Ahmad , great video. You have a great way of explaining things and help a lot of people. IMO a lot of the critic you get - such as here is unfunded. By the way, I do not see any video postings by Juan???. I am trying to get my KZbin channel started and hope that in 10 years time I will be 1/10th as good as Ahmad 👍.