How Memory Usage Slows Down Your Programs

  Рет қаралды 19,943

Jacob Sorber

Jacob Sorber

Күн бұрын

Пікірлер: 68
@reptilicusrex4748
@reptilicusrex4748 3 жыл бұрын
This video was very informative and really gets the point across using a well-chosen simple example. Thanks for the effort.
@hidden8990
@hidden8990 3 жыл бұрын
I like this one a lot. It moves the direction of data oriented design. I don’t know what your experience is with that, but I think an introduction to that vs object oriented, and what use cases would make one more efficient than the other would could be a good one-off video.
@rogo7330
@rogo7330 3 жыл бұрын
I thinking about how to make DOP and OOP help each other. DOT is more about how to actually place data in memory, but it lacks readability in complex things. OOP on the other hand can be done in C with structs and pointers to real objects, but i curious how much it will cost if i will use OOP-ish structs in runtime.
@badwolf8112
@badwolf8112 3 жыл бұрын
i feel like OOP doesnt necessarily stop you from optimizing. you can use OOP in many ways, and C++'s creator talks a lot about performance and reliability for systems, so i doubt using OOP makes things worse (many of C++'s abstractions have no overhead)
@sanderbos4243
@sanderbos4243 Жыл бұрын
@@badwolf8112 Using OOP doesn't stop you from optimizing, but it's just that things like for example virtual function calls can both bring the performance down and make it much harder to debug. Think of a line, where OOP is on the left and Data Oriented Design is on the right. The left is generalized to be very extensible, but can be slower and harder to debug. The right is more specialized to a specific task using knowledge of what you will *actually* use it for in practice, and can be faster and easier to debug. This principle of flexibility vs excellence also holds for a lot of other things, like AI: you either have an AI that is meh at a ton of different tasks, or have a state-of-the-art chess AI that excels at the one thing it was made for. If you try to design a real-life factory that is able to create *any* type of car, then your factory will be incredibly complicated and slow. The clue is that the more you decide to constrain what your program is capable of doing, the more performant and simpler you will be able to write it. OOP applies less constraints, which comes at several costs. This is an overgeneralization, because pretty much every programming technique has its place, but look up "object oriented programming vs data oriented design" for countless articles and videos if you want to know more.
@RobBCactive
@RobBCactive Жыл бұрын
OOP is for long term adaptability & encapsulation, when you're solving real problems correctness is key. Requirements change with time and you may need to add features and data types later, which were not even considered when the project began without updating 100's of files. A data oriented design works well for things like games, the data tables might even be extracted and processed from build edit tools that use objects to produce the level data that the runtime uses for high efficiency max fps with specificity not flexibility.
@diconicabastion5790
@diconicabastion5790 3 жыл бұрын
It can make a hell of a difference. I created a program a while back that created roughly 40,000 rooms connected together in a maze. Initially it could take hours. By the time I was done it was under 1 second. That was on a single thread.
@internetsfinest8839
@internetsfinest8839 3 жыл бұрын
It’s amazing how fast you can make a program. I was writing a program that will seek out a certain line, but it was extremely slow. The solution was to put the entire file into an array, and the program went from a couple minutes to a single second.
@JacobSorber
@JacobSorber 3 жыл бұрын
Yes indeed. The more you understand how the underlying layers work (and don't work), the more often you can see opportunities to improve things.
@sparten1527
@sparten1527 3 жыл бұрын
I'm taking an advanced computer architecture course which deals with optimizing memory access times in the cache. This is really interesting how it ties in with the knowledge I've learned in class!
@emreozdemir9358
@emreozdemir9358 3 жыл бұрын
So someone finally point out the importance of cahce with an excellent example. Thank you so much sir.
@JacobSorber
@JacobSorber 3 жыл бұрын
You are welcome.
@GAMarine137
@GAMarine137 4 ай бұрын
Appreciate these detailed videos
@billowen3285
@billowen3285 3 жыл бұрын
I love your channel Jacob!
@raghavsrivastava2910
@raghavsrivastava2910 3 жыл бұрын
Yes this professor is amazing
@JacobSorber
@JacobSorber 3 жыл бұрын
Thanks, Bill.
@cernejr
@cernejr 3 жыл бұрын
Helpful reminder. And good to see actual numbers to get a feel for how much the cache actually matters.
@realdragon
@realdragon 2 жыл бұрын
It makes sense, I would never think of that
@pathayes2224
@pathayes2224 3 жыл бұрын
As always, an excellent presentation. Caching is key to performance. I once developed with a processor which had this option switched off inadvertently,. As a result it crawled along, despite its fast clock speed. Also, file IO effects speed, because it is slow. This is a big area
@debanjanbarman7212
@debanjanbarman7212 3 жыл бұрын
Sir, please make a series on Linux kernel development.
@umpoucosobreconhecimentos
@umpoucosobreconhecimentos 3 жыл бұрын
Thank you for this very useful explanation about how memory locality can affect program performance. Amazing explanation
@nunorodrigues5628
@nunorodrigues5628 3 жыл бұрын
OOOOOOH i get it! And yes, please do a video on C++ and vectors regarding this issue, i would love to learn about it.
@matthewkott8863
@matthewkott8863 3 жыл бұрын
Very interesting. Intuitive, once (if) you think about it. By the way, Jacob, have you done a video on Big O already? That's a topic that really baffles me, as I simply have a hard time recognising the patterns that determine the most significant terms.
@JacobSorber
@JacobSorber 3 жыл бұрын
I haven't. I'll add it to the list and see what I can do.
@iltrovatoremanrico
@iltrovatoremanrico 3 жыл бұрын
Very informative, thank you!
@foadsf
@foadsf 3 жыл бұрын
I don't do C for work. I just watch your videos as meditation. you are amazing! 🖖
@farikunaziz6504
@farikunaziz6504 3 жыл бұрын
Can u give overview about rust language
@badwolf8112
@badwolf8112 3 жыл бұрын
(how) do we know that the faster way to access would be faster on all computers? can you make a video about profiling, and in particular, how do we know profiling on one computers makes a difference across all computers?
@JacobSorber
@JacobSorber 3 жыл бұрын
Good topic idea. Thanks! In C, arrays are laid out in a specific way. So, this example is going to see pretty consistent performance across nearly all machines, since the same elements will be close to each other each time. But, yeah, there definitely are situations where performance will be more machine specific. I'll see what I can do.
@lawrencedoliveiro9104
@lawrencedoliveiro9104 2 жыл бұрын
I once took some MATLAB code written by a client, made a series of tweaks to it, and showed it running a lot faster. I did all my testing on a Linux box. Unfortunately, the client needed to deploy it on a Windows installation. And on that machine, all the versions ran at pretty much the same speed. ∗Sigh∗ ...
@dmitripogosian5084
@dmitripogosian5084 Жыл бұрын
Back n the 80-s, anybody who had 2 hours of C instruction, would never write the second loop. It was just an common knowledge that C is stores 2D arrays in column major order, while Fortran is row major, so you iterate in C on the second index inside the loops, and in Fortran on the first
@happyTonakai
@happyTonakai 2 жыл бұрын
Good video, number 1000 thumb up!
@skilz8098
@skilz8098 3 жыл бұрын
It's not just memory, but any kind of system resource such as file access, threads, etc that can affect your applications performance...
@sman3424
@sman3424 3 жыл бұрын
Does spatial locality still hold with malloced memory since virtual memory could only be contiguous in the address space, but not physically?
@8292-d6n
@8292-d6n 3 жыл бұрын
Yes it does. Your cache line is usually 64 Bytes long (could differ especially on embedded systems) meaning every time you access somewhere in memory it loads 64B into the cache. If your malloc'ed data is larger than you will have a delay each time you enter a new section that wasn't loaded into cache. Also note that whatever you load into cache will be removed as soon as new data is coming in but this behavior depends on your cache sizes and strategy.
@jenidu9642
@jenidu9642 2 жыл бұрын
Why don't you time it with clock()? That way you can make sure the initialization is measured
@TheVertical92
@TheVertical92 3 жыл бұрын
This always segfaults on my machine if my ROWS/COLS are too high. Seems like max stack allocation makes trouble🤷‍♂ Edit: Oh i see. When i declare the array globally it works, if i declare it within main() it segfaults.
@benjaminshinar9509
@benjaminshinar9509 3 жыл бұрын
could you show the hit-rate of the two loops with cache-grind?
@JacobSorber
@JacobSorber 3 жыл бұрын
Probably. I'll take a look.
@tauqeerakhtar2601
@tauqeerakhtar2601 3 жыл бұрын
This is all about spatial locality. It is implementation based. May be the opposite will happen if implementation of memory is opposite.
@raghavsrivastava2910
@raghavsrivastava2910 3 жыл бұрын
Great video.
@dwaynestgeorge2558
@dwaynestgeorge2558 2 жыл бұрын
Thanks
@matteolacki4533
@matteolacki4533 3 жыл бұрын
Just out of curiosity, how fast was it without allocating anything?
@CR3271
@CR3271 Жыл бұрын
8:45 I know this wasn't the main point, but I just have to say stl vector -- at least the MS VS implementation -- is a scam. I regularly get 4x speed out of my home-spun dynamic array template. Maybe you could give the boys at MS a lesson in memory access 😂
@LoesserOf2Evils
@LoesserOf2Evils 3 жыл бұрын
Would using dynamic allocation address this issue? I fear -- I mean, 'think' -- not because of stopping to allocate a new cell in twoarray every so often. I fear -- I mean, 'think' -- dynamic allocation would increase the execution time partly because of the access time.
@JacobSorber
@JacobSorber 3 жыл бұрын
Allocating the space would take extra time, but once you have the memory allocated, I don't know that it would be any slower. Maybe a little if having the accesses further apart affects cache use, but I would expect them to be nearly the same (except for the extra cost of the malloc/free calls). I guess I'll have to try it out.
@LoesserOf2Evils
@LoesserOf2Evils 3 жыл бұрын
Thank you, Dr. @@JacobSorber. In summary, execution would slow because of the allocation time for each cell but not because of memory access time, right?
@ВладДок-д8щ
@ВладДок-д8щ 3 жыл бұрын
It would be slower due to additional inderection. And allocator could give you any address so locality goes out of the window. That's a serious problem in Node based data structures like trees and such
@leokiller123able
@leokiller123able 3 жыл бұрын
So all global and static variables are stored on the cache?
@JacobSorber
@JacobSorber 3 жыл бұрын
Caching happens at a lower level-at the load/store level. It doesn't know anything about whether the variable is global, local, static...whatever. If your program accesses memory it be cached.
@leokiller123able
@leokiller123able 3 жыл бұрын
@@JacobSorber so when you store memory in heap using malloc the memory also gets cached? I quite don't understand when memory is stored in cache or not
@JacobSorber
@JacobSorber 3 жыл бұрын
@@leokiller123able It does when you use it.
@leokiller123able
@leokiller123able 3 жыл бұрын
@@JacobSorber okay so then if all memory gets cached when you use it, why do people say that accessing memory on heap is slower that on stack? If it goes in cache anyway shouldn't it be the same speed resuired to access it?
@JacobSorber
@JacobSorber 3 жыл бұрын
@@leokiller123able This comes down to two things. First, there's the cost of allocating and freeing memory. That adds overhead. Second, the heap can get a bit fragmented with bits of data scattered all over the place. The call stack is a simple linear data structure, and all of your local data for a function call will all be colocated in its stack frame. So, you often get better locality (hence better cache performance).
@jwbowen
@jwbowen 3 жыл бұрын
Now do the same thing in Fortran and see how the timings shake out
@LinucNerd
@LinucNerd 3 жыл бұрын
'Mike Acton' made a presentation here on youtube about Data-Oriented Design (kzbin.info/www/bejne/qImTeqeMerudfsU), and he touched a bit on the mentality that "The compiler can fix it"... Although the compiler can do a lot, there are a lot of things it can't do. The compiler is not some magic software that knows everything, although built by intelligent people, there are limitations. There also seems to be a growing trend nowadays against OOP and C++, and I'm not entirely sure why... I don't know C++, it's too hard for me, but I wonder why it's become so trendy to hate on it.
@nngnnadas
@nngnnadas 3 жыл бұрын
cache invalidation
@47Mortuus
@47Mortuus 3 жыл бұрын
That's it? Matrix traversal? There's so much more to this... Let's just hint at the "Von Neumann Bottleneck", which suggests that ultimately, any program is limited by memory bandwidth when it comes to performance.
@ohwow2074
@ohwow2074 3 жыл бұрын
His second for loop probably increased cache miss rate to %100 lol. That's why it was slow.
@embeddedbastler6406
@embeddedbastler6406 3 жыл бұрын
As we are talking about cache locality. Wouldn't a video about data-oriented programming be very insteresting?
@JacobSorber
@JacobSorber 3 жыл бұрын
It would.
@irwainnornossa4605
@irwainnornossa4605 2 жыл бұрын
Openning braces belong to a new line!
@HimanshuSharma-sd5gk
@HimanshuSharma-sd5gk 3 жыл бұрын
New video idea. arbitrary precision arithmetic in c.
@HydeFromT70s
@HydeFromT70s 3 жыл бұрын
I appreciate the effort but this video shows HOW to use the arrow operator and not WHEN to use it...
@adityavsx
@adityavsx 3 жыл бұрын
likeeee uuu smmm
Memory Allocator Errors and XMalloc
9:48
Jacob Sorber
Рет қаралды 10 М.
Pulling Back the Curtain on the Heap
21:38
Jacob Sorber
Рет қаралды 38 М.
The Best Band 😅 #toshleh #viralshort
00:11
Toshleh
Рет қаралды 22 МЛН
1% vs 100% #beatbox #tiktok
01:10
BeatboxJCOP
Рет қаралды 67 МЛН
How to Check Your Pointers at Runtime
14:12
Jacob Sorber
Рет қаралды 32 М.
Just How Bad is Mixing Memory?
10:02
Linus Tech Tips
Рет қаралды 3,9 МЛН
What is an object pool, and how to create one in C?
23:14
Jacob Sorber
Рет қаралды 16 М.
How to make memory read-only in your C programs.
12:57
Jacob Sorber
Рет қаралды 21 М.
7 Outside The Box Puzzles
12:16
MindYourDecisions
Рет қаралды 258 М.
When do I use a union in C or C++, instead of a struct?
11:18
Jacob Sorber
Рет қаралды 71 М.
But, what is Virtual Memory?
20:11
Tech With Nikola
Рет қаралды 338 М.
A const int is not a constant.
9:16
Jacob Sorber
Рет қаралды 70 М.