Stop using std::vector wrong

Рет қаралды 171,653

Күн бұрын

Пікірлер: 774

@TheCherno 4 ай бұрын

What do you want to see next? 👇 Don’t forget you can try everything Brilliant has to offer-free-for a full 30 days, visit brilliant.org/TheCherno . You’ll also get 20% off an annual premium subscription.

@hanspeterbestandig2054 4 ай бұрын

A comparison between std::vector and std::list ? 😉 …and then std:set, std::map and the differences of std::unordered_set, std::unorderd_map … 😏

@JohnDoe-sq5nv 4 ай бұрын

@@hanspeterbestandig2054 Instead of just a comparison I'd like to see a video of cases where he has personally preferred one over the other, like what problem did this data structure solve. I've rarely found myself using lists, but when I have they have been invaluable. Rarely see myself using maps over unordered, priority_queues over deques, or stacks over deques. And lots of my usage of various data structures are just out of habit, but I chose them to solve a specific problem they might not be the best solution for.

@gccore 4 ай бұрын

A video about Memory Orders. Did you use any lock-free structure in your game engine? Does it bring any performance to your engine?

@iso-c 4 ай бұрын

memory safety. shortcuts what you can take so that you can product faster out. how to make code easily readable.

@iso-c 4 ай бұрын

video about people who optimising their code about replasing list to vector and how much they use their time to it compared to time how much their app will run in this universum.

@RevolutionaryUsername 4 ай бұрын

Please don’t forget to add this to the C++ series playlist. A lot of beginners need to see this

@realishak 4 ай бұрын

He says literally the same things in the vector video in the playlist

@danielmilyutin9914 4 ай бұрын

... and some Rusty guys who see only disadvantages in C++ :)

@theairaccumulator7144 4 ай бұрын

@@danielmilyutin9914 rust has the same thing tho idk what they're seeing

@echo_the_developer 4 ай бұрын

@@danielmilyutin9914 Rusty guys, nice lol

@brod515 4 ай бұрын

I think we should stop recommending c++ as a beginner language in this day and age. it's fine if you are looking for a job in it but in general it's not really that good of a language (like honestly I've used it)

@ekaktusz 4 ай бұрын

Another thing I would add to this, is to always mark your move constructor noexcept if you want the vector to use it. In this case it didn't cause problems, since the reserved size, and no vector resize occured in this example. But if a resize did occur, the vector probably would use the copy constructor instead of move if it's not noexcept. So always mark your move constructor noexcept if you can.

@poopingnuts 3 ай бұрын

this really helped me. Thank you.

@boas_ 4 ай бұрын

To solve this issue, C++26 will add a std::inplace_vector, a "dynamically-resizable, fixed capacity, inplace contiguous array". It has the benefits of both an array and a vector, it's capacity is fixed but it's size is dynamic, and it's stored on the stack.

@Kaptime 4 ай бұрын

I can't tell if this is satire or not.

@futuremapper_ 4 ай бұрын

@@Kaptimeit’s not

@DynamicalisBlue 4 ай бұрын

Dunno why it took them so long, using std::array with a manual counter is kinda annoying. I do find a hybrid model more convenient though. A vector that has two allocators, one in-place with fixed capacity and then if that exceeds, it will fallback to heap allocated storage. Useful for when the array usually doesn't exceed a certain size but can.

@shadergz 4 ай бұрын

Guys! take a loot at boost::container::small_vector ahhahahhhhhhhhhhhh

@boas_ 4 ай бұрын

@@Kaptime Why would it be satire? I am sorry if I made any mistakes, this is just my interpretation of what's said on cpppreference

@isodoubIet 4 ай бұрын

The discussion in the second part is wrong in the sense that the only reason you got all those copies was because your instrumentation code forced them to be there -- overload resolution will prefer the manually-added copy constructor over the compiler-generated move constructor. Had you _not_ written the copy constructor, a move would've happened instead. That is, it's not true that you need to supply a move constructor yourself. In the vast majority of cases the compiler will write one for you and it'll usually be correct, particularly if what you have are just dumb structs (even if they contain more complicated types like vector).

@isodoubIet 4 ай бұрын

As for whether one should still use push_back, IMO yes. It's true that emplace_back subsumes the same functionality so in principle there's no loss of expressivity if you just use emplace_back everywhere. However, 1. using push_back signals intent and more importantly 2. it's not a template, so error messages will happen at the call site instead of deep in the standard library in xmemory or some other implementation-specified header. push_back should also compiler faster and lead to a smaller binary, for the same reason.

@ХузинТимур 4 ай бұрын

Exactly this. There is no need to optimize copies of integer sized PODs (and even 16 integers copying is still OK).

@IntrinsicGameStudio 4 ай бұрын

@@isodoubIet Was looking for this comment ^_^ correct!

@aarong2374 4 ай бұрын

Does every compiler generate the move constructor?

@isodoubIet 4 ай бұрын

@@aarong2374 Yes, it's required by the standard. If it doesn't it's a bug.

@izikpfirrmann8775 4 ай бұрын

You also could have mentioned std::span, which is a meant as a view into a contiguous buffer (like std::vector/std::array) similar to std::string_view is a non-owning view into a std::string (or any contiguous buffer of char)

@coarse_snad 4 ай бұрын

As someone who works with Rust a lot, I'm sad to see std::span mentioned so rarely. We at least have people using std::string_view nowadays, but it's unfortunate that many people don't know about similar generalized concepts. In fairness though, I have colleagues who take arguments as &Vec in Rust and don't think to just change it to &[T], so i suppose this problem is language agnostic.

@carlpittenger 4 ай бұрын

@@coarse_snadthat &Vec issue should be pretty easy to detect using clippy if not a compiler warning

@ensuretime 4 ай бұрын

@@coarse_snad I ended up implementing my own span, but I call it *view* because it's more intuitive, so I have a list of defined types, such as: view view view ... They derive from view and implement their own operations, such as arithmetic operations for integers and string operations for char/wchar, etc. span really is a class that allows you to write very concise code.

@samuraijosh1595 3 ай бұрын

@@ensuretime Haskell which is where this concept was inspired from orginally called it view too, I thin

@ABaumstumpf 4 ай бұрын

6:08 - that is categorically wrong. The cost of using heap-allocation is the actual allocation. Once it is allocated there is no difference anymore. 9:57 - compile that with a not-ancient compiler and optimisation enabled: The result is most likely 0 allocations - the compiler is allowed to remove those. 16:35 - emplace_back would also be 0 allocations - that is mandated by the language. 19:05 - the reason it does not have a move-constructor is cause you disabled it by giving it a user-declared copy-constructor. had you not done that your class would be a simple aggregate-type, those operations would all be compiler-generated (with some other nice benefits) and you'd not see copies/moves either. With vector you only want to use reserve if you either know the exact number of elements already, or you have measured that there is a performance-problem and you have also measured that you can get a good enough heuristic that your preallocation actually is significantly faster. If you dont know then you can very easily end up with nearly the same number of allocations but a lot higher re-allocation and more memory-traffic.

@almightysapling 4 ай бұрын

Yeah, I was almost willing to forgive his giant "stack is faster than the heap" text hoping he would elaborate. Then he elaborated wrongly. I can see people duplicating values all over the stack just to avoid using the heap as a result of this advice.

@TheOnlyJura 4 ай бұрын

Of course it will be 0 allocations for a simple program. But for a more complex one? I wouldn't be so sure about that.

@redcrafterlppa303 4 ай бұрын

I usually share his opinion but coming from ccp weekly knowledge he is nit picking over stuff the compiler already does in most scenarios. It's often times better to write expressive code than performant code. Because they might work the same way after optimizations.

@justinzhao9831 4 ай бұрын

I think access heap is still slightly slower due to indirection. You first need to read the address of the heap block from the pointer variable.

@ABaumstumpf 4 ай бұрын

@@justinzhao9831 "You first need to read the address of the heap block from the pointer variable." That is the same with the stack.

@literallynull 4 ай бұрын

Sonic pro tip: preallocate memory beforehand

@bestopinion9257 4 ай бұрын

If you know how much memory you need, you do not need a vector.

@RagePower2000 4 ай бұрын

@@bestopinion9257 not necessarily, there’s a lot of cases where you know the minimum size but you still want the capability to expand the buffer without seg faulting.

@bestopinion9257 4 ай бұрын

@@RagePower2000 That's a contradiction. It is either fixed size or unknown expandable.

@musicismylife6172 4 ай бұрын

@@bestopinion9257 No it's not a contradiction. In some scenarios you know beforehand that you will be dealing with data in some range(between 100 and 10000 elements for example) that goes to thousands It would be good for optimisation to preallocate some memory so it wouldn't waste some time increasing capacity and copying inside arrays

@CoolDude911 4 ай бұрын

It is common to get something where you get an image line by line and get the metadata at the beginning to pre-allocate the size. Preallocating a maximum size image would not be as memory efficient.

@nordgaren2358 4 ай бұрын

Technically, std::array is stored wherever the memory you are using for it, is stored. If you have an object that has an std::array, that array is going to be stored wherever that object is. If you put that object on the heap, then the std array is stored on the heap. In your second example, if you make a static array to store the colors, that data would be stored in the .data section or .rdata section. (Or SOMEWHERE within the PE or ELF. I have also seen static data get stuck in the .text section. Haha) The important part is that it's not going to create any additional memory. Just minor nitpick though. :)

@e22z6 4 ай бұрын

If it’s a global variable then it will get stored in .data section

@SirusStarTV 3 ай бұрын

@@e22z6 does const makes them reside in .rdata? (checked in dumpbin, yes it does)

@robertvetter1011 3 ай бұрын

You are mixing up RAM memory storage with sections in the executable file. Both are different things.

@nordgaren2358 3 ай бұрын

@@robertvetter1011 no they aren't. The executable sections are literally stored in RAM. They are only stored elsewhere when on disk.

@styleisaweapon Ай бұрын

@@robertvetter1011 gonna have to agree with @nordgaran2358, .data and .data? is the best possible places to have fixed sized global buffers and from where I am standing its not even a question. The greatest sin is that after we left C behind, we also left behind clear and explicit allocations. We gave up the hard computer science for mathematician abstractions. Its true in all of the languages that followed soon after. If its in .data then it was allocated at *compile time*. If I can reliably make a compiler do that then thats a very big win. There is no room for that sort of anything in the mathematician abstractions because if there was, they would make a new abstraction without it.

@xugro 4 ай бұрын

I'm pretty sure the push back function is an amortized constant so preallocating just halves the copies. Misusing the resize can also make it go from a constant to O(n) insertion.

@Kurushimi1729 4 ай бұрын

Sorry I don't understand what you're saying. If I'm inserting n elements into a vector I don't understand how it could possibly be done faster than O(n)

@xugro 4 ай бұрын

@@Kurushimi1729 I was talking about complexity per insertion

@xugro 4 ай бұрын

@@Kurushimi1729 Meaning: inserting n elements normally is O(n) amortised. Misusing resize *can* make it O(n²)

@Kurushimi1729 4 ай бұрын

@@xugro ah I got it thanks

@dn275 4 ай бұрын

Thank you so much for making this video! It’s super helpful for someone like myself who’s self-taught and doesn’t have a good grasp of the inner workings. I really appreciate how you explain the thought process, the different approaches, and show how you can dig in deeper to verify for yourself. You’re a great teacher!

@Jplaysterraria 4 ай бұрын

Reserve has a fun pitfall of not following the geometric growth c: this may cause more allocations if e.g. you have a 1k element vector and reserve 100 elements; the reserve would allocate for 1.1k elements, while using push/emplace_back would allocate for 1.5k elements. If the reserve is done multiple times, it can cause real performance issues.

@oracleoftroy 4 ай бұрын

Reserve is allowed to overallocate, and I believe that using reserve on most implementations will follow the normal geometric growth of that implementation. On the other hand, resize has this issue on all implementations, IIRC. But in either case, caution is needed as it is vary easy to do worse than just letting the container manage its own size. You are right and Cherno should be more cautious recommending manually handling the vector's size. That works best if you know you want exactly N elements and will never change it, but in most cases it is better to just let vector handle it, at least until you have a performance profile showing that it is suboptimal.

@ensuretime 4 ай бұрын

@@oracleoftroy That's why I wrote my own vector, I can control the type of growth in compile-time, for a vector that will deal with blocks or buckets, growing exactly is more sensible because of the size of the chunks...

@petarpetrov3591 3 ай бұрын

Correction: stack memory is not faster than heap. It is literally the same memory. It is only the allocation step that is slower for heap ( in fact it is undetermined ). The reason why stack might be faster in some occasions is the fact it is pre-allocated and hard to cache miss.

@shiinondogewalker2809 2 ай бұрын

in order to access stack data the cpu will use the value in a stack register and an offset to get the address. In order to access a heap data the cpu will use the value in stack register and an offset to get the address to a pointer, and dereference that to get the address. After getting the base address to something like an array then yes, stack and heap memory are just as fast, but getting the address of the data is one more step for heap.

@petarpetrov3591 2 ай бұрын

@@shiinondogewalker2809 True but irrelevant IMO. 1-2-3 access for register versus million access for RAM via segments.

@shiinondogewalker2809 2 ай бұрын

@@petarpetrov3591 for one big task like that, sure. If you have many small lookups it's the other way around. Consider it irrelevant or not it's wrong to say it's only the allocation step that's different. I can agree that it's mostly irrelevant though

@johnwellbelove148 2 ай бұрын

Stack and heap memory are not necessarily the same memory. In an embedded environment, memory may be implemented in separate fast and slow physical memory devices with the stack and heap configured to live in either one.

@petarpetrov3591 2 ай бұрын

@@johnwellbelove148 they are. even in embedded, dunno what you re talking about.

@Bolpat 3 ай бұрын

15:00 The question of reserve or resize is basically: Is the element cheap to default-construct and to copy-assign? Yes? Then use resize. Otherwise use push_back. The difference can be massive, as every push_back requires checking and setting the capacity, plus writing the data. A copy-assignment just writes the data. The default-initialization for trivial data types is cheaper than a capacity check-and-decrement. If the size won’t increase, use a std::unique_ptr and write in all the elements. A little less overhead than std::vector. In C++20, there is std::make_unique_for_overwrite to create a unique_ptr with uninitialized elements.

@Bolpat 15 күн бұрын

Naive mathematician: I want to do simple vector stuff, but std::vector has no mathematical operations defined on it, so I added some operator overloads. Me, a C++ dev: Nooo, don’t do that. You want to use a std::valarray, that already implements those. Mathematician: Oh, okay… What is a std::vector then? Me: An extendable array. It has zero conceptual relationship with vector spaces. Mathematician: C++ naming is stupid. Me: yes-meme.jpg

@hanspeterbestandig2054 4 ай бұрын

BTW In the meanwhile the owner (Adam) of this Tetris project was so kind to accept my Pull Request in which I taught him about these Issues as part of a Fork of his Project. This means that the latest version of the discussed code is now fixed in this repeating copy of the vectors. This storage then is referenced by a *const reference* that refers to these cached data when needed. Hence the code now performs lazy allocation of the Resources upon the first access and stores (caches) it *on a single point*. BTW its not only the Vector of Colors that are managed within a std::vector).

@tfraven 4 ай бұрын

Great video! I've known about this for a while, and it's surprising how many programmers overlook the importance of understanding how many copies and movements are happening in deep memory when using std::vector. You absolutely nailed it in explaining why this matters. It's not just about using the right tools, but knowing how they work under the hood. Thanks for shedding light on this important topic!

@hbobenicio 4 ай бұрын

Great video! Just a side note... 8 calls to malloc (or standard c++ new operator) not necessarily heap allocate 8 times. it allocates in pages of memory, not every call. but I totally got the point of the video, which is awesome btw!

@Raspredval1337 4 ай бұрын

operating system gives memory to the process in pages, allocators (like malloc) then break those pages into chunks and give the chunks to the programmer. In order to reuse chunks you need to keep track of the chunks. That means you need at least 2 pools of chunks: one for unused chunks, one for occupied chunks. Plus, it makes sense to keep small and big chunks together respectively to minimize memory fragmentation. Even it the call to malloc (or new) looks very simple it's actually not very simple at all

@ohwow2074 4 ай бұрын

That's true. But even with that, allocations are slow.

@hbobenicio 4 ай бұрын

@@ohwow2074 Yes yes, totally should be avoided if possible of course.

@SirusStarTV 3 ай бұрын

It uses placement new operator?

@wissensfrosch 4 ай бұрын

A std::array is not a general replacement for std::vector. If you return a std::array from a function by value, all its elements will be copied. Returning a vector by value is a lot cheaper. (Sure, in most cases we have RVO.) Another important point, though, is that stack space is limited. Your program might work for small test examples, but will crash if you suddenly use it with larger std::arrays. This does not happen with std::vector. In certain cases it might be a lot smarter to use a vector to future proof your program for larger inputs. I’m also not sure about performance differences in accessing arrays or vectors. This might be true if you only store 5 ints, but for larger sizes the overhead of the initial indirection is negligible. Caches don’t play much of a role for performance comparisons if you iterate over a hundred structs or more. You should only make sure that you don’t continuously resize a vector by calling push_back (like you have mentioned). But, don’t initialize the vector with a size or resize it. (Almost) always use reserve() instead because default constructing objects (not just ints which actually stay uninitialized) is a performance killer. If you always use resize() you don’t have to think twice. And finally, only use emplace_back if you want to construct the object in place. Otherwise the general consensus is to use push_back to avoid nasty errors. If you move an already existing object into the vector with push_back it is not slower than emplace_back. But, it is safer and you actually want to push back an object in this case and cannot emplace it.

@kiverismusic 4 ай бұрын

You mentioning the code review episode which inspired this vid reminds me of a teacher looking at your paper during an exam, then reminding the whole class not to make some very specific mistake 😅

@collynchristopherbrenner3245 3 ай бұрын

And here I am, basking in the C# bliss of using List all the time with no idea of how it affects performance lol

@MagnificentImbecil Ай бұрын

Thank you for the information in this video. If I may make an observation (possibly wrong, please feel free to kindly correct me): I think that: Moving an `std::array` requires time approximately proportional to the number of elements (and the exception safety (error safety) level of this operation depends on the exception safety (error safety) level of the move operations for the elements). Moving `an std::vector` only requires a fixed (very small) amount of time whatever the number of elements (and always offers the strongest possible level of exception safety (error safety): it is always a no-throw (no-fail) operation, whatever the exception safety (error safety) level of the move operations for the elements -- and in fact works even when the elements cannot be moved, e.g. mutexes and condvars).

@somniad 4 ай бұрын

this is so interesting! I knew all this in theory, but all of these look to me, eyeballing it, like the compiler should be able to get rid of the extraneous allocations on its own. I've been told before things like "portability isn't the only reason to pick C over assembly - you're also probably not cleverer than the compiler!" but I had no idea that this doesn't actually even begin to translate to the jump from C to C++, not at all!

@zamf 4 ай бұрын

There are cases where push_back is more suitable and the distinction is quite simple: if you have an object of type Data already constructed, use push_back (preferably with std::move), and if you're constructing the Data object as you're inserting it into he vector use emplace_back. If you're using emplace_back for objects that were already constructed you're unnecessarily calling the copy/move constructor.

@przemeknowak7350 4 ай бұрын

I think it also worth to mention that if we use vector that may be resized, it is good to mark move constructor objects that vector stores as noexcept

@fbafelipe7666 4 ай бұрын

In the tetris code, when it iterates the vector it is also making a copy of Position. He could avoid that by changing that for to "for (const Position &item : tiles)"

@WiktorWandachowicz 7 күн бұрын

Or use "for (const auto& item : tiles)" for simplicity. In some code bases that construct could be just fine.

@maxscriptguru 4 ай бұрын

Your a natural teacher. And your editing is spot on too: Short and quick. Love it! fantastic video.

@Iamine1981 3 ай бұрын

For the use of std::array, good to mention also that the size needs to be a COMPILE TIME known, otherwise not useful.

@ScorpioHR 4 ай бұрын

My ex was an STD vector. She spread them all over the town!

@BobrLovr 3 ай бұрын

will always love this one

@mojojojo890 2 ай бұрын

lmao

4 күн бұрын

should have used push_back instead of emplace_back. (If you know you know XD)

@anon_y_mousse 3 ай бұрын

This is why I made arrays first-class objects in my language. If you use a question mark as part of the array size when declaring the array, then it's dynamic and on the heap, otherwise it'll be static and on the stack. For most operations the compiler attempts to optimize usage of dynamic arrays as views, like I do for strings. In fact, the default string type is a string view in my language. Of course, someone will invariably wonder how you control the allocator when these types are built-in, and for that you can use the template syntax to substitute your own allocator.

@PedroOliveira-sl6nw 4 ай бұрын

Since you touched the subject, it would be nice to make videos about the inline vectors of some libraries and the pmr::vector of the standard library

@ptrblz 22 күн бұрын

Good video, really summed up a lot of impotant concepts about writing efficient c++ code.

@RealDaveTheFreak 3 ай бұрын

Thanks! ❤

@delarosomccay 3 ай бұрын

Some of us are super old school and were around when STL was the "new kid on the block". It wasn't part of the C++ standard until 1998. These were days when you had a dog eared copy of Knuth's algorithms book (TAOCP) on your desk for quick reference ;) Anyway, the reason one is heap based and the other stack is pretty simple. The std::array is basically just a decorator for an array with some sugar thrown on like iterators and range checking. A std::vector, on the other side basically does a malloc to create a buffer. It will then exponentially grow that buffer as it needs to. Also, heap has always been slower than stack. Unless something has changed. In the world of x64 it does require a few extra instructions, but heap memory is not contiguous, so there is overhead with the memory manager (which I also wrote low level back in the day using Borland C++ and Assembler :P ). Stack is contiguous. In terms of complexity they are both O(1) for access, but the std::vector will add overhead for insertion. If it needs to resize that operation will take O(N), so in the long run std::vector is going to be slower. It's really not something you would typically optimize for - remember Knuth's warning: Premature optimization is the root of all evil. Use them for what they are used for. If you have a fixed buffer and you know it will never change, use a std::array, if you need a dynamic buffer use std::vector. These are patterns that should be familiar to all developers regardless of what frameworks or libraries they are using. Just saying. Now let me get off my X Gen soap box :P

@martinrodriguez1329 2 ай бұрын

What about huge data structures arrays? Say you need a million elements, should you use the stack or the heap? Should you use , or just an array? I'm aware the stack has a fixed size and it's not that big, but I'm also aware you can change the size of the stack so... Which one to use?

@Omnifarious0 2 ай бұрын

You missed talking about amortized constant time for push_back despite all the allocations. I think that's important to cover because it explains why vector chooses to increase the size by a multiplicative factor each time.

@Alturnator 4 ай бұрын

Before C++20 emplace_back didn't work with aggregate initialization, so you had no choice but to have it invoke the move constructor: emplace_back(Aggregate{x}) But, when you start doing that, push_back and emplace_back pretty much do the same thing. One other thing worth noting about emplace_back is that it returns a reference to the constructed element since C++17, which may be useful in certain situations.

@ymi_yugy3133 3 ай бұрын

Interesting to see your take on the STL. I know that Chromium, which I'd say falls in the category of performance sensitive real time applications is actually use as much of it as possible. They have their own abseil library for cases where STL implementations are lackluster but they generally try to avoid homegrown solutions.

@matspatpc Ай бұрын

One point I think should be made here: If you NEED to have a dynamically sized container (for example large number of entries, so stack won't work [although in modern 64-bit systems, allocating megabytes or gigabytes on the stack does work reasonably well], or it needs to be passed back to the caller without the caller knowing its size at compile time), then std:vector [or optimised version of it] should be the preferred option, rather than say std::list or std::deque - those add even more overhead in both copy and traversing it. The fact that it is contiguous in memory is VERY useful. Also, copying a std::array may well be worse than copy/moving a std::vector.

@rbaleksandar 3 ай бұрын

Fun fact - allocating more than you asked for also happens in other languages. I remember asking on SO why a supposedly empty dictionary has such a large size when I first discovered this.

@sanyanders 4 ай бұрын

I tried to write the same code in CLion and I'm glad that Clang-Tidy linter highlighted me these issues and recommended to use `emplace_back` instead of `push_back` and highlighted unnecessary temporary object creation when I tried `emplace_back(Data(i))`. Not even saying about `const T&` stuff

@chickenbobbobba 3 ай бұрын

keep in mind you dont always want to stack allocate arrays, especially if they are large. you only get 1mb of stack on windows and 8mb on linux, which is fine for small stuff but past a point you want to keep it fairly freed up

@koonhanong2267 3 ай бұрын

💡One thing that could have been highlighted (though you have briefly mentioned it), is that moves are not free. When a move is done on a temporary, both objects need to be created first, before the temporary is moved (and also destroyed). This is precisely why emplace_back() which forwards the arguments is good 👍

@theobarollet7619 3 ай бұрын

I think their is an important idea to have because we often add elements to vectors but sometimes we need to remove them. If we don't have a LIFO or something we sometimes need to remove elements in the middle of a vector. If the order does not matter the correct way to remove elements is to swap the element to remove with the last element THEN remove the last element so it is constant time and not linear. Actually I think this kind of algorithmic optimization is more important than machinery optimization that is a super complex topic in the end.

@kanishk2345 25 күн бұрын

Yes this is the video I have been searching for. That actually tells what's going on in the memory with vectors

@user-0xDEEDBEEF 3 ай бұрын

It is nothing wrong to return a vector from a function. It is a move semantic or compiler's return value optimisation which allows not to copy returned value. Also every container has "reserve" function to avoid multiple dynamic allocation/copy in a container. But in general I agree with you that C++ has big disadwantage in compatison to C. It is required paying much more attention for performance of the application.

@juancarlospizarromendez3954 3 ай бұрын

In C++, "struct is class" although trying to use the compiler option -fno-rtti that is not good for the enormous classes taxonomy. For cleaner design, v.push_back(Data(i)) may not give same performance as v.push_back(i), so that the former idea maybe unoptimized.

@askmiller 3 ай бұрын

So for the example of a function which returns a vector, I'm of the opinion that if you are intending to return a dynamically sized chunk of memory, it is better to use the heap. If you want to use the stack, you're stuck between two options, copying the structure up the stack in the return line of the function, or requiring that the caller knows how much memory you're going to need and passing in a reference. Having to allocate and deallocate space in the heap is not that big of a performance hit if done correctly. The blanket advice of "avoid heap" isn't nuanced enough in my opinion. What is more important is your emphasis that people understand how these data structures work. But of course, let's not forget that the compiler can often do the work for you and writing maintainable code is arguably more important than optimized code in those examples.

@garyp.7501 4 ай бұрын

Thanks, this is a good reminder on letting emplace_back do the construction, vs calling std::move(data)

@lightps5515 Ай бұрын

Fun Fact: emplace_back uses "placement new" operator under the hood to create the object in the pre allocated memory avoiding both copying or moving stuffs.

@AdonaiDio 4 ай бұрын

What a clever and simple way to find these "leaks" in the code. I will definitely try to use this.

@skilz8098 4 ай бұрын

One of the things that I think that wasn't mentioned between the uses of std::array, std::vector isn't just in knowing how many items you'll need or have within your containers, but also the lifetimes of those objects.

@clerothsun3933 3 ай бұрын

reserve(N) doesn't grow to N, but to _at least_ N. It's an important distinction as capacity() will usually be higher than what you just reserved.

@feetbots 20 күн бұрын

At 11:13 I'm pretty sure if you enabled optimisations when compiling, the compiler will allocate memory from where that function was called and use a reference to that memory, avoiding the copy that would have been made

@SouravChakraborty-y3w Ай бұрын

Genius! Although I like to know Visual Studio keyboard shortcuts, reason it gives motivation to code more.

@absention390 4 ай бұрын

Usually most of your videos go over my head as I only took like two C++ classes but I could understand this one and I really enjoyed it, 10/10 and also shared with a CS C++ graduate friend

@mr.anderson5077 4 ай бұрын

Thanks Cherno, please teach custom allocator, arena allocator next from scratch. Also speak about template specialization using std forward and more about templated classes Thanks a ton in advance

@WagnerSchmitt 17 күн бұрын

Great video, awesome explanation. One quick question: When using the std::array you have to do the assigning as it has no emplace method. Isn't that creating copies of Data like the ones we were trying to avoid when we changed push_back to emplace?

@Alturnator 4 ай бұрын

While I believe it's good to show the differences, I think the comparison isn't entirely fair. By having the copy and move constructor modify global state, you're introducing side effects, which effectively prevents copy/move elision from happening. Remove those side effects and the copies/moves created by push_back(Data(i)) and emplace_back(Data(i)) will actually be elided.

@OCTAGRAM 4 ай бұрын

If performance of stack is desired, one can consider using arena. Arena is like user defined additional stack

@shiinondogewalker2809 2 ай бұрын

without knowing much about arena, it can't reach identical performance of stack right since the CPU has registers specifically for the stack pointers. You'd have to put those in memory or something instead I imagine to emulate it with Arena

@OCTAGRAM 2 ай бұрын

@@shiinondogewalker2809 Generic memory allocation may require many hoops, several thousands of code statements: braching, failed branch prediction, merging memory holes. Arena, even if not backed by registers, saves some of that trouble. Once array has been allocated, its address can go to local variable, and local variable can be mapped to a register. Additional register for every array allocated on arena, and if CPU is out of registers, then local variables on stack for storing pointers to arrays in arena.

@karbonaterol4966 2 ай бұрын

for fixed-size arrays, the C Arrays (with type[size]) are usually faster than the std::array because they have less overhead

@DenshinIshin 29 күн бұрын

I just tried on godbolt with O3 optimization, and it essentially produce the exact same assembly in both case for clang and gcc. Instead of doing that, just use the clearer, more easily understood code, that is less prone to errors, and then, and only then, if you have a performance issues, measure it to see if you need to rewrite it another way, instead of doing premature micro optimizations like those. Compilers are smart enough, don't try to outsmart them if you don't know for sure it's not doing exactly what you want.

@fairy8141 13 күн бұрын

std::array doesn't have overhead! It has a single member, which is a C array. So no worries, use std::array to your heart's content.

@dhickey5919 4 ай бұрын

Amazing! Beyond the Tetris example, can you explain what this memory management change might look like in a larger application? Maybe not an air traffic control system but something where larger consequences can play out.

@h7qvi 4 ай бұрын

Moving amounts to a shallow struct copy, and happens when the source operand is known to be expiring.

@xfinnerjx 4 ай бұрын

Great show and tell of how something and simple as std:Vector isn't the silver of collections of data that a lot of people think it is unless through and planning are used first.

@parthivreddy7989 3 ай бұрын

Can you similarly talk about other STL features and how to write better code(performance), and one suggestion when doing some optimization talk about the tradeoffs rather than completely discarding an option, and it would be more engaging if you actually show the runtime after showing the number of allocations, using counters,tracers and profilers offered by kernel

@TryboBike 4 ай бұрын

For any task where size matters ( giggty ) - where vector sizes reach thousands of elements - the allocation cost incurred by the incremental push_back probably does not matter. Storing objects with a non-trivial copy semantics in a vector is asking for trouble either way - as a push-back ( or emplace ) that forces a reallocation will move / copy the content to a new location - incurring a pretty hefty cost. For games such usually means frame drops. The main benefit of std::array is that it is 'constexpr' able, which may mean 0 allocations, 0 moves and 0 copies.

@alpizzayer 4 ай бұрын

Thank you so much for this one ❤ I love your C++ tutors so much.

@jaysistar2711 4 ай бұрын

I understand that "on the stack" and "on the heap" is mostly true, but being as pedantic as I am, I would say that `std::array` is "inline", and `std::vector` is a "owning pointer to a heap allocarion.

@hanspeterbestandig2054 4 ай бұрын

5:35 Hoppla! 😳 why is std::array stored on the *Stack* ? It uses the storage *dependent* from the context/location* it lives* ! For example: If you use it within a local (stack managed) scope, then you‘re right. But If one uses it globally, then it *does not* use the stack but the global space which usually is the bss section within the program. If a std::vector is part of a class or struct an one creates an instance by using new then the containing array is also part of this memory which is the heap! For std::vector you’re totally right: This container *likely* uses the heap for its storage because since it is dynamic it will (likely) employ new to allocate its storage space and new means heap! However this is dependent from the implementation. Funny side note: In my career I stumbled over an quite clever implementation that attempts to optimize small Allocations by reserving a certain space for its storage as a fixed array (aka intrinsic storage). If this will not be sufficient, then it starts to use new/ delete to expand this Space upon larger storage demands. This design decision was clever for this ( embedded) Software, because the application code was designed not to exceed these limits to keep the performance high. Furthermore Embedded Software should pretend to use dynamically allocation of memory due to the risk of memory fragmentation… But this is another Story…

@lengors7327 4 ай бұрын

I think he uses stack intercahngeably with being alocated depending on location (which he should probably clarify to be fair). As for std::vector always using the heap, is this really a requirement? I.e. is it part of the spec? Genuine question, as I would assume it could possibily have a small "stack" allocation (more specifically storage dependent on location, like an array) for small vectors, no? Something like std::string does if Im not mistaken

@hanspeterbestandig2054 4 ай бұрын

@@lengors7327Exactly! You got the point! Thanks for this valuable explanations! 👍👏👏👏

@collynchristopherbrenner3245 3 ай бұрын

Wow, I had never thought about adding capacity config for a dynamic array before. I kind of figured that this is kind of redundant concer ing how this is a requirement of a static array and so you could just use a static array. But I see the value of automatic resizability after capacity config.

@Tyrano840 Ай бұрын

I don't use vectors very often as I used sometimes get confused between a vector and vectors. I usually use an Array as long as I now the size and a dynarray or list if I am not certain of the size I need.

@szabotudor 3 ай бұрын

Push back can actually be better when you have a vector of some small data type (bellow 8 bytes on 64 bit platforms, and bellow 4 bytes on 32 bit platforms) ESPECIALLY for built-in types like int, size_t, and other numerical types, because those have special optimizations around them in compilers when copying them, rather than the reference to it This is not ALWAYS true, but 99% of the time, copying a number is better than copying a reference/pointer to it, especially when it's smaller than 8/4 bytes (again, depending on the platform) PS: these optimizations don't really matter unless you're gonna move around millions or billions of numbers per second.

@DaveMacara89 4 ай бұрын

This video sort of confirmed my suspicions of a lot of what I see with Vector usage, e.g. misuse of the ->data() method (underlying pointer).

@MrAbrazildo 4 ай бұрын

4:20, if array is the most used, I don't know. But std::vector is certainly more comfortable, and almost as the same speed as array. Even nowadays, when clang seemed to achieve more performance for array, it's only ~5%, according to my benchmarks. Meaning a game (depending on vector performance) with 57 FPS would run at 60 with array. That also means vector is being deployed on stack - otherwise it would never reach this performance. vector is also more comfortable to use because of pushing_back things makes the size grows proportionally, without the worry for seg fault, when reaching stack's limits. But of course, I always use vector::reserve 1st, to avoid new "allocations" at every push_back.

@TheCherno 4 ай бұрын

std::vector is also an array, I wasn’t talking about std::array specifically

@MrAbrazildo 4 ай бұрын

@@TheCherno Are you saying that std::array is slower than C-array? I read somewhere that the standard granted same speed, by keeping its inner structure.

@Robbatog1 4 ай бұрын

@@MrAbrazildo That's not what he's trying to say. He means "across all programming languages, the most used data structure is the one that stores a set of objects in contiguous memory". The name "Array" is the most common name for this data structure. Both std::vector and std::array are "Arrays" in this sense.

@wolpumba4099 4 ай бұрын

I am just trying to learn std:pmr (polymorphic memory resource). This gives more control of where the memory comes from and can help making code that uses std::pmr::vector (and other STL datatypes) more robust and faster. This gives you one solution to store a vector inside stack memory that you allocated with std::array. I just haven't found a good explaination, yet and I'm currently very confused about it.

@mobslicer1529 4 ай бұрын

i just wrote an implementation of it, it was super fun (and it helped me fix a lot of bugs in my linked list and memory allocator implementations)

@henrikholst7490 4 ай бұрын

This is a great video and explanation. With a hot take like the one you had I wasn't too sure it would be useful. Love to be surprised!

@0240tutor 3 ай бұрын

Cherno, wonderful video as usual. You showed a practical review of how to use vector. The theory behind your practical lesson in fact is here in this book "Effective C++ Programming" by Scott Meyer And it was presented by Jon Kalb, Google shows it first hit. They are having more precise approaches for lvalue vs rvalue vs universal value, move vs copy then they introduce us to & and && for which case, finally why do you need emplace_back and which unique case you need to use push_back not emplace_back Please review that book to get the reasoning behind having those functions that you dag into. Wonderful video again thanks.

@soonts 4 ай бұрын

Good for beginners, but here’s a few comments. The C++ standard library is no longer called STL, it was years ago but now it’s just “the standard library”. std::vector is one of the very few standard collections which is OK even for performance-critical stuff. The only few times when I replaced it recently when I wanted to bypass malloc/free C heap i.e. I know my data is very large and I wanna page aligned zero initialized memory directly from the OS kernel i.e. VirtualAlloc or mmap. Modern compilers are smart enough to eliminate temporaries caused by push_back. For simple elements like your example they often automatically inline everything at least in release builds, compiling push/emplace into equivalent machine code.

@MagnificentImbecil Ай бұрын

Thank you. I just wonder whether `std::vector` could simply be adjusted to use `VirtualAlloc` (via the second template argument) instead of being entirely replaced -- then one could still benefit from much of the work already done by the authors of `std::vector`...

@soonts Ай бұрын

@@MagnificentImbecil Technically sure, but I just didn’t need any of that stuff. When vectors are small and growable, C runtime with its heap is faster than OS kernel APIs for memory management. For that one use case I didn’t need them to grow, but these things were huge, literally gigabytes of memory. Free zero initialization, and alignment by memory page boundary, was exactly what I needed for these huge blocks of memory. And another thing, std::vector itself is not that complicated. The interesting stuff on top of these standard collections come from header, and these algorithms happily accept raw pointers instead of iterators.

@Vencentguo 4 ай бұрын

I was going to comment on the fact that the case made in this video actually demonstrated how good the vector is since the cost of allocation is amortized which is explained a bit later. Also, the copy and move constructor is irrelevant to the topic in using the vector since other data structures will behave similarly to how your customized data type set up copy and move constructors. However, the process of analyzing allocations here is solid and in fact, lacking in most developers I've seen in other language users, so kudos.

@allNicksAlreadyTaken 4 ай бұрын

You can also use a stack-backed arena allocator and plug it into std::vector with pmr.

@bit_by_snake 4 ай бұрын

You should cover something like "How to approach to optimisation" or Optimisation to an existing codebase in general. (of course for beginners)

@delusionalaar4031 4 ай бұрын

I love you takes and insight. Never gets old.

@ARKSYN 3 ай бұрын

EASTL is pretty great, but I do have one question regarding it you may or may not know the answer to: What is the correct way to override the new operator? They mention briefly that you need to and give an example but don't talk further about it.

@lanchanoinguyen2914 4 ай бұрын

very useful,i can take this in mind.I still like std vector because it's convenient for a clean code.I may think about vector.reserve or vector(presize) if i had known size.

@davidfrischknecht8261 Ай бұрын

ReSharper and SonarLint will both flag calls to push_back() and suggest replacing them with calls to emplace_back().

@basti2k100 3 ай бұрын

The same behavior is for std::string. A better Implementation of a vector is in QT. Here the objects are copy-on-write. If you create a copy of the object, then there increment only a reference counter. So there are copy online a mangagent structure of some bytes. If you modify 1 copy then there class create a new sepeate copy of the data. So here the management of the data its better then the stl.

@robertvetter1011 3 ай бұрын

The reason Qt had to do this was that when Qt was created there were no modern features like move constructors in C++. It was way before C++ 11 was released.

@basti2k100 2 ай бұрын

@@robertvetter1011 Its also possible to extend the normal vector with COW. class CowVector { private: std::vector * internalData; size_t * linkCounter; }; With the construction of the Class setup the internalVector and the linkcounter = 1. If the linkcounter is equals 1, so you can read and write on the internalVector. If the linkcounter is bigger then 1, the internalVector is readOnly. If you want write, you create a new copy of the internalVector. The new structure has then the Linkcounter = 1 and is RW. A copy-constructor have copy both pointers and increment the value of the linkCounter. And the destructor have decrement the linkCounter. If the linkCounter == 0 then the destructor has to been delete both pointer. So you can theoretically extend the existing Vector with COW without using C++11 features. Be careful the example doesn't contains the public functions. Furthermore my example isn't threadsafe.

@michaeljackson1147 4 ай бұрын

This video is A++. Seriously. This will save a lot of time and headache for a lot of people The title, content and timing of release. I was just in the middle of going back and benchmarking + studying the different containers. Thanks for this fr because emplace_back is so important lol

@art0007i 4 ай бұрын

It's c++ actually

@malekith6522 3 ай бұрын

Headache? No. This is functioning code and it will work. It’s just a lack of awareness of the language and performance. It’s usually a problem of discipline and knowledge. BTW, in production, those issues will usually be caught by a static analyzer.

@michaeljackson1147 3 ай бұрын

@@malekith6522 Not sure what you're arguing about with the headache part but the point of the comment is that the video is helpful lol pretty simple.

@malekith6522 3 ай бұрын

@@michaeljackson1147 For me, headaches are synonymous with bugs. Some mistake that breaks something… therefore, I said it’s not helpful in preventing a headache. But anyway, it is a very useful video.

@michaeljackson1147 3 ай бұрын

@@malekith6522 Ah, but couldn't you say lack of language awareness and performance can contribute to "headaches", regardless? lol I see what you mean though really. Further more, bugs can be included in the entire knack, in regards to discipline and knowledge. Just picking back at you though at this point :P

@StormLord07 2 ай бұрын

As far as all this is true just wanted to add, don't blindly use emplaces instead of push or moving data. If you already have data copy/move it, move if it's no longer used in the scope you want it, copy otherwise.

@SuperSpeed52 4 ай бұрын

How would the std::array example look like at the end? since it doesn't have the emplace_back function as std::vector, would you be forced to use the struct's copy constructor?

@CrazyMineCuber 3 ай бұрын

The real solution should be to create a compile time array, with statically known size. That way, you can add more items to the array and still have even 0 stack allocations. The value will just be loaded into memory during load time.

@animanaut 4 ай бұрын

maybe a before/after benchmark would have been nice to show the impact

@galacticminx 4 ай бұрын

7:37 What stands out the most to me is not darta vs dayta, but the way you pronounce here... heeya.

@thomasfrewer1328 4 ай бұрын

Can we see some numbers on how much difference this stuff makes in program performance? How much faster does it run, how much memory do you save?

@Raspredval1337 4 ай бұрын

8:30 that's UB. You've effectively replaced the default *operator* *new* allocator with *malloc* . It's not guaranteed that default *operator* *new* uses *malloc* , thus memory blocks allocated with *new* \ *delete* might not be compatible with *malloc* \ *free* The 'correct' solution would be to also overload the *operator* *delete* with a *free* wrapper

@aboliguu1168 4 ай бұрын

Yeah but he was only allocating simple ints and in a very simple example program. Doesn’t matter

@Raspredval1337 4 ай бұрын

@@aboliguu1168 UB always matters. Even if the program does what you think is supposed to happen, UB prevents some (a lot actually) optimizations. It might be some micro optimizations here and there, and sometimes compiler can opt out huge chunks of dead code or even replace heap allocations with stack allocations. Plus, nobody wants their code to segfault, even if the only downside is an annoying error beep at the end of the program 🤷

@aboliguu1168 4 ай бұрын

@@Raspredval1337 the point is that in this TINY example program it didn’t matter. You are splitting hairs here.

@simonw3858 4 ай бұрын

Whats UB?

@Raspredval1337 4 ай бұрын

@@simonw3858 Undefined Behavior. Basically, there're some things, that a language (or library developer) assume as correct. And if the user breaks that assumption, then it's on the user. Good example is indexing into the memory that doesn't exist(like int array[ 5 ]; std::cout

@code_871 4 ай бұрын

we need more videos like this !!!

4 ай бұрын

What about implicit move constructors and copy elision?

@WiktorWandachowicz 7 күн бұрын

Just curious: when you define your own version of "operator new" (based on malloc), don't you also have to define "operator delete" (based on free)? I mean to pair the two and make memory deallocation work correctly?