Back to Basics: Iterators in C++ - Nicolai Josuttis - CppCon 2023

  Рет қаралды 38,386

CppCon

CppCon

Күн бұрын

Пікірлер: 60
@premkumaru8662
@premkumaru8662 Жыл бұрын
This is an amazing talk. Thank you for this Nicolai Josuttis.
@tylovset
@tylovset Жыл бұрын
I struggle with why the committee included caching of the first match. Wouldn't it be sufficient with a special function for the rare cases it is desired or needed? // e.g. something like template auto cached_filter(Iter start, Iter end, const auto& pred) { start = std::find_if(start, end, pred); return std::ranges::subrange(start, end) | std::views::filter(pred); }
@MarcusAseth
@MarcusAseth 16 күн бұрын
He is soo good at explaining things 👏
@bryce.ferenczi
@bryce.ferenczi Жыл бұрын
I'm sure someone thought of std::views::cached during the standardization, I can't see why that wasn't chosen over the internal caching nonsense. Where std::views::cached saves begin() and any other data required for ad-hoc "amortized-constant time" requirements.
@dgkimpton
@dgkimpton Жыл бұрын
For real, whatever happened to "if you don't need it, you don't pay for it"? Surely in a pipeline like this vec | cached_begin | filter(greater_than(40)) would have been perfectly reasonable?
@ABaumstumpf
@ABaumstumpf Жыл бұрын
Since C++20 the committee has delivered more and more ivory-tower broken changes (there were a few in 17 already). Now with the proposals in 26 it is becoming just absurd.
@ConceptInternals
@ConceptInternals Жыл бұрын
I am not sure if I get this right. If we have the requirement of amortised constant time in begin(), why does the standard say "This is undefined behaviour if the resulting value doesn't satisfy the filter predicate"? The statement of standard is a more strong statement than the consequence of prefetching. It could be like: "Running same filter again on container is undefined behaviour, if the elements of underlying container are changed"
@AbamaCai
@AbamaCai Жыл бұрын
Great talk from Nicolai Josuttis. 点赞。👍
@Roibarkan
@Roibarkan Жыл бұрын
Great talk. I’m not sure but I think that the fact that the committee chose to make some scenario “undefined behavior” gives them the opportunity of changing (defining) that scenario in subsequent standard versions. Of course in the case of std::views::filter_view it will be hard to ‘fix’ the situation without relaxing the requirement of begin() taking amortized constant-time (but potentially such relaxation could be made (there’s also a potential problem keeping ABI if we eliminate the caching in filter_view)
@benjaminfranklin329
@benjaminfranklin329 Жыл бұрын
I think the question of undefined behaviour as is often the case is to allow optimisations rather than leaving it open to change. Most of the explicitly mentioned UB is either because it can't be known at compile time or to allow optimisations by the library or compiler.
@AnthonyDentinger
@AnthonyDentinger Жыл бұрын
Relaxing time complexity has been done before; e.g. since c++11 std::list can return its size in constant time (whereas before it was in linear time, apparently because of the splice member function). I suppose if real-world practice shows the views thing causes too many issues, they might change it later; the ranges stuff isn’t too widely used yet.
@anon_y_mousse
@anon_y_mousse 11 ай бұрын
Believe it when I tell you that the committee can change any behavior and it won't matter if it was undefined before or not. They may dislike breaking backwards compatibility, and avoid it whenever possible and then some, but they have done so in the past and could for any revision. Even the C committee has done so and has again with C23.
@AlfredoCorrea
@AlfredoCorrea Жыл бұрын
48:00 std::ranges::remove already returns the (valid) subrange with elements removed.
@Drastonar
@Drastonar 4 ай бұрын
That filter bug is actually easy to fix, just test the filter in the cached begin before returning it, if it fails, start at the real begin. Also, if the container is modified before the cached begin, isn't it also an undefined behavior?
@CppCon
@CppCon Жыл бұрын
Congratulations Nico! Top must watched recent tech talk of the week. techtalksweekly.substack.com/p/tech-talks-weekly-2-where-web-tech
@bareminimum-mrtz
@bareminimum-mrtz 5 ай бұрын
Thankyou so much !!!
@ADHD101Thrive
@ADHD101Thrive 3 ай бұрын
very informative
@juancarlospizarromendez3954
@juancarlospizarromendez3954 2 ай бұрын
if caching is used then invalidating/updating the caching maybe required (it is similar to the CPU cache, web page cache, etc).
@danielelupo5224
@danielelupo5224 Жыл бұрын
why they did not "simply" implemented two different version of std::view::filters, one for const iterator that can be optimized like now, and one for non const iterator that can change values, so we can have the best of both world, speed on first case and coherence in second case...
@riven7855
@riven7855 Жыл бұрын
Thanks for the great talk. I also have a question: why does set/map support bidirectional iterators ? Shouldn't that be an implementation detail ?
@ndykhng
@ndykhng 11 ай бұрын
They are sorted, so there should be one logical direction
@llothar68
@llothar68 4 ай бұрын
No, thats the important features of a set. At least now that we have unordered versions.
@juancarlospizarromendez3954
@juancarlospizarromendez3954 2 ай бұрын
Because set/map may be implemented as either hash table (optionally with their doubly linked nodes), balanced binary tree or B+-like tree.
@johnwalker8952
@johnwalker8952 Ай бұрын
Should you have given an example (around 17:30) then tell the audience that this no longer works in the latest version? All while reassuring your listeners that you are 'protecting' them?
@YongweiWu
@YongweiWu 2 ай бұрын
25:00 `pos += 2;` invokes undefined behaviour when the container has an odd number of elements....
@karstenmeinders4844
@karstenmeinders4844 Жыл бұрын
What's going on behind the scenes?
@jangolab
@jangolab Жыл бұрын
Great talk. That whole std::view caching situation is very concerning. A mandatory optimization that induces semantic errors and promotes undefined behavior. With all due respect, it is unnecessarily patronizing and it makes the committee seem much less resourceful and discerning than it once was.
@StrikeEagleIII
@StrikeEagleIII Жыл бұрын
Yeah we should be taking foot guns out of C++ instead of adding them
@Evan490BC
@Evan490BC Жыл бұрын
@@StrikeEagleIII This is not possible, I'm afraid. All C++ constructs must adhere to the std::Footgunnable Concept.
@germanassasin1046
@germanassasin1046 11 ай бұрын
from the way I understood it, caching as an optimization was not the goal. nico kinda shows it from a wrong side. what really happened is that there is no sensible way to define multipass filter view with mutation, and whether you cache or not, does not make a difference. it is not a c++ pitfall but a pitfall of all languages where you can mutate multipass lazy filter, even rust has this footgun. so comittee decided since they cannot define this behaviour, they won't, and since they won't define this behaviour then caching will not bring any more problems. and while I do understand why they did this, I'd prefer version with no caching.
@anon_y_mousse
@anon_y_mousse 11 ай бұрын
Perhaps interesting to no one, but the "solution" I came up with in my own language is to allow for warning against the use of certain functions. For instance, I provide a list class as part of my standard library, but I have a preprocessor directive that warns against using it in its entirety, and for multiple element iteration and indexing it gives a second warning that the operation will be computationally expensive. I'm sure that people who love lists will find it childish and insulting, but for years I've told people to use arrays for everything they can because it's a superior data structure in nearly every way and apparently so have others. Of course, I still provide the functionality to do random access of a list, or to increment iterators more than once at a time, such as it += 5; and I even provide a slice operation for lists. I really don't get why C++ restricts so much in the name of speed, when warning against use of something would be more useful. Annoying, yes, but apparently people these days want annoying compilers.
@sufiyanadam
@sufiyanadam 4 ай бұрын
What's your language called?
@limlam22
@limlam22 8 ай бұрын
Some say the juniors say "yes, chef" after scrum with Nicolai.
@letsbegin_staytrue
@letsbegin_staytrue Жыл бұрын
Great talk !! but i didn't get how remove algorithm replaces ... anyone help here
@frisnitfrisnit
@frisnitfrisnit Жыл бұрын
If I understanding what you’re asking correctly, the remove was just moving the elements down, and left the last 2 elements where they were. I think what he didn’t mention was that the size reduces by 2. So if you iterate over the new range, it’ll ignore the 2 values at the end as they’re no longer within size, even if they’re technically still there in memory. But if you ignored that new range and continued to use the old range (IIRC that’s what the example was showing), it would still see the 2 removed elements (as you’re now using an incorrect range.
@leowribeiro
@leowribeiro 11 ай бұрын
I know that Iterators give a standard way to access containers, like a glue he said, but I just don't like them, when possible, I much rather use the integer indexes, but I do see their utility.
@ABaumstumpf
@ABaumstumpf Жыл бұрын
"Contiguous iterator" seems like a concept failure to me: having a contiguous memory layout does not in any way depend on the elements being randomly accessible. You can very well have say a container that guarantees it has contiguous memory but that does not allow backwards iteration. I have seen libraries that had compiletime sized lists with contiguous memory. They were memcopyable, contiguous, and not random accessible.
@dovahsenbrom836
@dovahsenbrom836 Жыл бұрын
it's related to cache locality mate
@anon_y_mousse
@anon_y_mousse 11 ай бұрын
Indeed, it's a linguistic error. I would think ranged iterator might make for a better name as it applies specifically to a range.
@gast128
@gast128 2 ай бұрын
Drop the caching of the filter view with these liabilities. Libraries should work straightforward; I don't have time to memorize all the internals or exceptional cases.
@juancarlospizarromendez3954
@juancarlospizarromendez3954 2 ай бұрын
Beware of the Bidirectional, Random-access and Contiguous range/iterators, because the '--' and '-=' operators maybe dangerous of UB. To understand easily it, .end() is always out of the container/range and one step after of the last element, and .begin() is always in of the container/range and pointing to the first element. In the forward iteration, there is not problem. But in the backward iteration, there is a problem, how to catch the pointer-like ".begin() - 1" that should be out of the container/range? For backward iteration, trying to swap .begin() and .end() is a nightmare for me. My proposal is .begin_minus_one() as the another .end(), and .end_minus_one() as the another .begin(). There is an example that prevents the UB for a Bidirectional range/iterator: // to start either in the begin, in the last or in the middle position: //auto pos = coll.begin(); //auto pos = coll.end_minus_one(); //auto pos = coll[555]; for (;(pos != coll.end()) && (pos != coll.begin_minus_one());) { code using incrementers and decrementers ... ++pos; ... --pos; ... as if it is bidirectional }
@fredoverflow
@fredoverflow Жыл бұрын
slide 19, second loop: If the vector has an odd number of elements, the very last pos+=2 will jump over the end iterator and invoke UB.
@eugnsp
@eugnsp Жыл бұрын
Yeah, that's a bug in the slide.
@ABaumstumpf
@ABaumstumpf Жыл бұрын
"will jump over the end iterator and invoke UB." Who says that this is UB? I can't find anything in the standard saying that this would be UB. Random-access iterators make no mentioning of not being allowed to increment the end-iterator. Other iterator-traits do have that restriction.
@dzmitrykulik9913
@dzmitrykulik9913 Жыл бұрын
Did you mean the second loop on the slide? Can you explain why do you consider it as UB? In case of odd number of elements, on the last step `pos` will be behind the .end() - that is right, but it will not be used, am I right?
@kuhluhOG
@kuhluhOG 10 ай бұрын
59:00 Imo, that's not just a Gotcha anymore, that's just actively user-hostile...
@ujin981
@ujin981 Жыл бұрын
56:45 "we have intentionally designed the filter view in a way that one of the major use cases we use a filter for (which is fix the broken elements) has undefined behavior". Worse is better. "The group that standardized this behaviour does not think this is a bug." There should be a law prohibiting the use of C++ for any new software.
@germanassasin1046
@germanassasin1046 11 ай бұрын
but this is indeed not a bug. you shouldn't mutate through a filter view while multipassing not only in c++ but pretty much in every language where said thing is possible. there is no way to define such behavior even without caching .begin(). single pass should be fine and it is more of a wording issue, not a fundamental design flaw. if you are willing, you are always welcome to propose the wording which would allow single pass with mutation.
@RevolutionaryUsername
@RevolutionaryUsername 3 ай бұрын
I mean this isn’t too different from how inserting in a for-each loop can invalidate the iterators. It’s annoying but it’s not a heinous design choice
@МишаБобров-и1з
@МишаБобров-и1з 7 ай бұрын
all C++ above the 17 standard in one phrase "is syntactic sugar"😂
@oliep
@oliep 11 ай бұрын
I think this is a really good and important talk, especially for new C++ programmers. But I am kind of frustrated, that Nicolai talks about "we" all the time. To my understanding this is the work of Alex Stepanov and Paul McJones. Here's a link where Alex tells the story and even explains why iterator is kind of a bad naming and how that happend: kzbin.info/www/bejne/bmXKeY2dhaiZZ9Usi=73xPLaJhP-9IJicy&t=1199
@DuRoehre90210
@DuRoehre90210 Жыл бұрын
.end() name has been a bad idea from the start. They should have named it "endex".
@Ptr-NG
@Ptr-NG Жыл бұрын
"We use c++, b'cause we dont like bad performance" ...interesting
@jerrymoostache7821
@jerrymoostache7821 18 күн бұрын
lesson: never use views
@alanwest6949
@alanwest6949 Жыл бұрын
Write our own views then. Because if you’re looking through a stained glass window, you wouldn’t go buy a new stained glass window after painting the bench and planting a tree outside?
@smaplessmap5355
@smaplessmap5355 Жыл бұрын
Guys you stick in past. Today we need short fast information. Not day long talks about simple standards!
@toby9999
@toby9999 Ай бұрын
I prefer the long format. I don't suffer from that modern, short attention span problem. If you don't like it, go find a short one. Some of us prefer the full details.
@egonkirchof
@egonkirchof 10 ай бұрын
Are you talking about iterators in 2023 as something new in C++ ? No wonder people use Python.
@toby9999
@toby9999 Ай бұрын
I don't follow your logic. Iterators are still part of the language, even now in 2024. And Python sucks. You can't develop high-performance applications in Python.
@swastikacharyya
@swastikacharyya Жыл бұрын
CppCon is a blessing 🫶 Thanks to all of you and Happy New Year.
Ful Video ☝🏻☝🏻☝🏻
1:01
Arkeolog
Рет қаралды 14 МЛН
Thank you mommy 😊💝 #shorts
0:24
5-Minute Crafts HOUSE
Рет қаралды 33 МЛН
The Only Unbreakable Law
53:25
Molly Rocket
Рет қаралды 356 М.
Back to Basics: C++ API Design - Jason Turner - CppCon 2022
1:00:42
"Simple Made Easy" - Rich Hickey (2011)
1:01:39
Strange Loop Conference
Рет қаралды 108 М.
Back to Basics: Lambdas - Nicolai Josuttis - CppCon 2021
1:05:21