How to Map Files into Memory in C (mmap, memory mapped file io)

  Рет қаралды 112,163

Jacob Sorber

Jacob Sorber

Күн бұрын

Пікірлер: 178
@beegbraining
@beegbraining 3 жыл бұрын
The background music makes it sound like we're solving a murder case. Great video by the way
@langstonnash5454
@langstonnash5454 3 жыл бұрын
pro tip : watch movies at flixzone. Been using them for watching a lot of movies lately.
@kabirkhalid2499
@kabirkhalid2499 3 жыл бұрын
@Langston Nash Yea, have been using flixzone for months myself :D
@cadearturo3879
@cadearturo3879 3 жыл бұрын
@Langston Nash definitely, I have been watching on flixzone for years myself :)
@aidenkylo4349
@aidenkylo4349 3 жыл бұрын
@Langston Nash Yup, I have been using Flixzone for since december myself :)
@aldomelvin1991
@aldomelvin1991 3 жыл бұрын
@Langston Nash yea, I've been using flixzone for years myself :)
@8015908
@8015908 3 жыл бұрын
NIce quick to the point videos. Lectures and textbooks often go too in-depth and confuse more than is. Your videos give brief insight, and then shows an actual functioning demostration.
@JacobSorber
@JacobSorber 3 жыл бұрын
Thanks. Glad you're enjoying them.
@LinucNerd
@LinucNerd 6 жыл бұрын
Thank god someone finally made a video on mmap that made sense! Thank you sooooooo much
@JacobSorber
@JacobSorber 6 жыл бұрын
You're welcome. Glad it made sense.
@LinucNerd
@LinucNerd 6 жыл бұрын
Also, why is the like/dislike ratio hidden? The quality of your other videos (including this one) are fine.
@JacobSorber
@JacobSorber 6 жыл бұрын
Good question. I guess I hadn't checked that option.
@topG448
@topG448 6 жыл бұрын
This man is a boss! Your video has some excellent stuffs. Please keep sharing :)
@JacobSorber
@JacobSorber 6 жыл бұрын
Will do. Glad you liked it.
@kotravaijm250
@kotravaijm250 2 жыл бұрын
NO ONE explains like you do. Thank You!
@djpolyester
@djpolyester 3 жыл бұрын
Thank you Jacob! However from the man page: `if neither O_CREAT nor O_TMPFILE is specified, then mode is ignored.` You don't have to use S_IRUSR and S_IWUSR as arguments. They should be ignored. Wanted to point out in case. Appreciate your great work!
@rafabaranowski513
@rafabaranowski513 6 жыл бұрын
I tried to understand this whole day. You made this in few minutes :)
@tigerrabbit3261
@tigerrabbit3261 6 жыл бұрын
ayyyy this is awesome very very good! you made mmap very easy to understand. craving for more !
@Jonathondelemos
@Jonathondelemos 3 жыл бұрын
The background music caught me off guard at first. After you executed the code, I said wtf like ten times and the music really made me laugh.
@zrodger2296
@zrodger2296 Жыл бұрын
This is exactly what I need to know for a project I'm working on. Thank you!
@officialsterlingarcher
@officialsterlingarcher 5 жыл бұрын
For my OS course one of our assignments is tbe classic consumer producer problem but with processes. One of the portions for this assignment is being able to write to a file from one of our consumer processes but I was having the hardest time understanding how to share a file in memory between processes. This definitely cleared it up! Thanks :D
@JacobSorber
@JacobSorber 5 жыл бұрын
Glad I could help.
@islandcave8738
@islandcave8738 3 жыл бұрын
Wow, this is amazing. My mouth was gaping when I saw this.
@anycaroliny7900
@anycaroliny7900 3 жыл бұрын
Perfect explanation!
@deepshankarjha5344
@deepshankarjha5344 4 жыл бұрын
amazing video . simplicity and clearity. hatsoff
@thepickicool97
@thepickicool97 4 жыл бұрын
you saved my life, and i'm not overreacting ! Thank you !!!
@MECHANISMUS
@MECHANISMUS Жыл бұрын
When could regular io be preferred over mmap?
@justwanderin847
@justwanderin847 2 жыл бұрын
I am on Linux and can not access more than the 4 gig limit. I have 64 bit and 32 gig ram and when I do the mmap it appears to work (8 gig), but when I try to access past the 4 gig, it gives sig memory err. Now I am using gnuCOBOL and have the table defined in Linkage and get the pointer set all is well. It works for 4gig or less. Is there something in my Linux system that I need to change as in the config ? (I use Ubuntu and PopOs) on two diff machines.... I should be able to mmap and access 90 gigs (should be no limit in linkage section. ?
@leokiller123able
@leokiller123able 2 жыл бұрын
Hey, great video ! I have one question though, how do you append characters at the end of the file this way ? Do we need a reallocation or something ? EDIT: I found a way: First I tried allocating more space with mmap but surprisingly it doesn't append character that I `strcat` at the end, so after a little bit of digging I found that if you edit files with `mmap` the size of the file is not automatically updated like with `write`, so we need to call `truncate` (or `ftruncate`) to manually change the size
@MrUmang40
@MrUmang40 6 жыл бұрын
Put everything aside.....did anyone notice.......the background music in this video.....it is so intense.... actually I loved it.......
@JacobSorber
@JacobSorber 6 жыл бұрын
Yeah, it felt like an intense topic at the time. Glad you liked it. :)
@PflanzenChirurg
@PflanzenChirurg 4 жыл бұрын
Whoever reads this, spread Jacob Sorbers Channel as it has been done with Cherno. The quality is good, the insight is insane! Have a nice day
@JacobSorber
@JacobSorber 4 жыл бұрын
Thanks. All sharing is definitely appreciated.
@PflanzenChirurg
@PflanzenChirurg 4 жыл бұрын
@@JacobSorber i see your channel already with 200k subs, you trended on my youtube recommendations since yesterday. I just can beg u to show us more as C is evenly Interesting as C++. Suggestions are: IPC, Sockets, Designpatterns in practical usecases because on my opinion you can learn that in C the best way. Lean and Mean
@strayedaway19
@strayedaway19 3 жыл бұрын
So freaking cool, thank you for this !
@arturesMC
@arturesMC 4 жыл бұрын
Hi. Is there any difference to memory acces time between MMAPed file, and just using MMAP to allocate memory? What if I want a fast way of loading stuff, but then I also need a high performance runtime?
@shaharrefaelshoshany9442
@shaharrefaelshoshany9442 3 жыл бұрын
This is gold !!
@ArunPrabhath
@ArunPrabhath 5 жыл бұрын
Hi, your videos are awesome. I would like to request you to post a video on shared listening socket descriptors which is shared across different processes.
@PflanzenChirurg
@PflanzenChirurg 4 жыл бұрын
you mean inter process communication?
@jinti4epicclips
@jinti4epicclips 3 жыл бұрын
Thanks dude!
@benedictionbora5902
@benedictionbora5902 2 жыл бұрын
I get errors using "open", "close", "write", and "read", saying that they are "invalid in C99", how do I fix this (I am using Vim on MacOS BigSur)? "fopen", and "fclose" work fine, but I just wanted to try the other functions.
@jeyanthinba9142
@jeyanthinba9142 5 жыл бұрын
This video helped a lot...Thank you So Much
@JacobSorber
@JacobSorber 5 жыл бұрын
You're welcome. I'm glad it helped.
@pagenotfound_code_404
@pagenotfound_code_404 4 ай бұрын
What are the drawbacks of using this vs the usual fopen, fseek, etc? I guess more memory used?
@dayol2026
@dayol2026 3 жыл бұрын
Thanks for the video!!
@rafaelnagel5253
@rafaelnagel5253 4 жыл бұрын
Thank you! Excellent content.
@JacobSorber
@JacobSorber 4 жыл бұрын
You're welcome. Glad you liked it!
@pankajkushwaha3793
@pankajkushwaha3793 3 жыл бұрын
Great tutorial .
@sghsghdk
@sghsghdk 4 жыл бұрын
How about appending to files - is that possible ?
@karlforshaw
@karlforshaw 6 жыл бұрын
Thank you for posting this
@JacobSorber
@JacobSorber 6 жыл бұрын
Happy to! Glad it was useful.
@paulwomack5866
@paulwomack5866 4 жыл бұрын
mmap is awesome and cool, and easy to use. Just RTFM. But for professional use, it's useless without exception based error handling. Because errors that you could handle via return failure code using read/write/seek (etc) just become memory errors. I don't know of a way to do high grade error checking when using mmap().
@godnyx117
@godnyx117 2 жыл бұрын
Like what?
@MECHANISMUS
@MECHANISMUS Жыл бұрын
Do we have to page align the mapped memory like it's shown in manpage example?
@realdragon
@realdragon Жыл бұрын
It actually can be useful but I'm using fscanf to get float from file but I don't know if it would work with mmap
@milossimicsimo
@milossimicsimo 4 жыл бұрын
Hi this is super cool example. I have one question...in this example you did some change to the content of the file, but if I want to increase size of the file? For example, I have some data in that file and I want to add more data to it. Is that something that could be done?
@SlideRSB
@SlideRSB 2 жыл бұрын
What's with the slasher movie music?
@adreto2978
@adreto2978 Жыл бұрын
When to use shm_open() over open(). Online people say in modern POSIX environments theyre basically equivalent? Is this true.
@utubeuser907
@utubeuser907 5 жыл бұрын
Nice videos.. to the point and no BS.. btw, what is the editor you are using? Is it on Windows?
@yomanos
@yomanos 5 жыл бұрын
Late response, but it's called Atom
@maellet.9707
@maellet.9707 5 жыл бұрын
great explanations, thanks !
@JacobSorber
@JacobSorber 5 жыл бұрын
You're welcome. Let me know if there are other topics you would like to see discussed on the channel.
@kathiravankathir3089
@kathiravankathir3089 5 жыл бұрын
Hi, when mapping memory with new empty file (opened with O_TRUNC) , 1 ) SIGBUS is generated when accessing the memory ,if i pass the size as page size to mmap. 2)SIGSEGV is generated when accessing memeory , if i pass the size obtained from fstat fun help me to resolve this !! Thanks.
@TimothyEBaldwin
@TimothyEBaldwin 4 жыл бұрын
No need to enlarge the file (for example by using ftruncate) to avoid SIGBUS and not write outside mapping (which can be larger or smaller than the file) to avoid SIGSEGV and other undefined behaviour.
@nabilelouahabi6735
@nabilelouahabi6735 4 жыл бұрын
T'es un amour
@JacobSorber
@JacobSorber 4 жыл бұрын
Merci beaucoup, my friend.
@CrazedMachine
@CrazedMachine 3 жыл бұрын
So why wouldnt we use mmap every time in lieu of fopen?
@jcialdella
@jcialdella 3 жыл бұрын
How do I move a memory file of dictionary words into hcreate/hsearch?
@thom9909
@thom9909 Жыл бұрын
awesome channel
@JacobSorber
@JacobSorber Жыл бұрын
thanks!
@Rebecca-sv3vd
@Rebecca-sv3vd 4 жыл бұрын
I still don't understand why this is faster. If we are in essence just hijacking the VM system's way of swapping from disk to memory, then why not let the VM system handle it? Also, what if the file is super big (either more than you have in memory or close to what you have in memory), if we're mmapping to memory (requesting memory from the OS), doesn't this cut into it's reservoirs?
@JacobSorber
@JacobSorber 4 жыл бұрын
With mmap, you are letting the VM system handle it. And, it's not *always* faster. It usually is, though. The virtual memory system already has techniques it uses for swapping memory in and and out of RAM from disk. So, if the file is super big, those techniques will just bring in a part of the data as you need it.
@dhruvakumar6964
@dhruvakumar6964 2 жыл бұрын
Thank you.
@ProkenKey
@ProkenKey 3 жыл бұрын
Can you show an example of offset?
@bonbonpony
@bonbonpony 4 жыл бұрын
So when you write something to that memory, _when_ does the operating system write this data into the file? I don't suppose it does it immediately, because that would dramatically slow down the access to that memory :q But in that case, my modifications to that memory won't appear in the file immediately, and they can be lost when there's a power outage in the meantime :q Is there some way to control when the operating system flushes those data to the file, or to cause it on demand? I'm also a bit dubious about your benchmarks, because from I've been told, `stdio` functions use `mmap` under the hood to do its own buffering. E.g. when you `fseek` to some location in the file, it mmaps a piece of the file into its internal buffer beginning from that offset in the file. I think what might have slowed down your "without `mmap`" version, is those multiple calls to `fseek` being done inside the loop, because that would most likely cause a lot of calls to `mmap` under the hood whenever you jump to an offset that is not in the range of currently mapped buffer (beside the function call overhead). Also, when you use `MAP_PRIVATE` with `PROT_WRITE`, the changes won't actually be written to the original file :q The system makes a private copy of the file in memory, that is swapped as any other virtual memory region. It won't be swapped into the original file. P.S. What's with that dramatic horror music? :q Is it supposed to scare us from using that dangerous C language? :D
@bonbonpony
@bonbonpony 4 жыл бұрын
OK, never mind, I'm answering my own questions :D The data stored in the memory-mapped region are _not_ written directly to disk until the system needs to _swap_ it (e.g. when it needs to grant physical memory to other processes). In that case, two different things may happen, depending on whether you used `MAP_PRIVATE ` or `MAP_SHARED`: For `MAP_SHARED`, the original file is used as the swap space, so the system dumps the content from physical memory into the original file, then uses that physical memory at a different virtual address for a (possibly) different process. For `MAP_PRIVATE`, the system makes a separate (private) copy of the data from the file in virtual memory of the process, and in that case they are *never* saved into the original file! Instead, the system swaps them as any other virtual memory - into a separate swap file/partition - when it needs to use that physical memory for something else, and reads it back from that swap file/partition into memory when the process needs it again. So the actual writing to disk is rather unpredictable and controlled by the system (it happens when it needs to swap that memory). But there is a way to request it on demand: using `msync` call. Also, `munmap` does it naturally when you finish using that mapping.
@edwingarcia5043
@edwingarcia5043 2 жыл бұрын
@@bonbonpony interesting
@michaelclift6849
@michaelclift6849 4 жыл бұрын
What if my modifications change the size of the file? Do you have a suggestion for how to handle that?
@bonbonpony
@bonbonpony 4 жыл бұрын
In that case, you have to change the size of the file the usual way, i.e. by seeking to some offset past the end of the file and writing some random byte there. This has nothing to do with memory mapping, though, because memory mapping just makes a 1:1 map between some part of the file and some range of virtual addresses. You can't "insert" data into that mapped region and expect it to shift the data after it to higher addresses, as you do when inserting text in text editors. Instead, it works more like the overwrite mode in text editors: you can only overwrite what's already in there, inside that mapped region.
@mariamka
@mariamka 2 жыл бұрын
Hello. First of all let me thank you for yoyr videos. They really teach a lot. I'm just a beginner and have still a lot to learn. Would you mind to answer my question? Do I need to allocate memory when using mmap func or it is already not causing leaks? Thank you)
@JacobSorber
@JacobSorber 2 жыл бұрын
Thanks. I'm glad the videos have been helpful. I'm not sure if I'm completely understanding your question (I'm guessing you meant "deallocate"). But, as with most things in computing, it depends. If you are using mmap to map a file into memory , it's probably a good idea to unmap that memory (using munmap) when you're done with it. If you're using mmap to write your own memory allocator (probably not something you will do as a beginner, but you do you), you might keep empty pages around to satisfy future requests. In that case, I wouldn't call it a memory leak, because you have a plan for reusing the memory. And, of course, if your program is about to terminate, and you're trying to maximize speed, you can just leave it and have the OS clean up your pages for you. A lot of terminal programs that just do one thing (like ls) often do this. It's still a memory leak, but (on laptops/desktops/servers) memory leaks are generally only problematic for programs that run for a long time and allocate a lot of memory blocks.
@kibme5189
@kibme5189 18 күн бұрын
Thanks for this and I am in no way saying it as hate but "music" doesn't belong in educational videos. I was distracted so many times (it is my undiagnosed adhd lol) again thank you.
@ignacionr
@ignacionr 2 жыл бұрын
why do you have books in Thai language on your shelf?
@thomaspalade9946
@thomaspalade9946 4 жыл бұрын
bravo tata
@homelessrobot
@homelessrobot 4 жыл бұрын
next level big-brain mmaping is when the file isn't a regular file, so it doesn't have anything at all to do with a disk. Like a device file. The fd for the file has to be seekable (random access... like memory), but thats the only restriction. This is how you get memory mapped device IO into userspace for efficient userspace device drivers/controllers. You can even double this back on itself to do your own userspace virtual addressing my remapping mapped portions of /proc/self/mem into a more convenient location.
@davidramsay9321
@davidramsay9321 4 жыл бұрын
Works nice when compiling in Linux but the function is not available in Linux or Windows Mingw :(
@inanismailov
@inanismailov 5 жыл бұрын
Your code, for some reason, does not compile when I use GCC or G++ in my makefile. After some fiddling I managed to get it to compile but when I run it it the file size is either 1 or 0 (its an entire novel) which is incorrect, or a segfault occurs. Any ideas on what could be causing this?
@inanismailov
@inanismailov 5 жыл бұрын
nevermind... thanks for the help :D
@JacobSorber
@JacobSorber 5 жыл бұрын
Glad you got it sorted out. Do you mind telling me what the issue was, for posterity's sake?
@darkcheese1
@darkcheese1 4 жыл бұрын
what kind of speed tool do you use? ;)
@mohssenelg4501
@mohssenelg4501 2 жыл бұрын
can i read my file line by line using mmap()
@mostafaomar5441
@mostafaomar5441 4 жыл бұрын
What is the difference between using mmap for a file and reading the whole file into memory in the beginning, accessing it in the memory as much as you want, then writing it back to the disk before the program ends? Does mmap work somehow similar to this? Also, what does mmap do when the file is way too big to fit in the memory?
@JacobSorber
@JacobSorber 4 жыл бұрын
One difference is that you will write much less code, using mmap. The main advantage is that you leverage your operating system's virtual memory system to handle things like 1) what to read into memory when, 2) what to write out when, and 3) what to do when it doesn't fit into memory. The complete answer would be too long to give in a comment, but say you have a large file, but you only need to read part of it (some at the beginning and some somewhere in the middle-but you don't know where until you read the stuff at the beginning). You can use read/fread/fseek to get just the bits of the file that you want, but the code might get a bit complicated. With mmap, this will happen automatically. The OS will just page in the parts you actually access (assuming it's using demand-paging). Hope that helps.
@feastures
@feastures 5 жыл бұрын
I've actually crashed Linux, long time ago, by allocating more memory than was physically available. Allocation (malloc()s) went all okay. But then I started writing to that memory, the kernel crashed at a certain point. I ran the same test program on SunOS, which didn't have this issue.
@JacobSorber
@JacobSorber 5 жыл бұрын
Software bugs happen in operating systems, too. Glad they seem to have fixed that issue.
@BHASKAR26able
@BHASKAR26able 4 жыл бұрын
Will this only help in printing string by string or can we also write complete output into file without doing so?
@JacobSorber
@JacobSorber 4 жыл бұрын
mmap just maps the bytes into memory. You can modify them however you choose, line by line, or just write in one big memcpy or memset operation. Or, you can just change a random byte in the middle of the file. It just treats the whole thing like a big array. It doesn't care how you access that array.
@spartacuspro88
@spartacuspro88 6 жыл бұрын
Is this just more convenient than allocating a buffer with malloc and then writing to it using fread, or are there other benefits?
@JacobSorber
@JacobSorber 6 жыл бұрын
For me, it's mostly about convenience. In some cases, it might also be a bit faster.
@HeavyRainMeditation
@HeavyRainMeditation 4 жыл бұрын
sys/mmap.h give some error in "lie no such directory"
@cursedbeats3795
@cursedbeats3795 4 жыл бұрын
it s sys/mman.h
@MECHANISMUS
@MECHANISMUS Жыл бұрын
into or onto?
@andrewdunbar828
@andrewdunbar828 4 жыл бұрын
Have you got any videos about learning Khmer?
@JacobSorber
@JacobSorber 4 жыл бұрын
No. Not really. Would you like to see more Khmer on the channel? 🤔
@andrewdunbar828
@andrewdunbar828 4 жыл бұрын
@@JacobSorber Well I was looking for a video on mmap but I'm also a language nerd and there are no good Khmer channels by foreigners. There's plenty on Chinese, Japanese, Korean, Thai, and Vietnamese. There's a niche waiting for you man!
@coolshailendra2805
@coolshailendra2805 6 жыл бұрын
is it possible to update mapped memory when base file is updated ? Suppose independent process A has read file "file.txt" where data is abcd. Later process B updated this file as abcde. How mapped memory can get this updated data ?
@JacobSorber
@JacobSorber 6 жыл бұрын
Good question. Try it out and see. I think it probably won't work. I did a quick test and didn't see the changes reflected, but there might be a way to get it to update in both directions. In general, things get a little tricky when processes are messing with an open file concurrently.
@coolshailendra2805
@coolshailendra2805 6 жыл бұрын
Hi , I tried it. With map_shared it is possible but content in file should be updated with open ( with c or python). :). All these are working fine , just one problem left that can we do mmap on a empty file ? Bcoz if we do we get a out of bound pointer which will lead to segm fault.
@JacobSorber
@JacobSorber 6 жыл бұрын
Can you explain exactly what you did that worked? Are you modifying the file in the same process that mapped the file? For the empty file case, just use "truncate" to extend the file length after creating it before mapping it.
@coolshailendra2805
@coolshailendra2805 6 жыл бұрын
Jacob Sorber I followed following steps. 1 process p1 open file as read only (as I don't wanna write ) 2 p1 mmap opened fd it with MAP_SHARED. 3 p1 print content from pointer got from mmap. 4 p1 add break in this process. I used cin/scanf 5 p2 write in same file from python from another session. 6 print again. . New changes will be reflected here.
@JacobSorber
@JacobSorber 6 жыл бұрын
What OS are you using? I tried something similar and didn't see the changes. Trying to figure out what we're going differently. Thanks.
@CerbTheUnidog
@CerbTheUnidog 6 жыл бұрын
What happens if you have an mmap'ed file in a running program, but you delete the underlying file?
@JacobSorber
@JacobSorber 6 жыл бұрын
Good question. You should try it out. I haven't explored all of the different failure scenarios, but I just tried deleting a mapped file on a simple example (on Linux), and it didn't change the program's behavior. My suspicion is that it's removing the directory entry, but not removing the file blocks while the mapping is active, but I could be wrong.
@sibendupaul6250
@sibendupaul6250 6 жыл бұрын
@Jacob, in what kind of scenarios, does the memory mapped files will be useful? can it be used to dump data directly to the RAM rather than disk while getting content from TCP socket?
@JacobSorber
@JacobSorber 6 жыл бұрын
Memory-mapped files are typically used to efficiently read large files or to edit files in place. You can use them for writing as well, but mmap won't expand the file size so you would have to do that separately. If you're getting data from a TCP socket and trying to dump it to disk, I doubt that memory mapped files will make things any easier for you. You might get some performance benefit, but I'm not sure.
@sibendupaul6250
@sibendupaul6250 6 жыл бұрын
Thanks for the reply. I am trying to directly write the data obtained from a TCP socket and write it to the virtual memory of the process by using mmap files. Not on to the disk first. Any idea how good/bad it will be?
@JacobSorber
@JacobSorber 6 жыл бұрын
Is your goal to put the data into RAM or onto the disk? Whatever you do, the data from the socket will start out in memory (in the process's virtual memory). If you want it to be written to disk, you can use either conventional file IO or memory mapped file IO. In this case, I think conventional file IO would probably be simpler to program and might be just as fast (because with mmap, you'll have to keep increasing the file size). In the end, you could implement both and time them and see which is faster.
@sibendupaul6250
@sibendupaul6250 6 жыл бұрын
My goal is to put the data into RAM (process virtual memory which is already allocated). Then How to directly put into the RAM. Consider, the size issue of the file is not there. Any suggestions now?
@JacobSorber
@JacobSorber 6 жыл бұрын
So, when you call "read" to get data from the socket, you give it a pointer (an address) to a place in memory that you want the data to be copied to. That can be a global array, or something on the stack or heap. At that point, it's in RAM (in the process's virtual memory).
@emiliocanton
@emiliocanton 5 жыл бұрын
Hey! Great videos. Can I change the file descriptor which was mapped to memory (e.g. I mmap fd 0 and once I've mapped it I change the fd to 1 so changes are made on that file)
@JacobSorber
@JacobSorber 5 жыл бұрын
Thanks. And, good question. So, are you thinking that you would call mmap once with a block of memory and then call mmap again with another file descriptor? I haven't ever tried to switch the file descriptor for a mapped block of memory. Sounds like an intriguing experiment. Just want to make sure I understand exactly what you have in mind.
@bonbonpony
@bonbonpony 4 жыл бұрын
If you close the file that is referenced by `fd`, it doesn't affect the memory mapping at all (and it doesn't actually close the file either), because `mmap` stores the file descriptor within the mapping, and it increases the reference count on that descriptor, so that the system knew that someone else is still using it. (That's why calling `fclose` on it doesn't actually close the file.) From GNU libc manual: "A new reference for the file specified by filedes is created, which is not removed by closing the file." www.gnu.org/software/libc/manual/html_node/Memory_002dmapped-I_002fO.html Also, if you modify the file on disk externally (especially if you change its size), the manual says that: "The effect of changing the size of the underlying file of a mapping on the pages that correspond to added or removed regions of the file is unspecified."
@ТомаТома-у9э
@ТомаТома-у9э 6 жыл бұрын
love you
@267praveen
@267praveen 3 жыл бұрын
What's the windows equivalent for this ?
@JacobSorber
@JacobSorber 3 жыл бұрын
I believe the function is call CreateFileMappingA, but it's been a while.
@TrippleXD545
@TrippleXD545 Ай бұрын
whats with the call of duty zombies?
@R4ngeR4pidz
@R4ngeR4pidz 4 жыл бұрын
Why is the background music so ominous? This is making me think this method is dangerous in some way o.o
@athinakyriakou4440
@athinakyriakou4440 6 жыл бұрын
So, the mmap() allocates memory in the heap like malloc()?
@JacobSorber
@JacobSorber 6 жыл бұрын
Sort of. Malloc actually uses mmap to get the memory that it then hands out to your program. Think of it this way. Mmap allows you to request blocks of memory from the kernel (the operating system's privileged code), but those blocks have to be a multiple of the system's page size (4096 on most systems). I actually made another video on this (kzbin.info/www/bejne/joeYpIWXiLunjrM). Hopefully, that helps fill in the rest of the details.
@TheMentalGentelman
@TheMentalGentelman 5 жыл бұрын
This is awesome; no more read and write calls. Whoo! Just a question though, I assume this works well with small to medium files but does this work with large files? Like, say, a one GB file?
@JacobSorber
@JacobSorber 5 жыл бұрын
You should try it out and see. (Yeah, it should within reason.)
@bonbonpony
@bonbonpony 4 жыл бұрын
This is PRECISELY what `mmap` is used for: peeking through huge files fast as if they were memory. Of course it wouldn't be very smart to map a several-gigs file as memory (unless you actually have that much of RAM, or at least swap space, but the latter would be slow anyway), because you rarely need the entire file at the same time. Instead, you map just some portion of the file (a buffer) as memory, do whatever you need with it, then you can map a different piece of the file at the same memory buffer, do some stuff with it, etc. You can think of it like opening that file in a text editor - after all you don't see the entire file in your editor, just some "window" of it (e.g. one page / one screen of text). But you can now move that window over the file to see different parts of it through that window. So the memory buffer is your "window", and mapping different parts of the file into that window at different times is like scrolling over that text ;) The good thing is that this way you don't have to load the entire file into memory to peek its last couple of bytes - now you can just map the part you're interested of into memory. Better still is that with `mmap` the part of the file that you mapped into memory doesn't have to take any actual RAM space until it's actually needed! :> The system just remembers that this range of addresses is mapped to that range of data in the file, but it loads it into actual physical memory only when someone tries to peek them. Until then, the pages that has not been accessed yet remain unmapped. Similarly, the system may swap some of the pages of your mapped memory back into the file (or swap file/partition) when it needs to free some physical memory for something else, until you try accessing those pages again (in which case it loads them back into memory).
@leikang8653
@leikang8653 5 жыл бұрын
genius!
@ohreally1021
@ohreally1021 4 жыл бұрын
Thanks for the vid! Which platforms will this work on?
@JacobSorber
@JacobSorber 4 жыл бұрын
This should work on MacOS and Linux. Most Unix-based OSes support mmap. On windows, look into the CreateFileMapping function.
@obinator9065
@obinator9065 2 жыл бұрын
1:55 the criminal music lol
@lordadamson
@lordadamson 5 жыл бұрын
I love you
@a4e69636b
@a4e69636b 3 жыл бұрын
The music sounds creepy, but thank you for the video.
@wenglish1968
@wenglish1968 4 жыл бұрын
Unfortunately, mmap isn't portable to Windows :-(
@bananalord8575
@bananalord8575 3 жыл бұрын
Great video why the creepy music tho?
@yjc149
@yjc149 2 жыл бұрын
nice
@liamquinlan3456
@liamquinlan3456 5 жыл бұрын
Why are you calling fseek at all? fopen fully buffers the file by default, and fgetc increments the FILE structure's position counter already. Also I trust you are rebooting between benchmark runs so that the first run's residuals in memory/disk cache aren't slanting the scales? (lastly, "time" makes me sad. perf stat?)
@JacobSorber
@JacobSorber 5 жыл бұрын
The point of using fseek is to not read sequentially because I expect to get very similar results if I read sequentially. As far as the timing runs. I don't recall exactly what I did, but it wasn't intended to be a scientifically-publishable experiment. Just a quick demo. I'm sure I played around with different orderings and different files, but I'm guessing I didn't reboot the machine between runs. Also, sorry to add to your sadness. Again, I was just going for a quick and dirty test, and I figured more people know and use time. I do like perf-stat. Might be a good topic for a future video.
@liamquinlan3456
@liamquinlan3456 5 жыл бұрын
@@JacobSorber see that's kinda what I mean though If you expect to get similar results *without* the random access component, then you're claiming the random access component is what's probably making the difference. Thing is, that means this test mostly boils down to showing you that array indexing is faster than calling an extern function. For what it's worth, that's probably still true at optimization levels other than what you used here...
@12crenshaw
@12crenshaw 2 жыл бұрын
I wish it was so easy with winapi...
@khomo12
@khomo12 9 ай бұрын
👍👍👍
@jenaipour2mn
@jenaipour2mn 2 жыл бұрын
great video, thank you. Music is awful though...
@IsaacC20
@IsaacC20 2 ай бұрын
The background music is extremely distracting.
@jodiethemathgenius9204
@jodiethemathgenius9204 3 жыл бұрын
omg lego
@ahrarcorson6452
@ahrarcorson6452 5 жыл бұрын
Has anyone ever told you, you look exactly like Mathew Macounaghy
@JacobSorber
@JacobSorber 5 жыл бұрын
You're the second. It's the hair, right?
@ahrarcorson6452
@ahrarcorson6452 5 жыл бұрын
@@JacobSorber hahah of course bro
@MrTomro
@MrTomro 4 жыл бұрын
a really cool vid but bruh that music
@thanlim7749
@thanlim7749 4 жыл бұрын
You must speak Khmer. I see the Khmer dictionary in the background. I speak Khmer too :)
@JacobSorber
@JacobSorber 4 жыл бұрын
ល្អណាស់ រៀននៅណា ? មកពីស្រុកខ្មែរទេ?
@thanlim7749
@thanlim7749 4 жыл бұрын
Jacob Sorber wow you do speak Khmer:). That is awesome!!! Where did you learn it? Why did you want to learn Khmer. ខ្ញុំចេះពីស្រុកខ្មែរ
@JacobSorber
@JacobSorber 4 жыл бұрын
@@thanlim7749 I lived in Phnom Penh and Kampong Cham from 1998 to 2000.
@thanlim7749
@thanlim7749 4 жыл бұрын
Jacob Sorber that is awesome!
@thanlim7749
@thanlim7749 4 жыл бұрын
Jacob Sorber can I ask you a question about programming. For example, struct donuts *ring; How can I insert integers into that ring pointer? I am not sure if we can do that ? Thanks
@michaelfrank1048
@michaelfrank1048 2 жыл бұрын
This doesn't work. Trying to access indices of the string returns random numbers instead of the actual characters.
@morganroberts5946
@morganroberts5946 5 жыл бұрын
You kinda remind me of Matthew Mcconaughey
@JacobSorber
@JacobSorber 5 жыл бұрын
A few people have said that. I'm guessing it's the hair. :)
@michaelespinoza4562
@michaelespinoza4562 Жыл бұрын
Life sucks now that i am an adult. I wish i could be a teenager again.
@littledev09
@littledev09 7 ай бұрын
'Baccha chor'
@charleshwankong
@charleshwankong 3 жыл бұрын
ABrAhAm lInCoLn
@LITHIUMINWATER
@LITHIUMINWATER 3 жыл бұрын
great video, thanks!
How processes get more memory. (mmap, brk)
6:50
Jacob Sorber
Рет қаралды 74 М.
CS 134 OS-7: Memory-Mapped Files
6:03
Neil Rhodes
Рет қаралды 31 М.
The evil clown plays a prank on the angel
00:39
超人夫妇
Рет қаралды 53 МЛН
UFC 310 : Рахмонов VS Мачадо Гэрри
05:00
Setanta Sports UFC
Рет қаралды 1,2 МЛН
Леон киллер и Оля Полякова 😹
00:42
Канал Смеха
Рет қаралды 4,7 МЛН
What if I try to malloc WAY too much memory?
4:32
Jacob Sorber
Рет қаралды 70 М.
i wrote my own memory allocator in C to prove a point
5:23
Low Level
Рет қаралды 408 М.
How to make memory read-only in your C programs.
12:57
Jacob Sorber
Рет қаралды 21 М.
File I/O With Memory Mapping Using Python mmap
23:02
Real Python
Рет қаралды 7 М.
Why Linux Has This Syscall?!
2:05:25
Tsoding
Рет қаралды 53 М.
How do I access a single bit?
11:07
Jacob Sorber
Рет қаралды 23 М.
Garbage Collection (Mark & Sweep) - Computerphile
16:22
Computerphile
Рет қаралды 251 М.
Mastering Memory: Allocation Techniques in C, C++, and ARM Assembly
17:05