"Compacting the Uncompactable" by Bobby Powers

Рет қаралды 28,409

Strange Loop Conference

Күн бұрын

Пікірлер: 35

@n8style 5 жыл бұрын

very interesting work! thanks for repeating the questions before answering too

@MatthijsvanDuin 2 жыл бұрын

Beware that this can have performance impacts on some cpu architectures, e.g. on the ARM Cortex-A8 configured with 32K L1 data cache, if you've got an even page that aliases an odd page (by even/odd I'm referring to bit 12 of the virtual address) then accessing a cache line via one page will evict that same cache line if last accessed via the other page

@Verrisin 5 жыл бұрын

"Well, there already is one indirection.... how could we abuse it?" ...... Damn.... I would not think of that, and I would certainly not think it's possible... I am very impressed.

@Verrisin 5 жыл бұрын

oh, right, there are still the size classes and other things.... well, impressive nonetheless

@Verrisin 5 жыл бұрын

34:49 - omg, that is amazing!

@peepzorz 5 жыл бұрын

Finally, I can download more RAM!! On a more serious note, I wonder if it would be feasible to incorporate this functionality directly into Linux at the kernel level? If so, would there be any way to ensure user-space code isn't running MESH on top of kernel provided MESH?

@willrandship 5 жыл бұрын

As they've described it, it's just a modification to the libc. If you preload a userspace malloc replacement, it would replace the existing malloc, and only one instance of the algorithm would ever run per program. As long as the two implementations replace the same functions there shouldn't be a conflict, since the compactor runs independently for each program. If the system version was substantially different in structure then some issues might start to crop up.

@virkony 5 жыл бұрын

Linux kernel knows nothing about allocation within page or data segment. It is responsibility of user-spaces allocator.

@FreeScience 2 жыл бұрын

I've been trying out mimalloc, which can also be used through LD_PRELOAD. While not "compacting" as I understand it, they claim this: " *free list sharding* : instead of one big free list (per size class) we have many smaller lists per "mimalloc page" which reduces fragmentation and increases locality -- things that are allocated close in time get allocated close in memory. (A mimalloc page contains blocks of one size class and is usually 64KiB on a 64-bit system)."

@EmeryBerger 5 жыл бұрын

Code here: github.com/plasma-umass/mesh

@rickyhan7023 5 жыл бұрын

Really cool. Thanks for sharing

@christianbarnay2499 5 жыл бұрын

I think there's one important point missing in the presentation and I'm surprised nobody asked about it. What happens after the mesh to free virtual space that conflicts with already allocated space in another virtual page of the same mesh? Is it marked as occupied to avoid conflicting allocations or do you un-mesh the pages when a conflicting allocation occurs?

@EmeryBerger 5 жыл бұрын

The pages are separately marked.

@JobvanderZwan 5 жыл бұрын

So could this also be used to lower or remove the need for extra memory in languages with a GC? And on a similar not: the talk shows an example of Ruby with Mesh, but would there be benefits to building a GC "from the ground up" with the techniques of Mesh?

@EmeryBerger 5 жыл бұрын

(1) re: reducing the need for extra memory with GC: yes, it can reduce the memory footprint of systems with garbage collection, as long as they use malloc/free (e.g., Ruby & Python) (2) re: GC + Mesh: we are exploring exactly that question!

@omeryehezkely3096 5 жыл бұрын

No, GC code frees unused memory by copying the live allocations into a clean space. The need for this clean space is the root cause why GC code isn't highly efficient in memory usage.

@FoldoutCouch 5 жыл бұрын

The last audience question is similar and the speaker's answer is no. I wasn't convinced, but listen to 39:27 and decide on your own.

@OMGclueless 5 жыл бұрын

He mentioned this in the last minute or so in response to a question. GCs can do something even better, which is to move and compact individual allocations. And they have to do this anyways to identify and remove unreachable objects from the graph of live objects. So there's not much benefit to using this technique which is basically opportunistically mashing together pages when possible to recover some fraction of unused memory when you could just recover all of it by compacting the whole heap.

@user-py9cy1sy9u 5 жыл бұрын

No. With GC you need to make a tradeof of throughput, latency and memory consumption. If you want low memory consumption there are GCs that dont have big memory overhead but they slow down you application

@tobinbaker383 Жыл бұрын

I don't understand how this won't lead to VMA fragmentation over time (as VM regions are remapped to "meshed" regions of physical memory), in which case you're just trading one form of fragmentation for another. Has this been tested with very long-running, high-fragmentation server workloads?

@NoNameAtAll2 5 жыл бұрын

what stops allocation on virtual page that was meshed?

@Ratstail91 2 жыл бұрын

This is really cool! I'm tinkering with some things (like a new interpreted language) that might benefit from this, eventually... edit: dang, windows is WIP.

@artzoc 5 жыл бұрын

Wow, I am pretty impressed. Are there any cases when MESH cannot improve performance/memory (re)usage?

@willrandship 5 жыл бұрын

Any program that allocates memory without ever freeing it would see no change, as would programs that only use the stack. For example, the typical "hello world" in C does not dynamically allocate any memory in userspace. (printf with a const cstring input compiles to puts, which, in glibc at least, just does a kernel "write" syscall with no allocation, which writes to the stdout file)

@Verrisin 5 жыл бұрын

I am confused why the shuffle vector is thread local.... All threads can allocate... - Do all threads have their own pages as well?

@PrivateSi 5 жыл бұрын

Great stuff, should be built into the main virtual memory system.

@endian675 2 жыл бұрын

Very interesting! Sadly it seems that the project is all but abandoned by the academics mentioned in the video.

@Verrisin 5 жыл бұрын

Ok, I like this! When will the be the default malloc implementation? *EDIT:* Ok, I was excited and tried this. I think many people had the same first thought: *Chrome!* ..... *segfault* ... ok, but firefox works, right? .... also segfault... - Ok, well... git works fine, but doesn't really need it. XD I might try a few other things, but I think, now I know *_why._* (Don't get me wrong: I still think it's amazing. Life is just not this nice... Maybe it will work with some work, but clearly those programs do some crazy things...)

@Verrisin 5 жыл бұрын

also: chrome doesn't use default malloc anyway, so... that's a thing.

@EmeryBerger 5 жыл бұрын

Please post this as an issue with steps to reproduce these errors (details like OS, versions of Firefox, etc.). Thanks! github.com/plasma-umass/mesh

@Verrisin 5 жыл бұрын

2:08 ... wait, you have that page ... it will most likely not result in a segfault, you will just read the wrong data.... - as far as I understand - Maybe I am wrong. (but if I had pointer to that place a moment ago, it will probably not cause segfault... - is what I think anyway - I don't even do low level programming, though...)

@Verrisin 5 жыл бұрын

I am not nitpicking: I am trying to make sure I understand it correctly.