What is NUMA?

Рет қаралды 83,222

5 жыл бұрын

**********************************
Thanks for watching our videos! If you want more, check us out online at the following places:
+ Website: level1techs.com/
+ Forums: forum.level1techs.com/
+ Store: store.level1techs.com/
+ Patreon: / level1
+ L1 Twitter: / level1techs
+ L1 Facebook: / level1techs
+ L1/PGP Streaming: / teampgp
+ Wendell Twitter: / tekwendell
+ Ryan Twitter: / pgpryan
+ Krista Twitter: / kreestuh
+ Business Inquiries/Brand Integrations: Queries@level1techs.com
IMPORTANT Any email lacking “level1techs.com” should be ignored and immediately reported to Queries@level1techs.com.
-------------------------------------------------------------------------------------------------------------
Intro and Outro Music By: Kevin MacLeod (incompetech.com)
Licensed under Creative Commons: By Attribution 3.0 License
creativecommons.org/licenses/b...

Пікірлер: 192

@account0199 5 жыл бұрын

"If anybody is gonna make Non-uniform Memory access exciting, I'm your guy" earned my like right then and there man...

@hristobelchev829 3 жыл бұрын

@smccrode 5 жыл бұрын

Ma-ia-hii Ma-ia-huu Ma-ia-hoo Ma-ia-haa

@aarontheblackfox 5 жыл бұрын

You have restored my faith in humanity. I came to this video hoping for this comment.

@YuzolainYuzolain 5 жыл бұрын

Salut

@jurie911 5 жыл бұрын

I knew from the title lol....

@konarider443 5 жыл бұрын

I was looking for this comment!!!! Yes!!!!!!!

@Kazrael 5 жыл бұрын

^ this

@oldschool1079 5 жыл бұрын

*Wendel ECC-ing (Error Correcting) the crap out of himself in this video :D

@oldschool1079 5 жыл бұрын

Ah those first world problems, mixing up your cores for threads, and your 8 core CPU for your 16 core cpu. Haha :P

@FutureChaosTV 5 жыл бұрын

Content found nowhere else! Thank you! :-)

@chomper720 5 жыл бұрын

Anyone else getting stuck on 2:54 no matter what?

@Totalschaden-dd4bp 5 жыл бұрын

same here

@AdriansNetlis 5 жыл бұрын

Yep, exactly what's happening.

@smilo_don 5 жыл бұрын

I got to 2:57, jumped forward a few seconds and it continued. weird.

@MrFlatox 5 жыл бұрын

Wendell has packet loss

@b2bb 5 жыл бұрын

Just change the quality of the stream from whatever you're running it at.. Mine went through just fine.

@james2042 5 жыл бұрын

How to make NUMA fun, step 1: be Wendell, step 2: you're already done

@koplakjaya7131 5 жыл бұрын

techporn

@DeepStone-6 5 жыл бұрын

Numa? Is it as bad as Ligma?

@andrewe3165 5 жыл бұрын

but not as bad as smegma. i'll get my coat

@vicious12394 4 жыл бұрын

I think joe really likes ligma

@edsknife 4 жыл бұрын

What's Ligma?

@vicious12394 4 жыл бұрын

@@edsknife joe mama likes to ligma balls

@noble_lemon 7 ай бұрын

Ligmanuma is worse

@stevenstevelinck 5 жыл бұрын

this is the first channel i consciously subscribed. it looks to me you know what your talking about and you produce serious videos with real info. well done.

@frozenfar7231 5 жыл бұрын

love the graphics on the TV. but really thank you so much for the forum and this great videos.

@peter2f6 5 жыл бұрын

Wendell, You make superb videos! Keep up the great work, Sir!

@Tosuzu0321 5 жыл бұрын

Probably the few but I love watching these informative videos.

@ASG16_4 5 жыл бұрын

Super informative video! glad I took the time to watch

@krisspkriss 5 жыл бұрын

Having a NUMA system requires a bit of management to get the best performance, but once you get used to it, the pros outweigh the hassles.

@phil.4688 5 жыл бұрын

I second that! I bought a couple of used Xeon E5-2687W in 2016 (16 physical cores total, 3.1 GHz base / turbo 3.8, for less than $600) and it's so versatile (64 GB RAM, 7 PCIe slots linked to either CPU 1 or 2 etc.) I love that workstation / server. Performance is on par with 16-cores Threadrippers. Thank you, Ebay!

@charlesturner897 5 жыл бұрын

Just for a quick rundown, because I don't really know much about numa, if I have 2 processors (x5690s) and a have a game running on one processor's cores but the GPU is on the PCIe lanes for the others I'm theoretically losing preformance?

@phil.4688 5 жыл бұрын

@@charlesturner897 yes. In particular the thread running the GPU should be in core 0 of the CPU controlling its PCIe lanes. Since the host OS would typically use core 0 of CPU 1 (most devices being attached to it I suppose), running the GPU off of CPU 2 may yield a smoother experience in some cases. You can check your topology to know which core numbers to target for the guest. If hyperthreaded, give it pairs accordingly (e.g. on a 16C/32T dual socket system from Intel, CPU 2 physical cores are 8-15 and their hyperthreads are cores 24-31.

@Neumah 5 жыл бұрын

Love it! This is high quality nerd content. And you can tell Wendell means business because of the awesome shirt.

@Rohrschacht 5 жыл бұрын

These kind of deep insights are the reason I am a fan of this channel

@tomsimonis 5 жыл бұрын

Very informative! thanks. Had some giggles along the way.

@Vespassassina 3 жыл бұрын

This is an excellent video. Thanks for taking the time to properly explain NUMA.

@SR-fi8ef 5 жыл бұрын

Pure class, great work!

@Kenny_Ded 5 жыл бұрын

Thanks for this video. I learned something today.

@b2bb 5 жыл бұрын

For the hiccup at 2:54 ----------- > Just change the quality of the stream from whatever you're running it at in the middle of the hitch. It will push through.

@michaelmyers4484 4 жыл бұрын

The R710 seems to be THE thing for home labs and learning, I see it in almost every server-related video on KZbin. :)

@seabass6106 4 жыл бұрын

very interesting! thanks Wendell!!

@alessandrosuppini943 5 жыл бұрын

Man, you’ve managed to clarify to the rest of us such a complex topic... you are a champion Wendell! Love your channel buddy, keep posting Threadripper stuff (maybe a comparison between 2990wx and 2790wx once released in Oct) 👍

@rochr4 5 жыл бұрын

Fantastic content, ty.

@phil.4688 5 жыл бұрын

Awesome timing. I was re-watching your Threadripper livestream from January regarding Looking Glass / VFIO, as I'm in the process of setting up such a system on my dual Xeon E5-2687W (very decent workstation chips, 2012). I'm hesitating between Arch and Fedora for the host OS... #nerdproblems

@oldschool1079 5 жыл бұрын

I get it when Wendell explains NUMA latency using the old x2 socket Xeon server with 2 physical CPU slots and 2 separate I/O riser slots, 2 RAM banks slapped together with that Intel QuickPath, because when cpu1 has to ask cpu2 for some I/O or memory request because maybe the stuff it needs in on the other I/O riser/RAM bank, but I want to know would ANY latency be present on an x399 system if there is only ONE I/O (1 GPU on PCI-E x16 slot) connected directly to TR CPU and only ONE of those RAM banks were populated? TR then only has one of those on the motherboard it can use when running a game for example If using 2 GPU slots and all RAM slots were populated it is easy to understand why then latency would be introduced. Great video Wendell! You made NUMA easy to understand. I wish you would soon make a video about how to properly assign AMD Ryzen/TR CPU cores to a Virtual Machines in VMware Workstation and Hyper-V when virtualizing stuff for most optimized VM experience. EDIT: NVM I watched the video till the end :P seems it does not matter how many slots are populated if you do not limit the app or game to use specific cores on specific CCX.

@Wourghk 5 жыл бұрын

NUMA NUMA node, NUMA NUMA NUMA node

@TwoWheeledDecaf 5 жыл бұрын

Very informative video, thankyou

@cheesefries7436 4 жыл бұрын

Very interesting video, thanks

@srmunir 5 жыл бұрын

Loved the video! You know what you're talking about

@Daniel-zr4pk 5 жыл бұрын

this channel is absolute gold

@jagardina 5 жыл бұрын

Good video, very informative.

@Gengh13 5 жыл бұрын

Awesome, very good explanation.

@MrMoxes 5 жыл бұрын

Yes, teach us Sensei. Class is in session.

@techfan7808 5 жыл бұрын

This is a great Video Wendell. Could you take a deeper dive and explain why we would, or would not want to manage memory, cpu cores, for every situation to optimize performance?

@Pheatrix 5 жыл бұрын

Great timing! My next exam is about parallelcomputing and parallelprogramming. Which includes NUMA / UMA. This video was really great and helpful!

@Conenion 5 жыл бұрын

Great! So they will teach you the most important thing. It's Amdahl's Law. en.wikipedia.org/wiki/Amdahl%27s_law Then you know why Cinebench render bench is such a deceptive benchmark for general (desktop/gaming...) performance.

@Pheatrix 5 жыл бұрын

Pretty much every benchmark is deceptive. Unless you would use pretty much the same programms you would use normally. Even Firestrike is deceptive for gaming performance. Especially when considering SLI. And sometimes even the build-in benchmarks in games are far from accurate of in-game performance. You should use a benchmark according to your use case. If you really want to render stuff on your CPU I guess cinebench is a great benchmark. In every other use case: not really great. And from what I've heard Windows has problems dealing with the number of threads on a 2990WX so you also have to consider the OS used for benchmarking / your use case. PCs are complex. There is no way there can be one easy answer (or performance number) that says everything you need to know....

@Conenion 5 жыл бұрын

> Pretty much every benchmark is deceptive. Cinebench render bench is an _absolute_ best case scenario. One has to be aware of that fact.

@davey3765 5 жыл бұрын

Using memory benchmarks I noticed a general improvement on the 1950X using UMA regarding peak bandwidth, I was unable to test latency

@morningreis5018 5 жыл бұрын

This is a great video Wendell, and I think it's going t be an important consideration going in the future because AMD would not have been able to achieve high core counts without multiples dies and thus multiple NUMA nodes, and Intel isn't going to compete without following a similar path. I have been experimenting with my Skylake-X processor (7820X) and on some games, I get a vastly improved framerate by setting core affinity manually. But the Skylake-X is still on one node, so I'm not sure what the exact cause would be. It's more apparent with older titles that do not multithread well. WoW is a good example, and I get best framerates in that choosing two logical cores which are not on the same physical core (ie cores 0 and 2, or 13, 15 etc). I will have to test on GTA V. I don't know if it's do do with the userland/system level processes you mentioned. I was originally thinking it was due to Intel using a Mesh instead of the Ring Bus. I'd like to hear your thoughts on that. I love this sort of content, keep it up!

@WillmerWonsang 4 жыл бұрын

Great job explaining Nonuniform Memory Access.

@syth-1 5 жыл бұрын

off topic but i just installed a new ssd into my laptop (that was suppose to go into my future build but was just collecting dust - and yes after installing chrome i watched an level1tech vid ^^) but im amazed at how in many different aspects an ssd has given light to this dying laptop... no more stuttery video playback is the biggest change,

@ricardoabh3242 2 жыл бұрын

It was riveting!

@suehunt622 2 жыл бұрын

that was pretty cool............... thank you wendel...........

@leviathanpriim3951 5 жыл бұрын

thanks Wendell

@myselfremade 5 жыл бұрын

Vrei sa pleci dar nu ma, nu ma iei nu ma, nu ma iei, nu ma, nu ma, nu ma iei Nu Ma, Nu Ma Iei, nu ma, nu ma, nu ma iei Chipul tau si dragostea din tei Mi-amintesc de ochii tai kzbin.info/www/bejne/gZ7Xq4R5iM1prs0

@BK-id4ft 5 жыл бұрын

What a classic

@nicoladellino8124 5 жыл бұрын

Nice video

@rossmpostpro 5 жыл бұрын

Enjoy the content man, can I just check something quickly though? So if all the cores are on one node, is NUMA not possible? Does it depend on there being a interconnect such as quickpath or more than one physical socket? Thanks man

@junkerzn7312 5 жыл бұрын

UMA mode doesn't just lie. It lies AND it also shuffles the memory so (for example), each block of 256 bytes alternates between NUMA nodes. This increases the chances that general memory allocations by the OS will cover both nodes and spread the memory load out. Some will have higher latencies, some will have lower latencies. On average it works out pretty well because latency is just one of several stall conditions that can slow a CPU down. The mechanism that makes UMA work reasonably well is this shuffling. Basically, one of the low address bits is exchanged with a high address bit in the physical address space. This shuffling appears to require symmetric NUMA. I have never been able to enable any sort of UMA mode on my 2990WX, and I'm guessing the reason is that the 2990WX has assymetric NUMA, where two of the four CPU dies have no memory connected to them at all. I guess the address bit shuffling hardware doesn't work for that particular situation. That is something AMD should fix. The 2990WX would benefit greatly from having a UMA mode. Your description of the spectre mitigation is ... well, it's actually pretty wildly off. Sorry. Though you got fairly close. First, the system overhead is not really an issue of switching between unrelated processes. There is no need to segregate processes to avoid that. There is overhead there, but it is not synchronous... it's no in the 'hot path' that would interfere with a program on a continuous basis and effect its performance. Where the spectre mitigation really hurts is when a user process does a system call. This is a synchronous operation, it always occurs on the same cpu as the process making the call. The syscall trampoline must issue the Spectre related MSRs to firewall the cache effects and these MSR's eat around 2uS (2000nS) of overhead when all is said and done. It's very nasty. The same context switch issue gets even worse when we are talking about VM's. VM's have to execute the spectre mitigation not only inside the VM, but the host also has to execute the mitigation when the VM punts out to the host (which it does for a lot of things, particularly if you can't map PCIe devices directly into the guest). The Spectre MSR's don't just clear the L1 cache. In fact, I'm pretty sure they don't clear the L1 cache at all. They primarily have to clear the branch cache and mess around with the call/return hardware cache in the cpu core. And they also have to firewall speculative execution from crossing the boundary. The Meltdown fix essentially requires isolating the MMU map between user and supervisor mode, which means that %cr3 has to be written to on the context switch (e.g. for a system call). Twice in fact, one to get into the kernel, and one to get out of the kernel. This overhead adds 100-300nS or so depending on the cpu. AMD is less vulnerable to Spectre and invulnerable to Meltdown. It is less vulnerable to Spectre because it uses full 64-bit address tags in its branch cache tags while Intel uses fewer address bits and XOR's the higher address bits into the lower address bits. This allows a user program (on Intel) to directly control the branch cache for memory locations in kernel memory space. And for Meltdown, Intel CPUs will speculatively read memory through the TLB while ignoring the U (supervisor/user) bit in the PTE, which allows a user program to mess with the L1 caches related to any kernel memory location. AMD's speculative reads honor the U bit and do not have this problem. So, basically, meltdown adds 100-300ns of overhead, and Specter adds 2000nS of overhead to any system call. This is also true for interrupts. That is a lot of overhead considering that the nominal overhead without the mitigations is typically only 70nS. -Matt

@mduckernz 5 жыл бұрын

Good explanation. This matches what I know about these vulns too.

@Zarcondeegrissom 5 жыл бұрын

I remember having major headaches just trying to not have threads stomp all over system process back in the day, and the only fix was just as aggravating as you could not set an affinity and have it stick. the instant the thread or process closes that affinity is forgotten the next time the thread or process starts up, so you basically had to manually set the affinity every single time you opened an app (on XP through win7). I've yet to even bother looking at win10 affinity, as it's a not worth the effort kind of thing at this point, and easy to just assume setting affinity on windows is more effort than it is worth. As for non-windows, well, yeah, remembering affinity has been a thing going way back, and it has always been a set it and forget it kind of thing, lol. I had thought about looking at the 'topo' of my FX8350 comp (win7), however, I hadn't had the time to do more than just think about it in passing. And I suspect I already know just how bad the M5A97 (rev original POJ) is laid out, lol. The top 16x slot will not post with a GPU over 50 or so watts, so the GTX1050ti had to go in the lower 4x slot (ouch), and I guess like the AM3-99 chipset (I forget the full name of it) that slot goes through the chipset. I don't game on the system, so it works well enough, lol. And I know the 4x PCIe 2.0 bus is NOT holding back the GPU, as the GPU has no prob going to 100% load with the bus never peeking over 50% usage from my testing some time ago. Great vid Wendal and Crew. B)

@Fee.1 5 жыл бұрын

You make me regret not going forward with computer science after hurricane Katrina :/

@randy206 Жыл бұрын

I run a 5950x. I'm going to try process lasso and or try modifying the core affinity for my games. I know that my system is not running numa but there is still some latency with spreading a task across multiple ccd's as far as I understand. Thanks for this.

@yassine_t 5 жыл бұрын

This is gold :D

@nastassia19 5 жыл бұрын

I couldn't help but notice DisplayCal on the monitor in the background. Doing some calibration? Nifty piece of software really.

@BasilLange 5 жыл бұрын

How could he ever survive those long podcasts with the other guy. His depth of knowledge is great.

@Zarrx 5 жыл бұрын

haha I was just going through the best performance guide for VMware and never heard of NUMA even though it was mentioned to have it enabled and to use vNUMA never heard of it before hand - gonna score some pts with the boss

@CharlesLijt 4 жыл бұрын

Who comes here after watching Linus Epyc video?

@lawrencedoliveiro9104 5 жыл бұрын

09:46 Something else for the Windows users to download separately and install ... and then figure out how to keep up-to-date.

@baudneo 5 жыл бұрын

This dude does a good job of explaining things without boring the sh*t out of me.

@chromerims Жыл бұрын

verdict: you indeed made NUMA exciting (0:53) 👍 Thank you, Wendell

@knightmarex13 5 жыл бұрын

Flush level1? All I can think about now is Wendell, Ryan, and Krista being flushed

@skshandilyamari 2 жыл бұрын

what is that software running on the big monitor? I want that :-)

@spongeyperson 5 жыл бұрын

The NUMA Graph that you showed kinda reminds me of the "It's a Unix System" on jurassic park, except this is a real Unix-ish system xD

@solar3mpire 5 жыл бұрын

Using ProcessLasso since 2015

@mechanicalfluff 5 жыл бұрын

15:38 - don't worry wendell, we won't flush you :P

@alejandroberistain4831 5 жыл бұрын

I love your explanations...lol

@justarandomname420 5 жыл бұрын

Engagement!

@gavinearls2935 5 жыл бұрын

Engagement X2

@duvalchris21 Жыл бұрын

thanks was helpful... not a gamer but good to know when it comes to Sql Server.

@fredEVOIX 4 жыл бұрын

here's one of those report you talk about ;) mine I do that for an old game (torchlight 2) it already had stuttering problems on a 9900K (8c) but on 3960x (24c) it was way worse so I did the set affinity thing and only select threads 40-47 (4 cores+MT) and tada frametimes and fps go from a rollercoaster to pretty much an horizontal line, older Forza Horizon games on UWP/MS store also had that big problem as the game was decrypting the "secure" data of the game aka the textures and that in a open world game meant your core0 was at 100% use all the time basically don't know if they fixed that but it was a complaint often found in the officialk Turn10 forums, process lasso is also something game modders/tweakers have used for years

@TheShorterboy 5 жыл бұрын

Sweet I'm saved my 4K TV gets confused when I try to run 1080p, so I'm not losing anything sticking with 4K.

@shariryaniv1978 4 жыл бұрын

Great job!!! Please, next time try to address the the issue at hand in regard to network card and network performance... :-). Thank you 🙏

@daviddow5591 5 жыл бұрын

Powershell!

@Ph42oN 5 жыл бұрын

But is there really any benefit to running numa mode on threadripper when you can just set affinity to cores within 1 die or 1 ccx, depending how many threads it benefits from? Even on ryzen 1600x setting affinity to threads within 1 ccx can improve framerate on some games. Also, if you want some more advanced tweaking, process hacker is program that can set affinity individually for every thread separately, but unfortunately you will have to do that every time manually for every thread when you open program.

@lawrencedoliveiro9104 5 жыл бұрын

1:43 Topology ≠ topography!

@glbernini0 5 жыл бұрын

Clive Cussler National Underwater And Marine Agency Great adventure books!

@Tom-nh5lr 2 жыл бұрын

I’m so confused so what’s better for latency no numa node or multiple

@DAVIDGREGORYKERR 5 жыл бұрын

This is very interesting and should make the DIDs simulator EF2000 Tact-com run much faster

@free2killoriginal 5 жыл бұрын

Great

@ronaldagorsah7954 4 ай бұрын

Let him talk about NUMA ...love your videos. Thanks for the schooling XD

@Luredreier 5 жыл бұрын

The picture on the screen is all white during most of the video for me... Can you add pictures/screenshots of those in the description?

@jthompson120db 5 жыл бұрын

thats strange the video won't play past a little after 2:45 , just skipped a little ahead and its playing fine which I hope it still counts as a view in the system

@elahn_i 5 жыл бұрын

Same happened to me.

@JasonOfTGA 5 жыл бұрын

hehehe. I clicked like because my 2009 Macpro Pro died (2 x 3.46GHz Xeons with 32 Gig ECC Memory). Now I'm back on my ~2008 core2 system, which I upgraded to a Quad core from South Korea on eBay, and maxxed the memory to 8gB. The 'old' workhorses you are showing made the internet way better. Time will tell if the newbies can do something awesome ;)

@clintrussell 5 жыл бұрын

I'm sure there is a song named numa,, I'm humming it now

@warmwaffles 5 жыл бұрын

engagement

@andljoy 5 жыл бұрын

Hey a dell R710 ( well google search app) runs my works 90TB raw raid just fine !

@MuhannadALAmeri 5 жыл бұрын

👍

@arielerosa3204 3 жыл бұрын

numa is the coachella of it yeeiii

@isbestlizard 4 жыл бұрын

They still have 16MB of cache.. that's like.. the entire main memory of my first PC. it ran windows 95 in 16 meg :O

@Fee.1 5 жыл бұрын

Changing the CPU affinity sounds like drug/chemical’s affinity for the receptor it’s seated on. Flushing L1 cache sounds like taking a receptor antagonist while the receptor is occupied by an agonist. What process has the authority to “kick” off you off your set affinity ? Anything?

@networkingjoe3635 5 жыл бұрын

Hey I just got a workstation off ebay... the machine only POSTs when I disable NUMA... when I enable it, it's basically a dice roll if it POSTs and 100% doesn't get into the OS. What could be the issue?

@josh223 5 жыл бұрын

at 2:54 the video is broken, thanks youtube

@HiMyNameIsColdguy 5 жыл бұрын

I remember numa numa having a dance. Where is the dance?

@jayxu6355 5 жыл бұрын

isn't flushing L1 cache related to resolving the meltdown issue ?

@peterjansen4826 5 жыл бұрын

The big question: why doesn't MS allocate cores which aren't doing anyghing (sort of, no load) to games? Why put a part of the game its load on the same CPU core as Windows? That doesn't make any sense to me. I noticed that Wendell is running Ubuntu. Preparations for the PCIe passthrough on Ubuntu tutorial?

@Conenion 5 жыл бұрын

You (usually) do not pin a thread to a core. Threads wander from one core to another.

@peterjansen4826 5 жыл бұрын

That can give problems for Ryzen (CCX) if Windows/Linux lets threads wander from one thread to another in between different CCX's. However, the point is, if a game only uses up to 6 cores and the CPU has 8 cores while the CPU has 4 cores per CCX, then why not reserving 4 cores on 1 CCX and 2 cores on the other for the game and confine Windows to those two remaining cores. My point was that you want for games to use as many cores on the same CCX as possible.

@Conenion 5 жыл бұрын

> However, the point is, if a game only uses up to 6 cores and the CPU has 8 cores A game (or any other application) does not "use 6 cores" if you want to be fully correct. It uses/has on *average* 6 *software* threads that are runnable (i.e. not blocked/waiting). Threads are dynamic, they come and go. And there are many more threads than hardware cores. There is no direct relation from a thread to a core. It is up to the scheduler (and depending on priorities, time a thread has already run etc.) to decide which thread runs on which core. If a higher prioritized thread becomes runnable the scheduler has to pick a core that is either idling or another thread has to be stopped. The scheduler however will try to keep a thread on one core as long as possible, to allow for the cache of that core to be kept hot. NUMA is making all of this much more complicated as the scheduler must be aware of it. With SMP things are easier since all cores are created equal (including SMT/Hyper Threading "cores").

@odomobo 5 жыл бұрын

Peter Jansen A particular core can't have 2 things running at the same time -- there's only 1 thread ever running per core. It's just that the cores are timesliced by the processor. In reality, a game can go for periods of time with all threads halted (not running on any cores), and then suddenly a bunch of threads spring to life. So, if windows receives other work to do (from the OS or other background processes) while some game threads are sleeping, it sees unused cores and may put them to use. A good analogy is a restaurant. Each table will be used by dozens of parties during a day, but only 1 at a time. If the restaurant had your favorite table reserved 24/7, they'd miss out on business when you weren't there, if they were otherwise at max capacity.

@RaVuK100 5 жыл бұрын

Only 5:00 in video already like clicking noise

@silasmayes7954 5 жыл бұрын

So the os scheduler is a better for dealing with specter and meltdown than any changes to the microcode? Obviously on lower core count processors this isn't the case, but for servers why is Intel responsible for fixing the issue? Please correct me if I'm misunderstanding this.

@triplefurious618 3 жыл бұрын

Does hp z600 has numa?

@Tom-nh5lr 2 жыл бұрын

He’s saying it improves performance but then saying it increases latency

@andibiront2316 5 жыл бұрын

Intel server grade hardware also supports "UMA". I should say... you really can't make a NUMA topology become UMA by flipping a switch. The moment you have two CPUs with an IMC on each one you have NUMA nodes. What you can do, and both Intel and AMD support this, is setting node-interleaving on. This will present an UMA like topology to the OS (I'd rather call it Sufficiently Uniform Memory Architecture or SUMA, as some people do). This will only change the memory addressing to be interleaved, so when you access memory in sequence you will hit local memory, remote memory and so on. The latency will "look" uniform but it is not actually. I've read about workloads that prefer this predictable approach, but I've never seen a case where it is necessary in real life.