Unlocking your CPU cores in Python (multiprocessing)

  Рет қаралды 313,275

mCoding

mCoding

Күн бұрын

Пікірлер: 236
@tdug1991
@tdug1991 2 жыл бұрын
It's also worth noting that smaller chunk sizes may be better for unpredictably distributed job times, as one runner may randomly grab many expensive jobs, and lock the pool when the rest of the processes finish. Great video, as always!
@ArtML
@ArtML 2 жыл бұрын
\o/ Yay! Long waited multiprocessing video! Always appreciate the humor in intros! :D Thanks a lot, I am on a path of making parallelization / multiprocessing to become a second nature in my coding - these videos help greatly! More topic suggestions: - Simple speed-ups using GPUs - Panda speedup by Dask - unlocking multiple cores - Numba, JAX and the overview of JIT compilers - Cython, and the most convenient (easy-to-use) wrappers for C++ implementations - All about Pickling, best practices/fastest ways to write picklers for novel objects
@ajflink
@ajflink 2 жыл бұрын
And GPU speedups without Nvidia.
@unusedTV
@unusedTV 2 жыл бұрын
Your video is about two years late for me! I was working on a heat transfer simulation in Python where we had to compare hundreds of different input configurations. I knew about the GIL and multiprocessing in general outside of Python, but had to figure out myself how to get it to work. Eventually I settled on a multiprocessing pool and it worked wonders, because now we could run 32 simulations in parallel (Threadripper 1950x). Quick caveat that I don't hear you mention: a lot of processors have hyperthreading/SMT (intel/amd respectively), showing double the amount of cores in the task manager. In our case we found that spawning a process for each physical core provided better results than using all logical cores.
@lawrencedoliveiro9104
@lawrencedoliveiro9104 2 жыл бұрын
5:07 Threading is also useful for turning blocking operations into nonblocking ones. For example, asyncio provides nonblocking calls for reading and writing sockets, but not for the initial socket connection. Simple solution: push that part onto a separate thread.
@SpeedingFlare
@SpeedingFlare 2 жыл бұрын
That pool thing is so cool. I like that it spawns as many processes as there are cores available. I wish my work had more CPU bound problems
@michaellin4553
@michaellin4553 2 жыл бұрын
The funny thing is, adding random noise is actually a useful thing to do. It's called dithering, and is used nearly everywhere in signal processing.
@tommucke
@tommucke 2 жыл бұрын
You would however apply it to the analog signal at about half the sampling rate in order of getting better results for the digital signal (and smoothen it with a capacitor afterwards). It makes no real sense to add it on the digital side which is the only thing python can do
@gamma26
@gamma26 2 жыл бұрын
@@tommucke Unless you're doing image processing and want to achieve that effect I suppose. Pretty niche tho
@maxim_ml
@maxim_ml 2 жыл бұрын
It can be used as data augmentation in training a speech recognition model
@louisnemzer6801
@louisnemzer6801 Жыл бұрын
'I'm going to need those sound files with random noise added in my email inbox by five pm' 😅
@lawrencedoliveiro9104
@lawrencedoliveiro9104 2 жыл бұрын
9:27 Actually, there is a faster way of sharing data between processes than sending picklable objects over pipes, and that is to use shared memory. Support for this is built into the multiprocessing module. However, you cannot put regular Python objects into shared memory: you have to use objects defined by the ctypes module. These correspond to types defined in C (as the name suggests): primitive types like int or float, also array and struct types are allowed. But avoid absolute pointers.
@ЕвгенийКрасилов-о9о
@ЕвгенийКрасилов-о9о Жыл бұрын
Aren't Managers a way to store shared python classes (via register)?
@jakemuff9407
@jakemuff9407 2 жыл бұрын
Great video! Maybe some more "real world" examples would be useful. Knowing that my code *could* be parallelized and actually parallelizing the code are two very different things. I've found that knowledge of multithreading in python does not translate to automatic code speed up. And of course no two problems are the same.
@tytywuu
@tytywuu 2 жыл бұрын
I think it is more about doing experiments on asyncio/threading/multiprocessing on your own - everyone has different Python use cases
@ibrahimaba8966
@ibrahimaba8966 2 жыл бұрын
multithreading is for io bound tasks, i use multiprocessing with zeromq to do some extensive image processing tasks!
@chndrl5649
@chndrl5649 2 жыл бұрын
Take crawling as example, it would be a huge time saver if you want crawl multiple words at a time
@chndrl5649
@chndrl5649 2 жыл бұрын
It all depends on how you can split your work.
@v0xl
@v0xl 2 жыл бұрын
python is not the right tool for high performance applicatons anyway
@zemonsim
@zemonsim 2 жыл бұрын
This video was so helpful ! I recently converted my mass encryption script to use multiprocessing. To encrypt my dataset of 450 Mb of images, it went from an estimated 11 hours to just 10 minutes, doing the work at around 750 Kb per second.
@OutlawJackC
@OutlawJackC 2 жыл бұрын
Your explanation of the GIL makes so much more sense than other people :)
@jesuscobos2201
@jesuscobos2201 2 жыл бұрын
Love your videos. I usually watch all of them just for fun but this has enabled me to speed up a very heavy optimization for my science stuff. Ty for your dedication. I can ensure that it has real world implications :)
@michaelwang3306
@michaelwang3306 2 жыл бұрын
The clearest explanation on this topic that I have ever seen! Really nice!! Thanks for sharing!
@knut-olaihelgesen3608
@knut-olaihelgesen3608 2 жыл бұрын
You are actually the best at advanced python videos! Love them so much
@superzcreationz2032
@superzcreationz2032 4 ай бұрын
This is what I was searching for.....very very very useful video...😊...that's why I subscribed you 😊
@etopowertwon
@etopowertwon 2 жыл бұрын
Multiprocessing helped me a lot recently. I had a script that periodically loads lots tons of small XML from netshare, process them and save locally, single thread ran in 30 seconds, multiprocessed ran in about 6 seconds.
@unotoli
@unotoli 2 жыл бұрын
So well explained. One nice2have thing - quick tip on how to debug (see summary of time2process) most cpu-intensive tasks (functions, like wav transformation in this case).
@daniilorekhov9191
@daniilorekhov9191 2 жыл бұрын
Would love to see a video on managing shared memory in multiprocessing scenarios
@Lolwutdesu9000
@Lolwutdesu9000 2 жыл бұрын
While I'm not using multi-threading in my current work, I'll definitely save this video so I can one day return to it!
@zlorf_youtube
@zlorf_youtube 2 жыл бұрын
Me learns a new python thing... Starts using it in every fucking uncessary place. Feels good. Really good thing to talk about typical pitfalls.
@eldarmammadov7872
@eldarmammadov7872 Жыл бұрын
liked the way to speak about all three modules asynchio, threading, multiprocessingin one vidoe
@codewithjc4617
@codewithjc4617 2 жыл бұрын
This is great content, I’m a big fan of C++ and Python and this is just amazing
@robertbrummayer4908
@robertbrummayer4908 2 жыл бұрын
Great video! Man, your videos are awesome. And every time I learn a little bit and get a little bit better, like you say :) Best wishes from Austria!
@flybuy_7983
@flybuy_7983 2 жыл бұрын
THANK YOU MY BROTHER FROM ANOTHER COUNTRY AND ANOTHER FAMILY!!!
@piotradamczyk6740
@piotradamczyk6740 2 жыл бұрын
I was looking for this kind of lessons for years. please do more.
@pamdemonia
@pamdemonia 2 жыл бұрын
It's really interesting to see the threading results, (avg ~.2.5sec per file, but only 7.6 sec total). Cool.
@walterppk1989
@walterppk1989 Жыл бұрын
Brilliant video. Absolutely flipping gold
@dlf_uk
@dlf_uk 2 жыл бұрын
What are the benefits/drawbacks of this approach vs using concurrent.futures?
@bersi3306
@bersi3306 2 жыл бұрын
Answer reside in the difference between concurrency and parallelism. When to use them also makes a lot of difference (here "CPU bounds" problems to solve with parallelism vs "I/O bounds" problems to solve with concurrency). You should also check (in the concurrent side) the difference between a Threaded function vs a coroutine.
@nocturnomedieval
@nocturnomedieval 2 жыл бұрын
This is so good and clear. A must share. BTW, how these techniques relate to the case when you are using numba with option parallel=True?
@HitAndMissLab
@HitAndMissLab 6 ай бұрын
Thank you for summarising it so wall and in a such intelligible way.
@imbesrs
@imbesrs 2 жыл бұрын
You are the only person i keep video notis on for
@lawrencedoliveiro9104
@lawrencedoliveiro9104 2 жыл бұрын
2:07 Remember that “I/O” can also include “waiting for a user to perform an action in a GUI”.
@POINTS2
@POINTS2 2 жыл бұрын
Yes! Pool is the way to go. Definitely an improvement the threading and allows you to not have worry about the GIL.
@jonathandawson3091
@jonathandawson3091 2 жыл бұрын
Not always an improvement. A process costs a lot more overhead as he explained in the video. Other languages don't have the stupid GIL, hope that it's also removed from python someday.
@sebastiangudino9377
@sebastiangudino9377 2 жыл бұрын
@@jonathandawson3091 It's a safety measure, it'll probably never be removed from python. If you really need "unsafe" threads you could probably just write your threaded function from c and inter-opt it with python. What her that's actually worth it is up you you but a lot of times it is not
@lawrencedoliveiro9104
@lawrencedoliveiro9104 2 жыл бұрын
The GIL is an integral part of reference-counting memory management. Getting rid of it completely means moving to Java-style pure garbage collection, where even the simplest of long-running scripts could end up consuming all the memory on your system. There is a project called “nogil”, which sets out to loosen some GIL restrictions a bit. That should give some useful speedups, without abandoning the GIL altogether.
@matejlinek287
@matejlinek287 2 жыл бұрын
Wow, finally a mCoding video where I didn't learn anything new :-D Thank you so much James, now I can rest in peace :)
@austingarcia6060
@austingarcia6060 2 жыл бұрын
I was about to do something involving multithreading and this video appeared. Perfect!
@JohnZakaria
@JohnZakaria 2 жыл бұрын
If numpy / scipy do the computations in C land, why don't they release the GIL and aquire it back when the computation is done? When writing a C++ module using pybind11, you have the option to release the Gil, granted that you are doing pure C++.
@julius333333
@julius333333 2 жыл бұрын
pretty sure it does
@JohnZakaria
@JohnZakaria 2 жыл бұрын
@@julius333333 if it did, then threads would speed up the computation. Just like i/o calls that do release the GIL
@jheins3
@jheins3 2 жыл бұрын
Not an expert but far and based on your comment, you probably know 100x more than I do. With that being said I am going to speculate that the traditional behavior of numpy/scify follows a standard api call to an external C/C++ optimized library (a dll in windows). The API is essentially a function that initiates the c-land magic. For error handling and for how the GIL works, the function call waits to receive the output from c-land before handing it back. Because the API is essentially a function call, the GIL cannot be released till the function returns. Again that's a guess.
@HomoSapiensMember
@HomoSapiensMember 2 жыл бұрын
really appreciate this, struggled understanding differences between map and imap...
@iUnro
@iUnro 2 жыл бұрын
Hello. Can you explain what is the difference between multiprocessing and concurrent futures package? For me they look the same so I wonder why did you chose one over another.
@joshuaowen1941
@joshuaowen1941 2 жыл бұрын
I love your videos man! Absolutely love them!
@riccardocapellino9078
@riccardocapellino9078 2 жыл бұрын
I tested this on my old code used for my thesis, which basically performs the same calculation hundreds of times with no I/O (calculates flows in a aircraft engine turbine stage). Took me 10 minutes to adjust the code and made it 40% FASTER
@quillaja
@quillaja 8 ай бұрын
god that's so much easier than what i've been doing writing all the coordination junk around queue
@marvalmoments2099
@marvalmoments2099 2 жыл бұрын
Great teaching, simple and effective, I've using this Multiprocessing with my coroutin, my program is flying, lol
@mingyi456
@mingyi456 2 жыл бұрын
Please make a video about pickable objects and pickling, I would like to know more about it.
@lawrencedoliveiro9104
@lawrencedoliveiro9104 2 жыл бұрын
2:49 Re “I say "CPU" a bunch but i actually mean "core"” --- remember that the term “core” for “CPU” was coined by Intel (and possibly other chipmakers) when they started putting the circuitry for multiple CPUs onto a single chip. The distinction isn’t really important, except that some proprietary server software from that time had licence fees that were calculated per-CPU, but somehow this was relaxed into “per-CPU-chip slot”. This way, if you had multiple CPUs in one chip, you didn’t have to pay as much as if the chips were in separate slots (which was quite common in servers in those days). Why did it matter? I guess to prevent a revolt by customers angry over licence fees ...
@user-zu1ix3yq2w
@user-zu1ix3yq2w 2 жыл бұрын
i went down a rabbit hole, MP, numba, cython, pypy... The speedup people can get is insane.
@nocturnomedieval
@nocturnomedieval 2 жыл бұрын
Could you please help me to find the answer: numba with option parallel=True how it relatesto cores/threads/process? @D:
@triola_3
@triola_3 2 жыл бұрын
5:43 that sounded like the minecraft oof
@caiomazzaferroadami
@caiomazzaferroadami Жыл бұрын
Can somebody help me out? I'm trying to put some of the things he mentioned in the video in practice and ran through something weird. In 5:50, he uses the iterable object from pool.imap_unordered() to print the return arguments from 'etl' function (filename and duration) for each element in the sounds list. I'm trying to do something similar, but my function (equivalent to his 'etl') returns just one argument instead of two. However, when I try to print each element from that iterable object, my program just freezes and I have to kill it. I can't figure out what's wrong. Note: when I convert it into a list, i. e. list(pool.imap_unordered(fcn, iterable)), it seems to work fine for some reason.
@HerChip
@HerChip 2 жыл бұрын
6:28 “blocks”; what do mean bij blocking? Without this result for loop, the processes dont seem to start: why is that? What if i dont need a for loop to do something with “results”?
@ДмитроПрищепа-д3я
@ДмитроПрищепа-д3я 2 жыл бұрын
Then you use a map instead of imap.
@mme725
@mme725 2 жыл бұрын
Nice, might play with this when I get off work later!
@cjsfriend2
@cjsfriend2 2 жыл бұрын
You should do a video on using logging alongside with the multiprocessing pool
@tobiasbergkvist4520
@tobiasbergkvist4520 2 жыл бұрын
On Linux/macOS you can use the fork-syscall to "send" things that can't be pickled, but only when using `Process`, and not when using `Pool`, since the process needs to get all the unpickleable data at startup, and can't receive it after it has started. The child processes inherits the parents memory with copy-on-write when using `fork`, meaning it only creates a copy of the memory if an attempt to modify it is made.
@m0Ray79
@m0Ray79 2 жыл бұрын
And don't forget that pure Python is not an only option. Pyrex, which is translated to C/C++, opens even more broad bridges towards performance.
@ДмитроПрищепа-д3я
@ДмитроПрищепа-д3я 2 жыл бұрын
Why use Pyrex when there's Cython tho?
@m0Ray79
@m0Ray79 2 жыл бұрын
@@ДмитроПрищепа-д3я Pyrex is a python language superset. Cython is its translator. I metioned it in my videos.
@ДмитроПрищепа-д3я
@ДмитроПрищепа-д3я 2 жыл бұрын
@@m0Ray79 Cython is also a python superset tho. And no, Cython isn't a translator for Pyrex, it's a separate thing that was influenced by Pyrex back then. And Pyrex is kinda dead with its last stable release being 12 years old.
@m0Ray79
@m0Ray79 2 жыл бұрын
​@@ДмитроПрищепа-д3я The syntax and the whole idea was introduced in Pyrex, I'm still calling it the old name. Ok, let's say Pyrex became Cython. And the file extension is still .pyx.
@SkyFly19853
@SkyFly19853 2 жыл бұрын
Very useful for video game development.
@goowatch
@goowatch 2 жыл бұрын
You should preferably use per-core display to better show what you want to explain. Thanks for sharing your experience.
@talhaibnemahmud
@talhaibnemahmud 2 жыл бұрын
Much needed video. I recently had to use multiprocessing for Image Processing & AI Game Assignment at the university. Although I used concurrent.futures.ProcessPoolExecutor() , this seems like a good option too. Maybe a comparison between these different options? 🤔
@Roule_n_Scratche
@Roule_n_Scratche 2 жыл бұрын
Hey mCoding, could you make an video about Cython?
@saketkr
@saketkr 5 ай бұрын
Awesome! This was so so helpful. Could you also make one about all these, i.e, asyncio, multi-threading, multi-processing and then workers, please?
@EvanBurnetteMusic
@EvanBurnetteMusic 2 жыл бұрын
This is great! Thanks! Would love a guide on how to use shared memory with multiprocess. I've been optimizing a wordle solver that looks for five words with 25 unique letters as in the recent Stand Up Maths video. On my 8 core machine, each subprocess ends up using half a gig of memory! My data structure is a list of variable length sets. With pool I have to resort to pool.starmap(func, zip(argList1, argList2)) to pass all the data I need into each subprocess. Compared with my naive manual multiprocess implementation, the mp pool version is 30% slower. I'm hoping it can be faster with shared memory. Again, I really appreciate that you created an almost real world problem to demonstrate multiprocessing. It gave me the context I needed to implement this with my program.
@volbla
@volbla 2 жыл бұрын
I tried using multiprocessing on my prime number sieve where each process have to write to the same array. It didn't really end up being faster (i'm probably bottlenecked by ram speed), but i did get the shared memory to work with numpy arrays. In your main process you do: shared_mem = SharedMemory(name = "John", create = True, size = #bytes) an_array = np.ndarray((#elements,), dtype = #type, buffer = shared_mem.buf) # Put your data in the array And in each subprocess you reference the memory by basically doing the same thing again. shared_mem = SharedMemory(name = "John") an_array = np.ndarray((#elements,), dtype = #type, buffer = shared_mem.buf) # Do something with the data In this case it was also useful to pass the process inputs through a Queue rather than function arguments. Then they only have to be instantiated once, even when consuming a lot of unpredictable data.
@EvanBurnetteMusic
@EvanBurnetteMusic 2 жыл бұрын
@@volbla Thanks for the queue tip I will definitely be trying that out!
@volundr
@volundr 2 жыл бұрын
This is very useful, thank you
@ali-om4uv
@ali-om4uv 2 жыл бұрын
It would be great if you could show if this can be used for Ml hyperparamerer tuning and other Ml tasks.
@codedinfortran
@codedinfortran Жыл бұрын
thank you. This made it all very clear.
@joshinils
@joshinils 2 жыл бұрын
A video on how to figure out which pieces take the most time and optimizing for time would be great. what profilers are there for python, how do i use them, how do i use them right?
@peterfisher3161
@peterfisher3161 2 жыл бұрын
"what profilers are there for python" Spyder and PyCharm have built in profilers.
@joshinils
@joshinils 2 жыл бұрын
@@peterfisher3161 ah, so I'd have to use those IDEs, not VS code... ok I'd rather have some cli solution or one that works with vs code.
@replicaacliper
@replicaacliper 2 жыл бұрын
Scalene is an amazing profiler especially on Linux
@peterfisher3161
@peterfisher3161 2 жыл бұрын
@@joshinils Quickly looking up I found cProfile, which is a built-in and can be used from the terminal. Not much popped up on VS code.
@jbusa5dimvzgkiik
@jbusa5dimvzgkiik 2 жыл бұрын
I've found yappi + gprof2dot to be really useful to find where asyncio applications are spending the CPU time.
@neelroshania7116
@neelroshania7116 2 жыл бұрын
This was awesome, thank you!
@unperrier
@unperrier 2 жыл бұрын
Can't wait for PEP 554 multiple interpreters to be mainline.
@thomasmontoya302
@thomasmontoya302 15 күн бұрын
The coffee machine will never see this coming
@firefouuu
@firefouuu 2 жыл бұрын
I still not sure why the wavfile.read is able to run in parallel thread despite the GIL. Is it just because it's C code ? So, if for any reason this was written in pure python this would not work ?
@plays1361
@plays1361 2 жыл бұрын
Great video, the program works great
@imnotkentiy
@imnotkentiy 2 жыл бұрын
-It is the end -ha. All this time i've only been using 1/16 of my true power, behold -nani?!
@rabin-io
@rabin-io Жыл бұрын
Any chance for a follow-up using this inside of a Class? And compare it with pathos.multiprocessing?
@TheGmodUser
@TheGmodUser 2 жыл бұрын
Soo, what do you do of the object isn't pickable?
@GaryHost-qs9pg
@GaryHost-qs9pg Жыл бұрын
Very well done video. thank you
@adityaalmighty1
@adityaalmighty1 Жыл бұрын
The function inside Pool() does not read global variables. Can you please show a way to fix that? It has something to do with this Queue() class, isn't it? The Docs are a bit confusing
@MihaiNicaMath
@MihaiNicaMath 2 жыл бұрын
What is this, a CPU monitor window for ants? It needs to be at least 3 times as big! Joking aside, I enjoyed the video and learned something! The pitfalls are especially helpful. Thank you :)
@MithicSpirit
@MithicSpirit 2 жыл бұрын
3:24 huh, so the python standard doesn't require a GIL?
@Darios2013
@Darios2013 2 жыл бұрын
Thank you for great explanation
@Kamel419
@Kamel419 2 жыл бұрын
I had to solve a complex problem similar to this and ended up needing to use a specific sequence of queues and workers to solve it. I think I ended up with 6 total workers, each with a "parent" worker flowing into it. I think it would be neat to showcase something like this
@aleale550
@aleale550 2 жыл бұрын
Great video! You could do a follow up parallel computing video using Dask?
@dinushkam2444
@dinushkam2444 2 жыл бұрын
Great video Very interesting stuff
@ewerybody
@ewerybody Жыл бұрын
This is cool and all for relatively small python scripts. What if I have a UI (maybe Qt for Python) and want to kick off some work on a pool of processes. I wouldn't want these processes to load (or even execute) any of the UI code 🤔
@czupryn0135
@czupryn0135 2 жыл бұрын
If i have an lost of x,y coordinats and i need to calculate distance between each one of them. so to make it faster i cut the 1000 elements array into 5 samller 200 elemnts arrays. than how do i make fisrt core process 1 array, second core the 2 one and so on?
@jdsahr
@jdsahr 2 жыл бұрын
This was just absolutely fantastic. I'm using this for processing radar data (numpy is involved), and the speedup is great! But because "experience is a dear school, and a fool will have no other" I did spend several hours banging my head against the following: >with Pool(8) as p: > print( p.map(my_etl_func, a_list_of_filenames) ) This works fine, but if you replace p.map() with p.imap(), then the print() statement prints out an address of some kind of iterator. The same thing happens with p.imap_unordered(), of course. The issue is that p.map() returns a conventional list, but p.imap() and p.imap_unordered() return an iterator. You can print(list_thing) and something useful happens, but when you print(an_iterator_thing) you get gobbledegook that isn't useful. It took me hours to figure out what was going on; hopefully that is hours that no one else has to spend. But, I have to admit that I probably learned more than those who will benefit from my folly. ---- For those who care about large binary datasets, I recommend HDF5 / h5py.
@mCoding
@mCoding 2 жыл бұрын
Great to hear you are getting value out of multiprocessing! Yes this is a common thing in Python where many iterables are lazy (like the builtin map). They return an iterator and if you really want a list just call list on them.
@felixfourcolor
@felixfourcolor Жыл бұрын
More videos on threading/asyncio please 😊
@ЭльмарИдрисов-г5э
@ЭльмарИдрисов-г5э 2 жыл бұрын
Are there any potential dangers/threats when using these methods? I understand that you can slow your program down instead of giving it speed, but besides that ? Any dangers to the computer itself or the data source (if it is coming from a database) ?
@tehseensajjad1003
@tehseensajjad1003 2 жыл бұрын
Im learning stuff myself though here's what i can say about databases. Corrections/additions are welcome. Usually there are specialized drivers for doing stuff asynchronously with the database. Also ACID should take care of not ruining the database. As for damage to the computer, no. This is the intended way of doing things in a multi core processing unit. Dont be scared to push your computer. Altough the robot uprising hasnt happened yet, its safe to say, Computers are not humans.
@ЭльмарИдрисов-г5э
@ЭльмарИдрисов-г5э 2 жыл бұрын
@@tehseensajjad1003 , thank you for your reply. I am planning to experiemnt with some of these methods for my projects. Let's see how many "time" gains it will give.
@tehseensajjad1003
@tehseensajjad1003 2 жыл бұрын
@@ЭльмарИдрисов-г5э It can get very confusing trying to design your program around doing stuff parallel or concurrent at first, but it'll click one day. Good luck friend.
@etopowertwon
@etopowertwon 2 жыл бұрын
You don't want to do non-atomic operations that can leak outside. Like in SQL check first something with SELECT and INSERT it if it was not found "if not sql("SELECT Id FROM Table WHERE Table.Foo=1"): sql("insert into Table(Foo) values(1)")" Two processes can try to insert the same value to the table at the same time.
@nathanthreeleaf4534
@nathanthreeleaf4534 5 ай бұрын
What if instead of completing a process in each pool, the data that is returned from each "pool" needs to be stored somehow?
@replicaacliper
@replicaacliper 2 жыл бұрын
I'm using Numba to optimize a program and I'm getting ~100% CPU usage. I want to run this program multiple times with independent parameters. In this case, would multiprocessing provide any real benefit over running the program one at a time?
@SageBetko
@SageBetko 2 жыл бұрын
If Numba is already fully utilizing all CPU cores, then no, the overhead of adding Python’s multiprocessing into the mix will probably just slow things down.
@pschweitzer524
@pschweitzer524 Жыл бұрын
Now the question: would running threads within each multiprocess process be even faster?
@pedroarthurstudart1999
@pedroarthurstudart1999 6 ай бұрын
5:42-5:43 and a reference to Mincraft damage taking.
@ArgumentumAdHominem
@ArgumentumAdHominem Жыл бұрын
Great video. It would be super nice to have a worked example showing strong scaling. I have found that python multiprocessing is really not that great in terms of performance. Imagine the basic scenario: you need to compute a very expensive function of index 'i' between 0 and 1000. The function takes the same time for each index. There are no shared resources between processes. Naively one would expect the performance to scale as the inverse of the number of cores, but it is actually significantly worse, and I still don't fully understand why.
@AntonioZL
@AntonioZL 2 жыл бұрын
Very useful. Thanks!
@mahmoudshihab
@mahmoudshihab 2 жыл бұрын
I didn't quite understand pitfall number 3, when you showed: `items = [np.random.normal(size=10000) for _ in range(1000)] ` Why is this a pitfall? Also, for the fib demonstration... For some reason, fib took 1.35s vs nfib took 35.05s Even the normal implementation took less time than multiprocessing at 12.93s I even copied the fib and n_fib from your github to ensure that I wasn't doing something wrong But I can't seem to replicate your results
@s7gaming767
@s7gaming767 2 жыл бұрын
This helped a lot thank you
@maheshcharyindrakanti8544
@maheshcharyindrakanti8544 2 жыл бұрын
took me a while due to mistake, but it works thanks
@aadithyavarma
@aadithyavarma 2 жыл бұрын
Doesn't Python use pass by reference instead of pass by value, so does passing an large object to a method really matter here?
@ДмитроПрищепа-д3я
@ДмитроПрищепа-д3я 2 жыл бұрын
That's true, but here we pass that to another process, which happens by value (well, almost, it's pickled, passed as a binary data and then unpickled inside of another process).
@aadithyavarma
@aadithyavarma 2 жыл бұрын
@@ДмитроПрищепа-д3я Since individual processes don't share memory, the data needs to be copied for each process. That's makes sense. Thanks!
@jaimedpcaus1
@jaimedpcaus1 2 жыл бұрын
This was a great Vid. 😊
@Malins2000
@Malins2000 2 жыл бұрын
Great Vid! When I learned to use mp was via Process object. Latest application was training TF models on GPU. I got some optimizing algorythm that searches for best Hyperparameters on models. Calculations for next parameter set to check take some time (after 50 points takes a lot of time tbh - longer then model training). So I created mp.Process() objects that deals with parameter search, and then communicates (via mp.Pipe() ) to process that builds and trains models on GPU (to avoid multiple processes access hardware the same time). Usage of mp.Queue helps with communication ;) It works great! keeps both GPU and CPU cores busy all the time :D but I've never had to use Pool though :P So mp.Process is closer to me :D
@lakeguy65616
@lakeguy65616 Жыл бұрын
I have a somewhat related question(s). I have a function where I open a file, perform a number of functions and then write the file to disk. without multiprocessing, it takes 1-2 minutes per file. I've modified my code to take advantage of the multi-cores on my pc. Its reduced the time by a factor of 3+. My problem is that its maxing out the CPU at 100% until the function finishes which means I can't use the pc for any other purpose while the multiprocessing is taking place. Heres my question. How can I reduce the work load on the CPU (even if it takes a little longer)? To process 100 files take at least 45 minutes. eventually I have 500+ files to process.... Any ideas? thank you!
@Sky007-f1k
@Sky007-f1k Жыл бұрын
from multiprocessing import Pool # Specify the number of cores to use num_cores = 4 # Change this to the desired number of cores with Pool(processes=num_cores) as pool: # Your code here Hope it helps
@necbranduc
@necbranduc 2 жыл бұрын
Awesome! What about using apply_async vs map?
@hicoop
@hicoop 2 жыл бұрын
Such a good video!
@rohitathithya3964
@rohitathithya3964 7 ай бұрын
@7:59 bruh! and slapping the like, odd number of times , wow
@anon_y_mousse
@anon_y_mousse 2 жыл бұрын
Once upon a time, multi-processing required multiple full CPU's, so it's a very understandable speako. It might also show your age. Although, it might make for an interesting video to make a Beowulf cluster with RPi's and show how to program it to calculate something in parallel. Pi itself is obvious and easy, but perhaps how to do video encoding or 3D scene rendering would be a great fit.
@harrytsang1501
@harrytsang1501 2 жыл бұрын
The best way to talk about multiprocessing and task scheduling is with RTOS. The important parts are in some 2000 lines of C and it's amazing for embedded systems
@anon_y_mousse
@anon_y_mousse 2 жыл бұрын
@@harrytsang1501 It might be pretty cool if he did a whole video series showing beginner methods in one and more advanced methods in an other. Using RPi OS with Python for the beginner series, RTOS and C for the more advanced.
@percythemagicpenguin
@percythemagicpenguin 2 жыл бұрын
I'm kicking myself for not learning this stuff earlier.
Next-Level Concurrent Programming In Python With Asyncio
19:19
ArjanCodes
Рет қаралды 185 М.
threading vs multiprocessing in python
22:31
Dave's Space
Рет қаралды 601 М.
黑天使被操控了#short #angel #clown
00:40
Super Beauty team
Рет қаралды 61 МЛН
Сестра обхитрила!
00:17
Victoria Portfolio
Рет қаралды 958 М.
25 nooby Python habits you need to ditch
9:12
mCoding
Рет қаралды 1,8 МЛН
Compiled Python is FAST
12:57
Doug Mercer
Рет қаралды 121 М.
Python is NOT Single Threaded (and how to bypass the GIL)
10:23
Jack of Some
Рет қаралды 111 М.
Python Generators
15:32
mCoding
Рет қаралды 144 М.
5 Good Python Habits
17:35
Indently
Рет қаралды 687 М.
Why You Should Think Twice Before Using Returns in Python
21:27
ArjanCodes
Рет қаралды 51 М.
Why Are Threads Needed On Single Core Processors
16:07
Core Dumped
Рет қаралды 219 М.
5 Useful Python Decorators (ft. Carberra)
14:34
Indently
Рет қаралды 110 М.
CONCURRENCY IS NOT WHAT YOU THINK
16:59
Core Dumped
Рет қаралды 127 М.
黑天使被操控了#short #angel #clown
00:40
Super Beauty team
Рет қаралды 61 МЛН