now *this* is my kind of computerphile video. low level, technical, mathematical, great stuff.
@Satchboy719 жыл бұрын
+Rob Mckennie Agreed. I hope they do an "extras" video on this topic.
@Crobisaur9 жыл бұрын
+Rob Mckennie If you want to learn more about image processing, a great book (and not too expensive) is ISBN-13: 978-0123965493.
@indianakernick37889 жыл бұрын
+xIsheyx Yeah me too!
@matthiaswandel9 жыл бұрын
To do a realiztic blur, you really should convert pixel values to light values (de-gamma), then filter, then convert back to pixel values. Gamma is typically around 2.3. which means, light intensity is pixel value raised to 2.3. If you don't do this, dark smears over light in ways that it doesn't do in real life.
@lukasdon00079 жыл бұрын
+Matthias Wandel Excellent comment. Photoshop actually has an option for this (to perfom RGB-blends with linear gamma) However, the problem is not just with blurs. It's for pretty much all operations you perform. Which begs the question: why not work in a linear color space alltogether, and only apply gamma to the output. That's how lightroom tends to do stuff: gamma is only applied for visual preview and for output, but internally it's all a linear colorspace. In Photoshop, you can choose to do all your work in a colorspace such as LAB, which would solve most of your problems. However, sadly LAB is a bit of an underdog because it requires people to learn a new colorspace and adapt their workflow accordingly.
@lcq929 жыл бұрын
+Lukas don How's this option called in Photoshop ?
@Tymon00009 жыл бұрын
+Matthias Wandel Yea and by some reason this is not default option in many programs.
@lesselp9 жыл бұрын
+Matthias Wandel You`re a woodwork guy,Wandel.This stuff is out of your area.
@db112nl9 жыл бұрын
+lesselp Haha but you forget he also does the video editing !
@LanIost9 жыл бұрын
I'm so happy now. I've been coding since I was a kid and never quite (had to but still...) understood how this sort of thing worked mostly because any time I tried to read/watch anything on the subject it SOLELY explained it using math too complex for me at the age I was researching it at OR was a snooze-fest. This was done in a way that IMMEDIATELY made sense and stayed that way the entire time. THANK YOU for finally solving this mystery for me!
@DeJayHank9 жыл бұрын
+LanIost If you like programming and would like to try do some image processing of your own I would recommend the OpenCV library (available for C++ Python and Matlab). It's very user friendly and popular, so it's easy to google for help. Loading an image and applying a Gaussian blur is as easy as (using python as an example): import cv2 myImg = cv2.imread('path/to/your/image.png'') myImg = cv2.GaussianBlur(myImg, (5,5), 2) cv2.imshow('Blurred image', myImg) cv2.waitKey(0)
@WhyFi599 жыл бұрын
This video lead me to wondering something. Although some things are taught about how images are represented and manipulated by computers in CS courses and other such resources, nothing is taught about sound, that I could find. That is a shame. I would like to see something about digital sound on Computerphile, if possible.
@TheBluMeeny9 жыл бұрын
+WhyFi59 Meto!
@tobsco29 жыл бұрын
Have a look at the videos on the Xiph.Org site, they're the guys that made the FLAC and ogg vorbis codecs.
@quantumsmith3719 жыл бұрын
Digital sounds for the most part is signal processing with Fourier Transformations. It lends itself more to Mathematics and EE.
@WhyFi599 жыл бұрын
Quantam Smith Sure, but I'm sure there's a programatic way of dealing with sound, otherwise there wouldn't be sound files and you'd require a dedicated sound card just to process sound (which is actually opposite to how most PCs are, with dedicated graphics cards but no dedicated sound cards). Maybe I'm messing things up though. tobsco2 Thank you for the suggestion.
@quantumsmith3719 жыл бұрын
***** most sound cards are advanced DACs they may do some sound processing, but for the most part they turn 1s and 0s into an analog signal. Programmatically sound is transformed using an FFT(Fast Fourier Transform). In which the constants of the Fourier series are changed which affects the sound wave. If you are interested in sound files I would say that is almost entirely under the field of compression as an uncompressed sound file is nothing more than a list of intensity values. If that is your thing then you should look at Discrete Cosine Tranforms and how they relate to compression
@gunjeetsingh908 жыл бұрын
Mike Pound and Tom Scott..Ill watch any videos these guys make..
@napillnik9 жыл бұрын
Having a square kernel is the correct way to do blurs, but it's also very computationally intensive. If you want something fast, but with the same effect, you do 2 passes through an image, applying 2 kernels each time. One time the kernel is a line vector, and the second time it is a column vector. So if you want a blur with a kernel that's 20x20, you need to make 400 multiplications for each pixel, and 399 additions, which is 799 operations. You could achieve the same with 2 times passing the images with just a 1x20 vector and 20x1 vector. For each pixel you make 20 multiplications and 19 additions, and you do that twice, amounting to 39 operations per pixel, which is 10 times faster than a square kernel. The advantage grows exponentially as the kernel size grows. The first approach (square kernel) has an algorithm complexity of O(n^2), while the second approach has the complexity O(n). Try it out. The effects are the same visually. The images will be different, but you won't be able to tell which one is the real Gaussian Blur.
@yawarjamal9094 жыл бұрын
I wish I had an opportunity to enroll in his class. I would have enrolled in each and every single class of his. Just simply love Dr Mikes teaching style and his passion towards teaching.
@Soundole7 жыл бұрын
This was a fantastic explanation, and I really appreciated the graphics used to demonstrate the processes.
@fukyougooglification8 жыл бұрын
i love the way this guy talks, awesome
@SyntekkTeam9 жыл бұрын
Great video, however I was a little dissapointed that no examples were shown. I would have liked to have seen how an image looks different when a gaussian vs a mean blur is applied and also how various standard deviations and radii would affect the image.
@evilakah19 жыл бұрын
+Arend Peter Castelein Download photoshop, try it yourself. It's really easy to do, and worth trying. Cheers! :)
@joseph100979 жыл бұрын
+Arend Peter Castelein Exactly! Thought the exact same thing.
@B20C08 жыл бұрын
Also not hard to include it. Of course I can do it myself but it kinda misses the point of an educational video to talk about the theory and don't display the outcome.
@meepk6337 жыл бұрын
Also, there were examples. Just look closer.
@Triantalex2 ай бұрын
false.
@EliteWheatProdctionZ2 жыл бұрын
As an outsider, this explanation was super easy to grasp and gave me a ton of insight into image processing, cheers Dr. Pound!
@xavinitram969 жыл бұрын
pleaaaase more videos with Mike!!! These are great!
@tminus2006 жыл бұрын
I suggest making a video discussing the compression and manipulation of audio files. There are some parallels between manipulating audio files and compressing JPEG files via Discrete Cosine Transform. It would be excellent to connect the dots! Great work!
@whileimgaming6 жыл бұрын
Dr. Mike Pound maybe the best. He has the patience to do arts and crafts and _really_ explain things. Superb!
@PixelOutlaw9 жыл бұрын
A lot of color changing filters can be explained by simply using the intensity of a pixel to map along a range of colors. No kernel matrix required. (Colorize, Sepia etc). For a fun bonus, you can make a nice gradient of colors representing a hightmap landscape of ocean and hills and map that to Perlin Noise for procedural 2D continents or maps. :D
@glyochi85754 жыл бұрын
6:51 i like how the editor cut to that guy closing his calculator even though its not necessary
@AlexanderMcNulty927 жыл бұрын
he casually gave us the most concise summation of Standard deviation i've ever heard.
@ayushgupta53897 жыл бұрын
Man you are awesome! Every video of your channel is so easy to understand and has in depth knowledge!
@DroidTsuenik94 жыл бұрын
3:06 @Computerphile The Average filter in Photoshop is NOT using the mean/uniform kernel. All this average does is it finds the average value of all selected pixels and assigns it to all of them. The mean filter that the video talks about corresponds to the "Blur" option under Photoshop's filters.
@Faxter3139 жыл бұрын
I've actually heard some lectures about this stuff. But this described some aspects from another perspective - that made some things clearer to me. Thanks!
@Eugensson9 жыл бұрын
He forget to mention that if you make central value positive, and border values negative then you have a sharpness filter. and if you make centre value zero then you have a border filter.
@rjfaber19919 жыл бұрын
+Dmítrij Ačkásov Makes absolute sense, yes. Thanks for that bit of trivia!
@martinsavc32029 жыл бұрын
+Robert Faber Yeah, but not quite. If you want to find edges, you want to subtract values on one side from the values on the other. Or subtract the center from the surrounding values. You need to make sure that the weights in the kernel sum to 0 - this makes sure that when all the values are the same - a flat surface, the result of the convolution is 0. To sharpen an image you usually calculate the edge image and add it back to the original. This can be achieved using a single kernel, with the sum of the values being positive - usually you would increase the center value.
@Roxor1289 жыл бұрын
+Dmítrij Ačkásov It also works the other way around: negative centre and positive everywhere else will also do sharpening.
@DeJayHank9 жыл бұрын
+Martin Savc More more info about simple edge detection, I suggest people look up Sobel filter. One of the most basic edge highlightning filters.
@shannonbruner5429 жыл бұрын
+DeJayHank decedent
@jstbillz8 жыл бұрын
Wouldn't mind a full image processing course from you guys !
@sayantansarkar44095 жыл бұрын
That linking of sd, mean to gauss kernel was so good, it cleared a lot of confusions , thanks 🙂
@ZombieKilla969 жыл бұрын
Aaaah now my linear algebra class is starting to sound a lot more useful!
@Tjekr8 жыл бұрын
The video's with Dr Mike are really good. Very interesting and nice pace in the editing
@logicalfundy9 жыл бұрын
An interesting optimization of the two filters shown here (and other "separable" filters) is that, even though this example shows a 2D kernel - you can actually use two 1D kernels. One vertically, the other horizontally. On small kernel sizes, this doesn't make much of a difference. However, as the kernel gets large, it makes a tremendous difference. Imagine, if you will, applying a 25x25 kernel on every pixel of the image. That's averaging 625 values per pixel. Now, imagine if you did it in 1D: one pass is 1x25, the other 25x1. Now you're averaging only 50 values per pixel, which is over 12 times as fast. I was actually experimenting with making poster-size images once and the open source software I was using had the 2D kernel, and it was quite slow for large kernel sizes. I was able to make the blur much faster by doing that plus using multiple threads.
@logicalfundy9 жыл бұрын
You blur the image that was created by the previous pass.
@knighty9 жыл бұрын
Something that probably should have been mentioned relating to gaussian blurs is that the window for them mathematically is infinite. The function tends to zero but never reaches it, and thus every pixel technically has an effect on every other pixel. The actual kernel size is determined by some arbitrary cut off where the function reaches a sufficiently low value that it has no perceivable effect on the result.
@dominiccasts9 жыл бұрын
+knighty It would have been nice if they had mentioned this, as well as the number of standard deviations one should typically cut off the blur window. In my experience, 3 standard deviations is enough (at least for 8 bits per channel), but the reasons why one should choose that or another value, and the error that would result, would be an interesting video on its own.
@michaelpound98919 жыл бұрын
+knighty +Shadowfury333 We did record some stuff on this, but sometimes it better not to make them too long! I use +-2.5 standard deviations for my kernel radius, because that covers 98% of the weight under the distribution. Anywhere above two is going to look alright, but too large and obviously efficiency will be affected. The separability of the gaussian kernel (mentioned in some other comments) helps keep the speed up.
@knighty9 жыл бұрын
+Michael Pound Oh yeah totally, it would be insane to try to go into every little detail, I just felt like it was notable because there was mention of kernel size changing because of gaussian radius and thought it might have been worth mentioning even briefly why we do change/restrict it.
@dl66919 жыл бұрын
+knighty but that's a property of the normal distribution that he used to explain it with.
@jumanaalali83547 жыл бұрын
you always save me before my finals ...thank you
@SharpblueCreative9 жыл бұрын
As a photoshop user and designer I can relate to this. Interesting to see what's going on under the hood when I'm manipulating an image
@reefstarnes8392 жыл бұрын
I was looking for a video like this. Thank you for the simple explanation!
@somtoachu57046 жыл бұрын
hey Computerphile this is the first of your videos am ever understanding
@AntOfThy9 жыл бұрын
For more information on image convolution search for image magic examples, convolution. Also closely related is morphology, which in a sense is a image manipulation technique that includes convolution.
@Psychopatz2 жыл бұрын
I'm always fascinated about how our technology works, thank you sir for giving us some basic abstraction to it, its more of a marvel now than a magic to me
@DeJayHank9 жыл бұрын
Love that you're doing videos about image filtering. You should do one about Canny Edge detection now that you've laid out the basics :D
@michaelpound98919 жыл бұрын
+DeJayHank Canny's next :)
@RisinT969 жыл бұрын
Oh man, when he said kernel convolution it reminded me of the horrible "sampling and reconstruction" from "signals and systems" where you have to convolute the sampled signal with a kernel to get back the original signal. (and the whole Nyquist-Shannon sampling theorem)
@darksteel789 жыл бұрын
I feel your pain. After I took a signals and systems class I never wanted to do a Fourier/Laplace transform by hand or impulse response ever again. Lol.
@RisinT969 жыл бұрын
darksteel78 my exam is in a couple days and I hope that I will not have to do any of those again without the help of matlab. :)
@darksteel789 жыл бұрын
good luck :)
@paabloojc8 жыл бұрын
+darksteel78 hnmm
@username172344 жыл бұрын
Kernel convolution is my new favourite fancy alias for a weighed average.
@sammcewan95445 жыл бұрын
6:37 "This is where we cut" Couldn't do the quick maffs himself ha
@Triantalex2 ай бұрын
false.
@fr33doom9 жыл бұрын
awesome explanation!
@vabalokis9 жыл бұрын
always wondered how they worked , and its calculated easier than i imagined , cool stuff
@alihsanelmas5 жыл бұрын
A great video for getting into image filtering and processing!
@승현-r4q3 жыл бұрын
Very informative, many thanks!
@tymothylim65503 жыл бұрын
Thank you very much for this video! I learnt a lot from these explanations!
@konstantinrebrov6756 жыл бұрын
Great Lecture! Great Professor! Great Chanell!
@BigyanDhital9 жыл бұрын
This was very interesting. Want more of these
@danielgrace78878 жыл бұрын
Dr Mike Pound is the dude!
@harchan62742 жыл бұрын
Thanks a lot for this video, its was really helpful
@WiskerousPirate8 жыл бұрын
Great Video!! Very Informative!!
@reda29100 Жыл бұрын
3:14 who needs a blur when you have a potato! jk CF but that montage was lit! 6:58 that's exactly what we humans call "not edges." Blurry images, well, blur the lines between high contrasting pixels of colors (thhe marginalized ones), which we would say about "where are the edges?" It's a "fault", or a flaw of the images/hardware (input) themselves, not the processing algorithm itself.
@hugabuga-il61254 жыл бұрын
Undestood👏👏👏 thanks♥♥♥
@Aevatheone7 жыл бұрын
Awesome explaination! Exactly what I needed to fill i gaps!
@HaiHoang-iv9rj3 жыл бұрын
A great explanation
@normconel29079 жыл бұрын
I hope they do more of this easy to understand stuff
@sina_m_123 Жыл бұрын
Great Explained!
@boydzhang8 жыл бұрын
very illustrative, thanks!
@eldelarechorga5 жыл бұрын
Very well explained
@chandusr16 жыл бұрын
Thanks for the video, great explanations !!! I spent 1 hr in some online video lecture(I don't want to name it) to understand what filters are and this video just explained me in ~8 min. I'm learning computer vision, could you guys please make a video on 'Image descriptors'
@trashkassett9 жыл бұрын
Great video! Any plans on showing interpolation on images? Would be great to show the standard nearest, bi-linear and bi-cubic interpolation to regular grid images, but also scattered data interpolation to irregular grids for reconstructing images using RBF or Gaussian Process.
@noureldinmoustafatutorials25122 жыл бұрын
amazing stuff👌
@frankowalker46624 жыл бұрын
Thank you for this. I've been creating and using these filter kernals (?) since the 90's, and never knew what they were or how they worked. LOL. PPaint on the Amiga lets you set your own parameters for some cool effects.
@jdgrahamo9 жыл бұрын
Very interesting and helpful, thank you.
@AG-lc5mn8 жыл бұрын
Loved the video! I'm studying neuroscience and it helps a lot
@eduardofernandes78893 жыл бұрын
How so?
@teekanne159 жыл бұрын
What Kind of accent is his? I think that's one of the best sounding ones I've heard and the voice is really good too
@buggaboo2707 Жыл бұрын
4:34 The area under the bell curve is equal to 1 ( 100% )... you can't just draw a bell curve fatter ( more variance ), without making it shorter at the mean Thank you for the informative video though, I learned from it :)
@annanyatyagip6 ай бұрын
thank you this was great!
@spasovan4 жыл бұрын
very nice explaination
@jackdumanat499 жыл бұрын
2d convolution is also used to make john conway's game of life. I used to make it such that each reference cell check all the cells around it then decides to die or live. Then i came a cross this that does everything and is more efficient than my original code.
@Man0fSteell7 жыл бұрын
AMAZING !! THANK YOU SO MUCH!!
@Vipenstrike9 жыл бұрын
Thanks for the video!
@whiteshampoodev9 жыл бұрын
Thanks ALOT for this video!
@mahoneg6 жыл бұрын
Blast from the past for me. In the 80s I was doing edge detection with Gould equipment at NYU. Then I went to finance. ...
@AuthenticTerrificRickCastle9 жыл бұрын
wow this is completely amazing I was wondering how different blurs differ! could you please explain how so-called "Photoshop Lens Blur" works? The one with bokeh effect and different polygonal shapes of aperture. How similar it is to photoshop "shape blur"?
@nightblind6 жыл бұрын
Maybe a video for how cheby1 and Butterworth work? Basically a data filtering video. Cheers, awesome explanation boys.
@zihanqiao28505 жыл бұрын
Really nice!
@omgimgfut9 жыл бұрын
super informative
@jorgevicencio35734 жыл бұрын
thanks a lot for this lesson
@31337flamer8 жыл бұрын
best computerphile vids.. :O the standard gaussian blur in photoshop had bad edges cuz it used empty pixel data from outside the picture .. changed in cs3 ^^ box blur is still best but gaussian has nice edges now
@YalnekH8 жыл бұрын
Those Gaussian's should have different peak heights!
@fluffigverbimmelt7 жыл бұрын
+1
@alexj01017 жыл бұрын
great video.
@cpt_4meric44 жыл бұрын
You sir, are amazing :3
@Tumbolisu9 жыл бұрын
But simply taking the average will give you something that doesn't look as good. All numbers stored in an image are actually the squareroot of the original real-life thing (if it would be a photo). So the math needed to get the correct value will need some x^2 kind of stuff. This is most noticable when bluring the two edges where one edge is red and the other is blue, or red and green, or blue and green.
@L4Vo59 жыл бұрын
+Tumbolisu 1. They already explained that in a video 2. this is how to handle an image, if some images have to be square rooted (im sure there are some kind of HD color images where that doesn't happen) or not is another problem (else i should say that even before blurring the image, you need to turn on the computer!! wich is true, but it's another problem).
@Mohamed.wael7 Жыл бұрын
crystal clear!
@rjfaber19919 жыл бұрын
So how does the "Despeckle" option in Photoshop work? I always find that a much better way to remove noise than just blurring everything, because blur always preserves at least some of the noise, especially if the noise is coloured and not monochromatic...
@Folopolis9 жыл бұрын
+Robert Faber Assuming it works similarly to the noise removing filter in GIMP, it's a layering of many filters. First, it computes the median and quartile values of the pixels in the original image under the kernel, to find outliers. It then applies a Gaussian filter, but applying where the outliers are.
@rawlamb51977 жыл бұрын
I would have liked to have seen more talk about how you get a picture to be expressed as a grid of numbers and how you would do this on an RGB image. For example, how does the pixel Mike first manipulates become just 64? I'm assuming this was done on a gray image because that's probably the easiest way to explain filters before introducing colour channels. Great video, anyways!
@hamrikada39185 жыл бұрын
thank you sir !
@TrebleWing9 жыл бұрын
This guy is my favorite
@DannyJulian776 жыл бұрын
How would you define noise in an image? An abrupt change in color from one pixel to another?
@nixie24629 жыл бұрын
Now please please, do a sharpen filter explanation!
@ThalesII9 жыл бұрын
What sort of optimization could be done to avoid having to look up several pixels for every one you have to process?
@MattiEskelinen9 жыл бұрын
+ThalesII Look up the convolution theorem and fast fourier transforms. The theorem basically says that one can calculate a convolution of two functions (in this case, the image and the kernel) as a product of the Fourier transforms of the functions. Using fast discrete fourier transforms, this is more efficient than the naive convolution presented here.
@Lanyovan9 жыл бұрын
+ThalesII If you cut the image into smaller pieces and process piece by piece a lot of pixel lookup will be faster due to caching (at least faster than going line by line on the whole image). Filters can also be parallelized well since the target pixels are calculated independently.
@fafetico9 жыл бұрын
+ThalesII You could still optimize convolution by using separable and recursive filter algorithms. Basically, in separable filters, instead of using a nxn convolution mask you would use a 1xn and a nx1, making it more efficient. And in recursive filters, you would use the outputs of previously processed pixels in order to lower the number of pixels you have to look for the next pixel being processed, reducing the number of computations needed.
@Theraot9 жыл бұрын
Search the document "Image Convolutionwith CUDA" you should find it on nvidia dot com as convolutionSeparable dot pdf - happy reading.
@johnvonhorn29428 жыл бұрын
So Dr Pound's Kernal of one's is called "Poundland" because everything is one pound. Coined a new SI unit there - The Poundland. Surely that's got to be worth a Nobel Prize? I'll swing by Nottingham later on and pick up my PhD. You're welcome.
@sagnikdas78775 жыл бұрын
4:21 are you sure of the diagram?got to preserve aea.
@zutru4 жыл бұрын
Why are you talking about "Convolutions" while doing a "Correlation"? Sure, for symmetrical kernels you will get the same result, but in general, they are not the same!
@ahmedaraby11135 жыл бұрын
thanks very much
@harsh77046 жыл бұрын
can someone explain me: while comparing mean blur kernel and Gaussian blur kernel , why we divide by 16 not 9 in case of Gaussian blue kernel
@crunchyduck6 жыл бұрын
Harshvardhan Matta It's the value of all the multiplication done. 1+2+1+2+4+2+1+2+1.
@World_of_OSes6 жыл бұрын
What do you do in your work?
@Sh-hg8kf3 жыл бұрын
Sorry if my question is silly, I'm a computer noob. What do those pixel value nunbers really represent in the image? Color? Brightness?
@235deer5 жыл бұрын
For the gaussian filter how did you come up with that square of numbers (x1, x2, x1, etc.) A gaussian is intended to take numbers at random, I thought?
@Deddiward7 жыл бұрын
What does 'intensity of the pixel' mean exactly? I mean, a pixel is composed of R,G,B,alpha values.. do you apply the matrix convolution to all 4 of them? Or do you do something different?
@BenevolentXMachine8 жыл бұрын
So interesting.
@quenz.goosington9 жыл бұрын
It would have been nice to mention the new deep learning based filters like deepdream and neuralart.
@RoySchl9 жыл бұрын
+44thats44oars well those are something entirely different, i would not even call those things filters at all.
@quenz.goosington9 жыл бұрын
Well, Mike in the video describes a filter as something that will "take some image, process it, and come up with some output". Yes, they may be completely different an new, but I'd still say they're filters.
@Theraot9 жыл бұрын
+44thats44oars the matter is that filters - at lesat conventionally - are pure*, in the sense that they ONLY "take some image, process it, and come up with some output". While those you mention has other inputs. *: deterministic, no additional inputs, no side effects.
@quenz.goosington9 жыл бұрын
You could say that the actual algorithm is an extra input. You could also say that deepdream/neuralstyle's extra inputs are just part of their respective algorithms, and aren't necessarily "extra". I guess it depends how you wanna see it, but to me, they aren't /really/ that fundamentally different.
@nhilario76384 жыл бұрын
This is a very old video. But I'm actually starting at all this and I tried programming a function that does exactly that (an image convolution), but when I calculate a reasonable kernel size for the gaussian blur (as well as the kernel values themselves) the image just gets darker... It's so frustrating because I normalized it, so the sum of everything is 1. I'm getting pretty desperate to be honest!!