Backpropagation Neural Network - How it Works e.g. Counting

Рет қаралды 202,632

RimstarOrg

Күн бұрын

Пікірлер: 334

@carloschau9310 8 жыл бұрын

Love your plain language and explaining with a visual example! Thank you!

@AppliedScience 8 жыл бұрын

Great explanation!

@RimstarOrg 8 жыл бұрын

+Applied Science Thanks!

@TheMrRedBeard 8 жыл бұрын

I have read up on neural networks quite a few times in the past and had only seen them as trivial things until now. Your video provided that epiphany/eureka moment for me. Thanks!

@Soundole 7 жыл бұрын

Thanks so much for producing this video, this has brought me closer to understanding neural networks than any other resource online!

@MatthewBishop64 7 жыл бұрын

"Do math with all these numbers" - the secret sauce of neural networks. :)

@pawelsberg 8 жыл бұрын

Thank you for showing "bias" neurons. I was struggling to get the same effect by constructing part of the network that will always give "1". But it will only work if you have at least one non zero input. "bias" solution is nicer.

@zinebslam5398 8 жыл бұрын

out of all the videos I ve watched it is the only one that gives such a clear and understandable model! thanks

@abhimanyusingh4281 8 жыл бұрын

Best explanation with simplicity so far

@prabuskleo 7 жыл бұрын

out of all, This is a best introductory video for NN. ! Thanks prof. :)

@InnovationBlast 8 жыл бұрын

Wow, my friends recently won first place at our school's research symposium with work on neural networks and machine learning. Thanks for teaching me some more, now I can surprise them with my new knowledge!

@favs5286 8 жыл бұрын

Very informative and student-friendly explanation!

@leskuelbs9558 8 жыл бұрын

Very cool! your explanation makes me want to play with neural networks again. thanks.

@ahamedimad4554 8 жыл бұрын

Thanks a lot. This looks exactly what I need for my final year research.

@IHAVE1ARM 8 жыл бұрын

Great video for people who are just starting out at Neural Networks.

@IamTheTimBecker 8 жыл бұрын

Thanks for the simple, visual explanation!! It was exactly what I was looking for

@raresdavid 7 жыл бұрын

Wow, this is the best explaination on back propagation I've ever seen. Great job!

@RimstarOrg 7 жыл бұрын

Thanks!

@Erodrigufer 8 жыл бұрын

Thanks a lot for making this video. It was really interesting. I would really like to see a video in which you explain your code in detail :) I think I'm not the only one interested on such a video

@bobsmithy3103 8 жыл бұрын

i finally understand backpropagation now. thx a ton!

@edwardkwartler2647 8 жыл бұрын

Excellent and helpful backprop explanation. thx.

@meowmeowmotherpucker 7 жыл бұрын

Props where props are due, this is just an excellent example for explaining basic neural network functionality and backpropagation. Thanks a whole lot!

@johnmadden9613 8 жыл бұрын

I *FINALLY* understand backpropagation! Thank you!

@MikePorterII 8 жыл бұрын

This was a really great video, and even better the total lucid and cogent explanations. THANK YOU!!!

@ericso836 8 жыл бұрын

very clear and packed full of knowledge yet understandable!

@sethsachin86 7 жыл бұрын

very good illustration !

@PraveenJose18551 8 жыл бұрын

Have you thought about making a "how schileren photography works" video? I think that is topic that is right in your alley.

@IncroyablesExperiences 8 жыл бұрын

Great job !

@svsujeet 8 жыл бұрын

Beautiful. Thanks for the effort.

@NaimaZakaria 8 жыл бұрын

I love your videos! Please make more on Neutral net and general AI topics. Like what the heck is the difference between all the different frameworks?

@RimstarOrg 8 жыл бұрын

Thanks! I was thinking of doing just that, a comparison about all the different types of neural networks and sample applications.

@NaimaZakaria 8 жыл бұрын

looking forward to it :)

@karnkraftverk 7 жыл бұрын

Great explanation, I think I understand the basics of it now!

@electronicsNmore 8 жыл бұрын

Excellent video, and excellent animations!

@RimstarOrg 8 жыл бұрын

+electronicsNmore Thanks!

@StoneFlange 8 жыл бұрын

Well done, sir :) thanks for the video and explanation! I'm thinking about how I can use this already haha

@carlosgil2691 8 жыл бұрын

Very clear and concise. Thank you!

@mohammadmaniat1040 8 жыл бұрын

thanks for the explanation about ANN

@DkMark-pu8hi 8 жыл бұрын

Great explanation. My teacher's poorly explanation confused everyone in the class.

@malgailany 8 жыл бұрын

Impressive video, Thanks.

@hiphopobenzema 8 жыл бұрын

Thank you for explaining this! I always wondered how machine learning worked at the base level.

@RimstarOrg 8 жыл бұрын

+hiphopobenzema You're welcome! This is just one type of machine learning, but it's a very flexible one. I've used this same neural network code, just with a different user interface, to do fairly decent character recognition. Sadly it was for a no longer existent GUI, otherwise I would have shown that one too.

@USWaterRockets 8 жыл бұрын

+RimstarOrg What GUI would that be? There are not that many to guess from.

@RimstarOrg 8 жыл бұрын

+USWaterRockets QNX Windows running on the QNX OS. I bet you didn't guess that one! :) I don't think I ever ported it to Photon under QNX, but nowadays QNX seems more focused on GTK and HTML5 for GUIs and I might try it with one of those. I guess I could train it under GTK but also have the trained network on HTML5 for people to try out (I used to be involved with the still awesome QNX OS in various capacities.)

@USWaterRockets 8 жыл бұрын

No, that was not on my short list. Pretty cool!

@gimpau 7 жыл бұрын

Thanks, it's all much clearer now.

@megamef 8 жыл бұрын

How are so knowledgeable on such a wide range of subjects?

@RimstarOrg 8 жыл бұрын

+Stephen Methuen Probably not as many subjects as I'd like :), but for what I do know, it's probably because I let my curiosity carry me away.

@boningli7158 7 жыл бұрын

best one i ever seen, so easy to understand

@realxvrw 8 жыл бұрын

This tutorial is awesome

@BiranchiNarayanNayak 7 жыл бұрын

Excellent tutorial on NN

@niazalikahn5845 7 жыл бұрын

This video is very help full i got lots of information from this video thank you soo much.

@melvin6228 5 жыл бұрын

I code in vim! I code in emacs! I code in Wordpad and made a neural net while you were arguing.

@peaceandlove5855 5 жыл бұрын

In 6 minutes !!. Stop it and don't waste my brain neurals

@nawkwan 8 жыл бұрын

Hi Steve, thanks for providing us with an overview of how backpropagation works. I understood most of what you presented in the video. Can you explain how the activation function works because that's really where the magic is? While I am new (no prior knowledge) to this topic, this "activation function" appears to serve as a function that normalizes the output values, which are compared to the expected values of the training set.

@minecraftermad 7 жыл бұрын

ALL I WANT IS THE MATH IN DETAIL

@jenneh8821 6 жыл бұрын

This was very succinct and super helpful! Thanks again!!!

@thebeststooge 8 жыл бұрын

Just here having my brain melt with all of this.

@RimstarOrg 8 жыл бұрын

+Dark Alchemist You mean... having your neural networks melt with all of this. :)

@thebeststooge 8 жыл бұрын

RimstarOrg Precisely, LOL.

@dfortaeGameReviews 7 жыл бұрын

Nice job. Thanks for sharing!

@GregPerry 7 жыл бұрын

Artfully done, kudos RimstarOrg.

@arielrisso9632 8 жыл бұрын

Hi, I understand the BIAS like a one dimension more, but is necesary the BIAS neuron to balance the network? why one bias? why no 2, 4, 5, or 1000 bias?

@RimstarOrg 8 жыл бұрын

The use of bias unit is really just a programming trick to make the code simpler. Originally each of the hidden and output units would have a threshold value that would determine if the unit would fire i.e. whether or not you'd do calculations in the next layer based on that unit. To simplify the code, bias units were created that did much the same job. So you still need a threshold of some kind. If you have only one value for the bias then you need only one bias unit. In the example in this video the bias is 1 for all hidden and output units and so only one bias unit is needed. Maybe there are cases where you'd need different bias values for different layers, in which case you'd have a different bias unit for each layer.

@nikkinishant 7 жыл бұрын

Hi , I wanted to know how do we select the activation function by seeing the independent variables?. I used SPSS for buidling NN and it either selects logit or tanh? Please help

@nickoutram6939 8 жыл бұрын

This is a good simple start about how a neural network can derive its outputs from a set of weights and functions but I would like to have seen a little more about how the error between what is expected and what is output is 'propagated back' through the network and impacts the weights. Especially wrt how it goes back from the hidden layer that is not itself an output (what is the delta of expected vs actual for a hidden neuron?) As things stand I have to wade through code without really understanding the principle. So its almost perfect but missing an extra minute or two IMO. Anyway, thanks for posting!

@RimstarOrg 8 жыл бұрын

I think making that clear is over a minute or two. But yeah, I agree I should go into that in more detail. There's a lot of interesting stuff there.

@parnabsanyal5750 8 жыл бұрын

very clean...

@sunebrian1423 6 жыл бұрын

I suggest you add the library to the code. The gcc compile have error without the time library. BTW good tutorial, thanks

@RimstarOrg 6 жыл бұрын

Thanks. I don't see anything which would require it and I'm not getting any errors. Which file is it having a problem with, backprop.c or testcounting.c? Do you have the error line?

@sunebrian1423 6 жыл бұрын

$ gcc -c backprop.c backprop.c: In function 'bkp_create_network': backprop.c:65:25: warning: implicit declaration of function 'time' [-Wimplicit-function-declaration] srand((unsigned int) time(NULL));

@RimstarOrg 6 жыл бұрын

Oh! I totally forgot about that. I scanned the code looking for things like sleep() and delay() thinking that there was no reason I would have used them. Thanks. I'll fix it.

@user-ed7gm7ol8k 8 жыл бұрын

great video

@nikodembartnik 8 жыл бұрын

Amazing explanation, but I can't understand one thing. I already wrote all stuff in c# but I stop on the back propagation. I calculate the errror for outputs and what's next? How to calculate errors for next weights (beetween input and hidden layers)? From where should I know, what the value should be in this place? This is the last thing to get it work :( Anybody can help? Thanks

@RimstarOrg 8 жыл бұрын

Maybe someone else can try but that's too much for me to explain in a comment and too much for a video (which is why I didn't.) All I can suggest is to download the code library from rimstar.org/science_electronics_projects/backpropagation_neural_network_software_3_layer.htm (part way down on the page) and look at the backprop.c file. The bkp_learn() function has the top level part of the learning code. It's in C though. I don't know how close that is to C#. There's also neuralnetworksanddeeplearning.com/chap2.html

@nikodembartnik 8 жыл бұрын

Thanks for answer, I will check it out

@GaryWee111 8 жыл бұрын

Thank you for your video!

@Pompiduskus 7 жыл бұрын

Thank you! This is awesome

@VVe11erMichae1 8 жыл бұрын

Thanks for the video, it has really helped me understand Backpropagation especially with the simplicity of the counting example. I wonder how this could be used in a deep learning context?

@RimstarOrg 8 жыл бұрын

Deep learning neural networks use the same basic approach but with more layers and slightly different training. They usually also connect some of the layers a little differently in something called a convolutional layer in addition to a few other connection approaches depending on what they want to do with it. Training is sometimes done with two layers at a time initially. For example, for an image recognition deep learning network they might initially train the network two layers at a time with a huge amount of sample data for many iterations. That gives the network basic understanding of image structure (edges, basic shapes, ...). Then they do backpropagation training with they particular images containing the objects they care about, but this time training all layers at the same time like in this video.

@TehFingergunz 7 жыл бұрын

This video is very helpful! Thank you!

@yulianloaiza 6 жыл бұрын

Thank you !! Very clear and understandable

@otomik1 8 жыл бұрын

Beautiful! Can you recommend any sources for further reading about neural networks?

@RimstarOrg 8 жыл бұрын

+Veseyron Thanks! I've been enjoying this online book lately neuralnetworksanddeeplearning.com/chap1.html. If anyone else has suggestions I'd love to hear them.

@TheKobeyaoming01 7 жыл бұрын

wow..this is so GREAAAAAATTT... I take a course in Machine Learning..This is make me more clear for what I do wow..Just wow

@lucamatteobarbieri2493 8 жыл бұрын

Very nice!

@Vikasthale 8 жыл бұрын

That was useful...!!!thanks for the video..

@houdalmayahi3538 5 жыл бұрын

This is a good explanation. Thanks mate!

@krellin 8 жыл бұрын

Thank you for the effort this is so far one of the best intros to neural networks i've seen. I wonder if it would be easy to convert this code to java? i'm no C programmer but as far as i know only .c and .h files are the actual sources right?

@RimstarOrg 8 жыл бұрын

Thanks. I've converted comparable C code to java before and based on that, it should be doable. Keep in mind though that since java is an interpreted language (as far as I know, I haven't done a lot of java) it will run slower. And yes, everything's in the .c and .h files. The Makefile just defines how it all goes together and how to build it.

@krellin 8 жыл бұрын

Its not interpreted ;) if one has knowledge of L caches and generally how low level of things work then he can produce java code that will be jit compiled to very efficient assembly that will take advantage of all the special cpu instructions, caches etc... So i dont worry about performance, right now i'm trying to compensate my forgotten math knowledge with some practical piece of code.

@krellin 8 жыл бұрын

I'm not saying that it will be faster than fine tuned awesome C libraries, but it definitely will outperform R or Python...

@Amacio 8 жыл бұрын

Hey mate, maybe you should check this site: neuroph.sourceforge.net/ They already have a lot of stuff for neural networks in java.

@krellin 8 жыл бұрын

thanks

@BGBTech 8 жыл бұрын

I personally haven't had many interesting results with NN's, but granted generally I had tried doing them with larger cube-shaped networks and typically fixed-point math. it seems like one needs a fairly big net to have much hope of it doing anything useful with an image (such as try to recognize an object in the picture), but then one runs into needing to make it fast enough to evaluate everything at a decent framerate (goal being to try to make it operate in some semblance of real-time, and on reasonably modest hardware, such as reasonably cheap ARM SoC's). getting it both fast enough, and getting useful output, have thus far remained elusive. have had more interesting results with genetic algorithms and genetic programming though, though still with relatively few "actually useful" results. had ok results using GA's to tune parameters for a video codec, if even then it doesn't turn it exactly as I would want it, which requires lots of fiddly with the code to evaluate how "good" the results are. have sort of an issue that it tunes it for "numbers look pretty good but output quality looks kind of like crap". for GP type stuff, I have generally had the best results with doing simplistic ISA'a, typically using fixed-width instructions, word-oriented memory, and "memory mapping" all the inputs and outputs. the advantage of this is, at least theoretically, if it finds something "good" you can convert it into a more efficient native-code program. IME, reg/mem ISA's seem to work better for this than stack-machine ISA's. for most "actually useful" tasks (in a computer vision sense), have had better results writing code to do stuff more directly (using similar strategies to those used in some of my video codecs). a lot of it is simple things, like finding the average and ranges (standard deviation) for blocks of pixels, allowing skipping more expensive (per pixel) checks in the majority of cases. have gotten things like basic spatial perception working (via stereo inference), but tasks like object recognition are a fair bit harder.

@RimstarOrg 8 жыл бұрын

+Brendan Bohannon Interesting path you've been on. I originally wanted to go down a similar path around 20 years ago but got sidetracked. I hadn't run across ISAs before. I'll read up.

@BGBTech 8 жыл бұрын

RimstarOrg I am not sure how common it is to do it this way (this isn't really a serious sub-project, just idle fiddly thus far). basically, I do something along similar lines to a fairly small/simplistic microcontroller or DSP, and essentially have the genetic algorithms work on what would be the ROM image (generally, ROM space may be around several k-words). my designs typically use word-oriented designs, with a word-oriented ISA (with 16 or 32 bit instructions). sometimes, vector instructions (ex: SIMD) or use-case specific built-in features may be included, such as dealing with 4x4 pixel blocks (typically color-cell technology), which may partially overlap with the SIMD ops. sometimes accommodations are made, like using gray-coding on the instructions, or allowing call/return by label IDs rather than by address (typically ROM addresses don't survive mutation/breeding all that well). I have wondered some about the possibility of adding some NN capabilities to these, with possibly memory-mapped NN IO and a "neuro-ROM" section or similar (probably with 128 or 256 bit neurons or similar), this needing to cooperate somehow with the main ROM. for simulation, behaviors are evaluated, typically working on a points system, with some events having strong penalties (such as crashing the virtual CPU, ...). generally best scores win. the specifics of the simulation depend a bit on the task, anything from sequential evaluation, to running a largish number of virtual agents simultaneously. the goal of such a thing would be to hopefully get something on par with insects, or at least compete well with traditional finite-state-machine AIs. results typically fall short of real-world insect-like skills (not usually that good at responding to surroundings, displaying survival skills, and often better at finding weird/undesirable ways of gaming the simulation, ...). relatively few of these tests have given them 3D rendered visual inputs though, mostly due to the costs of 3D rendering the view for a largish number of virtual agents (as a result, most of these tests have been 2D with top-down sensory perception). as noted before, generally sensory-input is memory mapped, with a small part of the IO space for movement outputs and other things. nothing thus far has really given particularly noteworthy results though. or such...

@ai2221 7 жыл бұрын

Hello Sir, How does back propagation algorithm works with Multilayer feed Forward Neural Network in terms of Recognizing the Image leaf?

@RimstarOrg 7 жыл бұрын

You should look for a convolutional neural network instead. The reason is that a leaf can be in any orientation and at any location in the image and a simple neural network like the one in this video doesn't handle that well. That's why convolutional neural networks were invented. So for images of leafs I'd treat the input units as a rectangular array of pixels. For example, for a 128x128 image, the first row of the image would be input units 0 to 127, the second row of the image would be input units 128 to 255, and so on. If all you're looking for it whether the image is a leaf or not then I'd have two outputs, one for yes and one for no. You might have to train it with things that aren't leafs too.In any case, the neural network in this video is a Multilayer Feed Forward Neutral Network. It might not seem like it because I take the outputs and feed them back to the inputs for one demonstration. But if you're just looking for the next highest binary number, then it's just feed forward.

@freesoul2677 7 жыл бұрын

Amazing ! Thank you.

@haroldsu 7 жыл бұрын

awesome , and Thank you!

@deslomeslager 8 жыл бұрын

Roughly 20 years ago I applied the backprop NN to train it with real stock values. Of course with the idea to try and guess the next value. Will the stock rise or go down? Of course there is science behind the NN, and although it could fairly well predict the new stock value, it could not beat a fairly simple rule of thumb. If the value goes up, then anticipate it will go up next day as well. And same for going down. (But if it stays the same that rule of thumb is wrong of course). Training bits and bytes with a known outcome (and a fixed one) is fairly good to do.

@RimstarOrg 8 жыл бұрын

+deslomeslager I gave this same backprop NN code to w friend of mine around 10 years ago who wanted to do the same with the TSX (Toronto Stock Exchange.) He had some sort of subscription that gave him download access to all the historical data. I don't think he ever went far with it though.

@Gary1q2 7 жыл бұрын

awesome visuals and well said, everything was explained well except for the how the backpropagation formula worked :'(

@RimstarOrg 7 жыл бұрын

The thing is, there is no one backpropagation formula. Backpropagation is the act of figuring out your error and then propagation back through the network, adjusting the weights using any of a number of different algorithms (e.g. en.wikipedia.org/wiki/Stochastic_gradient_descent#Extensions_and_variants).

@Gary1q2 7 жыл бұрын

OHHHHHHH THANKS RIMSTAR

@zy2870 6 жыл бұрын

Would like to hear more about how you set up the hidden layer!

@yummytums6882 5 жыл бұрын

Probably it was arbitrary, honestly. You can use any number of 'units' (or nodes, as I like to call them) to train it. You could also have as many hidden layers as you want--he only used one. More nodes presumably make it learn faster, but this comes at a cost of having the network slower and is really not guaranteed to actually have substantial effects.

@vivekchand19 7 жыл бұрын

very helpful. thanks a lot!

@ulob 7 жыл бұрын

What if you used two hidden layers? Would it learn faster or slower? Also, does it always converge to similar weights when you train with different random seed? Can it fail to learn for some random seed or is it guaranteed to work?

@armouredheart5389 7 жыл бұрын

The hidden layer is actually an arbitrary number of hidden layers, unless hardcoded that is. The more layers, the more operations have to be performed to train it. However I would wager that there is a sweet spot that has the optimum number of layers that lets it learn fast without being slowed down by hardware limitations. Disclaimer; I am still learning so I may be wrong.

@MrSatyavinay 7 жыл бұрын

Great, got some rough idea on how backpropagation works :D

@Murderface666 7 жыл бұрын

I see a lot of videos about using this to train programs to do something, but nobody is really going into detail on the process of how to really use the network to perform a task. For example, if I wanted to use it to train a pixel to navigate a maze with street lights to stop on red, slow on yellow and go on green (travelling in a straight line), I wouldn't know where to begin to implement such behavior even after studying various implementations of neural network examples.

@RimstarOrg 7 жыл бұрын

One way to do it is to have the inputs be the low resolution input from a camera and the output would be five neurons, for example: stop, forward, backward, left, right. Those outputs would be commands for the robot's motors. You'd train it by first using a remote control to control the robot as you drive it through the maze. Do it many times. As you do that, you'd record what the camera sees and what you told the motors to do. That creates your training dataset, just like the one I talk about in this video. Then you'd train the neural network using that training dataset. Once it's trained, you'd then bypass the remote control and have the robot use the neural network instead to make its decisions. That's how these ones do it hackaday.com/2017/06/06/self-driving-rc-cars-with-tensorflow-raspberry-pi-or-macbook-onboard/. This is called supervised learning since you train it using a dataset containing inputs and expected outputs.

@rylaczero3740 7 жыл бұрын

Partisan Black read the deep learning book, bro

@dansierra2222 7 жыл бұрын

so im at my first year of computer science yet i have been coding for 2 years before ,is this too advacnced for me ? i am really interested

@RimstarOrg 7 жыл бұрын

The programming side of it shouldn't be too advanced. The math side may or may not be depending on how deep you want to go. Most people these days just use the libraries/frameworks instead of writing their own from scratch so there's less math involved.

@erikm9768 8 жыл бұрын

Great video!!!

@FSXgta 8 жыл бұрын

I reccomend you check out Sethbling he used this technique to make mario learn to play and complete levels

@veluart 7 жыл бұрын

I'm using codeblocks to run your program. When I build it, it creates only the object file, not the executable file. Please help me how to run it?

@RimstarOrg 7 жыл бұрын

Are you getting any error messages? What OS are you using? I can only vouch for the latest version of codeblocks, 16.01. You should be able to go the the Build menu and choose Run. A command line window should pop up with the program already running in it (at least that's what happens in Windows Vista). If that doesn't happen, and you didn't get any errors, then bring up a command line window of some sort (depends on your OS) and go to the testcounting folder/directory. Under there go to the bin folder and then the Release folder. The executable should be there. At a command line you should be able to just type the executable name and it will run. If you're running linux, the current directory may not be in your path, in which case you would have to type ./testcounting to run it.

@veluart 7 жыл бұрын

I got it and it's working perfectly :-) If possible can you send me the java or python program for the same?

@RimstarOrg 7 жыл бұрын

Glad to hear it! What was the problem, in case it'll help me help others in the future? Funny that you should ask for a python version. I'm working on a python version using tensorflow (Google's neural network framework, www.tensorflow.org/) as part of learning tensorflow. But I don't currently have any plans for doing straight-up java or python versions.

@dkkoala1 8 жыл бұрын

one question; you input the bias value into the activation function as an "x" value (if we say that the activation function looks like y = ax+b) but shouldn't it be used as the b value and therefor added on afterwards? The reason i ask is that i understood the bias as an adjustment value ment to move the function on the x axis. Can it be used for that or is it only used as an adjustment of the input into the function?

@RimstarOrg 8 жыл бұрын

I'm keeping the bias as a separate unit with weights in order to simplify the training. One approach is to do it as I suspect you're describing (if I understand you correctly) and just have a bunch of separate b values. Those values would have to be adjusted during training, which is okay. But an equivalent way is to instead create a separate unit, the bias, whose value is 1 and then have weights connecting that bias unit to the hidden and output units. Then during training we adjust those weights, just like we'd adjust all the other weights in the network. It's a trick to simplify the training. Instead of adjusting all the other weights and also the bias values, we just have a bunch of weights to adjust. That's my understanding from where I originally learned all this in Scientific American magazine, Sept 1992, in the "The Amateur Scientist" article and from which I got my original code base. I didn't point it out in the video, or show it in the animations, but I initialize the values of all the weights to a value between -1 and 1.

@RimstarOrg 8 жыл бұрын

As my other reply stated, the way I'm doing the bias is equivalent to what you're expecting. To answer your second question, I've never thought of it as moving the function on the x axis. I guess I look at it more intuitively. The bias is a replacement for the threshold in the old perceptron model. With perceptrons, after all the input values have been multiplied by the weights, we'd next need to make a decision as whether or not the sum total is high enough to fire the neuron (hidden unit). For that, the sum total is compared to a threshold value and if it's greater than the threshold then the neuron fires, otherwise it doesn't (it's output is 0.) The threshold has long since been replaced by the bias. Instead of testing against a threshold, we add on a bias which adds to or subtracts from the sum total moving is closer to or away from 1. To give you a reference, search for the word "bias" here neuralnetworksanddeeplearning.com/chap1.html

@kirandeepsingh9144 8 жыл бұрын

i feel there is any problem in my college internet. all other website are working well there but your given links are not opening. but thanks i have tried it in my phone and there it is opened and worked well. thanks for your instant reply.