Gradient Boosting and XGBoost in Machine Learning: Easy Explanation for Data Science Interviews

  Рет қаралды 36,353

Emma Ding

Emma Ding

Күн бұрын

Пікірлер: 72
@emma_ding
@emma_ding Жыл бұрын
Many of you have asked me to share my presentation notes, and now… I have them for you! Download all the PDFs of my Notion pages at www.emmading.com/get-all-my-free-resources. Enjoy!
@SanuSatyam
@SanuSatyam Жыл бұрын
Thanks a lot. Can you please make a video on Time Series Analysis? Thanks in Advance!
@anand3064
@anand3064 8 ай бұрын
Beautifully written notes
@PhucHoang-ng4vh
@PhucHoang-ng4vh 10 ай бұрын
just read out loud, no explanation at all
@emmafan713
@emmafan713 Жыл бұрын
I am confused about the notation, so h_i is a function to predict r_i and r_i is the gradient of the loss function w.r.t the last prediction F(X). so h_i should be similar to r_i why h_i is similar to gradient of r_i
@Heinz3792
@Heinz3792 6 ай бұрын
I believe there is an error in this video. r_i is the gradient of the loss function w.r.t. the CURRENT F(X), i.e. F_i(X). The NEXT weak model h_i+1 is then trained to be able to predict r_i, the PREVIOUS residual. Alternatively all this could be written with i-1 instead of i, and i instead of i+1. TLDR: Emma should have called the first step "compute residual r_i-1", not r_i. And in the gradient formula, she should have written r_i-1.
@jennyhuang7603
@jennyhuang7603 Жыл бұрын
For 5:10, why the MSE delta r_i is Y-F(X) instead of 2*(Y-F(X))? or is the coefficent doesn't matter?
@zhenwang5872
@zhenwang5872 Жыл бұрын
I usually watch Emma's video when I doing revision.
@annialevko5771
@annialevko5771 11 ай бұрын
Hi! I have a question, how does the parallel tree building work? Because based in the gradient boosting it needs to calculate the error from the previous model in order to create the new one, so I dont really understand in which way is this parallelized
@shashizanje
@shashizanje 6 ай бұрын
Its parallelized in such a way that , during formation of tree , it can work parallel....means it can work on multiple independent features parellaly to reduce the computation time....suppose if it has to find root node, it has to check information gain of every single independent feature and then decide which feature would be best for root node...so in this case instead of calculating information gain one by one, it can parallely calculate IG of multiple features....
@MahdiShayanNasr
@MahdiShayanNasr 4 ай бұрын
An excellent video
@aaronsayeb6566
@aaronsayeb6566 2 ай бұрын
there is a mistake in the representation of algorithm. the equation for ri, L(Y, F(X)), and grad ri = Y-F(X) can't hold true at the same time. I think ri= Y-F(X) and grade ri should be something else (right?)
@kandiahchandrakumaran8521
@kandiahchandrakumaran8521 4 ай бұрын
Excellent video Many thanks. Could you kindly make a video for time to event with survival SVM, RSF, or XGBLC?
@jet3111
@jet3111 Жыл бұрын
Thank you for the very informative video. It came up at my interview yesterday. I also got a question on time series forecasting and preventing data leakage. I think it would great to have a video about it.
@Leo-xd9et
@Leo-xd9et Жыл бұрын
Really like the way you use Notion!
@emma_ding
@emma_ding Жыл бұрын
Thanks for the feedback, Leo! I tried out a bunch of different presentation methods before this one, so I'm glad to hear you're finding this platform useful! 😊
@elvykamunyokomanunebo1441
@elvykamunyokomanunebo1441 Жыл бұрын
Hi Emma, I'm struggling to understand how to build a model on residuals: 1) Do I predict the residuals and then get the mse of the residuals? What would be the point/use of that? 2) Do I somehow re-run the model considering some factor that focuses on accounting for more of the variability e.g. adding more features(important features) which reduce mse/residual? Then re-running the model adding a new feature to account for remaining residual until there is no more reduction in mse/residual?
@poshsims4016
@poshsims4016 Жыл бұрын
Ask Chat GPT every question you just typed. Preferably GPT-4
@Heinz3792
@Heinz3792 6 ай бұрын
It's important to understand what the residual is. The residual is a vector giving a magnitude of the prediction error AND the direction, i.e. the gradient. Thus, regarding your questions: 1) we predict the residual with a weak model, h, in order to know in what direction to move the prediction of the overall model F_i(X) so that it is reduced. We assume h makes a decent prediction, and thus we treat it like the gradient. 2) we then calculate alpha, the regulation parameter, in order to know HOW FAR to move in the direction of the gradient which h provides. I.e., how much weight to give model h. Minimizing the loss function gives us this value, and keeps us from over or undershooting the step size.
@objectobjectobject4707
@objectobjectobject4707 5 ай бұрын
Okay subscribed !
@wallords
@wallords 10 ай бұрын
How do you add L1 regularization to a tree???
@ermiaazarkhalili5586
@ermiaazarkhalili5586 Жыл бұрын
Any chance to have slides?
@NguyenSon-ew9wn
@NguyenSon-ew9wn Жыл бұрын
Agree. Hope to have that note
@emma_ding
@emma_ding Жыл бұрын
Yes! Download all the PDFs of my Notion pages at emmading.com/resources by navigating to the individual posts. Enjoy!
@nihalnetha96
@nihalnetha96 3 ай бұрын
is there a way to get the notion notes?
@MacadamMarcus-y1x
@MacadamMarcus-y1x 2 күн бұрын
Miller Christopher Clark Brian Hernandez Daniel
@RichardLentz-g5k
@RichardLentz-g5k 2 күн бұрын
Clark Maria Lopez Gary Lopez Christopher
@riswandaayu5930
@riswandaayu5930 11 ай бұрын
Hallo Miss, thankyou for the knowledge, Miss can I request your file in this presentation ?
@ellazeaustin1404
@ellazeaustin1404 Күн бұрын
Rodriguez Timothy Taylor Christopher Perez Gary
@JamesaGray-b1l
@JamesaGray-b1l Күн бұрын
Lee Angela Garcia Richard Walker William
@MacadamMarcus-y1x
@MacadamMarcus-y1x 13 күн бұрын
Jones William Martinez Melissa Brown Patricia
@SrabontyAfrin-f8p
@SrabontyAfrin-f8p 2 күн бұрын
Smith Mary Clark George Johnson Ronald
@BuckleLynn-v7s
@BuckleLynn-v7s 13 күн бұрын
Jackson Carol Smith Eric Lopez Kenneth
@MillDonald-g3p
@MillDonald-g3p 13 күн бұрын
Thomas Jessica Harris Cynthia Wilson Timothy
@khguhxjddVudhijo
@khguhxjddVudhijo 4 күн бұрын
Lopez Jennifer Clark Joseph Martin Matthew
@patrickgilbers3160
@patrickgilbers3160 8 күн бұрын
Clark Kevin Taylor Sandra Lee Thomas
@BurneJonesClaire-b1v
@BurneJonesClaire-b1v 13 сағат бұрын
Perez Paul Williams Donna Williams Paul
@EdwardHernandez-l4z
@EdwardHernandez-l4z 14 күн бұрын
Thomas Scott Lewis Paul Lewis George
@RanjsaBiswase
@RanjsaBiswase 17 күн бұрын
Thomas Kevin Brown Sarah Martin Thomas
@LisaRamos-y4s
@LisaRamos-y4s 4 күн бұрын
Jones Matthew Brown Jennifer Williams Angela
@richardpaterson3888
@richardpaterson3888 9 күн бұрын
Garcia Sandra Davis Jose Walker Jennifer
@ExGuess-w1i
@ExGuess-w1i 13 күн бұрын
Anderson Mary Clark Richard White Nancy
@GraceElla-f2x
@GraceElla-f2x 18 күн бұрын
Lee Ruth Martinez Robert Jones Linda
@SimonMaria-y7r
@SimonMaria-y7r 8 күн бұрын
Smith Lisa Rodriguez Eric White Kenneth
@rosalindwalkman5742
@rosalindwalkman5742 6 күн бұрын
Lewis David Thomas David Davis Deborah
@WendellSaddler
@WendellSaddler 16 күн бұрын
Anderson Deborah Young John Davis Laura
@JustinGreek-b4g
@JustinGreek-b4g 7 күн бұрын
Lewis Mary Gonzalez Linda Moore Barbara
@EisenhowerMarshall-q4j
@EisenhowerMarshall-q4j 5 күн бұрын
Clark Paul Harris Karen Moore Shirley
@DawnIngram-e8x
@DawnIngram-e8x 17 сағат бұрын
Lee Amy Young Donna White Sarah
@MaxPenelope-w4j
@MaxPenelope-w4j 26 күн бұрын
Anderson Betty White Steven Smith Gary
@PitmanBoyd-m9z
@PitmanBoyd-m9z 7 күн бұрын
Johnson Amy Anderson Kevin Miller Susan
@HarrodQueena-p8o
@HarrodQueena-p8o 10 күн бұрын
Lee Barbara Lewis Helen Clark Christopher
@BettyBarry-u2m
@BettyBarry-u2m 12 күн бұрын
Davis Linda Davis Sarah Gonzalez Dorothy
@Susan-l5n7d
@Susan-l5n7d 11 сағат бұрын
Taylor Elizabeth Taylor Mary Davis Michelle
@JeffreyWest-p2u
@JeffreyWest-p2u 3 күн бұрын
Taylor Donald Lee Anna Taylor Mary
@faisalsal1
@faisalsal1 7 ай бұрын
She just read the text with zero knowledge about the content. U no good.
@TnikusaMoni-i7p
@TnikusaMoni-i7p 5 күн бұрын
Wilson Angela Rodriguez Sharon Brown Ronald
@MimiKnight-n7u
@MimiKnight-n7u 11 күн бұрын
Moore William Clark Christopher Wilson Ruth
@ShirleyWheeler-j2o
@ShirleyWheeler-j2o 8 күн бұрын
Robinson William Hernandez Dorothy Anderson Jason
@KarenPerez-w9d
@KarenPerez-w9d 19 күн бұрын
Taylor Jeffrey Moore Cynthia Lopez Anthony
@jacquelinecroftoon2071
@jacquelinecroftoon2071 2 күн бұрын
Miller Helen Garcia Charles Gonzalez Sarah
@TruckJob-t5h
@TruckJob-t5h 4 күн бұрын
Hernandez Jessica Lee Barbara Davis Anna
@CharlesJackson-i7q
@CharlesJackson-i7q 26 күн бұрын
Jackson Kimberly White Timothy Thompson Gary
@harrisongibbse7663
@harrisongibbse7663 8 күн бұрын
Jones Jessica Brown David Hernandez Michael
@StoweLauren-t1c
@StoweLauren-t1c 15 күн бұрын
Martinez Jessica Lee Donna Young Richard
@agathakeatase9243
@agathakeatase9243 4 күн бұрын
Gonzalez Shirley Walker Brian Harris James
@GuntherPatrick-o3u
@GuntherPatrick-o3u 14 күн бұрын
Lee David Lewis Matthew Harris Shirley
@marisafraser3404
@marisafraser3404 4 күн бұрын
Lopez Mark Walker Larry Wilson Donald
@marisafraser3404
@marisafraser3404 14 күн бұрын
Rodriguez Michelle Harris Paul Lewis William
@LewisHughes-d2o
@LewisHughes-d2o 4 күн бұрын
Martinez Barbara Miller John Davis Mark
@SaimirBogdani-t6j
@SaimirBogdani-t6j 9 күн бұрын
Gonzalez Jason Brown Barbara Hall Ronald
@Melissa-zi4xq
@Melissa-zi4xq 12 сағат бұрын
Hall Robert Taylor Carol Brown Shirley
Gradient Boosting : Data Science's Silver Bullet
15:48
ritvikmath
Рет қаралды 61 М.
АЗАРТНИК 4 |СЕЗОН 3 Серия
30:50
Inter Production
Рет қаралды 947 М.
Girl, dig gently, or it will leak out soon.#funny #cute #comedy
00:17
Funny daughter's daily life
Рет қаралды 42 МЛН
Angry Sigma Dog 🤣🤣 Aayush #momson #memes #funny #comedy
00:16
ASquare Crew
Рет қаралды 50 МЛН
Colorado Energy Office (CEO) Annual Vendor Training - Fall 2024
1:05:33
Colorado Energy Office
Рет қаралды 4
Boosting - EXPLAINED!
17:31
CodeEmporium
Рет қаралды 50 М.
Gradient Boosted Trees for Classification Explained
11:32
Mazen Ahmed
Рет қаралды 1,5 М.
ML Was Hard Until I Learned These 5 Secrets!
13:11
Boris Meinardus
Рет қаралды 301 М.
XGBoost: How it works, with an example.
15:30
Sundog Education with Frank Kane
Рет қаралды 96 М.
All Machine Learning algorithms explained in 17 min
16:30
Infinite Codes
Рет қаралды 10 М.
681: XGBoost: The Ultimate Classifier - with Matt Harrison
1:09:56
Super Data Science: ML & AI Podcast with Jon Krohn
Рет қаралды 5 М.
АЗАРТНИК 4 |СЕЗОН 3 Серия
30:50
Inter Production
Рет қаралды 947 М.