it seems needlessly complicated. All the rescaling is unecessary for tree models. I am also not sure that because there is 0.3 correlation between two one hot market variables that we should remove market_id altogether. I didn't even follow what was happening with the PCA. Were the PCA variables even considered important to the model? You threw out some engineered features that might have been useful, and then put them back in the end? You might have thought of demand / supply ratio in terms of (total dashers - busy dashers)/(total open orders). And just because two variables are quite correlated doesnt mean their difference or their quotient wont also be useful. If i were doing this problem I would choose one robust baseline model like random forest, much less likely to overfit than boosted models. Time permitting i may try something more exotic, to compare with the baseline. And you don't need to worry about scaling, as i said, with trees. Also - when you have high cardinality categorical features, i would just choose the top 10 in terms of frequency and bucket the others - then do OHE. Or switch them to target encoding. I have to say all the stuff with VIF i have never seen before, I wonder did it actually help? How bad are your results if you just choose the top 10 most important features as decided by random forest and run the model again with just those features. Forget PCA and VIF. So what is your ultimate RMSE? something like 1000 seconds? I got <900 but i was working with a dataset where i junked most of the Nans and dupes. and extreme values, e.g. times over 2 hours. So i was left with ~90% of the original data in my training. Perhaps that is how I got a lower rmse. In the end i used only the top 7 features. (No PCA or VIF). You should talk more about the result in terms of how feasible it is. 18mins on average is not bad i guess, given the huge variability in execution time this three way marketplace can produce. Its a lot of moving parts. Anyway, i am not sure what new folks will think of this exposition. I think it will scare them, and that. is a shame. Being a DS isn't just about showcasing how many techniques you know, sometimes you take a breath and close your eyes before just diving in.
@XxTheRaksoBHAxX10 күн бұрын
Great video, learnt a lot but when i run the code the files are correctly moved to datasets but they also still exist in the original directory. Wouldn't we have to update the directory that csv_files() looks in as they are now stored in datasets instead?
Hello, sharing the links to the articles cited in the video, will help to get the most of this .
@chiuvinson5035Ай бұрын
great video! thank you so much! my results are showing only 50 rows of the df but the channel has over 200 videos. how can I get all of the others past the first page? i feel like it has to do with the pageToken maybe?
@dima8832Ай бұрын
Very useful! Thank you for sharing the process
@shubhamgattani5357Ай бұрын
Toooo many GIFs..please keep the video simple. Have some mercy on my eyes 🙂
@shubhamgattani5357Ай бұрын
Too many GIFs in this video....was very tough on my eyes!
@yalichacham9977Ай бұрын
So essentially learn ai and how to organize data for it?
@yalichacham9977Ай бұрын
Do you think people who code will be replaced by ai?
@technophilecorner72852 ай бұрын
you explain everything so well!! grreat job!!!
@ThangTatNaoNguyenHuuTri2 ай бұрын
Lol. Commercial products turn the whole academic field into trash. You gotta have the commercial mind, then.
@fazdatasciencetechcodingml56622 ай бұрын
Game changer
@gupsau2 ай бұрын
I am missing Nate's explanations. Please send me playlist of vidoes only by Nate on SQL.
@thequiickbrownfox2 ай бұрын
excellent excellent tutorial!
@crusader83312 ай бұрын
Guys leave data science for us buisness people. You computer science grads make software. You have studied to make tools not use tools.
@martinleung88052 ай бұрын
this is hilarious thanks for the laugh
@dmax93242 ай бұрын
You are very talented at teaching and explaining everything with no bloat while also not assuming anything about your audience. It was an excellent series, and I have learned a lot. Thank you very much!
@stratascratch10 күн бұрын
Appreciate it!
@StEvUgnIn2 ай бұрын
For someone who lives in Continental Europe, I am experimenting a different reality where everything that counts to recruiters is the credential and the level of experience of the candidate. I feel they should test the ability to write codes more instead of basing their choice on their gut.
@cuicuidev3 ай бұрын
Skill issue
@cobana47303 ай бұрын
the doom posting is crazy
@RitwikDandriyal3 ай бұрын
This is something I relate to so much! Spent the last year interviewing with 10 different companies and every single one is a completely different experience where people don't know what they're looking for. The worst, and the most frustrating experience I had was with this company that's into electronic equipment manufacturing that was setting up a new ML team. The title of the role had everything in it (Data Scientist, Software Engineer, R&D Engineer). I was really excited for the role and pretty much aced all the screening and ML rounds with ML researchers and PHDs. Everything was set in place and I was 100% sure of getting the offer (the panel was also confident about me), until this "Software Engineering" final round spawned out of nowhere. I have never done much leetcode in my life as a Data Scientist, and absolutely bombed the interview as I was asked a backtracking question that I had no idea of how to solve. Till this day, this has been the most frustrating experience ever.
@AshishBangwal3 ай бұрын
hey ritwik, was it a south-asian company, and do you have research exp or PHD?, i am in pre-final so just looking for simple advise on how to break into the ML field from experienced peeps just like you, Thanks.
@spektriye3 ай бұрын
bro think he fireship
@sankhuz3 ай бұрын
But everything is now just AI
@sitrakaforler86963 ай бұрын
hahaha YES. Bloody hard but yeah...
@sebamango20943 ай бұрын
Whats the solution then?
@josephmargaryan3 ай бұрын
I had to go through 5 rounds of interviews. In the final interview, they asked me to prove Poincare conjecture, and I just happened to know how to do it, and that's why I got the job. Ever since I got the job I have been asked to clean Excel sheets and provide summary reports to the head of sales
@vunguyen22463 ай бұрын
Omg
@teistensean72273 ай бұрын
Lol wtf
@divinefavour12893 ай бұрын
are you for real?
@josephmargaryan2 ай бұрын
@@divinefavour1289 Yes. During my first interview, I talked with a guy with a computer science PhD. Then, my following interview was with a guy holding a PhD in biomedical engineering. Then I talked with the head of sales and some guy with a PhD in nanoscience. Then, my last interview was with my current manager, who holds a PhD in math from Oxford. No kidding. The head of sales is butchering me because I shifted the numbers from the Excel files. They were unstructured…
@PickleNoDie3 ай бұрын
had an interview for a software engineer job the other day, bluntly asked the guy about the really dodgy Glassdoor reviews.... told me the place is a hellscape and that 10 people have quit in the last 2 months... then asked me if I was ready to continue.... think we are done here lol ( though I really appreciated his honesty)
@rashim3 ай бұрын
Requirements of DSA, Devops, System Design is also an issue for other specializations. The whole Computer Science interview process is f*cked up
@viswesz3 ай бұрын
thanks for addressing this issue, was struggling for years
@awesomeGuss3 ай бұрын
REAL man i wish i saw this before I got sold on the hype for real
@andiuptown17113 ай бұрын
Just do swe
@mdreid3 ай бұрын
Pronounce “Kubernetes” wrong? Rejection.
@amyrlexiion46883 ай бұрын
When everyone thought future of AI is AGI, its actually API
@vyas13 ай бұрын
This is too funny 🤣
@enghimanshu3 ай бұрын
so what should i study ...... currently doing ml and i know web dev and devops
@aymenoulmi76723 ай бұрын
that's 3, 14 to go good luck am switching to data engineering before i know more of these
@PKperformanceEU3 ай бұрын
75k😂😂 3x it and i may consider it
@hwang16073 ай бұрын
Ai generated voice
@stratascratch3 ай бұрын
Yup. I'll be the first to admit. It makes it much easier to produce videos so I can focus more on content
@tablettablete1863 ай бұрын
@@stratascratchWhich program do you use? I am thinking about using Tortoise TTS
@Spectacurl3 ай бұрын
I manage to land an amazing job that I actually like, but my last interview was a fucking topology problem… You know, the math field used in relativity. If I wasn’t a physicist that just happened to re read a topology book just because, I wouldn’t pass that interview. Why topology? Why not?
@jothamprince87653 ай бұрын
🤣🤣 wtf ??
@vincentadultman62263 ай бұрын
Wtf is going on? Why did they even expect you to know that? Are you a PhD?
@Nnm263 ай бұрын
even the fucking script is chatgpt generated, this shit is generated entirely by ai
@stratascratch3 ай бұрын
No the script is not AI generated. Just the voice. It makes it easier to produce so I can focus on the content.
@xyzabc123-o1l3 ай бұрын
question, as this is what im currently doing-- im working a normal basic programming job to pay my bills while also working on solving my own problems using my own ML(and otherwise) solutions, in hopes of starting my own business, specifically to avoid all of the problems youve described
@goodboy41293 ай бұрын
i was asked to explain the working of a transformer model for an internship position in india they were expecting to pay me 250 dollars per month😭
@Bhavishya_est3 ай бұрын
Purchasing Power Parity
@IarukaSkYouk3 ай бұрын
jesus fuck man. no wonder why you guys all study like mad. the competition and level of rejection is so high
@srikrishna25613 ай бұрын
That's great for an Internship right ? (In India)
@aliensamv39973 ай бұрын
@@srikrishna2561 true i did my internship for free
@goodboy41293 ай бұрын
@@srikrishna2561 Not really it required me to shift to another city The rent and transport alone will take up that much, security guards earn that much but don't need a college degree for it 😄
@dhillaz3 ай бұрын
Any unicorn with all these skills doesn't apply to employers, they apply to *investors*
@alexisdamnit90123 ай бұрын
As a Sr ML Engineer and AI engineer I can confirm this interview process is exhausting to the core. Once you hit 30, you’re already over the grind and you just want to enjoy work life balance - so studying for weeks on end for a stupid interview is really exhausting
@martindbp3 ай бұрын
Try it's with small kids as well...
@my_study_channel-po4bn3 ай бұрын
For fresher ml engineer they need SQL, ML ops, Pipelines, spark, pytorch, tensorflow, keras powerbi or tableau, Azure,gcp or aws or everything, Excel , statics and maths, NLP , python or R or both . Now LLM with some magic also needed
@awesomeGuss3 ай бұрын
you forgot to add ML theory too
@quickpert13823 ай бұрын
@@awesomeGuss or fking darknet. I still do not understand why they ask for tensorflow, it is a red flag. Also he forgot bash.
@e4043 ай бұрын
ML is the new fullstack
@LuccaCedeño-p7m3 ай бұрын
So true
@harshtripathi41133 ай бұрын
Not just full, but an overflown stack.
@YouHaveToLoveMe3 ай бұрын
It's like our universe, it's expanding like anything every single day
@quickpert13823 ай бұрын
The fullstack with PhD
@jimenezluis333 ай бұрын
@@quickpert1382 pHD to do any stupid non sense job in the company or fill spreadsheets or rearrange the disasters of someone else
@lex4943 ай бұрын
I was able to land a junior data science position this year, after working in business intelligence for 2 years before. I was promised building data pipelines, forecasting, python etc. just to end up doing power bi off sql queries for non tech execs and sales people who tbh don’t even bother using them
@EobardUchihaThawne3 ай бұрын
u mean ur work is mostly about data prep?
@teistensean72273 ай бұрын
Probably reality thou, unless he work in the Silicon Valley but majority of company still use excel, power bi, tableau too i supposed
@parl81503 ай бұрын
I mean, the field was started by researchers and mathematicians. The knowledge built up is really huge here. No wonder half of the positions are for phd’s.
@elgatodelamuerte3 ай бұрын
You missed the point - the experience is irrelevant if it's not going to be put to use. What's the point in hiring a ph.D only for him to use Excel and PowerBI
@parl81503 ай бұрын
@@elgatodelamuerteI meant, that a historical kind of momentum is here. The bigtech hires Phds for actual complex work, and all of the other companies be like “they hire phds for this, we will also do that”
@fr52293 ай бұрын
@@elgatodelamuerte warehousing talent
@doofus82 ай бұрын
@@elgatodelamuerteno phd is working with excel or powerBI .... what world are you living in... only undergrads get hired for those jobs... I can even once consider for masters guy but definitely not phd lol
@developerashish68493 ай бұрын
This is the most relatable video ever. I wish i would've have chosen another field. Interviewers expect everything man, i have been rejected so many times because i didnt know something which is no where related to the Job description. I recently gave an interview for NLP engineer and they asked something related to computer vision which i used to work on 3 years ago, i obviously have forgot many things so couldn't answer it, and they rejected me. They somehow thinks that a ML Engineer should somehow know everything related to AI + software engineering + DSA. Man i m tired of trying to crack this field.
@Xnozea3 ай бұрын
You guys are getting interviews? Best I can get is application rejection e-mail. Man I gave 4 years for application rejections.
@fr52293 ай бұрын
You learned the entire modern stack + ML in 4 years?
@Xnozea3 ай бұрын
@@fr5229 No. Not the entire SWE stack. I am an economics graduate(have econ masters too). I came from data analysis route and started with R. Because of my background ML and Statistical Modelling was kinda easy. Then I started to learn SQL and Python. And then I learned data engineering and data science tools for GCP(such as BigQuery, VertexAI, Dataproc, Cloud Run for Streamlit and Shiny applications yada yada). I also used all of these as either learning projects or you know something a little bit more advanced projects. Solved many questions on StrataScratch. I did lots of web scraping and EDA and ML stuff in my free time because they were fun really. Learning process lasted like a little more than 3 years. I was also busy with my econ masters classes and thesis. Last year I wasted my time with a shitty Business Analytics internship(for 8 months) where you only needed Excel. I did my work with pandas there I only used Excel for collobration. Even that internship wasnt enough I guess.
@JamesGelok3 ай бұрын
@@fr5229 don't think Xnozea said that
@Xnozea3 ай бұрын
@@fr5229 Oh man I gave you a lengthy answer and KZbin removed it lol. Long story short. I learned modern ML stack(r, python,sql, relevant GCP services from data engineering to application deployment) minus traditional SWE stack (since I am an Econ graduate. I also have econ masters.) within 3 years(during my masters). Last year I wasted my time with a bad Business Analytics internship where Excel was sufficient to do the work.Thats why I said 4.
@Xnozea3 ай бұрын
@@fr5229 bro I tried to answer your question 2 times. KZbin keeps removing them.The answer was 3. Then I found a bad internship.