Full stack data science tools
8:27
Пікірлер
@shraddhadeshmukh708
@shraddhadeshmukh708 5 күн бұрын
The vedio i was looking for.Thank you so much .
@gabrieldurkin7366
@gabrieldurkin7366 10 күн бұрын
it seems needlessly complicated. All the rescaling is unecessary for tree models. I am also not sure that because there is 0.3 correlation between two one hot market variables that we should remove market_id altogether. I didn't even follow what was happening with the PCA. Were the PCA variables even considered important to the model? You threw out some engineered features that might have been useful, and then put them back in the end? You might have thought of demand / supply ratio in terms of (total dashers - busy dashers)/(total open orders). And just because two variables are quite correlated doesnt mean their difference or their quotient wont also be useful. If i were doing this problem I would choose one robust baseline model like random forest, much less likely to overfit than boosted models. Time permitting i may try something more exotic, to compare with the baseline. And you don't need to worry about scaling, as i said, with trees. Also - when you have high cardinality categorical features, i would just choose the top 10 in terms of frequency and bucket the others - then do OHE. Or switch them to target encoding. I have to say all the stuff with VIF i have never seen before, I wonder did it actually help? How bad are your results if you just choose the top 10 most important features as decided by random forest and run the model again with just those features. Forget PCA and VIF. So what is your ultimate RMSE? something like 1000 seconds? I got <900 but i was working with a dataset where i junked most of the Nans and dupes. and extreme values, e.g. times over 2 hours. So i was left with ~90% of the original data in my training. Perhaps that is how I got a lower rmse. In the end i used only the top 7 features. (No PCA or VIF). You should talk more about the result in terms of how feasible it is. 18mins on average is not bad i guess, given the huge variability in execution time this three way marketplace can produce. Its a lot of moving parts. Anyway, i am not sure what new folks will think of this exposition. I think it will scare them, and that. is a shame. Being a DS isn't just about showcasing how many techniques you know, sometimes you take a breath and close your eyes before just diving in.
@XxTheRaksoBHAxX
@XxTheRaksoBHAxX 10 күн бұрын
Great video, learnt a lot but when i run the code the files are correctly moved to datasets but they also still exist in the original directory. Wouldn't we have to update the directory that csv_files() looks in as they are now stored in datasets instead?
@akshitabindal2444
@akshitabindal2444 15 күн бұрын
kzbin.info/www/bejne/qHWWcqx4bqesgZYsi=dkp6s-fp1cWplikb
@bilybak2
@bilybak2 22 күн бұрын
Dude, are you shadow ban??
@jayprakashkakde7767
@jayprakashkakde7767 29 күн бұрын
This is so True.
@alejandromarquezcarrillo9474
@alejandromarquezcarrillo9474 Ай бұрын
Hello, sharing the links to the articles cited in the video, will help to get the most of this .
@chiuvinson5035
@chiuvinson5035 Ай бұрын
great video! thank you so much! my results are showing only 50 rows of the df but the channel has over 200 videos. how can I get all of the others past the first page? i feel like it has to do with the pageToken maybe?
@dima8832
@dima8832 Ай бұрын
Very useful! Thank you for sharing the process
@shubhamgattani5357
@shubhamgattani5357 Ай бұрын
Toooo many GIFs..please keep the video simple. Have some mercy on my eyes 🙂
@shubhamgattani5357
@shubhamgattani5357 Ай бұрын
Too many GIFs in this video....was very tough on my eyes!
@yalichacham9977
@yalichacham9977 Ай бұрын
So essentially learn ai and how to organize data for it?
@yalichacham9977
@yalichacham9977 Ай бұрын
Do you think people who code will be replaced by ai?
@technophilecorner7285
@technophilecorner7285 2 ай бұрын
you explain everything so well!! grreat job!!!
@ThangTatNaoNguyenHuuTri
@ThangTatNaoNguyenHuuTri 2 ай бұрын
Lol. Commercial products turn the whole academic field into trash. You gotta have the commercial mind, then.
@fazdatasciencetechcodingml5662
@fazdatasciencetechcodingml5662 2 ай бұрын
Game changer
@gupsau
@gupsau 2 ай бұрын
I am missing Nate's explanations. Please send me playlist of vidoes only by Nate on SQL.
@thequiickbrownfox
@thequiickbrownfox 2 ай бұрын
excellent excellent tutorial!
@crusader8331
@crusader8331 2 ай бұрын
Guys leave data science for us buisness people. You computer science grads make software. You have studied to make tools not use tools.
@martinleung8805
@martinleung8805 2 ай бұрын
this is hilarious thanks for the laugh
@dmax9324
@dmax9324 2 ай бұрын
You are very talented at teaching and explaining everything with no bloat while also not assuming anything about your audience. It was an excellent series, and I have learned a lot. Thank you very much!
@stratascratch
@stratascratch 10 күн бұрын
Appreciate it!
@StEvUgnIn
@StEvUgnIn 2 ай бұрын
For someone who lives in Continental Europe, I am experimenting a different reality where everything that counts to recruiters is the credential and the level of experience of the candidate. I feel they should test the ability to write codes more instead of basing their choice on their gut.
@cuicuidev
@cuicuidev 3 ай бұрын
Skill issue
@cobana4730
@cobana4730 3 ай бұрын
the doom posting is crazy
@RitwikDandriyal
@RitwikDandriyal 3 ай бұрын
This is something I relate to so much! Spent the last year interviewing with 10 different companies and every single one is a completely different experience where people don't know what they're looking for. The worst, and the most frustrating experience I had was with this company that's into electronic equipment manufacturing that was setting up a new ML team. The title of the role had everything in it (Data Scientist, Software Engineer, R&D Engineer). I was really excited for the role and pretty much aced all the screening and ML rounds with ML researchers and PHDs. Everything was set in place and I was 100% sure of getting the offer (the panel was also confident about me), until this "Software Engineering" final round spawned out of nowhere. I have never done much leetcode in my life as a Data Scientist, and absolutely bombed the interview as I was asked a backtracking question that I had no idea of how to solve. Till this day, this has been the most frustrating experience ever.
@AshishBangwal
@AshishBangwal 3 ай бұрын
hey ritwik, was it a south-asian company, and do you have research exp or PHD?, i am in pre-final so just looking for simple advise on how to break into the ML field from experienced peeps just like you, Thanks.
@spektriye
@spektriye 3 ай бұрын
bro think he fireship
@sankhuz
@sankhuz 3 ай бұрын
But everything is now just AI
@sitrakaforler8696
@sitrakaforler8696 3 ай бұрын
hahaha YES. Bloody hard but yeah...
@sebamango2094
@sebamango2094 3 ай бұрын
Whats the solution then?
@josephmargaryan
@josephmargaryan 3 ай бұрын
I had to go through 5 rounds of interviews. In the final interview, they asked me to prove Poincare conjecture, and I just happened to know how to do it, and that's why I got the job. Ever since I got the job I have been asked to clean Excel sheets and provide summary reports to the head of sales
@vunguyen2246
@vunguyen2246 3 ай бұрын
Omg
@teistensean7227
@teistensean7227 3 ай бұрын
Lol wtf
@divinefavour1289
@divinefavour1289 3 ай бұрын
are you for real?
@josephmargaryan
@josephmargaryan 2 ай бұрын
@@divinefavour1289 Yes. During my first interview, I talked with a guy with a computer science PhD. Then, my following interview was with a guy holding a PhD in biomedical engineering. Then I talked with the head of sales and some guy with a PhD in nanoscience. Then, my last interview was with my current manager, who holds a PhD in math from Oxford. No kidding. The head of sales is butchering me because I shifted the numbers from the Excel files. They were unstructured…
@PickleNoDie
@PickleNoDie 3 ай бұрын
had an interview for a software engineer job the other day, bluntly asked the guy about the really dodgy Glassdoor reviews.... told me the place is a hellscape and that 10 people have quit in the last 2 months... then asked me if I was ready to continue.... think we are done here lol ( though I really appreciated his honesty)
@rashim
@rashim 3 ай бұрын
Requirements of DSA, Devops, System Design is also an issue for other specializations. The whole Computer Science interview process is f*cked up
@viswesz
@viswesz 3 ай бұрын
thanks for addressing this issue, was struggling for years
@awesomeGuss
@awesomeGuss 3 ай бұрын
REAL man i wish i saw this before I got sold on the hype for real
@andiuptown1711
@andiuptown1711 3 ай бұрын
Just do swe
@mdreid
@mdreid 3 ай бұрын
Pronounce “Kubernetes” wrong? Rejection.
@amyrlexiion4688
@amyrlexiion4688 3 ай бұрын
When everyone thought future of AI is AGI, its actually API
@vyas1
@vyas1 3 ай бұрын
This is too funny 🤣
@enghimanshu
@enghimanshu 3 ай бұрын
so what should i study ...... currently doing ml and i know web dev and devops
@aymenoulmi7672
@aymenoulmi7672 3 ай бұрын
that's 3, 14 to go good luck am switching to data engineering before i know more of these
@PKperformanceEU
@PKperformanceEU 3 ай бұрын
75k😂😂 3x it and i may consider it
@hwang1607
@hwang1607 3 ай бұрын
Ai generated voice
@stratascratch
@stratascratch 3 ай бұрын
Yup. I'll be the first to admit. It makes it much easier to produce videos so I can focus more on content
@tablettablete186
@tablettablete186 3 ай бұрын
​@@stratascratchWhich program do you use? I am thinking about using Tortoise TTS
@Spectacurl
@Spectacurl 3 ай бұрын
I manage to land an amazing job that I actually like, but my last interview was a fucking topology problem… You know, the math field used in relativity. If I wasn’t a physicist that just happened to re read a topology book just because, I wouldn’t pass that interview. Why topology? Why not?
@jothamprince8765
@jothamprince8765 3 ай бұрын
🤣🤣 wtf ??
@vincentadultman6226
@vincentadultman6226 3 ай бұрын
Wtf is going on? Why did they even expect you to know that? Are you a PhD?
@Nnm26
@Nnm26 3 ай бұрын
even the fucking script is chatgpt generated, this shit is generated entirely by ai
@stratascratch
@stratascratch 3 ай бұрын
No the script is not AI generated. Just the voice. It makes it easier to produce so I can focus on the content.
@xyzabc123-o1l
@xyzabc123-o1l 3 ай бұрын
question, as this is what im currently doing-- im working a normal basic programming job to pay my bills while also working on solving my own problems using my own ML(and otherwise) solutions, in hopes of starting my own business, specifically to avoid all of the problems youve described
@goodboy4129
@goodboy4129 3 ай бұрын
i was asked to explain the working of a transformer model for an internship position in india they were expecting to pay me 250 dollars per month😭
@Bhavishya_est
@Bhavishya_est 3 ай бұрын
Purchasing Power Parity
@IarukaSkYouk
@IarukaSkYouk 3 ай бұрын
jesus fuck man. no wonder why you guys all study like mad. the competition and level of rejection is so high
@srikrishna2561
@srikrishna2561 3 ай бұрын
That's great for an Internship right ? (In India)
@aliensamv3997
@aliensamv3997 3 ай бұрын
@@srikrishna2561 true i did my internship for free
@goodboy4129
@goodboy4129 3 ай бұрын
@@srikrishna2561 Not really it required me to shift to another city The rent and transport alone will take up that much, security guards earn that much but don't need a college degree for it 😄
@dhillaz
@dhillaz 3 ай бұрын
Any unicorn with all these skills doesn't apply to employers, they apply to *investors*
@alexisdamnit9012
@alexisdamnit9012 3 ай бұрын
As a Sr ML Engineer and AI engineer I can confirm this interview process is exhausting to the core. Once you hit 30, you’re already over the grind and you just want to enjoy work life balance - so studying for weeks on end for a stupid interview is really exhausting
@martindbp
@martindbp 3 ай бұрын
Try it's with small kids as well...
@my_study_channel-po4bn
@my_study_channel-po4bn 3 ай бұрын
For fresher ml engineer they need SQL, ML ops, Pipelines, spark, pytorch, tensorflow, keras powerbi or tableau, Azure,gcp or aws or everything, Excel , statics and maths, NLP , python or R or both . Now LLM with some magic also needed
@awesomeGuss
@awesomeGuss 3 ай бұрын
you forgot to add ML theory too
@quickpert1382
@quickpert1382 3 ай бұрын
@@awesomeGuss or fking darknet. I still do not understand why they ask for tensorflow, it is a red flag. Also he forgot bash.
@e404
@e404 3 ай бұрын
ML is the new fullstack
@LuccaCedeño-p7m
@LuccaCedeño-p7m 3 ай бұрын
So true
@harshtripathi4113
@harshtripathi4113 3 ай бұрын
Not just full, but an overflown stack.
@YouHaveToLoveMe
@YouHaveToLoveMe 3 ай бұрын
It's like our universe, it's expanding like anything every single day
@quickpert1382
@quickpert1382 3 ай бұрын
The fullstack with PhD
@jimenezluis33
@jimenezluis33 3 ай бұрын
​@@quickpert1382 pHD to do any stupid non sense job in the company or fill spreadsheets or rearrange the disasters of someone else
@lex494
@lex494 3 ай бұрын
I was able to land a junior data science position this year, after working in business intelligence for 2 years before. I was promised building data pipelines, forecasting, python etc. just to end up doing power bi off sql queries for non tech execs and sales people who tbh don’t even bother using them
@EobardUchihaThawne
@EobardUchihaThawne 3 ай бұрын
u mean ur work is mostly about data prep?
@teistensean7227
@teistensean7227 3 ай бұрын
Probably reality thou, unless he work in the Silicon Valley but majority of company still use excel, power bi, tableau too i supposed
@parl8150
@parl8150 3 ай бұрын
I mean, the field was started by researchers and mathematicians. The knowledge built up is really huge here. No wonder half of the positions are for phd’s.
@elgatodelamuerte
@elgatodelamuerte 3 ай бұрын
You missed the point - the experience is irrelevant if it's not going to be put to use. What's the point in hiring a ph.D only for him to use Excel and PowerBI
@parl8150
@parl8150 3 ай бұрын
@@elgatodelamuerteI meant, that a historical kind of momentum is here. The bigtech hires Phds for actual complex work, and all of the other companies be like “they hire phds for this, we will also do that”
@fr5229
@fr5229 3 ай бұрын
@@elgatodelamuerte warehousing talent
@doofus8
@doofus8 2 ай бұрын
​@@elgatodelamuerteno phd is working with excel or powerBI .... what world are you living in... only undergrads get hired for those jobs... I can even once consider for masters guy but definitely not phd lol
@developerashish6849
@developerashish6849 3 ай бұрын
This is the most relatable video ever. I wish i would've have chosen another field. Interviewers expect everything man, i have been rejected so many times because i didnt know something which is no where related to the Job description. I recently gave an interview for NLP engineer and they asked something related to computer vision which i used to work on 3 years ago, i obviously have forgot many things so couldn't answer it, and they rejected me. They somehow thinks that a ML Engineer should somehow know everything related to AI + software engineering + DSA. Man i m tired of trying to crack this field.
@Xnozea
@Xnozea 3 ай бұрын
You guys are getting interviews? Best I can get is application rejection e-mail. Man I gave 4 years for application rejections.
@fr5229
@fr5229 3 ай бұрын
You learned the entire modern stack + ML in 4 years?
@Xnozea
@Xnozea 3 ай бұрын
​@@fr5229 No. Not the entire SWE stack. I am an economics graduate(have econ masters too). I came from data analysis route and started with R. Because of my background ML and Statistical Modelling was kinda easy. Then I started to learn SQL and Python. And then I learned data engineering and data science tools for GCP(such as BigQuery, VertexAI, Dataproc, Cloud Run for Streamlit and Shiny applications yada yada). I also used all of these as either learning projects or you know something a little bit more advanced projects. Solved many questions on StrataScratch. I did lots of web scraping and EDA and ML stuff in my free time because they were fun really. Learning process lasted like a little more than 3 years. I was also busy with my econ masters classes and thesis. Last year I wasted my time with a shitty Business Analytics internship(for 8 months) where you only needed Excel. I did my work with pandas there I only used Excel for collobration. Even that internship wasnt enough I guess.
@JamesGelok
@JamesGelok 3 ай бұрын
@@fr5229 don't think Xnozea said that
@Xnozea
@Xnozea 3 ай бұрын
@@fr5229 Oh man I gave you a lengthy answer and KZbin removed it lol. Long story short. I learned modern ML stack(r, python,sql, relevant GCP services from data engineering to application deployment) minus traditional SWE stack (since I am an Econ graduate. I also have econ masters.) within 3 years(during my masters). Last year I wasted my time with a bad Business Analytics internship where Excel was sufficient to do the work.Thats why I said 4.
@Xnozea
@Xnozea 3 ай бұрын
@@fr5229 bro I tried to answer your question 2 times. KZbin keeps removing them.The answer was 3. Then I found a bad internship.
@themaskedpsalmistasmr
@themaskedpsalmistasmr 3 ай бұрын
Crazy