As a junior data analyst i felt hurt when you "assumed you already did data collection" 😂 it's basically the most daunting part of the job. Deploying models is fun! Building dockers that dont work is not!
@ghettoghost26575 ай бұрын
This is a great page
@flonga2302 Жыл бұрын
Thank you for the great r content James
@james-h-wade Жыл бұрын
I’m glad you like it!
@kyleGrealis8 ай бұрын
Thanks, James, this was an excellent tutorial.
@00moumou007 ай бұрын
Thank you very much. This is fantastic.
@TURALOWEN Жыл бұрын
Suggestions: (1) work on a dataset that one can monitor with real data (2) deploy the model API to AWS and/or posit connect (3) showcase drifts when they happen, and show the ways to handle them. I would learn a lot from these items. Thanks for the video!
@james-h-wade Жыл бұрын
Thank you for the suggestions. Those are great ideas for future videos.
@hanselpalencia2822 Жыл бұрын
@@james-h-wade Seconding the posit connect point, would be really get to get a view into how it's done there. I've had issues deploying as there are hardly any good walkthroughs!
@dreznik Жыл бұрын
auc is an unreliable metric if classes are imbalanced; prediction probabilities need to be adjusted to "undo" the stratified sampling. you should keep a hold out set (randomly sampled) to verify the performance
@aephidayatuloh7 ай бұрын
Great video, now I ca deploy my model as API Can you make a video like this for plumber API deployment to vercel app project? It would be helfpul since if I using huggingface the space must a public and poeple can access to my R code files.
@ARandomDataGuy Жыл бұрын
Great series! A bit confusing was the part of model deployment-why HuggingFace, what is HuggingFace, other options, etc.
@james-h-wade Жыл бұрын
Thanks for sharing that. I'm thinking that should be a topic for a future video. There are many to choose from, and it's hard to understand the differences. My advice is to use the one that works. Posit Connect is the easiest to use in my experience, but it's a pro product.
@staticalmo Жыл бұрын
@@james-h-wadeyes easy, but with a price, even for amateurs. I don't know yet something easier than HF, even though the docker part might scare
@tarasst6887 Жыл бұрын
🎉🎉🎉
@staticalmo Жыл бұрын
which quarto function did you use to create the chunk with numbers on the right that shows the explanation?
@ntran04299 Жыл бұрын
Hello James, I have a question. I see that you did EDA first, then split the data into train and test sets. Shouldn't I do EDA after the split to avoid data leakage?
@dreznik Жыл бұрын
the mlops workflow should become a tool that hides the code and only exposes options