The Value of Synthetic Data
2:48
Жыл бұрын
K-Means Clustering using Python
11:21
Logistic Regression using Python
18:38
How to write a data science blog
2:19
Python Sets and Set Theory
10:30
6 жыл бұрын
About me + Getting a Data Science Job
30:25
Install Git on Windows
5:16
7 жыл бұрын
Install R and RStudio on Mac
4:50
7 жыл бұрын
Пікірлер
@pedroelias2097
@pedroelias2097 4 күн бұрын
This is the best Decision Tree video i've found on the internet. I am at med school and learning about ML, this helped me so much, thx !!!
@chadplaysfifa2363
@chadplaysfifa2363 14 күн бұрын
Why no comments? Your video helped me a lot. Thank you.
@MichaelGalarnyk
@MichaelGalarnyk 14 күн бұрын
@@chadplaysfifa2363 Im glad you liked it and appreciate your comment! Ill be evenually making a couple more videos this year (random forests and something finance related)
@srujanachowdary8301
@srujanachowdary8301 2 ай бұрын
in which website can we execute it
@arjannijk7647
@arjannijk7647 2 ай бұрын
Dear Michael, thank you for this great explanation! As a newb, it took me some time to realise how to actually get the figure shown (figure.show()). I have another question: how do I get the figure to show the 'names' of my data points? I want to take a closer look at outlier cases, so it would be great if the figure would show their case number. Hope you can help!
@skyyystudy9853
@skyyystudy9853 6 ай бұрын
Thankssss
@aniket33591
@aniket33591 8 ай бұрын
thank you, very helpful
@Worldwidenigespam
@Worldwidenigespam 8 ай бұрын
Top man. Great video.
@hope5614
@hope5614 8 ай бұрын
thanks
@saheedadio
@saheedadio 8 ай бұрын
Thank you so much for your video. Would it be possible to adapt this for a 3D-histogram? Thank you so much for your help
@Slim-bob
@Slim-bob 10 ай бұрын
Thank you! 🙂
@MichaelGalarnyk
@MichaelGalarnyk 10 ай бұрын
You're welcome! I will make another video soon enough :)
@italobaciliere2081
@italobaciliere2081 10 ай бұрын
Thanks!!
@MichaelGalarnyk
@MichaelGalarnyk 10 ай бұрын
Happy to help
@chrisw1462
@chrisw1462 10 ай бұрын
Wow.. light green brackets on a white background... not very well thought out.
@MichaelGalarnyk
@MichaelGalarnyk 10 ай бұрын
what a delightful comment.
@StalkedHuman
@StalkedHuman 11 ай бұрын
Is it correct that anaconda needs 300 GB space? AWS free tier is 30 GB . The AWS anaconda subscription wants a new instance to have large size EC2.. but I set it (the launch new 🆕 instance from the AWS subscription page) to the free tier and it seems to have anaconda installed. I tried installing anaconda manually on a regular default EC2 but it says it failed to finish because of no space 🌌
@tomrhee1
@tomrhee1 11 ай бұрын
I wonder if you could go over the Rolling Linear Regression one more time. Thank you, Mike.
@forcse113
@forcse113 11 ай бұрын
The Best One
@MichaelGalarnyk
@MichaelGalarnyk 11 ай бұрын
thanks!
@PRAVEENKUMAR.Vbtech_cse2020
@PRAVEENKUMAR.Vbtech_cse2020 Жыл бұрын
ill get this error (base) sathish@SATHISHs-Air ~ % snotebook zsh: no matches found: local[2]
@kelvinmoran6455
@kelvinmoran6455 Жыл бұрын
😋 Promo*SM
@MichaelGalarnyk
@MichaelGalarnyk Жыл бұрын
If you want to learn about image augmentation for deep learning (synthetic image data for deep learning), check out A Data Odyssey's video: kzbin.info/www/bejne/Z3nPn6iZhqxmfKcsi=uwBKIb3dTzuAzhnd
@MichaelGalarnyk
@MichaelGalarnyk Жыл бұрын
This is a free preview video from the Synthetic Data: Advanced Concepts and Applications Course: www.linkedin.com/learning/synthetic-data-advanced-concepts-and-applications/unlock-the-power-of-synthetic-data 00:00: What is a robust dataset? 00:27: Real data lacks enough rare classes and edge cases (healthcare and PII) 00:47: Identify edge cases and rare classes in real data 01:02: Check that real data is representative of real-world scenarios with minimal domain gaps 01:15: Make sure your model validation process has enough real data edge cases in the test set 01:28: Don't let your machine learning model learn that all synthetic data is an edge case 1:40: Synthetic data can be iteratively improved over time to account for more edge cases
@thedivyasingh
@thedivyasingh Жыл бұрын
Thank you for such simple information.
@MichaelGalarnyk
@MichaelGalarnyk Жыл бұрын
This is a free preview video from the Synthetic Data: Advanced Concepts and Applications Course: www.linkedin.com/learning/synthetic-data-advanced-concepts-and-applications/unlock-the-power-of-synthetic-data 00:00: Why do you need synthetic data 00:26: Privacy issues with real data (healthcare and PII) 00:46: Safety issues with real data (self-driving cars and drones) 01:20: Real data collection and labeling are not scalable (annotating medical images) 02:04: The real data you have is not ideal (not enough data to properly train a model)
@MichaelGalarnyk
@MichaelGalarnyk Жыл бұрын
Code: github.com/mGalarnyk/Python_Tutorials/blob/master/Sklearn/KMeans/KMeans.ipynb 00:00: What is K-Means Clustering (KMeans.ipynb) 01:05: Interactive K-Means Visualization (www.naftaliharris.com/blog/visualizing-k-means-clustering/) 03:13: Import Libraries, Visualize Data, and Standardize Data (KMeans.ipynb) 03:58: Clustering Data using K-Means (KMeans.ipynb) 04:35: Visually Evaluate the Clusters (KMeans.ipynb) 06:05: Compute the Silhouette Score for your Clusters (KMeans.ipynb) 06:53: K-Means Potential Assumptions/Issues (KMeans.ipynb)
@MichaelGalarnyk
@MichaelGalarnyk Жыл бұрын
Code: github.com/mGalarnyk/Python_Tutorials/blob/master/Sklearn/HierarchicalClustering/HierarchicalClustering.ipynb 00:00: Introduction (HierarchicalClustering.ipynb) 00:14: Hierarchical Clustering Animation (HierarchicalClustering.ipynb) 00:45: Linkage Criterion Determines which Distance to Use Between Sets of Observations (HierarchicalClustering.ipynb) 01:31: How Hierarchical Clustering Works (HierarchicalClustering.ipynb) 01:50: Comparing K-Means and Hierarchical Clustering Algorithms (HierarchicalClustering.ipynb) 02:33: K-Means Starting Position of Cluster Centers Problem (HierarchicalClustering.ipynb) 02:46: Hierarchical Clustering in Python
@MichaelGalarnyk
@MichaelGalarnyk Жыл бұрын
After fitting a random forest on data, you can view all the individual decision trees. This visualization was made for the blog post Visualizing Decision Trees with Python (Scikit-learn, Graphviz, Matplotlib): medium.com/p/1c50b4aa68dc
@MichaelGalarnyk
@MichaelGalarnyk Жыл бұрын
Code: github.com/mGalarnyk/Python_Tutorials/tree/master/Sklearn/Logistic_Regression Logistic Regression Visualization: mfviz.com/binary-predictions/ 00:00: Introduction (LogisticRegression.ipynb) 01:05: Visualizing Relationship between worst_concave_points and diagnosis (LogisticRegression.ipynb) 03:13: What is Logistic Regression (LogisticRegression.ipynb) 03:47: Showing Predictions for Logistic Regression (LogisticRegression.ipynb) 05:46: Advantages and Disadvantages of Logistic Regression (LogisticRegression.ipynb) 07:10: Evaluating Logistic Regression with accuracy and a confusion matrix (LogisticRegression.ipynb) 08:35: True Negatives, False Positives, False Negatives, and True Positives (LogisticRegression.ipynb) 09:27: Sensitivity, Specificity, Type 1 Error, and Type 2 Error (LogisticRegression.ipynb) 11:37: Trading True Positive and True Negatives through an ROC Curve (LogisticRegression.ipynb) 12:49: Interactive Logistic Regression Visualization (mfviz.com/binary-predictions/) 14:51: Logistic Regression on the Titanic Dataset
@blurabbit6476
@blurabbit6476 Жыл бұрын
Unhearable, the audio is really bad
@MichaelGalarnyk
@MichaelGalarnyk Жыл бұрын
There is a new version of this video here: kzbin.info/www/bejne/q4DadJyQgM9lqMk That was my old mic.
@stlo0309
@stlo0309 Жыл бұрын
thanks!
@leonardopoveromo8611
@leonardopoveromo8611 Жыл бұрын
is this the same course is also available on linkedin learning?
@MichaelGalarnyk
@MichaelGalarnyk Жыл бұрын
Nope! This was for a university. some similar content but a lot is different.
@MichaelGalarnyk
@MichaelGalarnyk Жыл бұрын
Code here: github.com/mGalarnyk/Python_Tutorials/tree/master/Sklearn/CART Video based on this blog: medium.com/p/9663d683c952 09:31: Train Test Split (TrainTestSplit.ipynb) 18:17: Decision Tree Exercise with Titantic Data (ExerciseDecisionTree.ipynb) 18:52: Solution to Decision Tree Exercise with Titantic Data (ExerciseDecisionTreeSolution.ipynb) 19:18: Arrange Data into Features Matrix and Target Vector (ExerciseDecisionTreeSolution.ipynb) 21:02: Split Data into Training and Testing Sets (ExerciseDecisionTreeSolution.ipynb) 21:12: Fit a Decision Tree on the Titantic Dataset (ExerciseDecisionTreeSolution.ipynb) 21:56: Make Predictions on the Testing Set and Calculate the Accuracy (ExerciseDecisionTreeSolution.ipynb) 22:10: Compare the Testing Accuracy to the Null Accuracy (ExerciseDecisionTreeSolution.ipynb) 23:38: Confusion Matrix of Titanic Predictions (ExerciseDecisionTreeSolution.ipynb) 24:14: Feature Importance Metric from Decision Trees (ExerciseDecisionTreeSolution.ipynb) 24:52: Creating a Decision Tree Visualization using Matplotlib and Graphviz (ExerciseDecisionTreeSolution.ipynb)
@abdulsammadsaeed1199
@abdulsammadsaeed1199 Жыл бұрын
Thanks!
@glennmatlin1952
@glennmatlin1952 Жыл бұрын
@GereHadush
@GereHadush Жыл бұрын
Great step by step video and still works. Thank you Micheal!
@MichaelGalarnyk
@MichaelGalarnyk Жыл бұрын
I guess I did a good job years ago :) I will make more videos this year (on synthetic data) and quite a few next year
@alikarimi8994
@alikarimi8994 Жыл бұрын
Good materials. Needs to work on teaching and presentation skills. Don’t need to say “okay” after every sentence!
@kartik_exe_
@kartik_exe_ Жыл бұрын
This video seems helpful, but the sound is so feeble.
@Rigvedbhargav
@Rigvedbhargav Жыл бұрын
thank u so much sir.... was trying to do this and was pissed
@seansullivan6986
@seansullivan6986 Жыл бұрын
Thanks. this is very helpful.
@FxnkyFresko
@FxnkyFresko Жыл бұрын
Why I am getting this error in 4:45 ? Error in FUN(left) : invalid argument type
@luizrenato2714
@luizrenato2714 Жыл бұрын
The link no longer works :'( Can you provide the new one (if it exists), please?
@MichaelGalarnyk
@MichaelGalarnyk Жыл бұрын
I updated the link to builtin.com/machine-learning/pca-in-python. Thanks for asking!
@msteele919
@msteele919 Жыл бұрын
Thanks so much Michael
@Adhisya
@Adhisya Жыл бұрын
I have random forest, gradient boosting, and adaboost model prediction. Can I visualize it in one scatter plot? I have been hardly thinking to figure it out actually. If anyone knows how to visualize the model in one scatter plot like PCA, I really apreciate it.
@murielsamuels4370
@murielsamuels4370 Жыл бұрын
Thank you
@MichaelGalarnyk
@MichaelGalarnyk Жыл бұрын
Happy to help!
@吳育綺-j7y
@吳育綺-j7y 2 жыл бұрын
Thank you for providing so perfect video. I also meet the problem that my connection stays setting up for hours... and never complete. I tried to fresh page but it didn't work either. Is there any solution?
@MichaelGalarnyk
@MichaelGalarnyk 2 жыл бұрын
Which Ubuntu version are you using? did you adjust the script with your username?
@MichaelGalarnyk
@MichaelGalarnyk 2 жыл бұрын
Also if you figure out the problem, can you share the solution? i havent run into that problem before except when i messed up the script
@吳育綺-j7y
@吳育綺-j7y 2 жыл бұрын
​@@MichaelGalarnyk Thank you for your reply. I use ubuntu 20.4 and also changed my username in right way. It works when I terminated instances and retry all steps on the next day. I think it due to there are too many users using it at the same time.
@MichaelGalarnyk
@MichaelGalarnyk 2 жыл бұрын
​@@吳育綺-j7y Too many users is not what I expected. Thank you for getting back to me!
@SMARTAICOMMUNITY
@SMARTAICOMMUNITY 2 жыл бұрын
Thank you, Michael! greatly appreciate the step by step process! very helpful.
@MichaelGalarnyk
@MichaelGalarnyk 2 жыл бұрын
Happy you found this useful!
@sarahs2018
@sarahs2018 2 жыл бұрын
Great video, saw the post on LinkedIn
@MichaelGalarnyk
@MichaelGalarnyk 2 жыл бұрын
Thanks! I’m glad you found it valuable!
@abohashem5935
@abohashem5935 2 жыл бұрын
should i choose ubunto even if i am windows user?
@MichaelGalarnyk
@MichaelGalarnyk 2 жыл бұрын
Yes! JupyterHub officially does not support Windows and running something on AWS doesn't effect what you are using for a laptop/desktop. Any other questions?
@abohashem5935
@abohashem5935 2 жыл бұрын
@@MichaelGalarnyk yeah it keeps give me 404 error thanks in advance
@MichaelGalarnyk
@MichaelGalarnyk 2 жыл бұрын
@@abohashem5935 Did you check the blog section titled "404 page not found" www.kdnuggets.com/2023/01/setup-jupyterhub-tljh-aws-ec2.html?fbclid=IwAR1aswRoPmV4Q-3tviwAq4TedBxj_-HopnguGS3TXRBYjJeWEY8y2u9vuE8
@victor_harvard_data_science
@victor_harvard_data_science 2 жыл бұрын
Great video! Thanks! I have a problem, my connection stays setting up for hours... any idea why?
@MichaelGalarnyk
@MichaelGalarnyk 2 жыл бұрын
How big is your instance? Also, have you tried refreshing your browser?
@victor_harvard_data_science
@victor_harvard_data_science 2 жыл бұрын
@@MichaelGalarnyk t2.2xlarge, di I need more?
@MichaelGalarnyk
@MichaelGalarnyk 2 жыл бұрын
@@victor_harvard_data_science Sorry for the late reply. Let me know what landed up working for you so I can add it to the blog post comments section Did refreshing the browser help? You don't need larger instance. If you are an IAM user, do you have the appropriate permissions. If that isn't the problem? Did you replace <admin-user-name> with the admin user? I had a similar error one time when I forgot to do that one time.
@victor_harvard_data_science
@victor_harvard_data_science 2 жыл бұрын
@@MichaelGalarnyk Hey no worries, thanks for getting back to me. I did not have permission to request that instance. I don't know why, I email them and then they granted me permissions. But it took them like 4 days. I decided to use lambda labs and in 5 minutes I had an instance running with 8 teslas v100. Way easier, check them out if haven't. Thanks again.
@MichaelGalarnyk
@MichaelGalarnyk 2 жыл бұрын
If you are having trouble with this following this tutorial, I made an updated tutorial: kzbin.info/www/bejne/nmWoZY2sis2MkKc Accompanying blog: www.kdnuggets.com/2023/01/setup-jupyterhub-tljh-aws-ec2.html?fbclid=IwAR1aswRoPmV4Q-3tviwAq4TedBxj_-HopnguGS3TXRBYjJeWEY8y2u9vuE8
@MichaelGalarnyk
@MichaelGalarnyk 2 жыл бұрын
Accompanying Blog Post: www.kdnuggets.com/2023/01/setup-jupyterhub-tljh-aws-ec2.html?fbclid=IwAR1aswRoPmV4Q-3tviwAq4TedBxj_-HopnguGS3TXRBYjJeWEY8y2u9vuE8
@rashmirikhi9861
@rashmirikhi9861 2 жыл бұрын
This was really helpful. The process got hustle-free
@MichaelGalarnyk
@MichaelGalarnyk 2 жыл бұрын
Happy you found this useful!
@Zippo_1234
@Zippo_1234 2 жыл бұрын
Fantastic. This helped me out a lot