A Gentle Introduction to Pandas Data Analysis (on Kaggle)

  Рет қаралды 121,629

Rob Mulla

Rob Mulla

Күн бұрын

In this video you will learn the basics of how to use pandas in python for data science. Rob Mulla, kaggle grandmaster, will walk through the tutorial in a kaggle notebook. This pandas python tutorial for data science is great for anyone new to learning this technology. Pandas is a MUST for anyone getting into data science with python. Python is the #1 coding language for data science and has been growing over the years as an essential tool, with Pandas being the main data wrangling module. In this video we discuss the basics of how to use pandas including...
Timestamps:
00:00 Introduction
02:44 Series data structure
05:04 DataFrame data structure
10:01 Reading in data
12:17 Inspect The Data
15:25 Columns and Rows
19:00 Subsetting Data
25:45 Casting dtypes
29:00 Creating new column
31:19 Adding new Row
33:05 Plot Examples
36:45 Save out the output as a csv
Follow me on twitch for live coding streams: / medallionstallion_
Link to kaggle notebook used in the tutorial: www.kaggle.com/robikscube/the...
* KZbin: / @robmulla
* Twitch: / medallionstallion_
* Twitter: / medalliondata
* Kaggle: www.kaggle.com/robikscube
#DataScience #Pandas #Python #Coding

Пікірлер: 149
@soumyadrip
@soumyadrip 2 жыл бұрын
Timestamps: Introduction: 0:00 Pandas data structure: Series 2:44 Pandas data structure: Series 5:04 Reading in data: 10:01 Inspect The Data: 12:17 Columns and Rows: 15:25 Subsetting Data: 19:00 Casting dtypes: 25:45 Creating new column: 29:00 Adding new Row: 31:19 Plot Examples: 33:05 Save our output: 36:49
@robmulla
@robmulla 2 жыл бұрын
You are awesome @somuSan! Thanks so much for tagging these.
@soumyadrip
@soumyadrip 2 жыл бұрын
@@robmulla Nah no problem 😅, you are the awesome here, creating such a great end to end tutorial.
@Abdolahy
@Abdolahy 9 ай бұрын
Hi there Rob! I wanna thank you for this fantastic tutorial you made on EDA with Python Pandas library. That really was the most impressive EDA tutorial I've ever watched on YT.
@nasranruwaidi
@nasranruwaidi 11 ай бұрын
Amazing introduction for pandas. This video alone just already covered most of my reporting needs.Thank you
@chacehawkins4708
@chacehawkins4708 8 ай бұрын
First pandas or python tutorial I was able to watched more than 10 min of in one sitting. Great job. Looking forward to following you here and on twitch
@kapamagicman
@kapamagicman Ай бұрын
Really great! I love how you go through step by step and with the end to end examples. Thank you!
@Cpt_Diabetes
@Cpt_Diabetes 5 ай бұрын
absolutely loved this video, one of the best walkthroughs out right now
@jorgeimperial-sosa9097
@jorgeimperial-sosa9097 Жыл бұрын
Excellent intro. I had been toying with pandas but had not fully grasped the concepts of series and dataframes.
@chq012
@chq012 Жыл бұрын
Wow, great video! Very much enjoyed it. I will definitely watch your other videos to learn many more techniques. Thank you.
@user-vl4vb5zo1w
@user-vl4vb5zo1w 3 ай бұрын
I have been using pandas quite a while,learned a lot from your video. Your presentation was very good to follow through. Thanks for your hard work.
@byte_me_xd-hk5zt
@byte_me_xd-hk5zt 10 ай бұрын
your intro i immediately was like you're my hero!
@dennisalbarello2551
@dennisalbarello2551 7 ай бұрын
Great video! Interesting way to make a Pandas Introduction. Thank you a lot to share with us your knowlegde!
@zhiyang2553
@zhiyang2553 10 ай бұрын
excellent video! amazing introduction! thank you a lot !
@gabrielacolen7281
@gabrielacolen7281 9 ай бұрын
THank you so much for this video. The quality is amazing and you are such a good teacher.
@gabrielbiacchi6169
@gabrielbiacchi6169 9 ай бұрын
You explain these in a very clear way,! Thank u sir 🙏
@kirtwedel9275
@kirtwedel9275 5 ай бұрын
Fantastic video! Thanks much for the lesson.
@bashar9200
@bashar9200 Жыл бұрын
Thank you!! I never get tired of your videos!!!
@robmulla
@robmulla Жыл бұрын
I appreciate you watching them!!
@JHornsby89
@JHornsby89 Жыл бұрын
Excellent tutorial - Really clear with excellent explanations and concise steps. I've found it useful as a codealonng with a different set of data. Just to get to grasps with Pandas. Thank you!
@robmulla
@robmulla Жыл бұрын
Awesome! So glad you enjoyed and learned from it.
@davidcrdz1966
@davidcrdz1966 11 ай бұрын
First I want to thank you for sharing your skill and time creating these videos. I thought I was doing pretty well until we got to the Columns and rows section...haha I type the same "set_index" as you did but I kept getting an error. I found out I could run "reset_index", then I did "set_index" again and it worked.
@Dongnanjie
@Dongnanjie 4 ай бұрын
Love it. Thank you Rob!
@ibowman_UCLA_BRAIN
@ibowman_UCLA_BRAIN Жыл бұрын
This video deserves to be the most liked Gentle Introduction to Pandas Data Analysis on KZbin.
@robmulla
@robmulla Жыл бұрын
Really apprecaite that feedback. Share it with all your friends and maybe we can make it the most liked!
@travistexian
@travistexian 27 күн бұрын
Awesome information, plainly stated
@BarryPennock
@BarryPennock 4 ай бұрын
So cool! Pandas for Python makes things so much easier.
@rrio7171
@rrio7171 8 ай бұрын
best I found on YT! underrated video, there should be more likes
@laurenceturpin1409
@laurenceturpin1409 11 ай бұрын
Thank you for doing the video I learnt a lot about pandas please keep making videos like this.
@AlbertoChillon
@AlbertoChillon 11 ай бұрын
Thank you very much, an amazing tutorial!!
@betterliving747
@betterliving747 5 ай бұрын
One of the best explanations thank you
@ItsMePeterB
@ItsMePeterB Ай бұрын
Thank you for the tutorial!
@wesleyweel8007
@wesleyweel8007 Жыл бұрын
Excellent video, after 2 decades of experience in the software industry I cannot stress more how important it is to have a strong foundation in the basics inorder to attempt something more advanced. Wonderful job
@robmulla
@robmulla Жыл бұрын
Really appreciate your endorsement! I completely agree about knowing the basics first.
@utica2burn
@utica2burn Жыл бұрын
Interesting comment! I have been using python and pandas for years and thought to myself today I should go back and check the basics - so here I am! I find myself still googling every time I have to do simple stuff like change column order or make a copy of a data frame.
@ilyesb2271
@ilyesb2271 2 жыл бұрын
Thank's for everything you share with us
@robmulla
@robmulla 2 жыл бұрын
My pleasure! I apprecaite the positive feedback!
@jakubkopczynski779
@jakubkopczynski779 Жыл бұрын
Thanks for the video, I've learnt a lot! I tried casting different dtypes on the imported Excel spreadsheet columns and found out an interesting issue, when I summed up the numbers in the same column with float64 and int64 dtypes, I got different results (no decimal points involved). Honestly I have no idea why it's working like that! I guess it has to be something with Excel formatting...
@uchegodswill-iv4cd
@uchegodswill-iv4cd 9 ай бұрын
yes, thanks a lot. l learnt so much. its been interesting and i am going to watch all you videos.
@ke30_
@ke30_ 2 жыл бұрын
Your channel is a goldmine ty!
@robmulla
@robmulla 2 жыл бұрын
Thanks!
@andresfrancojunor
@andresfrancojunor Жыл бұрын
That was really good ! Thank you !
@robmulla
@robmulla Жыл бұрын
Glad you found it helpful. Please share it with anyone else you think might also learn from it!
@turboblitz4587
@turboblitz4587 Жыл бұрын
Hey I think this is a great tutorial for actual pandas beginners. I have been using pandas for quite some time now, and still learned something. However, an advanced Video would be super great, because I see you doing all sorts of super advanced stuff in your coding videos that blow my mind and that I don't understand! Cheers
@robmulla
@robmulla Жыл бұрын
Thanks for watching. I’m glad you think it’s a good beginners! I need to think through what I could include for an intermediate/advanced tutorial. My pandas noob video has some good examples/tips if you haven’t already checked that out.
@turboblitz4587
@turboblitz4587 Жыл бұрын
@@robmulla Hey thanks :) No I think I didnt watch it gonna check it out
@turboblitz4587
@turboblitz4587 Жыл бұрын
@@robmulla I checked out the video and yes, I have learned a lot in this one! Also in the timeseries forecasting video. Now I feel like a noob haha
@anirbanc88
@anirbanc88 Жыл бұрын
this is such a great tutorial, thank you so much!
@robmulla
@robmulla Жыл бұрын
Glad it was helpful! Please share with others who you think might also learn from it.
@brandoncyoung
@brandoncyoung 8 ай бұрын
just getting into DS and kaggleis great the notebook is so easy to use access to datasets is great. thanks for sharing!
@robmulla
@robmulla 8 ай бұрын
Glad it was helpful!
@mohan250s
@mohan250s Жыл бұрын
watching your videos one by one bro, awesome work as usual
@robmulla
@robmulla Жыл бұрын
Thanks a ton, so glad you are learning from them. Make sure to comment and share so the videos get picked up by the youtube algorithm.
@moniquebrasilbaptista1989
@moniquebrasilbaptista1989 Жыл бұрын
Amazing. Thank you!
@PaquiCamus
@PaquiCamus Жыл бұрын
I watched the whole video and I am so amazed at how easy is to do data analysis nowadays. I am the generation that used: Lotus to Excel, due to their limitation we needed to combine Statistical Packages(R, SAS), Graphing(Grapher), and 3D surface mapping (Surffer) plus Visual Basic. Now I have decided to learn Python plus others to go back to some old data for recalibration that was impossible deal with due to memory overloading. Panda is very powerful. Thank you very much for sharing your knowledge.
@robmulla
@robmulla Жыл бұрын
Glad the video showed you something new. Pandas is a great skill to master and can be really powerful for doing things that aren’t possible to do in excel with large datasets.
@PaquiCamus
@PaquiCamus Жыл бұрын
@@robmulla Yes, you are right. I am learning it and struggling with Jupyter Notebook and Kagle. I like both. I still have a long way to go!
@CarolinaMunoz-vy3ni
@CarolinaMunoz-vy3ni Жыл бұрын
Although I watched this video a long time ago, I wanted to go over a basic explanation. Thanks for sharing your knowledge.
@robmulla
@robmulla Жыл бұрын
Glad it was helpful!
@otenyop
@otenyop 7 ай бұрын
Wow this was a great video!
@RoadTo10KsubsWithoutAnyVideos
@RoadTo10KsubsWithoutAnyVideos Жыл бұрын
Very well explained. easily understood . Didn"t have any issues following along. Good job, brother 😀😀
@robmulla
@robmulla Жыл бұрын
Glad it helped! I apprecaite the feedback.
@100themagician
@100themagician 11 ай бұрын
Amazing video, thank you Rob
@robmulla
@robmulla 11 ай бұрын
Thanks for watching. What with a friend!
@davdeveloper
@davdeveloper 11 ай бұрын
This video was so useful. Thank you so much!
@robmulla
@robmulla 11 ай бұрын
You're so welcome! Thanks for watching.
@user-mn4me8wv5d
@user-mn4me8wv5d 5 ай бұрын
Thanks! So well explained.
@robmulla
@robmulla 5 ай бұрын
Glad you enjoyed it!
@gospelmoto2833
@gospelmoto2833 9 ай бұрын
Got a new sub here. Thanks for your video.
@ArenitaHernandez
@ArenitaHernandez 10 ай бұрын
Amazing! Thanks!
@swannschilling474
@swannschilling474 Ай бұрын
Great content!!
@pashkinzon
@pashkinzon 9 ай бұрын
Wonderful video - explanation is nice and clean, very intuitive narrative, thank you!
@metalflames777
@metalflames777 3 ай бұрын
21:38 Wow!! I never knew you could use underscores in place of commas for the larger numbers!
@aditipandey1769
@aditipandey1769 9 ай бұрын
Thank you Sir
@wwpharmacist
@wwpharmacist Жыл бұрын
For sure I enjoy this tutorial
@robmulla
@robmulla Жыл бұрын
Glad you did!
@AdobadoFantastico
@AdobadoFantastico Жыл бұрын
You're very good at tutorializing.
@robmulla
@robmulla Жыл бұрын
🙏
@stifferdoroskevich1809
@stifferdoroskevich1809 Жыл бұрын
Amazing video!!!
@robmulla
@robmulla Жыл бұрын
Glad you liked it!!
@amitavroydev
@amitavroydev 6 ай бұрын
Great video
@n.zisanyalvac6538
@n.zisanyalvac6538 Жыл бұрын
thank you! greetings from Turkey :)
@robmulla
@robmulla Жыл бұрын
My pleasure. Glad to know there is someone watching from Turkey!
@shahbazsaeed7090
@shahbazsaeed7090 7 ай бұрын
please start a series of ML from scratch like linear regression, logistic regression, decision tree, random forest, k means clustering with python please sir please ..... i m really understanding your teaching style as you teach through hands on
@antoines8843
@antoines8843 Жыл бұрын
What is the shortcut you use @27:16 ??? To copy paste the code while replacing the variable inside?
@heitorrapela
@heitorrapela 2 жыл бұрын
Good content! 😄
@robmulla
@robmulla 2 жыл бұрын
Thanks so much Heitor!
@pkprasads
@pkprasads 10 ай бұрын
Thank you so much.
@robmulla
@robmulla 10 ай бұрын
Always welcome!
@BrutalNewby
@BrutalNewby 4 ай бұрын
Have a like good man. Gonna watch all of your videos from 0 ;)
@sandraoriji8351
@sandraoriji8351 2 жыл бұрын
Awesome 👍
@robmulla
@robmulla 2 жыл бұрын
Thanks Sandra! Glad you liked it.
@andrewkurian726
@andrewkurian726 10 ай бұрын
thank you
@madiva45
@madiva45 6 ай бұрын
Hello from dom rep next to aiti, how i change the back ground color of kaggle note book .
@thechristan019
@thechristan019 6 ай бұрын
Hi folks Does anyone know what keyboard he is using? Thanks in advance
@stylesg7818
@stylesg7818 Жыл бұрын
Thank you
@robmulla
@robmulla Жыл бұрын
You're welcome
@DiegoSilva-dv9uf
@DiegoSilva-dv9uf 10 ай бұрын
Valeu!
@NAC79
@NAC79 2 жыл бұрын
Going on 1 full week of trying to change the font between Jupyter now Kaggle. Idk why I'm obsessed with it
@robmulla
@robmulla 2 жыл бұрын
Did you figure it out? Have you watched my tutorial about jupyter/python notebooks?
@NAC79
@NAC79 2 жыл бұрын
@@robmulla I have not. Adding it to the que now. Thanks!
@boubi9329
@boubi9329 2 жыл бұрын
Hello, I saw one of your post in the youtube forum saying that your Timestamps didn’t worked out properly, I see now that it works. Did you do anything for it to work, any report sent to youtube ? Did the Timestamp worked after you hit the 1000 subscribers or the problem was there before ?
@robmulla
@robmulla 2 жыл бұрын
Hey! Yes. that was really frustrating. I didn't do anything special. It took a few months but then started working. I hope you get it figured out too.
@nikunjgorani8964
@nikunjgorani8964 10 ай бұрын
i cant convert the dtype of likeCount into int64 Can some1 hlp me
@wiscgaloot
@wiscgaloot Жыл бұрын
For the last few weeks I've been struggling to learn how to do with Python what I do in 10 minutes with Excel. And a few months ago I spent half a day figuring out most of the steps to do the same analysis in Matlab. Dataframes are ridiculously difficult to deal with I'm learning it because several colleagues said that if I'm looking for a new systems engineering job, I should know Python. Numpy is easy. Pandas dataframes are obscure and clunky.
@robmulla
@robmulla Жыл бұрын
Interesting to hear you actually find numpy easier to work with than dataframes. I find the opposite because dataframes have an index and column names. It might just take some time and then it will click for you, or maybe you will find something else that works better for your use case. Hope the video was helpful and good luck on your journey.
@ghrangelr
@ghrangelr 2 жыл бұрын
Hello, What key do you use to open the menus?
@robmulla
@robmulla 2 жыл бұрын
Which menu is that? You might want to check out my video on jupyter notebooks when I talk about the keyboard shortcuts I often use.
@filmssharecenter3293
@filmssharecenter3293 Ай бұрын
great
@terenceochuo701
@terenceochuo701 8 ай бұрын
Getting errors while reading csv file
@jonesPossibly
@jonesPossibly Жыл бұрын
Hi - why is the series of 'thing' in mydf dataframe an object dtype, but the 'count' is an int64 dtype? Why is 'thing' not a 'string' dtype?
@robmulla
@robmulla Жыл бұрын
I'm not sure, if "thing" is a string it will show as "object" if you count on it it will produce a number integer.. but I'm not sure exactly what you are asking.
@jonesPossibly
@jonesPossibly Жыл бұрын
@@robmulla hiya - thanks for the reply - at kzbin.info/www/bejne/lXbFYaiqfreXodk you can see that the series containing stings has a dtype of object, whereas the series containing the integers has a dtype of int64. I was just wondering why its object and not string?
@marlowisws
@marlowisws Жыл бұрын
Would be helpful to mention Tab kicks off the auto-complete for us n00bs
@robmulla
@robmulla Жыл бұрын
Good point. Also shift-tab for docstrings!
@speedyg2295
@speedyg2295 Жыл бұрын
What is I have an Excel or csv that has 999 rows but I want to delete 101 thru 999 rows? or Just read the first 100 rows only. Then save that as a new DataFrame as only those 100 rows.
@robmulla
@robmulla Жыл бұрын
.head(100) will give you just the first 100 rows.
@speedyg2295
@speedyg2295 Жыл бұрын
@@robmulla Thanks for the info
@caiyu538
@caiyu538 Жыл бұрын
thumb up. Thumb up
@robmulla
@robmulla Жыл бұрын
🙏
@CarolinaMunoz-vy3ni
@CarolinaMunoz-vy3ni Жыл бұрын
Hello Rob, Can you help me with this error, please? I can't figure out what i did wrong. Thanks a lot :) df['likeCount'] = pd.to_numeric(df['likeCount'].astype('str')) TypeError: 'Series' object is not callable
@robmulla
@robmulla Жыл бұрын
Strange. Don’t know why it isn’t working. Do you need to set the type to string? Try just dunking to numeric on the column without astype(‘str’)
@CarolinaMunoz-vy3ni
@CarolinaMunoz-vy3ni Жыл бұрын
@@robmulla Sorry again, I have a question, one more time. What is the best way to work with numbers with decimals, for example, I have a field that has decimal numbers, but I want to convert it to an int.
@annikaw5068
@annikaw5068 Жыл бұрын
yass
@robmulla
@robmulla Жыл бұрын
😉
@annikaw5068
@annikaw5068 Жыл бұрын
@@robmulla You just gained a new subscriber✅😊
@txreal2
@txreal2 Жыл бұрын
Can I use Google Colab to follow along also?
@robmulla
@robmulla Жыл бұрын
Yes! It's actually really easy. Just click the three dots in the top right corner of the kaggle notebook and then click "download code" you then can open it in google colab. You would also need to download and link the data. Why do you prefer colab over a kaggle notebook? They are very similar.
@obayram4615
@obayram4615 Жыл бұрын
Very good 🙂👋👋👋👋👋👋😋
@robmulla
@robmulla Жыл бұрын
Thank you very much
@ericbroun4657
@ericbroun4657 Жыл бұрын
@vaporr5929
@vaporr5929 Жыл бұрын
At minute 12:40, I believe you meant first "5 rows" and *not* first "5 columns"
@robmulla
@robmulla Жыл бұрын
Ah! Good catch. Maybe I did it on purpose to make sure you were paying attention 😏
@vaporr5929
@vaporr5929 Жыл бұрын
@@robmulla I truly appreciate the learning content you provide to the masses on your channel. Looking forward to more tutorials on Pandas, liked and subscribed!
@nakul469
@nakul469 3 ай бұрын
29:20 - it is giving me value error
@NAC79
@NAC79 2 жыл бұрын
How many years have you been doing Data Science?
@robmulla
@robmulla 2 жыл бұрын
Thanks for asking. I've been working with data for over 10 years but doing data science specifically for about 6.
@felixakwerh5189
@felixakwerh5189 Жыл бұрын
do you have a discord channel??
@robmulla
@robmulla Жыл бұрын
Yes I do! Join! discord.gg/KnsDbstv
@average-jojo-enjoyer
@average-jojo-enjoyer Жыл бұрын
My time stamps still not working -_-
@robmulla
@robmulla Жыл бұрын
Oh man. I’m sorry. That was really frustrating when it happened to me.
@punamjadhav7801
@punamjadhav7801 10 ай бұрын
Dataframe 1 / Table 1 MaterialID |Unit Selling Price |Unit Cost A | 100 | 80 B | 200 | 140 C | 150 | 100 D | 250 | 230 E | 225 | 215 Dataframe 2 / Table 2 Month | Quantity Sold | Material ID Jan | 10 | A Feb | 5 | E Mar | 25 | C Jan | 5 | D Feb | 15 | B Mar | 2 | A Which month highest total sale amount achieved Which month highest profit amount observed Change Quantity Sold from 5 to 7 programatically and find out revised Which month highest total sale amount achieved Which month highest profit amount observed
@abhishekrai1060
@abhishekrai1060 Жыл бұрын
Don't use twitch. Will subscribe you here though
@robmulla
@robmulla Жыл бұрын
Thanks!
@samarumugam4833
@samarumugam4833 8 ай бұрын
Hi Rob why are addressing Hey u tube ,seems to be quite odd or some sort , mind you U tube is not watching yours Videos we guys are watching our Likes and Views makes your day worth while , So Gentle request be direct to source please address us by " Hi guys " or something .. 🙏🙏🤞Any way Great Videos by You bro. Keep it Up. God Bless You...
@CarolinaMunoz-vy3ni
@CarolinaMunoz-vy3ni 2 жыл бұрын
Hello Rob, i followed your tutorial and i had an error when you created a new columns. Can you help me with this error. Thank you very much for the great job. /opt/conda/lib/python3.7/site-packages/ipykernel_launcher.py:1: SettingWithCopyWarning: A value is trying to be set on a copy of a slice from a DataFrame. Try using .loc[row_indexer,col_indexer] = value instead
@robmulla
@robmulla 2 жыл бұрын
Hey Carolina. This message usually comes up when you are trying to edit a subset of a previously defined dataframe. The best fix is to add `.copy()` after subsetting and renaming a dataframe. So before you might have: df_small = df.query('thing > 10') Change to: df_small = df.query('thing > 10').copy()
@nikunjgorani8964
@nikunjgorani8964 10 ай бұрын
@@robmulla Hey Rob I am getting this error during converting the likecount into integer IntCastingNaNError Traceback (most recent call last) Cell In[126], line 2 1 df['viewCount'].astype('int') ----> 2 df['likeCount'] = df['likeCount'].astype('int').copy()
Speed Up Your Pandas Dataframes
11:15
Rob Mulla
Рет қаралды 66 М.
Learning Pandas for Data Analysis? Start Here.
22:50
Rob Mulla
Рет қаралды 71 М.
GADGETS VS HACKS || Random Useful Tools For your child #hacks #gadgets
00:35
McDonald’s MCNUGGET PURSE?! #shorts
00:11
Lauren Godwin
Рет қаралды 35 МЛН
ОДИН ДОМА #shorts
00:34
Паша Осадчий
Рет қаралды 6 МЛН
Exploratory Data Analysis with Pandas Python
40:22
Rob Mulla
Рет қаралды 410 М.
Audio Data Processing in Python
19:52
Rob Mulla
Рет қаралды 140 М.
7 Python Data Visualization Libraries in 15 minutes
15:03
Rob Mulla
Рет қаралды 65 М.
Make Your Pandas Code Lightning Fast
10:38
Rob Mulla
Рет қаралды 174 М.
25 Nooby Pandas Coding Mistakes You Should NEVER make.
11:30
Rob Mulla
Рет қаралды 255 М.
The Ultimate Coding Setup for Data Science
17:35
Rob Mulla
Рет қаралды 101 М.
Exploratory Data Analysis in Pandas | Python Pandas Tutorials
32:13
Alex The Analyst
Рет қаралды 102 М.
GADGETS VS HACKS || Random Useful Tools For your child #hacks #gadgets
00:35