14. Project - 9 | Data Analysis | IMDB Movie Dataset | Python Pandas Project

14. Project - 9 | Data Analysis | IMDB Movie Dataset | Python Pandas Project | Kaggle Dataset

Рет қаралды 62,453

Күн бұрын

Пікірлер: 74

@faisalshaikh4480 3 жыл бұрын

WOW, Man!! you have done such amazing work for the Data science community. May God bless you for the wonderful work that you are doing. I just completed this playlist and now jumping on to Data visualization. Looking forward for Machine learning playlist

@DataThinkers 3 жыл бұрын

Thank you so much for your comment. Glad you like this video.thanks

@TonderayiKanoz 5 ай бұрын

Visualizing the location of missing data at 7:55 has been a great one. Many analyst videos have never showed me this. Awesome!!

@Mars7822 3 жыл бұрын

Many people can jump into this field because of your videos... Keep it up

@DataThinkers 3 жыл бұрын

Thanks for your comment. Glad you like this video. Sure. Thanks

@jetspray3 2 жыл бұрын

This is the best of all the others I have seen. I just like how you show the column you are working on and then pull everything together to get the result you want and in that way I start understanding what you are doing. Thumbs up to you, I wish there were other videos on KZbin like that.

@DataThinkers 2 жыл бұрын

Glad you liked it. You can watch other videos from this play list. kzbin.info/aero/PL_1pt6K-CLoDMEbYy2PcZuITWEjqMfyoA

@muralikumaar9456 2 жыл бұрын

One of the best Pandas Project playlist you have made. Your channel is so much underrated. Create more machine learning playlist .Wishing you all the best

@DataThinkers 2 жыл бұрын

Thanks for your comment. because of subscribers like you, our channel is growing. Sure I will create. Keep watching. Thanks.

@surbhirautaray6872 9 ай бұрын

You are awesome. Doing amazing work.

@leolee7884 Жыл бұрын

Amazing job my friend! You are amazing! Thank you for a wonderful job!

@shakilahammed1887 Жыл бұрын

I correctly answered 20 question of 22. Thanks

@DataThinkers Жыл бұрын

Excellent!

@sketchytv1321 2 жыл бұрын

very informative and easy to understand.Subscribed

@DataThinkers 2 жыл бұрын

Thanks for your comment. Glad you liked it. Keep watching.

@l.kennethwells2138 2 жыл бұрын

Thank you so much for this. This tutorial just raised my confidence level.

@shubhammeshram8504 8 ай бұрын

Thank you. Great teaching.😀

@petroliumengineer 2 жыл бұрын

I Really like the video you make, and i learned a lot from it I would like to suggest an easier way to solve the latest 4 questions: question number 20 : bin_edges = [data['Rating'].min() ,data['Rating'].quantile(0.25) ,data['Rating'].quantile(0.75) ,data['Rating'].max() ] bin_names = ['Average', 'Good', 'Excellent'] data['Rating_cat_my_method'] = pd.cut(data['Rating'], bin_edges, labels=bin_names) Question 21,22 and 23 : data['Genre_list']=data['Genre'].apply(lambda x:x.split(',')) data_exploded=data.explode('Genre_list') data_exploded['Genre_list'].value_counts()

@DataThinkers 2 жыл бұрын

Thanks for your comment.Glad you like this video.keep watching.thanks

@abhishekpatil492 2 жыл бұрын

Excellent, great work .....

@DataThinkers 2 жыл бұрын

Thanks for your comment. Glad you like this video. Keep watching. Thanks

@riyaz8072 3 жыл бұрын

Please create more EDA videos please.. Your videos are so good.

@DataThinkers 3 жыл бұрын

Thanks for your comment. Glad you like this video. Sure i will. Thanks

@piyushpathak7311 3 жыл бұрын

Sir we want more projects like this it's awesome plz 🙏🙏 upload sir..

@DataThinkers 3 жыл бұрын

Thanks for comment dear. I'm working on it. Upload it shortly.Thanks

@piyushpathak7311 3 жыл бұрын

@@DataThinkers thanks sir upload it soon..

@petroliumengineer 2 жыл бұрын

Thank you very much.. i see that it is easier to use EXPLODE function to answer Genre questions instead of multiple for loops

@DataThinkers 2 жыл бұрын

Thanks for your comment.Glad you like this video.keep watching.thanks

@miguelandrade982 3 жыл бұрын

Thank you, Sir. I did really enjoyed this video. Keep up this great work My solution for the last question: from collections import Counter Counter(data.Genre.apply(lambda x: pd.Series(x.split(','))).stack().values)

@DataThinkers 3 жыл бұрын

Thanks for your comment. Glad you like this video. Thanks

@arfanariyanto297 Жыл бұрын

Thx u very much, its help a lot

@AustralianYoutuber Жыл бұрын

Hello Dear, Great way to teach!! I am getting error for below queries. 1. Create a new column profit in df_movie by subtracting the variables gross and budget. 2. Create a new categorical column sequelcat in df_movie which takes the value sequel if the movie is a sequel, and original otherwise. 3. Find the five most profitable original movies and print their movie title and profit to the console (sorted as highest profit first). 4. Find the five least profitable sequel movies and print these movie title and profit to the console (sorted as lowest profit first).

@DataThinkers Жыл бұрын

check my code : github.com/DataThinkers/Data-Analytics-Projects-Code

@prabhavsingh654 Ай бұрын

"let me" sir. Thank You...

@cloykorea9874 2 жыл бұрын

fabulous broooooooooooooo

@DataThinkers 2 жыл бұрын

Thanks for your comment. Glad you like this video. Keep watching. Thanks

@janakiyeluripati6368 3 жыл бұрын

Sir, can you make videos on regular expressions in pandas. Amazing tutorials

@DataThinkers 3 жыл бұрын

Thanks for your comment. Sure i will.

@kishanmaurya422 3 жыл бұрын

nice explanation sir , can make a video on web scraping using selenium and converting it in data frame and then excel

@DataThinkers 3 жыл бұрын

Thanks for your comment. Glad you like this video.sure i will try.thanks

@jalego800 Жыл бұрын

Hello sir, @09:44, it seems that we didn't drop those NAN rows...so the row of 'Grindhouse' is still included in our dataset, right?

@ranjithraghunathan1267 3 жыл бұрын

Amazing

@DataThinkers 3 жыл бұрын

Thanks for your comment. Glad you like this video.

@ranjithraghunathan1267 3 жыл бұрын

@@DataThinkers can you also do videos similar python projects for 1) consolidating multiple excel files in a folder. 2) using loops, nested loops with multiple examples etc

@DataThinkers 3 жыл бұрын

@@ranjithraghunathan1267 First of all thanks for your comment. sure I will do that

@fakerrain 3 жыл бұрын

I smashed that like button so very hard. This was amazing content and I learned so much from this video. I will be for sure working through all the playlist. Are there any books you recommend to read for this info? Thank you again for this video.

@DataThinkers 3 жыл бұрын

😄😄 Thanks for your comment. Glad you like this video. Actually I'm not following any book. You can follow my video tutorials. Thanks.

@mohseenmohammed46 2 жыл бұрын

excellent

@DataThinkers 2 жыл бұрын

Thanks for your comment. Glad you like this video. Thanks

@mohseenmohammed46 2 жыл бұрын

@@DataThinkers please make more videos on complex datasets

@DataThinkers 2 жыл бұрын

Sure 👍. Thanks

@riyaz8072 3 жыл бұрын

Please share the playlist of these EDA

@DataThinkers 3 жыл бұрын

Data Analysis Projects | Pandas Projects | Data Analysis With Python Pandas | Case Studies: kzbin.info/aero/PL_1pt6K-CLoDMEbYy2PcZuITWEjqMfyoA

@shailesh_jain_ Жыл бұрын

sir, what is code of visualization of sum of votes based on year in barplot or countplot

@hairavyadav6579 Ай бұрын

if you show how to replace null value using statistics its more better then dropping the null value.

@mohamad5005 2 жыл бұрын

Thank you but I think there is an easy way instead of the for loop, if there is , could you illustrate what it is?

@Zelalem-k1c Жыл бұрын

Wow, you crashed. Thak you. you helped me so much to work on my project.

@himanshushorts7143 Жыл бұрын

Question 22 Another way list1 = df[‘Genre’].str.split(‘,’) list2 = [] for i in list1: list2 += i list2 = list(set(list2)) #typecast Print(len(list2))

@syedabaduruunnisa3099 Жыл бұрын

If I want to take top 10 movies which is successfull and even after bad reviews how to get this output.. Pls help

@NangiMugira Жыл бұрын

What about finding the genre with the highest rating.

@maurocruz1824 2 жыл бұрын

37:00

@akashyeole2258 Жыл бұрын

Please tell me uniqueness of this project ?? My teacher is asking me ....please tell me uniqueness of this project

@DataThinkers Жыл бұрын

Based on the list of data analytics questions for the IMDB movie dataset, the uniqueness of your project could stem from the following aspects: Comprehensive Data Analysis: Your project appears to cover a wide range of data analysis tasks, including data exploration, data cleaning, descriptive statistics, data aggregation, and data visualization. This comprehensive approach allows for a holistic analysis of the IMDB movie dataset, providing a thorough understanding of the data and its various aspects. Deep Dive into Movie Attributes: Your project involves examining various attributes of movies, such as title, runtime, revenue, rating, genre, director, and voting, among others. This multifaceted analysis allows for a detailed investigation of different movie characteristics, enabling insights into different dimensions of the movie industry and audience preferences. Data Cleaning and Pre-processing: Your project includes steps to check for missing values, drop them, and identify duplicate data. This demonstrates a meticulous approach to data cleaning and pre-processing, which is crucial for obtaining accurate and reliable analysis results. Time-based Analysis: Your project involves analyzing movies on a yearly basis, such as finding the highest average voting and revenue by year, as well as counting the number of movies per year. This time-based analysis provides insights into how movie-related metrics have evolved over time, revealing trends, patterns, and changes in the industry. Genre Analysis: Your project includes examining movie genres, such as finding unique genre values, counting the number of films per genre, and classifying movies based on genre. This genre-based analysis adds a unique dimension to your project, allowing for insights into popular genres, genre trends, and genre-specific characteristics of movies. Rating and Revenue Relationship: Your project involves investigating the relationship between movie ratings and revenue, as well as finding the average rating for each director. This analysis provides insights into whether ratings affect revenue and sheds light on the interplay between critical acclaim and commercial success in the movie industry. Presentation of Results: Your project may involve presenting the analysis results in a visually appealing and informative manner, such as displaying top 10 movies, creating visualizations, and summarizing statistics. This approach enhances the interpretability and usability of your project, making it more engaging and accessible to stakeholders.

@akashyeole2258 Жыл бұрын

@@DataThinkers Plz reply in short 2-3 line ...... Uniqness of this project

@DataThinkers Жыл бұрын

The uniqueness of the project analyzing the IMDB movie dataset lies in its comprehensive data exploration, holistic data cleaning and pre-processing, and diverse analysis tasks, including time-based, director-wise, revenue and rating, movie classification, and genre analysis. This approach provides valuable insights into various aspects of the movie industry, such as trends over time, director performance, factors impacting movie success, and genre preferences, making the project distinct and informative.