Python Sentiment Analysis Project with NLTK and 🤗 Transformers. Classify Amazon Reviews!!

  Рет қаралды 413,461

Rob Mulla

Rob Mulla

Күн бұрын

Пікірлер: 436
@nixonsebastian2892
@nixonsebastian2892 2 жыл бұрын
great content, this deserves a million views... {'roberta_neg': 0, 'roberta_neu': 0, 'roberta_pos': 100}😀
@robmulla
@robmulla 2 жыл бұрын
Haha. Best comment! Pinned.
@xBaphometHx
@xBaphometHx Жыл бұрын
Pos should be 1, since the maximum value is 1. lol
@48-tarunsalgotra81
@48-tarunsalgotra81 Жыл бұрын
​@@robmulla plz give ur what's app no
@smi14172
@smi14172 Жыл бұрын
Good one!!😅
@Thikondrius
@Thikondrius Жыл бұрын
I don't often left comments on youtube but, finally someone that explains everything from scratch...I am a JS developer. And it's really cool your that you explain every piece of code. That really helped, I was able to understand everything.
@robmulla
@robmulla Жыл бұрын
Hey! I really apprecaite this comment. Thanks so muc.
@atharvpatawar8346
@atharvpatawar8346 2 жыл бұрын
Huge thank you to you!!! I recently participated in a ML hackathon and they had sentiment analysis as one of their problem statements. I had watched your video prior to the competition and used hugging face whereas everyone else used the standard vader. I ended up getting the highest accuracy and placed first, all in my second year of engineering. Genuinely, can’t thank you enough for the information! Team random_state42
@mohammedmehdi1940
@mohammedmehdi1940 2 жыл бұрын
Mil gaya tu yaha
@robmulla
@robmulla 2 жыл бұрын
This is so awesome! Thanks for sharing. I posted a screenshot of your comment on twitter, hope that's ok!
@bhaumik3118
@bhaumik3118 2 жыл бұрын
Btw huge fan of your statistics' notes Mr. Patawar, didn't expect to find you here.
@mohammedmehdi1940
@mohammedmehdi1940 2 жыл бұрын
@@bhaumik3118 i also study statistics from mr patawar
@TANISHQTHUSE
@TANISHQTHUSE 10 ай бұрын
nice man
@ibrahimkhanjabarkhail
@ibrahimkhanjabarkhail 8 ай бұрын
Just completed it. I really enjoyed working on it. Your way of teaching is just awesome!
@chairjacker
@chairjacker Жыл бұрын
I like the pace at which you teach this content it is relaxed and very enjoyable to watch for me.
@alexthe2
@alexthe2 Жыл бұрын
I'll admit I watched this on two times speed, but those were the best spend 21 minutes of the day! Very helpful and we'll explained!
@SaurabhSingh-oi5ev
@SaurabhSingh-oi5ev Жыл бұрын
Your videos like gem to me learned a lot your use of modules packages are like cherry on cake. Currently I'm working as an Jr. Data scientist in KPMG but man oh man you taught me many things thank you 😊 🙏
@robmulla
@robmulla Жыл бұрын
Great to hear you enjoyed the video. Data science is a never ending learning journey for all of us!
@IndianHacker-hisBest
@IndianHacker-hisBest Жыл бұрын
Bro, I just need to talk to u. I wanted to ask few questions regarding the profile you are working on. I have secured a job with Deloitte but want to switch to KPMG (Gurgaon).
@Nitesh717
@Nitesh717 2 жыл бұрын
Hey brother , you just provided the best NLP sentiment project , your channel deserve million+ subscriber , nd now I am just one new subscriber now to reach you there
@robmulla
@robmulla 2 жыл бұрын
Thank you so much 😀
@dgr8a1
@dgr8a1 Жыл бұрын
You are my newly found Python mentor. Good content Rob
@robmulla
@robmulla Жыл бұрын
Happy to be! There are a lot of good channels out there.
@jerrywang3225
@jerrywang3225 2 жыл бұрын
Your channel is a gem, thanks so much for the free course.
@robmulla
@robmulla 2 жыл бұрын
Glad you enjoyed it. Thanks for watching!
@thisisvazqz
@thisisvazqz 4 ай бұрын
I've just recently found myself interested in Computer Vision and NLP and I've finally gotten to the right content creators, this video absolutely rocks! And I fouind it 2 years late, I wonder how far are you now in this topic, if ever you come back to this comment section could I ask how did you get so experienced in this topic and how did you learn how to tackle all this problems? Thank you!
@louie0187
@louie0187 Жыл бұрын
This may be the test tutorial on any language/library/app I have ever watched. One part, very concise and well explained. Thank you.
@robmulla
@robmulla Жыл бұрын
Glad it was helpful! This comment makes me really happy and excited to make more tutorials!
@bazoo513
@bazoo513 Жыл бұрын
More of an appetite wetter. to make any use of it, I have to learn Python first 😀 But then, that's valuable by itself.
@adityabhatt04
@adityabhatt04 2 жыл бұрын
Thanks for posting the awesome tutorial. Would love to learn more from you.
@robmulla
@robmulla 2 жыл бұрын
Thanks for watching and learning!
@anishshah4850
@anishshah4850 Жыл бұрын
Great tutorial, for anyone facing the error of tensor_size more than 514 need to add the max_length as an argument in tokenizer... def polarity_scores_roberta(example): encoded_text= tokenizer(example, return_tensors='pt', truncation=True, max_length=512) # (max_length should be 512) output= model(**encoded_text) scores= output[0][0].detach().numpy() scores= softmax(scores) scores_dict= { 'roberta_neg': scores[0], 'roberta_neu': scores[1], 'roberta_pos': scores[2] } return scores_dict
@juan.o.p.
@juan.o.p. 2 жыл бұрын
Really interesting video. I've been following a lot of your tutorials lately and I must say that I really like the way you explain things, it's so easy to understand and follow along. Thank you!
@robmulla
@robmulla 2 жыл бұрын
Thanks so much for the feedback Juan. It's always hard to tell when I'm recording these if they are any good, so it's great to hear that it is helpful to you.
@CodewithTanveer-g8m
@CodewithTanveer-g8m 4 күн бұрын
this tutorial is very helpfull for me when i was learning sentiment analysis. Love it
@ayushapoorva
@ayushapoorva 2 жыл бұрын
great content, perhaps the best material I found on sentiment analysis in youtube!!!
@robmulla
@robmulla 2 жыл бұрын
Thanks for the compliment Ayush! That means a lot to me.
@farhadnikhashemi8681
@farhadnikhashemi8681 Жыл бұрын
Thanks for such a wonderful tutorial. I used your shared data on my own with Google Collab and worked so well. Just I had to download a few more libraries for tokenization. Wonderful content and I truly enjoyed it.
@kenyan_patriot-e9t
@kenyan_patriot-e9t Ай бұрын
A very terrific video! I am about to start a sentiment analysis project and this is absolute gem. Makes you want to explore everything there is in the subject
@HchggUgyjgg
@HchggUgyjgg Ай бұрын
Hey , Can you explain to me what's a dataset and a model ?
@ChitranshThakurM22AI543
@ChitranshThakurM22AI543 5 ай бұрын
00:01 In today's video, we'll explore sentiment analysis on Amazon reviews using traditional and more complex models. 02:26 Importing and reading data for sentiment analysis 07:28 Tokenization and part of speech tagging in NLTK 09:55 Introduction to VADER for sentiment analysis 15:12 Looping through Amazon review data to calculate polarity scores. 17:33 Perform sentiment analysis with NLTK and 🤗 Transformers 22:04 Explains the positive, neutral, and negative sentiments in Amazon reviews 24:24 Transformer-based deep learning models from Hugging Face are easy to use and powerful 28:45 Introduction to sentiment analysis with NLTK and Transformers 31:02 Running sentiment analysis on text using Vader and Roberta 35:28 Comparing vader and roberta sentiment analysis scores using seaborn's pair plot. 37:45 Vader model less confident compared to Roberta model 41:59 Hugging Face Transformers makes sentiment analysis simple and efficient 44:09 Explored models and ran sentiment analysis on Amazon reviews. Crafted by Merlin AI.
@fabricembida4526
@fabricembida4526 10 ай бұрын
Good, very good video! You cannot imagine how valuable this kind of video is for someone like me who is trying to transition to data science...
@brindhaganesan3580
@brindhaganesan3580 Жыл бұрын
I’m so glad I found this channel!!
@robmulla
@robmulla Жыл бұрын
Me too!
@AndrewSeywright
@AndrewSeywright Жыл бұрын
Thank you so much for this step by step process it has opened up all sorts of new analysis opportunities for our customer insights. Really well explained and easy to follow
@mateusbalotin7247
@mateusbalotin7247 2 жыл бұрын
Amazing content man! Your channel and videos deserve a lot more attention. Hope you have an amazing week!!
@robmulla
@robmulla 2 жыл бұрын
Thanks so much. I really appreciate the feedback. Please consider sharing the video with anyone else you think might learn from it.
@DmiHindi
@DmiHindi 5 ай бұрын
Really must watch video. I must say that I really like the way you explain things, it's so easy to understand and follow along. Thank you!
@sootybuu2963
@sootybuu2963 2 жыл бұрын
This was a good tutorial. I'm trying to get my feet wet in data analytics and found myself overwhelmed while trying to read the NLTK documentation, so thanks for the structured guidance. I'm working on analyzing sentiment across a dataset I've gathered myself, so I wasn't following along in kaggle and hit a hiccup as AutoModelForSequenceClassification requires pytorch and I initialized a python 3.10 environment. Oopsy poopsy. All the same, you made my headache significantly less daunting. Thank you. :)
@robmulla
@robmulla 2 жыл бұрын
Thanks so much. I’m glad it helped you get started with NLTK it can be a lot easier when you see it in action once. Setting up an environment that works with all the packages can also sometimes be frustrating so I can relate!
@davv02
@davv02 Жыл бұрын
just did all of that as a thesis by myself without knowing you made a video about it lol, luckily I've used a different Bert model from hug face at least. Nice video btw!
@robmulla
@robmulla Жыл бұрын
Thanks!
@798185xz
@798185xz 6 ай бұрын
Who are you? My saver! I was asked to conduct a sentiment analysis on reviews from my internship. I was doing computer vision at the graduate school. New to NLP. Thanks God.
@sachingupta5155
@sachingupta5155 Жыл бұрын
I find the topic really interesting , the way you explain were pretty articulated and having a fundamental approach
@kaifahmedkhan
@kaifahmedkhan 10 ай бұрын
Great content. I am doing a project in my uni where I need to do sentiment analysis on book reviews. This helped me a lot. Thanks.
@srishtikaranth
@srishtikaranth Жыл бұрын
i cannot thank you enough , you saved my 6th semester
@srivastavshubh
@srivastavshubh 28 күн бұрын
i am a beginner and i understood everythinggg
@pavlostsoukias8147
@pavlostsoukias8147 2 жыл бұрын
Rob, you are the Best! Thank you for all the quality content you are uploading! Greetings from Greece!
@robmulla
@robmulla 2 жыл бұрын
Thanks so much Pavlos for watching. Sending a 💙 to Greece.
@ngominhhieu6602
@ngominhhieu6602 8 ай бұрын
A great video! Many thanks for your valuable content.❤
@naderbazyari2
@naderbazyari2 Жыл бұрын
I am so happy to have discovered your channel. Many thanks friend.
@priyanshnegi03
@priyanshnegi03 Жыл бұрын
Really great, helped me a lot in my project!
@robmulla
@robmulla Жыл бұрын
Glad it helped. Thanks for watching.
@ColaWen
@ColaWen 9 ай бұрын
Awesome! I am shocked that everything is so efficient and amazing. THANKS!
@robmulla
@robmulla 9 ай бұрын
Glad it was helpful! Share the video with friends.
@tusharguys1234
@tusharguys1234 6 ай бұрын
🎯 Key points for quick navigation: 00:00 *🎬 Introduction to Sentiment Analysis* - Introduction to natural language processing (NLP) and sentiment analysis. - Overview of the project, including using traditional techniques like VADER and more advanced models like RoBERTa. - Explanation of the dataset used for sentiment analysis, which consists of Amazon food reviews with ratings. 03:00 *📊 Data Preprocessing and Exploration* - Importing necessary libraries for data analysis and visualization. - Reading the dataset and performing basic exploratory data analysis (EDA). - Downsampling the dataset for quicker analysis and showcasing the structure of the data. 05:05 *📈 Exploring Sentiment Distribution* - Analyzing the distribution of sentiment scores based on review ratings. - Visualizing the distribution of sentiment scores across different star ratings using bar plots. - Observing the relationship between review ratings and sentiment scores. 07:00 *🧠 Introduction to NLTK for Sentiment Analysis* - Overview of NLTK (Natural Language Toolkit) and its capabilities for text processing. - Demonstrating tokenization and part-of-speech tagging using NLTK. - Explaining the process of chunking text into entities using NLTK. 10:48 *📉 Sentiment Analysis with VADER* - Introduction to VADER (Valence Aware Dictionary and sEntiment Reasoner) for sentiment analysis. - Understanding how VADER assigns sentiment scores based on individual words. - Applying VADER sentiment analysis to example sentences and the food review dataset. 23:41 *🔍 Advanced Sentiment Analysis with RoBERTa* - Introducing RoBERTa, a transformer-based deep learning model for contextual understanding. - Preprocessing text and encoding it for analysis using RoBERTa's tokenizer. - Applying the pre-trained RoBERTa model to perform sentiment analysis on text data. 29:05 *📊 Comparing Vader and Roberta sentiment analysis models* - Demonstrated how to print scores from both Vader and Roberta sentiment analysis models. - Created a scores dictionary for both models to store negative, neutral, and positive scores. - Illustrated the difference in sentiment analysis results between the Vader and Roberta models using a negative review as an example. 35:52 *📈 Comparing sentiment scores across models and reviewing examples* - Utilized Seaborn's pair plot to compare sentiment scores between Vader and Roberta models. - Reviewed examples where the sentiment analysis model contradicted the actual review sentiment, showcasing nuances in language understanding. - Examined instances where both models misinterpreted the sentiment of reviews, highlighting the limitations of bag-of-words approaches like Vader. 42:08 *🤖 Simplifying sentiment analysis with Hugging Face Transformers pipeline* - Demonstrated how to use Hugging Face Transformers pipeline for sentiment analysis, simplifying the process to just two lines of code. - Showcased the ease of changing models and tokenizers within the pipeline for different analysis tasks. - Provided examples of sentiment analysis using the pipeline, showcasing its efficiency and accuracy. Made with HARPA AI
@evansala7814
@evansala7814 10 ай бұрын
Great video. Your explanations were very clear and concise and easy to follow.
@SuperMjJang
@SuperMjJang Жыл бұрын
I've watched bunch of ML videos and you are THE TOP! 👍👍👍
@TugelaCo
@TugelaCo Жыл бұрын
I rarely comment on YT videos but this is amazing! +1 subscriber!
@robmulla
@robmulla Жыл бұрын
That really means a lot to me. Thanks for leaving a comment.
@abhishekpadmanabhan3945
@abhishekpadmanabhan3945 9 ай бұрын
Excellent video, started coding with chatgpt, and this adds a new layer of info , thank you mate :) Subd
@fpishita2725
@fpishita2725 Ай бұрын
very good, thank you for your effort and passion!!!
@carlossamperquinto2777
@carlossamperquinto2777 Жыл бұрын
This video is incredibly helpful! Thanks!
@stevebim000
@stevebim000 2 жыл бұрын
Extremly useful, super easy to understand! Thank you so much for a great and valuable video !!
@robmulla
@robmulla 2 жыл бұрын
Really appreciate the feedback. Comments like this make me want to keep making more videos!
@monty510
@monty510 10 ай бұрын
Great video, I am starting to understand NLP much more. Thank you so much!
@josiel.delgadillo
@josiel.delgadillo 2 жыл бұрын
Just found your channel through Twitter. Great work, I am doing research in sentiment analysis and related to a lot of the video. Cool stuff! I will have to use the pariplot, I typically use a confusion matrix.
@robmulla
@robmulla 2 жыл бұрын
Awesome Josiel. Glad you find it helpful. Check out some of my other videos if you have time and share the video with friends!
@kmkushad
@kmkushad Жыл бұрын
Thanks for the video, we have a school project to do anything coding related and while my classmates are using scratch I wanted to do something flashier, and some kind of language analysis seemed the way to go. I'll use this video as inspiration.
@robmulla
@robmulla Жыл бұрын
I love it! Good luck on your project !
@techingenius2540
@techingenius2540 Жыл бұрын
insane
@robmulla
@robmulla Жыл бұрын
@@techingenius2540 in the membrane?
@chrisogonas
@chrisogonas Жыл бұрын
Great resource! Thanks Rob.
@robmulla
@robmulla Жыл бұрын
Glad you liked it! Thanks for watching.
@patrickonodje1428
@patrickonodje1428 2 жыл бұрын
I founf this video immensely helpful Rob Thanks
@robmulla
@robmulla 2 жыл бұрын
So glad you found it helpful!!
@nandanhegde532
@nandanhegde532 Жыл бұрын
Great Content, thanks man
@robmulla
@robmulla Жыл бұрын
Thanks!
@vinitkumarpatel1030
@vinitkumarpatel1030 6 ай бұрын
Very good explanation . Thanks a lot❤❤
@ademhilmibozkurt7085
@ademhilmibozkurt7085 2 жыл бұрын
What a video! I lovee this. Please keep continue this content. Greetings
@robmulla
@robmulla 2 жыл бұрын
Thank you! Will do, Adem!
@666rony
@666rony 2 жыл бұрын
crystal clear explanation thanks my friend
@robmulla
@robmulla 2 жыл бұрын
Glad you liked it!
@analysis_maestro_taha
@analysis_maestro_taha Жыл бұрын
Thank you very much for this video. I'm new to the field of Data Analysis and related disciplines so this sentimental analysis project is pretty insightful for me.
@robmulla
@robmulla Жыл бұрын
Glad you found it helpful
@Curious_Citizen0
@Curious_Citizen0 2 жыл бұрын
Pls make more such videos, that was great. I am a data engineer and wants to move to Data Science, please make videos for guidance also. Love from India
@robmulla
@robmulla 2 жыл бұрын
I will! Hope this video was helpful for you in your journey into data science.
@deepeshrajak3407
@deepeshrajak3407 Жыл бұрын
your content is goldmine
@robmulla
@robmulla Жыл бұрын
Thank you sir! Share the goldmine with others!
@ahmadnawaz3683
@ahmadnawaz3683 Жыл бұрын
Rob you are the best. Hands Down mate.
@ivanalonso1460
@ivanalonso1460 2 ай бұрын
Really good content. Liked and subscribed!
@engmohammedbahanshal5204
@engmohammedbahanshal5204 Жыл бұрын
Thanks for great model ideas.
@robmulla
@robmulla Жыл бұрын
Glad you like them!
@it029-shreyagandhi5
@it029-shreyagandhi5 11 ай бұрын
Great work🎉🎉🎉🎉 ty for this amazing video .Your explanation , flow , content everything is up to the mark 🚩
@seblewongelawash5891
@seblewongelawash5891 Жыл бұрын
Thank you! Great content and easy to understand!
@robmulla
@robmulla Жыл бұрын
Appreciate that!
@nguyenhuyhoangk18hcm37
@nguyenhuyhoangk18hcm37 6 күн бұрын
thank you for your videos. This video is useful for my project in the future. Instead of using English dataset, I can train the Vietnamese dataset!
@sindhumatipanigrahi3801
@sindhumatipanigrahi3801 Жыл бұрын
Thank you so much. This tutorial helped me in my project. Thanks a lot.
@spicytuna08
@spicytuna08 Жыл бұрын
wow. speechless. both you and ml.
@rajatshukla2605
@rajatshukla2605 Жыл бұрын
Extremely helpful! Thanks a bunch!
@sebastianbenitez4401
@sebastianbenitez4401 2 жыл бұрын
thank you for this content! Great quality! Now subscribed!
@robmulla
@robmulla 2 жыл бұрын
Thanks so much for watching!
@karthiksheggoju738
@karthiksheggoju738 Жыл бұрын
I really liked this video a lot, it answered lot of my questions, thanks a lot.
@world_news261
@world_news261 Ай бұрын
Hi bro I am from india and I like your video and your explanation and english is so understandable love you bro❤❤❤❤
@analyticswithadam
@analyticswithadam 2 жыл бұрын
This is a great video, thanks a lot.
@robmulla
@robmulla 2 жыл бұрын
Glad you like it. Thanks for watching
@PriteshRPatel-lr5uh
@PriteshRPatel-lr5uh 9 ай бұрын
loved what you did, but would be nice to show how you got the amazon data as well. Plus, do you have any videos on sentiment analysis for company stocks?
@Leszt_kb
@Leszt_kb Жыл бұрын
This video was genius and very helpful thank you
@jstello
@jstello 2 жыл бұрын
how you don't have 100k subs, defeats me.
@robmulla
@robmulla 2 жыл бұрын
Hah. Thanks Juan. Maybe someday 😊
@rishirajmathur07
@rishirajmathur07 Жыл бұрын
Great content. Please do more content model which solves attrition prediction for org. Very complex subject because its hard to find already made models on such topics. It would be great help if you can make something attrition prediction model with variables more than 45-50.
@nishanths5724
@nishanths5724 8 ай бұрын
24:45 the hugging face model is not laoding properly
@daredevilxrage
@daredevilxrage Жыл бұрын
The huggingface model , should it require any preliminary dataset while we are importing it?
@CaribouDataScience
@CaribouDataScience 2 жыл бұрын
Very interesting!!
@robmulla
@robmulla 2 жыл бұрын
Thanks!
@Midhun938
@Midhun938 Жыл бұрын
Love from India ♥️
@robmulla
@robmulla Жыл бұрын
Thanks! ❤️
@jilanikashif
@jilanikashif 2 жыл бұрын
Great Content, We need more tutorial on Transformers please
@robmulla
@robmulla 2 жыл бұрын
Glad you liked it. Anything specific about transformers you would like to see? Huggingface has so many of them for various NLP tasks.
@jilanikashif
@jilanikashif 2 жыл бұрын
@@robmulla Please explain Transformers and BERT architect. Also tutorial with use case in current industry
@talhabedir3812
@talhabedir3812 Жыл бұрын
Hey Rob, great content man, it helps big time! I just cannot find you conveniently find at 26:00. I work with PyCharm so nothing is that automatic for me. Where can I download those files?
@robmulla
@robmulla Жыл бұрын
You can download the data via Kaggle! Check the notebook in the description. Hope that helps.
@talhabedir3812
@talhabedir3812 Жыл бұрын
@@robmulla Hey man, for some reason I can find the Amazon food reviews but cannot find this one on the Input tab of your Kaggle Notebook.
@usamaarif5763
@usamaarif5763 7 ай бұрын
Thanks for this video, it was descriptive, well structured and well explained. I have two questions and I would appreciate if you can give your opinion and guidence on that. 1. At the end of the day star reviews and sentiment are giving the same results so how can we justify going through all this process when we already have a very good indication of user sentiment based on the star reviews. 2. How can we get the strength and weakness of the product based on the reviews using the sentiment analysis.
@thuhuong-it0107
@thuhuong-it0107 2 жыл бұрын
great!! i hope you will create video more than!! tkssssssssss
@robmulla
@robmulla 2 жыл бұрын
Thank you, I will. I appreciate you watching.
@astitwapanwar9621
@astitwapanwar9621 Жыл бұрын
dude in 26:03 while writing the pertained model from hugging face it throwing an error. "Connection error, and we cannot find the requested files in the cached path. Please try again or make sure your Internet connection is on. " and my connection is very good I had run this around 40 times with good connection and still throwing that error and also changed the model from hugging face please help me on this
@robmulla
@robmulla Жыл бұрын
You might want to check and make sure the source hasn't changed from the hugging face site. They might have changed this specific model and your refrence might need to be updated.
@dailypolyglot2815
@dailypolyglot2815 Жыл бұрын
Had the same problem. Just solved it. Unlike aveage laptop, Kaggle notebook is not connected to internet. To get an internet access with your Kaggle notebook you need to go through a phone verification. Look for the notebook option menu on the right side.
@pythonicd1239
@pythonicd1239 Жыл бұрын
​@@dailypolyglot2815 thank you so much!
@ryrylc
@ryrylc 2 жыл бұрын
Awesome video. Would be great to see you follow the sentiment analysis with a topic analysis. I’ve seen a few different options out there (LDA, Top2Vec and BERTopic), but would love to see your take on it.
@robmulla
@robmulla 2 жыл бұрын
Great suggestion! I'll keep that in mind for future videos.
@GaurangDave
@GaurangDave 2 жыл бұрын
@@robmulla Looking forward to that!! :)
@OnLyhereAlone
@OnLyhereAlone Жыл бұрын
@robmulla, great presentation but I have looked through videos on your channel, it appears you have not done one on finetunning a BERT model with custom dataset. I am particularly wanting to learn how you would finetune a BERT model for multiclass text classification, maybe on Google collab. I think many of us subscribers would love it. Thanks.
@timdentry9754
@timdentry9754 2 жыл бұрын
One of the best tutorials on Vader and the Huggingface Transformers I have seen. One question I had: How is the confidence score calculated on the Pipeline model and is there a way to evaluate the model's performance on these calculations?
@robmulla
@robmulla 2 жыл бұрын
Thanks so much for the feedback. Glad you found it helpful. Evaluating the model performance is a bit tricky without ground truth labels. The output of the Pipeline model is essentially the probability the model predicts of each class given the dataset it was trained on. Check out the actual model description on the huggingface site here along with the noted limitations: huggingface.co/distilbert-base-uncased-finetuned-sst-2-english Specifically this part is interesting: ``` Based on a few experimentations, we observed that this model could produce biased predictions that target underrepresented populations. For instance, for sentences like This film was filmed in COUNTRY, this binary classification model will give radically different probabilities for the positive label depending on the country (0.89 if the country is France, but 0.08 if the country is Afghanistan) when nothing in the input indicates such a strong semantic shift. In this colab, Aurélien Géron made an interesting map plotting these probabilities for each country. ```
@timdentry9754
@timdentry9754 2 жыл бұрын
@@robmulla FWIW - I reached out to the creator of this and what I was told is that the score is calculated using the activation function after the final layer of the neural net. It is used to determine polarity (and is not a confidence score). The model returns an array with the score for each polarity, and the larger is the prediction. The values will always be positive, regardless of the actual sentiment class tagged to the text. This is unlike Vader's model which provides a composite polarity score that could be a positive or negative float based on the inferred sentiment (positive, negative, neutral).
@robmulla
@robmulla 2 жыл бұрын
@@timdentry9754 thanks for clarifying. Cool that you got a response from the creator!
@superfreiheit1
@superfreiheit1 2 ай бұрын
Awesome teaching quality. Can you create a coursera course
@DattaSaiSrinivasBaswa
@DattaSaiSrinivasBaswa Жыл бұрын
Hi, Sir. It was really a great video. What should I do if I want to calculate the accuracy score of both models? Is there any formula for that? I have worked with various machine learning models, but NLP appears to be quite distinct from all of them. As a newcomer to NLP, I'm still in the learning phase. Your reply would be very helpful to me.
@Geepee11
@Geepee11 Жыл бұрын
Hello, did you figure out how to do this?
@V3geta420
@V3geta420 11 ай бұрын
Is there a other source then Kaggle where you got that csv from ??
@francofmm
@francofmm 10 ай бұрын
New viewer and sub!! great work!!!
@manasghosh3709
@manasghosh3709 7 ай бұрын
Excellent explanation and material. Thank you for your efforts in making learning enjoyable. A brief query about reviews that are negative (5 stars) and positive (1 stars), where the algorithm is unable to forecast the relevancy score. Regarding these kinds of situations, how would you advise handling them??
@MuhammadHanif-tj3dr
@MuhammadHanif-tj3dr 5 ай бұрын
thank you sir. you are my savior
@jenniferchi2117
@jenniferchi2117 Жыл бұрын
Thank you so much for this video tutorial! I wanted to ask if you created the Amazon review dataset from scratch or was it already pre-made from somewhere else?
@mishuo1983
@mishuo1983 6 ай бұрын
It is really a wonderful video! I just wonder @Rob, do we need to do Cross Validations? Are there any hyper-parameters that we also need to optimise? How to do the cross validations here in the NLP? Just like the normal ML Cross Validation process? Should we worry about the overfitting under-fittings problems? How would the learning curve look like with and without the cross validations? Thanks
@MehakFatima-mx1ix
@MehakFatima-mx1ix Жыл бұрын
Amazing video! One question though. Initially we tokenized the data, found their part of speech and then grouped them into entities. However the vader and roberta model were ran on the raw example. does it mean that data cleaning/manipulation like dropping stop words etc isnt required for the models or did i understand it incorrectly?
@prithviiyer
@prithviiyer Жыл бұрын
from what i understood, the vader model automatically doesn't include stop words so just by using it it gets rid of them.
@siddharthuzumaki
@siddharthuzumaki Жыл бұрын
23:42 Step 3. Roberta Pretrained Model. RoBERTa base sentiment I am getting a value error. That is we cannot find the requested files in the cached path. Please try again or make sure your Internet connection is on. But My internet connection is good. What can I do about it?
@lohithburra5353
@lohithburra5353 Жыл бұрын
yeah im getting the same problem ValueError: Connection error, and we cannot find the requested files in the cached path. Please try again or make sure your Internet connection is on.
@FallenJakarta
@FallenJakarta 2 жыл бұрын
Thank you. Great content
@robmulla
@robmulla 2 жыл бұрын
Glad you enjoyed it! Make sure you sub and share!
@gangxaaku
@gangxaaku 2 жыл бұрын
Top-notch 🔥 !!
@robmulla
@robmulla 2 жыл бұрын
Thanks Akshat!
@DailyVibz
@DailyVibz 7 ай бұрын
WOW! Help me learn some Python of this level ! i am now at 0. learning to install it.
@osmanson8212
@osmanson8212 2 жыл бұрын
abi eline koluna sağlık çok güzel olmuş. türkçe karakterleri cozememdik
@robmulla
@robmulla 2 жыл бұрын
Thanks?!
Build Your First Pytorch Model In Minutes! [Tutorial + Code]
31:32
Exploratory Data Analysis with Pandas Python
40:22
Rob Mulla
Рет қаралды 509 М.
Правильный подход к детям
00:18
Beatrise
Рет қаралды 11 МЛН
Chain Game Strong ⛓️
00:21
Anwar Jibawi
Рет қаралды 40 МЛН
99.9% IMPOSSIBLE
00:24
STORROR
Рет қаралды 31 МЛН
Tuna 🍣 ​⁠@patrickzeinali ​⁠@ChefRush
00:48
albert_cancook
Рет қаралды 145 МЛН
Visualizing transformers and attention | Talk for TNG Big Tech Day '24
57:45
Natural Language Processing with spaCy & Python - Course for Beginners
3:02:33
Why Does Diffusion Work Better than Auto-Regression?
20:18
Algorithmic Simplicity
Рет қаралды 398 М.
The Secret to 90%+ Accuracy in Text Classification
10:34
Pritish Mishra
Рет қаралды 51 М.
Natural Language Processing in Python
1:51:03
PyOhio
Рет қаралды 1,2 МЛН
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 4 МЛН
What is Sentiment Analysis?
10:05
IBM Technology
Рет қаралды 33 М.
Правильный подход к детям
00:18
Beatrise
Рет қаралды 11 МЛН