Python Sentiment Analysis Project with NLTK and 🤗 Transformers. Classify Amazon Reviews!!

  Рет қаралды 317,439

Rob Mulla

Rob Mulla

Күн бұрын

In this video you will go through a Natural Language Processing Python Project creating a Sentiment Analysis classifier with NLTK's VADER and Huggingface Roberta Transformers. The project is to classify the seniment of amazon customer reviews. 🤗 provides some great open source models for NLP: huggingface.co/models. We will look at the difference between model outputs from the two packages and compare the results. Seniment analysis is an important tool for data scientists to use in laguage modeling.
Link to Kaggle Notebook: www.kaggle.com/robikscube/sen...
Timeline:
00:00 Intro
01:10 Setup + NLTK
10:44 VADER Model
23:42 RoBERTa Model
35:51 Compare Results
Follow me on twitch for live coding streams: / medallionstallion_
My other videos:
Speed Up Your Pandas Code: • Make Your Pandas Code ...
Speed up Pandas Code: • Make Your Pandas Code ...
Intro to Pandas video: • A Gentle Introduction ...
Exploratory Data Analysis Video: • Exploratory Data Analy...
Working with Audio data in Python: • Audio Data Processing ...
Efficient Pandas Dataframes: • Speed Up Your Pandas D...
* KZbin: youtube.com/@robmulla?sub_con...
* Discord: / discord
* Twitch: / medallionstallion_
* Twitter: / rob_mulla
* Kaggle: www.kaggle.com/robikscube
#nlp #python #machinelearning #huggingface

Пікірлер: 382
@nixonsebastian2892
@nixonsebastian2892 Жыл бұрын
great content, this deserves a million views... {'roberta_neg': 0, 'roberta_neu': 0, 'roberta_pos': 100}😀
@robmulla
@robmulla Жыл бұрын
Haha. Best comment! Pinned.
@xBaphometHx
@xBaphometHx Жыл бұрын
Pos should be 1, since the maximum value is 1. lol
@48-tarunsalgotra81
@48-tarunsalgotra81 Жыл бұрын
​@@robmulla plz give ur what's app no
@smi14172
@smi14172 5 ай бұрын
Good one!!😅
@AndrewSeywright
@AndrewSeywright Жыл бұрын
Thank you so much for this step by step process it has opened up all sorts of new analysis opportunities for our customer insights. Really well explained and easy to follow
@sachingupta5155
@sachingupta5155 7 ай бұрын
I find the topic really interesting , the way you explain were pretty articulated and having a fundamental approach
@chairjacker
@chairjacker 7 ай бұрын
I like the pace at which you teach this content it is relaxed and very enjoyable to watch for me.
@user-hk6le3bx4c
@user-hk6le3bx4c Ай бұрын
Just completed it. I really enjoyed working on it. Your way of teaching is just awesome!
@alexthe2
@alexthe2 5 ай бұрын
I'll admit I watched this on two times speed, but those were the best spend 21 minutes of the day! Very helpful and we'll explained!
@fabricembida4526
@fabricembida4526 3 ай бұрын
Good, very good video! You cannot imagine how valuable this kind of video is for someone like me who is trying to transition to data science...
@juan.o.p.
@juan.o.p. Жыл бұрын
Really interesting video. I've been following a lot of your tutorials lately and I must say that I really like the way you explain things, it's so easy to understand and follow along. Thank you!
@robmulla
@robmulla Жыл бұрын
Thanks so much for the feedback Juan. It's always hard to tell when I'm recording these if they are any good, so it's great to hear that it is helpful to you.
@kaifahmedkhan
@kaifahmedkhan 3 ай бұрын
Great content. I am doing a project in my uni where I need to do sentiment analysis on book reviews. This helped me a lot. Thanks.
@mateusbalotin7247
@mateusbalotin7247 Жыл бұрын
Amazing content man! Your channel and videos deserve a lot more attention. Hope you have an amazing week!!
@robmulla
@robmulla Жыл бұрын
Thanks so much. I really appreciate the feedback. Please consider sharing the video with anyone else you think might learn from it.
@farhadnikhashemi8681
@farhadnikhashemi8681 8 ай бұрын
Thanks for such a wonderful tutorial. I used your shared data on my own with Google Collab and worked so well. Just I had to download a few more libraries for tokenization. Wonderful content and I truly enjoyed it.
@evansala7814
@evansala7814 2 ай бұрын
Great video. Your explanations were very clear and concise and easy to follow.
@naderbazyari2
@naderbazyari2 8 ай бұрын
I am so happy to have discovered your channel. Many thanks friend.
@it029-shreyagandhi5
@it029-shreyagandhi5 4 ай бұрын
Great work🎉🎉🎉🎉 ty for this amazing video .Your explanation , flow , content everything is up to the mark 🚩
@monty510
@monty510 3 ай бұрын
Great video, I am starting to understand NLP much more. Thank you so much!
@jerrywang3225
@jerrywang3225 Жыл бұрын
Your channel is a gem, thanks so much for the free course.
@robmulla
@robmulla Жыл бұрын
Glad you enjoyed it. Thanks for watching!
@stevebim000
@stevebim000 Жыл бұрын
Extremly useful, super easy to understand! Thank you so much for a great and valuable video !!
@robmulla
@robmulla Жыл бұрын
Really appreciate the feedback. Comments like this make me want to keep making more videos!
@Thikondrius
@Thikondrius Жыл бұрын
I don't often left comments on youtube but, finally someone that explains everything from scratch...I am a JS developer. And it's really cool your that you explain every piece of code. That really helped, I was able to understand everything.
@robmulla
@robmulla Жыл бұрын
Hey! I really apprecaite this comment. Thanks so muc.
@josiel.delgadillo
@josiel.delgadillo 2 жыл бұрын
Just found your channel through Twitter. Great work, I am doing research in sentiment analysis and related to a lot of the video. Cool stuff! I will have to use the pariplot, I typically use a confusion matrix.
@robmulla
@robmulla 2 жыл бұрын
Awesome Josiel. Glad you find it helpful. Check out some of my other videos if you have time and share the video with friends!
@pavlostsoukias8147
@pavlostsoukias8147 2 жыл бұрын
Rob, you are the Best! Thank you for all the quality content you are uploading! Greetings from Greece!
@robmulla
@robmulla 2 жыл бұрын
Thanks so much Pavlos for watching. Sending a 💙 to Greece.
@SuperMjJang
@SuperMjJang 10 ай бұрын
I've watched bunch of ML videos and you are THE TOP! 👍👍👍
@louie0187
@louie0187 Жыл бұрын
This may be the test tutorial on any language/library/app I have ever watched. One part, very concise and well explained. Thank you.
@robmulla
@robmulla Жыл бұрын
Glad it was helpful! This comment makes me really happy and excited to make more tutorials!
@bazoo513
@bazoo513 Жыл бұрын
More of an appetite wetter. to make any use of it, I have to learn Python first 😀 But then, that's valuable by itself.
@ngominhhieu6602
@ngominhhieu6602 Ай бұрын
A great video! Many thanks for your valuable content.❤
@atharvpatawar8346
@atharvpatawar8346 Жыл бұрын
Huge thank you to you!!! I recently participated in a ML hackathon and they had sentiment analysis as one of their problem statements. I had watched your video prior to the competition and used hugging face whereas everyone else used the standard vader. I ended up getting the highest accuracy and placed first, all in my second year of engineering. Genuinely, can’t thank you enough for the information! Team random_state42
@mohammedmehdi1940
@mohammedmehdi1940 Жыл бұрын
Mil gaya tu yaha
@robmulla
@robmulla Жыл бұрын
This is so awesome! Thanks for sharing. I posted a screenshot of your comment on twitter, hope that's ok!
@bhaumik3118
@bhaumik3118 Жыл бұрын
Btw huge fan of your statistics' notes Mr. Patawar, didn't expect to find you here.
@mohammedmehdi1940
@mohammedmehdi1940 Жыл бұрын
@@bhaumik3118 i also study statistics from mr patawar
@TANISHQTHUSE
@TANISHQTHUSE 3 ай бұрын
nice man
@carlossamperquinto2777
@carlossamperquinto2777 Жыл бұрын
This video is incredibly helpful! Thanks!
@adityabhatt04
@adityabhatt04 2 жыл бұрын
Thanks for posting the awesome tutorial. Would love to learn more from you.
@robmulla
@robmulla 2 жыл бұрын
Thanks for watching and learning!
@Nitesh717
@Nitesh717 Жыл бұрын
Hey brother , you just provided the best NLP sentiment project , your channel deserve million+ subscriber , nd now I am just one new subscriber now to reach you there
@robmulla
@robmulla Жыл бұрын
Thank you so much 😀
@SaurabhSingh-oi5ev
@SaurabhSingh-oi5ev Жыл бұрын
Your videos like gem to me learned a lot your use of modules packages are like cherry on cake. Currently I'm working as an Jr. Data scientist in KPMG but man oh man you taught me many things thank you 😊 🙏
@robmulla
@robmulla Жыл бұрын
Great to hear you enjoyed the video. Data science is a never ending learning journey for all of us!
@IndianHacker-hisBest
@IndianHacker-hisBest 9 ай бұрын
Bro, I just need to talk to u. I wanted to ask few questions regarding the profile you are working on. I have secured a job with Deloitte but want to switch to KPMG (Gurgaon).
@dgr8a1
@dgr8a1 Жыл бұрын
You are my newly found Python mentor. Good content Rob
@robmulla
@robmulla Жыл бұрын
Happy to be! There are a lot of good channels out there.
@abhishekpadmanabhan3945
@abhishekpadmanabhan3945 2 ай бұрын
Excellent video, started coding with chatgpt, and this adds a new layer of info , thank you mate :) Subd
@ayushapoorva
@ayushapoorva Жыл бұрын
great content, perhaps the best material I found on sentiment analysis in youtube!!!
@robmulla
@robmulla Жыл бұрын
Thanks for the compliment Ayush! That means a lot to me.
@brindhaganesan3580
@brindhaganesan3580 Жыл бұрын
I’m so glad I found this channel!!
@robmulla
@robmulla Жыл бұрын
Me too!
@analysis_maestro_taha
@analysis_maestro_taha Жыл бұрын
Thank you very much for this video. I'm new to the field of Data Analysis and related disciplines so this sentimental analysis project is pretty insightful for me.
@robmulla
@robmulla Жыл бұрын
Glad you found it helpful
@rajatshukla2605
@rajatshukla2605 8 ай бұрын
Extremely helpful! Thanks a bunch!
@ColaWen
@ColaWen Ай бұрын
Awesome! I am shocked that everything is so efficient and amazing. THANKS!
@robmulla
@robmulla Ай бұрын
Glad it was helpful! Share the video with friends.
@sindhumatipanigrahi3801
@sindhumatipanigrahi3801 8 ай бұрын
Thank you so much. This tutorial helped me in my project. Thanks a lot.
@karthiksheggoju738
@karthiksheggoju738 7 ай бұрын
I really liked this video a lot, it answered lot of my questions, thanks a lot.
@sootybuu2963
@sootybuu2963 Жыл бұрын
This was a good tutorial. I'm trying to get my feet wet in data analytics and found myself overwhelmed while trying to read the NLTK documentation, so thanks for the structured guidance. I'm working on analyzing sentiment across a dataset I've gathered myself, so I wasn't following along in kaggle and hit a hiccup as AutoModelForSequenceClassification requires pytorch and I initialized a python 3.10 environment. Oopsy poopsy. All the same, you made my headache significantly less daunting. Thank you. :)
@robmulla
@robmulla Жыл бұрын
Thanks so much. I’m glad it helped you get started with NLTK it can be a lot easier when you see it in action once. Setting up an environment that works with all the packages can also sometimes be frustrating so I can relate!
@blanka_herceg
@blanka_herceg 10 ай бұрын
This video was genius and very helpful thank you
@davv02
@davv02 Жыл бұрын
just did all of that as a thesis by myself without knowing you made a video about it lol, luckily I've used a different Bert model from hug face at least. Nice video btw!
@robmulla
@robmulla Жыл бұрын
Thanks!
@sebastianbenitez4401
@sebastianbenitez4401 Жыл бұрын
thank you for this content! Great quality! Now subscribed!
@robmulla
@robmulla Жыл бұрын
Thanks so much for watching!
@srishtikaranth
@srishtikaranth Жыл бұрын
i cannot thank you enough , you saved my 6th semester
@kmkushad
@kmkushad Жыл бұрын
Thanks for the video, we have a school project to do anything coding related and while my classmates are using scratch I wanted to do something flashier, and some kind of language analysis seemed the way to go. I'll use this video as inspiration.
@robmulla
@robmulla Жыл бұрын
I love it! Good luck on your project !
@techingenius2540
@techingenius2540 Жыл бұрын
insane
@robmulla
@robmulla Жыл бұрын
@@techingenius2540 in the membrane?
@priyanshnegi03
@priyanshnegi03 Жыл бұрын
Really great, helped me a lot in my project!
@robmulla
@robmulla Жыл бұрын
Glad it helped. Thanks for watching.
@chrisogonas
@chrisogonas Жыл бұрын
Great resource! Thanks Rob.
@robmulla
@robmulla Жыл бұрын
Glad you liked it! Thanks for watching.
@seblewongelawash5891
@seblewongelawash5891 Жыл бұрын
Thank you! Great content and easy to understand!
@robmulla
@robmulla Жыл бұрын
Appreciate that!
@patrickonodje1428
@patrickonodje1428 Жыл бұрын
I founf this video immensely helpful Rob Thanks
@robmulla
@robmulla Жыл бұрын
So glad you found it helpful!!
@ademhilmibozkurt7085
@ademhilmibozkurt7085 Жыл бұрын
What a video! I lovee this. Please keep continue this content. Greetings
@robmulla
@robmulla Жыл бұрын
Thank you! Will do, Adem!
@jenniferchi2117
@jenniferchi2117 Жыл бұрын
Thank you so much for this video tutorial! I wanted to ask if you created the Amazon review dataset from scratch or was it already pre-made from somewhere else?
@francofmm
@francofmm 2 ай бұрын
New viewer and sub!! great work!!!
@666rony
@666rony Жыл бұрын
crystal clear explanation thanks my friend
@robmulla
@robmulla Жыл бұрын
Glad you liked it!
@engmohammedbahanshal5204
@engmohammedbahanshal5204 Жыл бұрын
Thanks for great model ideas.
@robmulla
@robmulla Жыл бұрын
Glad you like them!
@ahmadnawaz3683
@ahmadnawaz3683 7 ай бұрын
Rob you are the best. Hands Down mate.
@kimnhunguyent1489
@kimnhunguyent1489 Жыл бұрын
Hi, thank you for the amazing video. Your presentation was informative and insightful. Looking forward to your future content! Btw, I want to ask how can I save my expected result, it seems like I had a good training and dont want to keep going. What should I do in this situation ? Thank you
@TugelaCo
@TugelaCo Жыл бұрын
I rarely comment on YT videos but this is amazing! +1 subscriber!
@robmulla
@robmulla Жыл бұрын
That really means a lot to me. Thanks for leaving a comment.
@spicytuna08
@spicytuna08 8 ай бұрын
wow. speechless. both you and ml.
@ryrylc
@ryrylc Жыл бұрын
Awesome video. Would be great to see you follow the sentiment analysis with a topic analysis. I’ve seen a few different options out there (LDA, Top2Vec and BERTopic), but would love to see your take on it.
@robmulla
@robmulla Жыл бұрын
Great suggestion! I'll keep that in mind for future videos.
@GaurangDave
@GaurangDave Жыл бұрын
@@robmulla Looking forward to that!! :)
@marcodigennarobari
@marcodigennarobari 6 күн бұрын
great stuff!!
@mohit_hada
@mohit_hada Жыл бұрын
Pls make more such videos, that was great. I am a data engineer and wants to move to Data Science, please make videos for guidance also. Love from India
@robmulla
@robmulla Жыл бұрын
I will! Hope this video was helpful for you in your journey into data science.
@rishirajmathur07
@rishirajmathur07 9 ай бұрын
Great content. Please do more content model which solves attrition prediction for org. Very complex subject because its hard to find already made models on such topics. It would be great help if you can make something attrition prediction model with variables more than 45-50.
@andreascalenghe8068
@andreascalenghe8068 9 ай бұрын
Great content, thanks
@-zak-7048
@-zak-7048 14 күн бұрын
what an absolute legend
@NisaRoy-jo2wi
@NisaRoy-jo2wi 2 ай бұрын
Great content.thank u
@rachmanmohammad6210
@rachmanmohammad6210 Жыл бұрын
Thank you. Great content
@robmulla
@robmulla Жыл бұрын
Glad you enjoyed it! Make sure you sub and share!
@PriteshRPatel-lr5uh
@PriteshRPatel-lr5uh 2 ай бұрын
loved what you did, but would be nice to show how you got the amazon data as well. Plus, do you have any videos on sentiment analysis for company stocks?
@nandanhegde532
@nandanhegde532 Жыл бұрын
Great Content, thanks man
@robmulla
@robmulla Жыл бұрын
Thanks!
@analyticswithadam
@analyticswithadam Жыл бұрын
This is a great video, thanks a lot.
@robmulla
@robmulla Жыл бұрын
Glad you like it. Thanks for watching
@savichopra9083
@savichopra9083 9 ай бұрын
very usefulll!
@OnLyhereAlone
@OnLyhereAlone 10 ай бұрын
@robmulla, great presentation but I have looked through videos on your channel, it appears you have not done one on finetunning a BERT model with custom dataset. I am particularly wanting to learn how you would finetune a BERT model for multiclass text classification, maybe on Google collab. I think many of us subscribers would love it. Thanks.
@anishshah4850
@anishshah4850 Жыл бұрын
Great tutorial, for anyone facing the error of tensor_size more than 514 need to add the max_length as an argument in tokenizer... def polarity_scores_roberta(example): encoded_text= tokenizer(example, return_tensors='pt', truncation=True, max_length=512) # (max_length should be 512) output= model(**encoded_text) scores= output[0][0].detach().numpy() scores= softmax(scores) scores_dict= { 'roberta_neg': scores[0], 'roberta_neu': scores[1], 'roberta_pos': scores[2] } return scores_dict
@merwinjosepha3897
@merwinjosepha3897 2 ай бұрын
Thnak you so much
@gangxaaku
@gangxaaku 2 жыл бұрын
Top-notch 🔥 !!
@robmulla
@robmulla 2 жыл бұрын
Thanks Akshat!
@deepeshrajak3407
@deepeshrajak3407 Жыл бұрын
your content is goldmine
@robmulla
@robmulla Жыл бұрын
Thank you sir! Share the goldmine with others!
@zikrifisehaye323
@zikrifisehaye323 5 ай бұрын
THANK YOU!
@MoAlarawi
@MoAlarawi 10 ай бұрын
Great content.
@user-bc5wf2qq2r
@user-bc5wf2qq2r 3 ай бұрын
Amazing!
@henkhbit5748
@henkhbit5748 Жыл бұрын
Clearly explained and the comparison vaders versus transformers is quite interesting. I see that transformers Bert model is much better in understanding nuances in sentences. Do you know what kind of algorithm textblob used? I just bumped to this channel when searching for sentiment analysis and like the content very much and subscribed also.
@robmulla
@robmulla Жыл бұрын
Thanks for subscribing! I'm glad you learned something new. I've never used textblob but it says it's a "lexicon-based approach" so I'm gussing it's similar to VADER.
@akshatbhatnagar3571
@akshatbhatnagar3571 Жыл бұрын
Great video. Also, is there a way to include the number of retweets or followers in the sentiment analysis process?
@manasghosh3709
@manasghosh3709 Күн бұрын
Excellent explanation and material. Thank you for your efforts in making learning enjoyable. A brief query about reviews that are negative (5 stars) and positive (1 stars), where the algorithm is unable to forecast the relevancy score. Regarding these kinds of situations, how would you advise handling them??
@usamaarif5763
@usamaarif5763 6 күн бұрын
Thanks for this video, it was descriptive, well structured and well explained. I have two questions and I would appreciate if you can give your opinion and guidence on that. 1. At the end of the day star reviews and sentiment are giving the same results so how can we justify going through all this process when we already have a very good indication of user sentiment based on the star reviews. 2. How can we get the strength and weakness of the product based on the reviews using the sentiment analysis.
@sudurimabanerjee4612
@sudurimabanerjee4612 Ай бұрын
Thanks for the video. Very well explained. Is there any token limit for the transformer based Roberta model ?
@jilanikashif
@jilanikashif Жыл бұрын
Great Content, We need more tutorial on Transformers please
@robmulla
@robmulla Жыл бұрын
Glad you liked it. Anything specific about transformers you would like to see? Huggingface has so many of them for various NLP tasks.
@jilanikashif
@jilanikashif Жыл бұрын
@@robmulla Please explain Transformers and BERT architect. Also tutorial with use case in current industry
@mohammedkastali7096
@mohammedkastali7096 3 ай бұрын
importante lesson thanks
@eleonorpatak4698
@eleonorpatak4698 Жыл бұрын
hey sir! thx for the tuto!! for an end to end project , can we save those models example roberta with pickle to deploy it on the web or is there other method for this kind of models?
@sebastianwefers8579
@sebastianwefers8579 Жыл бұрын
Hey Rob, great content as always. I am currently working on sentiment prediction project and labelling customer review data with its corresponding sentiments to later train a supervised model on it. How would we for example deal with observations that have a positive sentiment score but a very low customer rating? Can this be considered noise and we simply remove it or do we just assume weird human behaviour and leave it in? Kind regards
@amaansaigal5508
@amaansaigal5508 5 ай бұрын
Could also be a result of people using ratings and comment as two different mediums of feedback. I often use ratings to guide the quality and comments to write about anything i found that was peculiar or negative, not necesarily reflecting the overall quality of the product, just highlighting the bad features.
@CaribouDataScience
@CaribouDataScience Жыл бұрын
Very interesting!!
@robmulla
@robmulla Жыл бұрын
Thanks!
@timdentry9754
@timdentry9754 Жыл бұрын
One of the best tutorials on Vader and the Huggingface Transformers I have seen. One question I had: How is the confidence score calculated on the Pipeline model and is there a way to evaluate the model's performance on these calculations?
@robmulla
@robmulla Жыл бұрын
Thanks so much for the feedback. Glad you found it helpful. Evaluating the model performance is a bit tricky without ground truth labels. The output of the Pipeline model is essentially the probability the model predicts of each class given the dataset it was trained on. Check out the actual model description on the huggingface site here along with the noted limitations: huggingface.co/distilbert-base-uncased-finetuned-sst-2-english Specifically this part is interesting: ``` Based on a few experimentations, we observed that this model could produce biased predictions that target underrepresented populations. For instance, for sentences like This film was filmed in COUNTRY, this binary classification model will give radically different probabilities for the positive label depending on the country (0.89 if the country is France, but 0.08 if the country is Afghanistan) when nothing in the input indicates such a strong semantic shift. In this colab, Aurélien Géron made an interesting map plotting these probabilities for each country. ```
@timdentry9754
@timdentry9754 Жыл бұрын
@@robmulla FWIW - I reached out to the creator of this and what I was told is that the score is calculated using the activation function after the final layer of the neural net. It is used to determine polarity (and is not a confidence score). The model returns an array with the score for each polarity, and the larger is the prediction. The values will always be positive, regardless of the actual sentiment class tagged to the text. This is unlike Vader's model which provides a composite polarity score that could be a positive or negative float based on the inferred sentiment (positive, negative, neutral).
@robmulla
@robmulla Жыл бұрын
@@timdentry9754 thanks for clarifying. Cool that you got a response from the creator!
@sahilkakkar5628
@sahilkakkar5628 Жыл бұрын
Thank you
@jbie4590
@jbie4590 10 ай бұрын
thanks man
@muslumyildiz5694
@muslumyildiz5694 Жыл бұрын
you are awesome.. thanks a lot..
@robmulla
@robmulla Жыл бұрын
Thanks for watching. Share with a friend!
@lawalsontomiwa1925
@lawalsontomiwa1925 Жыл бұрын
great tutorial , quick question sir ... does the hugging face model understand emojis 😃🤬 and can it be translated to the score points of the sentiments results
@thuhuong-it700
@thuhuong-it700 Жыл бұрын
great!! i hope you will create video more than!! tkssssssssss
@robmulla
@robmulla Жыл бұрын
Thank you, I will. I appreciate you watching.
Жыл бұрын
Nice work
@robmulla
@robmulla Жыл бұрын
Thanks for watching!
@setarehfasihi8090
@setarehfasihi8090 8 ай бұрын
Very good , thanks. Do you have any toturial regarding readability tests in Python with many texts in a Excell file?
@mohan250s
@mohan250s Жыл бұрын
you are awesome bro
@robmulla
@robmulla Жыл бұрын
No, YOU are awesome. Thanks for watching.
@huansun5384
@huansun5384 Жыл бұрын
Very well explained video and clear guidance! I have a question about the preprocessing part of the text before putting it into the tqdm sia loop, do we directly put the raw data into it, or do we do the tokenize, remove stop words and stuff first, and then go for the sentiment analysis? Looking forward to your reply!
@robmulla
@robmulla Жыл бұрын
Hey Huan! Glad you found the video helpful. I'm not sure about the loop you are referring to but typically the text needs to be tokenized, but depending on the model it may handle that within the predict function. Hope that helps.
@huansun5384
@huansun5384 Жыл бұрын
@@robmulla Hi Medallion, got it and that makes sense, thanks for the clarification!
@navaneeths4694
@navaneeths4694 Жыл бұрын
Great content, Really loved the explanation. I'm new to sentiment analysis but was wondering this : My objective is to score a set of reviews online of products, so shouldn't i first do a set of text pre-processing like normalization, spell check , lemmatization, tokenization before feeding each sentence into the pre-trained transformers model ? . How much of a difference would this cause in accuracy of predictions ?
@robmulla
@robmulla Жыл бұрын
Stoaked you enjoyed it Navaneeth! This video only scratches the surface. The tokenization and preprocessing of the text is usually built into the model pipeline and would depend on the model you are using. I'm not sure abuot how it would impact the accuracy but for instance VADER I believe stop words are removed. Worth looking into for sure!
@ShahZ
@ShahZ 8 ай бұрын
@Rob, this another one of your masterpiece. Almost 300 comments and counting. How about a refresher on a newer Deep Learning Model :)
@HrisavBhowmick
@HrisavBhowmick 10 ай бұрын
Nice video! Now instead of sentences suppose there were paragraphs . What will be your approach to find sentiment of paragraphs?
@baneledludlu7983
@baneledludlu7983 Жыл бұрын
Great content, wow!😍,how on earth did you master all that😅.!
@robmulla
@robmulla Жыл бұрын
Thanks for the feedback. I'm still learning every day just like you. The great part about data science is that there is always something new to master.
Build Your First Pytorch Model In Minutes! [Tutorial + Code]
31:32
Кәріс тіріма өзі ?  | Synyptas 3 | 8 серия
24:47
kak budto
Рет қаралды 1,6 МЛН
Мы играли всей семьей
00:27
Даша Боровик
Рет қаралды 3,9 МЛН
How to open a can? 🤪 lifehack
00:25
Mr.Clabik - Friends
Рет қаралды 13 МЛН
What is NLP (Natural Language Processing)?
9:37
IBM Technology
Рет қаралды 190 М.
Predict The Stock Market With Machine Learning And Python
35:55
Dataquest
Рет қаралды 604 М.
Кәріс тіріма өзі ?  | Synyptas 3 | 8 серия
24:47
kak budto
Рет қаралды 1,6 МЛН