Introduction to NLP | Bag of Words Model

Рет қаралды 34,079

Normalized Nerd

Күн бұрын

Пікірлер: 61

@অচিনমানুষ-জ৪খ Жыл бұрын

I am Bengali as well. I am glad that you are doing this and feeling proud of you brother.

@balipavankalyan5008 2 жыл бұрын

Rocked in very less time. This can only come with very keen knowledge.

@fahdciwan8709 4 жыл бұрын

excellent vids brother ... keep going. You've compressed months & months of learning into a easy-to-learn videos ... please don't stop

@NormalizedNerd 4 жыл бұрын

Thanks a lot for appreciating the effort :D

@shivanineeli5392 3 жыл бұрын

My 4 months struggle to learn nlp got done in 23 min. Thanks a ton bro!

@NormalizedNerd 3 жыл бұрын

Glad to hear that bro! ❤

@sumitchhabra2419 4 жыл бұрын

I am at present studying the Machine Learning A-Z course from Udemy. Trust me, the tutor has no Idea how to explain stuff and it is the best selling course on Udemy for Data Science. I scratched my head while I was going through his tutorial, then I came here and all my doubts were cleared. Coding in python is not a difficult task but understanding the concept is the most important thing. And I got that understanding of Bag of words Model from your tutorial. Thanks a lot for your help. P.S. One who understands the concept and has strong Fundamentals has the ability to explain stuff in the simplest manner possible. Please keep it Simple like this in upcoming tutorials also. ALREADY SUBSCRIBED.

@NormalizedNerd 4 жыл бұрын

Thanks a lot @Sumit Chhabra. I'll try my best to maintain this level of simplicity :)

@rajaroy4731 3 жыл бұрын

thanks a lot man, after two days of such boring videos on youtube finally found a good one to learn NLP, great work man keep going on machine learning and other data science topic, it's really rare to find such a great video.

@NormalizedNerd 3 жыл бұрын

Thanks a lot mate! Keep supporting...

@dimaelzein9908 3 жыл бұрын

I like the video. Simple and straight to the point. Keep it up!

@NormalizedNerd 3 жыл бұрын

Thanks! :D

@my_day_my_day Жыл бұрын

Amazing. W8ing for more video😊

@dien2971 2 жыл бұрын

Thank you for your best explanation!

@akalankagamage8112 4 жыл бұрын

Really Amazing and very clear . Keep this up . New subscriber .

@NormalizedNerd 4 жыл бұрын

Thanks for the sub!

@vindyaabeygunawardena6222 2 жыл бұрын

excellent. thanks Normalised Nerd

@thegamooo7310 4 жыл бұрын

Very nice explanation also covered much things in very less time. Keep it up Man👍

@NormalizedNerd 4 жыл бұрын

Glad to hear that!

@soumadeepacharya9513 Жыл бұрын

Thanks brother❤

@prajnapn5946 3 жыл бұрын

Well explained !!!

@NormalizedNerd 3 жыл бұрын

Thanks!!

@shaminmohammed672 3 жыл бұрын

Thank you sir 🙏

@thetensordude 4 жыл бұрын

chalie jao dada!!!

@NormalizedNerd 4 жыл бұрын

Thank you bhai!

@thetensordude 4 жыл бұрын

Can you make some videos about seaborn?

@sumitchhabra2419 4 жыл бұрын

What does CountVectorizer do in this model ? Does it convert the words in the instances/Document into 0/1 please suggest.... Thanks for your help in advance.

@NormalizedNerd 4 жыл бұрын

It'll generate the feature matrix that I started drawing at 7:06

@oludotunode 3 жыл бұрын

Shouldn't you initialize the regular expression outside the loop?

@elirhm5926 3 жыл бұрын

Thank you so much

@christinavalavani1561 3 жыл бұрын

Please make a video or two about neural machine translation. With an example.

@elirhm5926 3 жыл бұрын

how can we go back to the original sentence from X_test? I mean how can I see what sentences the algorithm doesn't classify correctly?

@lpereira95 3 жыл бұрын

Compare y_pred and y_test. The indices where they don't match are the mistaken samples. Then use that indices to access the sentences in X_test. I hope it helps.

@user-jz3wo1om2c 4 жыл бұрын

you are great.

@apoorvshrivastava3544 5 жыл бұрын

Great, New subscriber

@NormalizedNerd 5 жыл бұрын

Thanks a lot :D

@doyourealise 4 жыл бұрын

amazing keep posting videos :)

@NormalizedNerd 4 жыл бұрын

Will do 😁

@kanchangandhi1200 4 жыл бұрын

Very informative video , thank you for uploading the NLP series , having one queries how can we use auto text summary generator in other human lanaguages text like(Japanese , chinese & Korean)????? your reply would be very helpfull.

@NormalizedNerd 4 жыл бұрын

Unfortunately there's no library for that can summarize every language. However, you'll find many github repos where people have built text summarizers for other languages using the same method!

@assiaben1220 3 жыл бұрын

Hello Normalize Nerd, i've got an error in y(target) y = data.as_matrix(['Review_class'], AttributeError: 'DataFrame' object has no attribute 'as_matrix', By the way, thanks 💚 the tutorial is very clear and well explained . 👏Bravo

@vindyaabeygunawardena6222 2 жыл бұрын

change it into data.(['Review_class'].to_numpy().

@user-or7ji5hv8y 3 жыл бұрын

Is X_train the bag of words?

@NormalizedNerd 3 жыл бұрын

X_train is the feature matrix.

@roopeshroope5988 3 жыл бұрын

We first split the data and do Preprocessing right why you perform on whole dataset

@NormalizedNerd 3 жыл бұрын

Yes, ideally we should first split then preprocess. Here, the text preprocessing will remain the same for both train and test set so I did them together. However, I also formed the BOW model on the whole data; which is not the correct way. We should build it only on the training set then apply it on the test set. I did it just to make things a little easier.

@user-or7ji5hv8y 3 жыл бұрын

How can we access the text files used?

@NormalizedNerd 3 жыл бұрын

I've provide the link in the video description

@dhusor9633 4 жыл бұрын

plz a make tutorial how nlp is work on Bangla text datasets.

@NormalizedNerd 4 жыл бұрын

Sure I'll...stay tuned!

@apoorvshrivastava3544 5 жыл бұрын

why we use deimitor as '/t'

@NormalizedNerd 5 жыл бұрын

In the .txt file, the values are separated by tab('/t') just like the values are separated by a comma in a .csv file. In pandas we have the function read_csv(reads .csv files by default). We need to pass the parameter to read tab-separated files.

@sumitchhabra2419 4 жыл бұрын

One suggestion here. Please ZOOM-IN your screen while you are explaining the coding part. Press Ctrl and scroll up from your mouse, it will zoom in. It puts strain on our eyes and understanding the coding part becomes a punishment.

@NormalizedNerd 4 жыл бұрын

Point noted. Thanks for the feedback.

@mdarifulislam2736 4 жыл бұрын

ভাই আমিও বাঙালি.! ❤❤❤

@NormalizedNerd 4 жыл бұрын

বেশ ভালো লাগলো আপনার কমেন্টটি পেয়ে। চ্যানেলটিকে আপনার পরিচিতদের মধ্যে শেয়ার করার অনুরোধ রইল। ❤️

@rakeshmanathana 4 жыл бұрын

Is it 'Lov' or 'Love' the root?

@NormalizedNerd 4 жыл бұрын

Good question. The thing is...stemming should give us 'lov'. But, Porter stemmer gives us 'love'. I guess the reason lies in the details of Porter stemmer's implementation.