Word2Vec Easily Explained- Data Science

Рет қаралды 174,516

Күн бұрын

Пікірлер: 128

@karthip23 5 жыл бұрын

I was trying to understand word2vec for the past two years with many videos. You made it clear today with just this 20 minute stuff. You are simply amazing :)

@ajeetsinghshekhawat2696 3 жыл бұрын

Guys please use "words = model.wv.key_to_index" in place of "words = model.wv.vocab" in code line 60, as per gensim update. Thanks Krish sir for all the efforts you made for data science community.

@melisavidiera3673 3 жыл бұрын

You're a savior!!!

@abhishekpratapsingh9283 2 жыл бұрын

At 3:05 you are saying, in TF - IDF also semantic information is not stored but in video of TF - IDF you said its stores words semantically unlike Bag if words.

@BalaguruGupta 4 жыл бұрын

your tutorials are so good, watching only once is enough to understand the concept. thank u sir

@shobhitsrivastava2123 5 жыл бұрын

sir, please upload practical videos of Glove and Bert

@linusthelab 3 жыл бұрын

Thanks!

@AnalyticsMaster 5 жыл бұрын

I am looking for some explanation of how the vectors were derived. Most of the other youtube videos that i have seen did not explain this. I was expecting that in your videos, but here also only the python implementation is explained. How the vectors got derived mathematically is missing here too. Would appreciate if you could elaborate on that, since you have a special talent of explaining complex things in a simplified manner.

@Someonner 5 жыл бұрын

For that you can go to cs224n Stanford.

@arushkharbanda 4 жыл бұрын

I was also looking for the same.

@maulindusarkar4581 3 жыл бұрын

kzbin.info/www/bejne/i6K1dHijp9tknbM

@karaniitgn0908 3 жыл бұрын

That requires good knowledge of probability.

@shawkyahmad 2 жыл бұрын

Hi, did you find the answer ?

@sandipansarkar9211 4 жыл бұрын

Superb video once again Krish. All my doubts about word2vec are noe gone.Thanks

@siddharthsingh7717 2 жыл бұрын

Requesting you to upload more videos on BERT, Transformers, LSTM, GIU etc in NLP Playlist .It would be of great help and Thanks Krish for making such amazing videos..

@medley5670 3 жыл бұрын

Following ur NLP playlist... I must say u are very good at explaining each and every concepts clearly. Thank you so much for the effort that u have put in creating this amazing playlist. I took a course to learn NLP but your playlist is far better than the course. Thank you sir!

@amaljose116 3 жыл бұрын

Love the conceptual videos, Have been searching everywhere.

@mohammedfaisal6714 5 жыл бұрын

Excellent Explanation Zabardast Bhai 😎

@sahiltrivedi69 3 жыл бұрын

Thank you very much for this video, super helpful 👍👍👍

@akshayakki8969 5 жыл бұрын

@Krish Naik Sir please make videos on data structures and algorithms...you are a great teacher 🙏🙏🙏🙏

@sanandapodder5027 4 жыл бұрын

Great explanation.You made the complex topic very simple sir.Thank you very much. One request please upload all the ppts you shown in this nlp series.

@nareshkatturi9012 3 жыл бұрын

Thank you krish 🙏

@manjunath.c2944 5 жыл бұрын

superb job krish kindly video on Bert which will be very helpful

@MAhmadian 3 жыл бұрын

Thanks Krish. Why you did't manage out the punctuations from the input text? Are you expecting to get some useful information from them?

@Maths_With_Rahul 3 жыл бұрын

Sir, one modification is there in Gensim from 3.8.0 to 4.0.0 version model.wv.vocab has changed to model.wv.key_to_index (model initialized according to your video) Thank You

@thepresistence5935 3 жыл бұрын

Thanks!

@BharathKumar-vs8fm 5 жыл бұрын

Krish, please make a video on Glove model and pickle model

@amruthasankar3453 Жыл бұрын

Thankyou sir❤️🔥

@riteshpatil7230 3 жыл бұрын

Hello, can we use Word2Vec in same way like Bag of Words and TFIDF for training a classification model ? If yes, how do we do it ? If no, then how exactly can we see whether Word2Vec is overcoming drawbacks of TFIDF or not?

@Xnshau 2 жыл бұрын

Geat explanation. How do i evaluate the performance of two or more models trained on the same dataset?

@akash_thing 4 жыл бұрын

Hi Krish ! Can you make the video on converting whole data frame words to vectors using Word2Vec as you have not completely explained it.

@keerthivasini 4 жыл бұрын

Sir, Please do a similar video about how to Implement GloVe to vectorize text documents using Python.

@venkatkrishnan9442 3 жыл бұрын

Nice explanation. But one thing I didn't understood is how the words you showed are similar. I can see all are of different meaning only

@saramohammadinejad298 3 жыл бұрын

Amazing tutorials!

@lanceabhishek6727 2 жыл бұрын

can you make a video on how to deal with class balance in nlp, active learning , and when to use w2v and when to use tfidf. ps thanks for your content

@pavankumarpotta4565 3 жыл бұрын

krish sir, can you show how create our own Word2Vec

@ljtutorials2447 2 жыл бұрын

Hello sir very much impressed by your video. I wanted to know whether we can have hindi or punjabi corpus too instead of English. Pls reply

@ashimmaity64 5 жыл бұрын

awesome all my doubt are clear now .please make a video on tfidf word to vec.

@krishnaik06 5 жыл бұрын

Check my NLP playlist

@debatradas9268 3 жыл бұрын

thank you so much

@SudipPandey 4 жыл бұрын

excellent explanation sir,I have 2 questions..1)Is the word2vec different than word embedding or is it the same form of word embedding 2)can we use word2vec in both machine learning and deep learning.

@shashireddy7371 4 жыл бұрын

Hi Sudip, Word embadding is general technique to represent document or word in vector form. Like we do one hot encoding , dummification etc.. Some of embedding tecniques are : 1) Bag or word 2) TF-IDF 3) word2Vec (its capture semantic information: word sequence detailes). I hope this might help you :)

@padmaparameshwaran4986 3 жыл бұрын

Did you get answer for this ? Have the same question now

@cer_oz Жыл бұрын

Hi Krish, why did you tokenize the text to sentences rather than the words? Is there a special reason for that? that would give almost the same result.

@aminumyau1040 3 жыл бұрын

Hello Mr krish. Please help me with video tutorial of fake news detection using machine learning algorithms with word2vec as feature extraction method.

@sumayyaafreen3499 Жыл бұрын

Hi ..Thanks for making such wonderful videos!!! Small doubt ..NLTK doesn't support Urdu language. Then which library can be used for URDU?

@suvarnadp1806 5 жыл бұрын

Sir, please make a video on elastic search engine

@arsiblack2404 3 жыл бұрын

Please, make study case: sentiment analysis svm with feature selection word2vec

@sowmyabhat1297 3 жыл бұрын

Please explain Drain parser algorithm implemented for parsing log files.

@DS_AIML 4 жыл бұрын

Good Try Krish.Even though i got the concept of using WordtoVec. it not connected well with the code.Please create one Python code for full implementation.

@prasanthdevarapalli 4 жыл бұрын

The words "not", " haven't" etc should be excluded while performing stopword removal. These words are very useful when constructing bigrams as Word2Vec takes semantic meaning. Correct me if I am wrong.

@GauravSharma-ui4yd 5 жыл бұрын

Plz continue the deployment of ML models series

@kushalchakrabarti240 4 жыл бұрын

Why did we not use lemmatization or stemming here? Won't that make the system more smooth?

@nehamanpreet1044 4 жыл бұрын

Please make videos on Glove and Bert

@sachin143ful 4 жыл бұрын

How about using N-gram with bag of words? Example: sent1: he is good boy. sent2: he is not good boy using stop words. "not" will be removed..

@reshmachikate5713 2 жыл бұрын

Do we not require either of the stemming or lemmatization while converting words into vectors here?

@ranjan4495 5 жыл бұрын

Sir, I downloaded the nltk library, but word2vec_sample is not getting downloaded. It says out dated, how to get it completed?

@nithinmamidala 5 жыл бұрын

what is semantic information? do you have any material related to that please tell me.

@haziq7885 3 жыл бұрын

is there a need to lemmatize or stem before we do word2vec? Thanks!

@thelastone1643 5 жыл бұрын

Thank you very much. Can use word2vec to predict the most frequent 10 words that come before a specific word and the most frequent 10 words that come after that specific word? and how?

@bharathreddy4806 5 жыл бұрын

sir please extend this video by explaining the latest ELMO & BERT.(including handson)

@prashanthkolaneru3178 5 жыл бұрын

Can we give word2vec input to machine learning models

@mayanksinghal473 4 жыл бұрын

Hi Krish, Your videos really helpful us a lot. Could you please make a video on skip gram and cbow model of word2vec?

@nikhilsharma6218 4 жыл бұрын

how does the vector war got 100 dimension and what does it indicate?? and what is logic of finding vocab in algo and how algo. is performing that??

@ArathiK-s8u 10 ай бұрын

Can anyone tell me as to why in the pre processing part white space is removed twice?

@omernaeem1388 4 жыл бұрын

Sir Text embedding bhi bta dy kis trha krty han plz

@nik7867 4 жыл бұрын

is it following basically percentile system in vector of finding similar words?

@starsailor984 4 жыл бұрын

what is the floating number beside most_similar? is it cosine similarity?

@ravikiran1284 5 жыл бұрын

Please do a video on glove

@praveshbisaria5303 4 жыл бұрын

Sir upload video for Glove and BERT too.

@akashacharya1046 3 жыл бұрын

sir, why didn't you remove punctuations?

@arjunbali2079 4 жыл бұрын

thank you sir

@barax9462 3 жыл бұрын

I'm tasked to implement w2v multicagorical classification from scratch,,, but I'm too confused on what is the input the network exactly that is the x1, x2 and x3,,, I mean is x1 the 1sr word in a document? Or is it the 1st element in "a word embedded vector“ for instance if" cat=[0.1, 0.8, 0.7] then x1 is 0.1,,, Im really confused about this generally

@rafsunahmad4855 3 жыл бұрын

make a video on glove

@renuroy6096 4 жыл бұрын

Can you please add the video for Topic modelling and Text Summarization?

@maxwellochieng4924 3 жыл бұрын

so there is no stemming or lemmatization in W2V

@utpalbandyopadhyay1633 4 жыл бұрын

Sir please provide us with an easy-to-preprocess chatbot dataset....

@Skandawin78 4 жыл бұрын

How to do information extraction to grab sentences for a particular context from multiple websites ?? Can you point me to the right approach or source

@NareshKumar-ir3ye 5 жыл бұрын

Excellent vidoe. Can we have text summarizer using word2vec?

@ouryly1541 5 жыл бұрын

Hi sir, I am your biggest fun. In this video, have you used a pre-trained word2vec from gensim to get the embedding vectors of sentences or you just trained this word2vec with sentences.

@lamnguyentrong275 4 жыл бұрын

thank you but if you put some subtitle, it would be easier for us, from VIETNAM

@souravghosh2450 9 ай бұрын

I clicked on the link - "Career Transition Advice and Real Life Data Scientist Journey" but it gives an output as "This channel do not exist". Pls update on this. Thanks

@vaibhavikumari384 3 жыл бұрын

sir actually i m getting error while executing # Training the Word2Vec model model = Word2Vec(sentences, min_count=1) words = model.wv.vocab(error is coming in this line) plz help

@jaydhanwant4072 4 жыл бұрын

People afaid of AI to take over humanity, Also AI: Vikram also looted satish :D

@prachigopalani5399 4 жыл бұрын

Sir, please upload Glove embeddings and BERT Model

@satishm8316 3 жыл бұрын

What do you mean Semantic Data?

@trexmidnite 3 жыл бұрын

Which particular stufff?

@MukeshKumar-dk6mc 4 жыл бұрын

You didn't thought me types of NLP plz make video about this....

@niteshchotaliya8849 5 жыл бұрын

Do you provide classes?

@omfuke3083 5 жыл бұрын

why you didn't use stemming and lemmatization instead of regex

@SuperShiva619 4 жыл бұрын

Krish any plan to cover Latent Semantic Analysis ?

@mohitkaushik3660 3 жыл бұрын

sir i am facing problem while installing genism please help me that

@joyeetamallik5063 5 жыл бұрын

can you please explain what is this join for? I wanted to join but not sure what are these and how this works. It would be really great if you can explain. :-) thanks

@gyaan3101 4 жыл бұрын

During the preperation of dataset I did this sir, later while training word2vec model words are showing the output as individual letters.... could you pls help me out with this corpus=[] sentences=nltk.sent_tokenize(paragraph) for i in range(len(sentences)): review=re.sub('[^a-zA-Z]',' ',sentences[i]) review=review.lower() review=review.split() review=[word for word in review if word not in set(stopwords.words('english'))] review=' '.join(review) corpus.append(review)

@pratyushkanojia3650 3 жыл бұрын

Using the same command i am unable to import the gensim library

@sougataghosh4101 3 жыл бұрын

Hi Krish, if my text data is Vietnamese or Hebrew, which process will be best to convert text data to vector?

@shivanireddy4701 4 жыл бұрын

Can you please explain POS tagging? thanks

@BalaguruGupta 4 жыл бұрын

can you please do a tutorial on Glove

@datasciencegyan5145 5 жыл бұрын

After applying w2v, can we proceed for sentiment analysis through the selected words as the sentence is having huge amount of words

@GamingEver07 4 жыл бұрын

can we use word2vec in sentiment analysis?

@kothapallysharathkumar9743 5 жыл бұрын

Could you please on Make a video carrer in NLP Domain. And give where to start like curriculum

@sadabratakonar4219 4 жыл бұрын

how to change number of dimensions of a word in word2vec

@srijitasaharoy2228 4 жыл бұрын

how can i construct sent2vec from facebook word2vec model?

@manavmanoj3870 Жыл бұрын

can anybody explain why the vector has exactly 100 dimensions

@harikrishnam3473 4 жыл бұрын

Hi Krish, I tried same steps, KeyError: "word 'infosys' not in vocabulary" could you please guide me.

@salvindsouza7053 4 жыл бұрын

that word is not present in vocab. check the spelling.

@paneercheeseparatha Жыл бұрын

that's nice that he calls "woman" as "human" because most of the people don't consider them as human.

@deenasiva2829 4 жыл бұрын

Sir, How to extract the keyword using Word2vec?

@ashok9588 4 жыл бұрын

best one

@mandarkulkarni823 4 жыл бұрын

Hi @krish naik can i make payment through gpay 299/- to join as a member,to access live videos...gpay id which you have given in previous playlist description.

@krishnaik06 4 жыл бұрын

No Mandar ...u have to go through youtube xhannel itself...it is handled by youtube

@mandarkulkarni823 4 жыл бұрын

@@krishnaik06 Sir i am facing problem for joining channel,my be bank servers are down,any other mode of payment plz let me know @krish naik