I was trying to understand word2vec for the past two years with many videos. You made it clear today with just this 20 minute stuff. You are simply amazing :)
@ajeetsinghshekhawat26963 жыл бұрын
Guys please use "words = model.wv.key_to_index" in place of "words = model.wv.vocab" in code line 60, as per gensim update. Thanks Krish sir for all the efforts you made for data science community.
@melisavidiera36733 жыл бұрын
You're a savior!!!
@abhishekpratapsingh92832 жыл бұрын
At 3:05 you are saying, in TF - IDF also semantic information is not stored but in video of TF - IDF you said its stores words semantically unlike Bag if words.
@BalaguruGupta4 жыл бұрын
your tutorials are so good, watching only once is enough to understand the concept. thank u sir
@shobhitsrivastava21235 жыл бұрын
sir, please upload practical videos of Glove and Bert
@linusthelab3 жыл бұрын
Thanks!
@AnalyticsMaster5 жыл бұрын
I am looking for some explanation of how the vectors were derived. Most of the other youtube videos that i have seen did not explain this. I was expecting that in your videos, but here also only the python implementation is explained. How the vectors got derived mathematically is missing here too. Would appreciate if you could elaborate on that, since you have a special talent of explaining complex things in a simplified manner.
@Someonner5 жыл бұрын
For that you can go to cs224n Stanford.
@arushkharbanda4 жыл бұрын
I was also looking for the same.
@maulindusarkar45813 жыл бұрын
kzbin.info/www/bejne/i6K1dHijp9tknbM
@karaniitgn09083 жыл бұрын
That requires good knowledge of probability.
@shawkyahmad2 жыл бұрын
Hi, did you find the answer ?
@sandipansarkar92114 жыл бұрын
Superb video once again Krish. All my doubts about word2vec are noe gone.Thanks
@siddharthsingh77172 жыл бұрын
Requesting you to upload more videos on BERT, Transformers, LSTM, GIU etc in NLP Playlist .It would be of great help and Thanks Krish for making such amazing videos..
@medley56703 жыл бұрын
Following ur NLP playlist... I must say u are very good at explaining each and every concepts clearly. Thank you so much for the effort that u have put in creating this amazing playlist. I took a course to learn NLP but your playlist is far better than the course. Thank you sir!
@amaljose1163 жыл бұрын
Love the conceptual videos, Have been searching everywhere.
@mohammedfaisal67145 жыл бұрын
Excellent Explanation Zabardast Bhai 😎
@sahiltrivedi693 жыл бұрын
Thank you very much for this video, super helpful 👍👍👍
@akshayakki89695 жыл бұрын
@Krish Naik Sir please make videos on data structures and algorithms...you are a great teacher 🙏🙏🙏🙏
@sanandapodder50274 жыл бұрын
Great explanation.You made the complex topic very simple sir.Thank you very much. One request please upload all the ppts you shown in this nlp series.
@nareshkatturi90123 жыл бұрын
Thank you krish 🙏
@manjunath.c29445 жыл бұрын
superb job krish kindly video on Bert which will be very helpful
@MAhmadian3 жыл бұрын
Thanks Krish. Why you did't manage out the punctuations from the input text? Are you expecting to get some useful information from them?
@Maths_With_Rahul3 жыл бұрын
Sir, one modification is there in Gensim from 3.8.0 to 4.0.0 version model.wv.vocab has changed to model.wv.key_to_index (model initialized according to your video) Thank You
@thepresistence59353 жыл бұрын
Thanks!
@BharathKumar-vs8fm5 жыл бұрын
Krish, please make a video on Glove model and pickle model
@amruthasankar3453 Жыл бұрын
Thankyou sir❤️🔥
@riteshpatil72303 жыл бұрын
Hello, can we use Word2Vec in same way like Bag of Words and TFIDF for training a classification model ? If yes, how do we do it ? If no, then how exactly can we see whether Word2Vec is overcoming drawbacks of TFIDF or not?
@Xnshau2 жыл бұрын
Geat explanation. How do i evaluate the performance of two or more models trained on the same dataset?
@akash_thing4 жыл бұрын
Hi Krish ! Can you make the video on converting whole data frame words to vectors using Word2Vec as you have not completely explained it.
@keerthivasini4 жыл бұрын
Sir, Please do a similar video about how to Implement GloVe to vectorize text documents using Python.
@venkatkrishnan94423 жыл бұрын
Nice explanation. But one thing I didn't understood is how the words you showed are similar. I can see all are of different meaning only
@saramohammadinejad2983 жыл бұрын
Amazing tutorials!
@lanceabhishek67272 жыл бұрын
can you make a video on how to deal with class balance in nlp, active learning , and when to use w2v and when to use tfidf. ps thanks for your content
@pavankumarpotta45653 жыл бұрын
krish sir, can you show how create our own Word2Vec
@ljtutorials24472 жыл бұрын
Hello sir very much impressed by your video. I wanted to know whether we can have hindi or punjabi corpus too instead of English. Pls reply
@ashimmaity645 жыл бұрын
awesome all my doubt are clear now .please make a video on tfidf word to vec.
@krishnaik065 жыл бұрын
Check my NLP playlist
@debatradas92683 жыл бұрын
thank you so much
@SudipPandey4 жыл бұрын
excellent explanation sir,I have 2 questions..1)Is the word2vec different than word embedding or is it the same form of word embedding 2)can we use word2vec in both machine learning and deep learning.
@shashireddy73714 жыл бұрын
Hi Sudip, Word embadding is general technique to represent document or word in vector form. Like we do one hot encoding , dummification etc.. Some of embedding tecniques are : 1) Bag or word 2) TF-IDF 3) word2Vec (its capture semantic information: word sequence detailes). I hope this might help you :)
@padmaparameshwaran49863 жыл бұрын
Did you get answer for this ? Have the same question now
@cer_oz Жыл бұрын
Hi Krish, why did you tokenize the text to sentences rather than the words? Is there a special reason for that? that would give almost the same result.
@aminumyau10403 жыл бұрын
Hello Mr krish. Please help me with video tutorial of fake news detection using machine learning algorithms with word2vec as feature extraction method.
@sumayyaafreen3499 Жыл бұрын
Hi ..Thanks for making such wonderful videos!!! Small doubt ..NLTK doesn't support Urdu language. Then which library can be used for URDU?
@suvarnadp18065 жыл бұрын
Sir, please make a video on elastic search engine
@arsiblack24043 жыл бұрын
Please, make study case: sentiment analysis svm with feature selection word2vec
@sowmyabhat12973 жыл бұрын
Please explain Drain parser algorithm implemented for parsing log files.
@DS_AIML4 жыл бұрын
Good Try Krish.Even though i got the concept of using WordtoVec. it not connected well with the code.Please create one Python code for full implementation.
@prasanthdevarapalli4 жыл бұрын
The words "not", " haven't" etc should be excluded while performing stopword removal. These words are very useful when constructing bigrams as Word2Vec takes semantic meaning. Correct me if I am wrong.
@GauravSharma-ui4yd5 жыл бұрын
Plz continue the deployment of ML models series
@kushalchakrabarti2404 жыл бұрын
Why did we not use lemmatization or stemming here? Won't that make the system more smooth?
@nehamanpreet10444 жыл бұрын
Please make videos on Glove and Bert
@sachin143ful4 жыл бұрын
How about using N-gram with bag of words? Example: sent1: he is good boy. sent2: he is not good boy using stop words. "not" will be removed..
@reshmachikate57132 жыл бұрын
Do we not require either of the stemming or lemmatization while converting words into vectors here?
@ranjan44955 жыл бұрын
Sir, I downloaded the nltk library, but word2vec_sample is not getting downloaded. It says out dated, how to get it completed?
@nithinmamidala5 жыл бұрын
what is semantic information? do you have any material related to that please tell me.
@haziq78853 жыл бұрын
is there a need to lemmatize or stem before we do word2vec? Thanks!
@thelastone16435 жыл бұрын
Thank you very much. Can use word2vec to predict the most frequent 10 words that come before a specific word and the most frequent 10 words that come after that specific word? and how?
@bharathreddy48065 жыл бұрын
sir please extend this video by explaining the latest ELMO & BERT.(including handson)
@prashanthkolaneru31785 жыл бұрын
Can we give word2vec input to machine learning models
@mayanksinghal4734 жыл бұрын
Hi Krish, Your videos really helpful us a lot. Could you please make a video on skip gram and cbow model of word2vec?
@nikhilsharma62184 жыл бұрын
how does the vector war got 100 dimension and what does it indicate?? and what is logic of finding vocab in algo and how algo. is performing that??
@ArathiK-s8u10 ай бұрын
Can anyone tell me as to why in the pre processing part white space is removed twice?
@omernaeem13884 жыл бұрын
Sir Text embedding bhi bta dy kis trha krty han plz
@nik78674 жыл бұрын
is it following basically percentile system in vector of finding similar words?
@starsailor9844 жыл бұрын
what is the floating number beside most_similar? is it cosine similarity?
@ravikiran12845 жыл бұрын
Please do a video on glove
@praveshbisaria53034 жыл бұрын
Sir upload video for Glove and BERT too.
@akashacharya10463 жыл бұрын
sir, why didn't you remove punctuations?
@arjunbali20794 жыл бұрын
thank you sir
@barax94623 жыл бұрын
I'm tasked to implement w2v multicagorical classification from scratch,,, but I'm too confused on what is the input the network exactly that is the x1, x2 and x3,,, I mean is x1 the 1sr word in a document? Or is it the 1st element in "a word embedded vector“ for instance if" cat=[0.1, 0.8, 0.7] then x1 is 0.1,,, Im really confused about this generally
@rafsunahmad48553 жыл бұрын
make a video on glove
@renuroy60964 жыл бұрын
Can you please add the video for Topic modelling and Text Summarization?
@maxwellochieng49243 жыл бұрын
so there is no stemming or lemmatization in W2V
@utpalbandyopadhyay16334 жыл бұрын
Sir please provide us with an easy-to-preprocess chatbot dataset....
@Skandawin784 жыл бұрын
How to do information extraction to grab sentences for a particular context from multiple websites ?? Can you point me to the right approach or source
@NareshKumar-ir3ye5 жыл бұрын
Excellent vidoe. Can we have text summarizer using word2vec?
@ouryly15415 жыл бұрын
Hi sir, I am your biggest fun. In this video, have you used a pre-trained word2vec from gensim to get the embedding vectors of sentences or you just trained this word2vec with sentences.
@lamnguyentrong2754 жыл бұрын
thank you but if you put some subtitle, it would be easier for us, from VIETNAM
@souravghosh24509 ай бұрын
I clicked on the link - "Career Transition Advice and Real Life Data Scientist Journey" but it gives an output as "This channel do not exist". Pls update on this. Thanks
@vaibhavikumari3843 жыл бұрын
sir actually i m getting error while executing # Training the Word2Vec model model = Word2Vec(sentences, min_count=1) words = model.wv.vocab(error is coming in this line) plz help
@jaydhanwant40724 жыл бұрын
People afaid of AI to take over humanity, Also AI: Vikram also looted satish :D
@prachigopalani53994 жыл бұрын
Sir, please upload Glove embeddings and BERT Model
@satishm83163 жыл бұрын
What do you mean Semantic Data?
@trexmidnite3 жыл бұрын
Which particular stufff?
@MukeshKumar-dk6mc4 жыл бұрын
You didn't thought me types of NLP plz make video about this....
@niteshchotaliya88495 жыл бұрын
Do you provide classes?
@omfuke30835 жыл бұрын
why you didn't use stemming and lemmatization instead of regex
@SuperShiva6194 жыл бұрын
Krish any plan to cover Latent Semantic Analysis ?
@mohitkaushik36603 жыл бұрын
sir i am facing problem while installing genism please help me that
@joyeetamallik50635 жыл бұрын
can you please explain what is this join for? I wanted to join but not sure what are these and how this works. It would be really great if you can explain. :-) thanks
@gyaan31014 жыл бұрын
During the preperation of dataset I did this sir, later while training word2vec model words are showing the output as individual letters.... could you pls help me out with this corpus=[] sentences=nltk.sent_tokenize(paragraph) for i in range(len(sentences)): review=re.sub('[^a-zA-Z]',' ',sentences[i]) review=review.lower() review=review.split() review=[word for word in review if word not in set(stopwords.words('english'))] review=' '.join(review) corpus.append(review)
@pratyushkanojia36503 жыл бұрын
Using the same command i am unable to import the gensim library
@sougataghosh41013 жыл бұрын
Hi Krish, if my text data is Vietnamese or Hebrew, which process will be best to convert text data to vector?
@shivanireddy47014 жыл бұрын
Can you please explain POS tagging? thanks
@BalaguruGupta4 жыл бұрын
can you please do a tutorial on Glove
@datasciencegyan51455 жыл бұрын
After applying w2v, can we proceed for sentiment analysis through the selected words as the sentence is having huge amount of words
@GamingEver074 жыл бұрын
can we use word2vec in sentiment analysis?
@kothapallysharathkumar97435 жыл бұрын
Could you please on Make a video carrer in NLP Domain. And give where to start like curriculum
@sadabratakonar42194 жыл бұрын
how to change number of dimensions of a word in word2vec
@srijitasaharoy22284 жыл бұрын
how can i construct sent2vec from facebook word2vec model?
@manavmanoj3870 Жыл бұрын
can anybody explain why the vector has exactly 100 dimensions
@harikrishnam34734 жыл бұрын
Hi Krish, I tried same steps, KeyError: "word 'infosys' not in vocabulary" could you please guide me.
@salvindsouza70534 жыл бұрын
that word is not present in vocab. check the spelling.
@paneercheeseparatha Жыл бұрын
that's nice that he calls "woman" as "human" because most of the people don't consider them as human.
@deenasiva28294 жыл бұрын
Sir, How to extract the keyword using Word2vec?
@ashok95884 жыл бұрын
best one
@mandarkulkarni8234 жыл бұрын
Hi @krish naik can i make payment through gpay 299/- to join as a member,to access live videos...gpay id which you have given in previous playlist description.
@krishnaik064 жыл бұрын
No Mandar ...u have to go through youtube xhannel itself...it is handled by youtube
@mandarkulkarni8234 жыл бұрын
@@krishnaik06 Sir i am facing problem for joining channel,my be bank servers are down,any other mode of payment plz let me know @krish naik
@shivakrishnareddy58555 жыл бұрын
Sir can "Data Protection Act" can effect the jobs in data science