TFIDF : Data Science Concepts

  Рет қаралды 30,355

ritvikmath

ritvikmath

Күн бұрын

Пікірлер: 113
@vinson2233
@vinson2233 3 жыл бұрын
I'm really glad to choose this video instead wasting my time watching 30minutes explanation of tf-idf. Great job for explaining this
@pohkeamtan9876
@pohkeamtan9876 3 жыл бұрын
This is really good. Concise , straight to the point, and there is no need to show a line of code !
@akshikaakalanka
@akshikaakalanka 2 ай бұрын
always the best place to look for a concept explained. Always grateful.
@stanlukash33
@stanlukash33 3 жыл бұрын
I started googling tf-idf and then I was like "Hey, maybe that guy has a video on it", and you do! Thanks!
@ritvikmath
@ritvikmath 3 жыл бұрын
😂 "that guy" says you're welcome
@stanlukash33
@stanlukash33 3 жыл бұрын
@@ritvikmath haha sorry, Ritvik!
@rt58528
@rt58528 4 жыл бұрын
Being a math lover, within a minute of your explanation I became your fan, was always in a search of videos like this
@mango-strawberry
@mango-strawberry 10 ай бұрын
true. his channel hasn't been picked up by KZbin yet.
@hemantsah8567
@hemantsah8567 4 жыл бұрын
Your videos before sleep... Keep nightmares away...
@ritvikmath
@ritvikmath 4 жыл бұрын
aww, thanks !
@summerxia7474
@summerxia7474 2 жыл бұрын
Such a clear explanation!!! Much better than my teacher in the class. Why can't they just make it this simple? Thank you so much.
@ritvikmath
@ritvikmath 2 жыл бұрын
no problem!
@mosca-tse-tse
@mosca-tse-tse 4 жыл бұрын
Excellent teaching! Perfectly designed, clearly explained and not even one sentence that would be redundant. I’m your fan my friend 👍🏼🙏🏼
@ritvikmath
@ritvikmath 4 жыл бұрын
Wow, thank you!
@leod1740
@leod1740 2 жыл бұрын
@@ritvikmath Yes excellent explanation
@gunbac74
@gunbac74 4 жыл бұрын
I read this explanation in a book, but not as clear as this video. Well done!
@ritvikmath
@ritvikmath 4 жыл бұрын
thanks!
@Scar_
@Scar_ Жыл бұрын
Thank you for this! You saved me much time! Your explanation is legit!
@ritvikmath
@ritvikmath Жыл бұрын
Thanks!!
@imdadood5705
@imdadood5705 3 жыл бұрын
I wish to have your coherence when explaining. Awesome explanation as always.
@BhuvaneshSrivastava
@BhuvaneshSrivastava 4 жыл бұрын
I like your videos first and then start watching your Data Science videos because I am sure that after I am done watching it, I will like it anyway. Keep it up.. 🙏
@ritvikmath
@ritvikmath 4 жыл бұрын
Wow, thank you!
@luuz_study_yt7123
@luuz_study_yt7123 2 жыл бұрын
Thank you for the video, we are working at a Movie recommender System and this helps a lot for NLP.
@elsywehbe2897
@elsywehbe2897 Жыл бұрын
Your examples are excellent! Thank you!
@ritvikmath
@ritvikmath Жыл бұрын
You're very welcome!
@michaelhaag3367
@michaelhaag3367 2 жыл бұрын
Lucid explanation, my man back at it again!
@David-nw6rz
@David-nw6rz 3 жыл бұрын
When using the whiteboard, your videos are even better than with pen and paper! Thanks for your videos!
@alexfeng75
@alexfeng75 11 ай бұрын
great video with depth and simplicity at the same time!
@maefiosii
@maefiosii 3 жыл бұрын
that explanation was so smooth and clear.. great job
@ritvikmath
@ritvikmath 3 жыл бұрын
Thank you :)
@martand_05
@martand_05 4 жыл бұрын
What a classy explanation. So good man!
@ritvikmath
@ritvikmath 4 жыл бұрын
Much appreciated!
@hariharvyas4741
@hariharvyas4741 2 ай бұрын
What if the word we are checking does not appear in any of the document, then in the denominator it would be 0 which is not possible
@butterfly34457
@butterfly34457 10 ай бұрын
So simple and concise! Thank you so much!
@mila4real1
@mila4real1 Жыл бұрын
Cool! Loved your simple but extremely efficient explanation
@srijitbhattacharya6770
@srijitbhattacharya6770 2 жыл бұрын
Excellent , simply briliant
@_instanze_
@_instanze_ Жыл бұрын
Youre an excellent explanar man. And I don't mean that lightly (I rarely compliment people wallah). You got a knack. Truly! Subscribed!!
@ritvikmath
@ritvikmath Жыл бұрын
I appreciate that!
@cesarreinoso2203
@cesarreinoso2203 3 жыл бұрын
Awesomeeee Simple and Clear
@balaganesh3440
@balaganesh3440 Жыл бұрын
Outstanding explanation!
@hannahb.9454
@hannahb.9454 8 ай бұрын
This came in clutch, thanks
@VishalKhopkar1296
@VishalKhopkar1296 2 жыл бұрын
if the word 'healthcare' did occur in all 3 speeches, but occurs in the Obama speech 26 times, but only once in Clinton's and Bush's speeches. Using this mechanism, the IDF of healthcare would still be 0, but since the word has been used a considerably large number of times in the Obama speech, it is definitely important
@redpz
@redpz Жыл бұрын
in a more realistic situation the # of D would be much larger so cases like this would be extremely rare
@TheTranscending
@TheTranscending Жыл бұрын
Good point
@hameddadgour
@hameddadgour 2 жыл бұрын
Great presentation!
@amadios9874
@amadios9874 2 жыл бұрын
That was crystal clear, thanks
@ericzhang5987
@ericzhang5987 3 жыл бұрын
Excellent explanation !
@mohamedshatarah7264
@mohamedshatarah7264 2 ай бұрын
Thank you so much for explaining this clearly sir
@nehimomo
@nehimomo 2 жыл бұрын
veyr great explanation, much better than my lecturer
@vaibhavmourya65
@vaibhavmourya65 Жыл бұрын
Great explanation buddy🙌🏻
@MrKqsami
@MrKqsami 2 жыл бұрын
Great Job sir!
@bananalord8575
@bananalord8575 8 ай бұрын
Sweet and simple!
@adrianramirez9729
@adrianramirez9729 2 жыл бұрын
Amazing explanation!
@pallavijog912
@pallavijog912 2 жыл бұрын
Nice explanation. Thanks!
@eramy1
@eramy1 4 жыл бұрын
Good explanation in a simple way... keep doing well man
@ritvikmath
@ritvikmath 4 жыл бұрын
Thanks a ton!
@athena9357
@athena9357 Жыл бұрын
You saved me! My professor explained this in 3 hours, I watched it 2 times and I don't get it. This guy explained the same concept in 7 minutes and I get it!
@Justrelaxx101
@Justrelaxx101 2 жыл бұрын
Perfectly explained
@warislthong3149
@warislthong3149 Жыл бұрын
Excellent !
@dalvirsingh4070
@dalvirsingh4070 3 жыл бұрын
Explanation was awesome!
@devidurga392
@devidurga392 Жыл бұрын
clear cut explanation. Thank you
@Shaan11s
@Shaan11s 11 ай бұрын
YES! I get it now, much love bro
@RedditFam
@RedditFam 2 жыл бұрын
Very useful! Thank you Sir!
@user-or7ji5hv8y
@user-or7ji5hv8y 3 жыл бұрын
Clear and concise.
@ernestanonde3218
@ernestanonde3218 2 жыл бұрын
Powerful...Thank yoiu
@robertodigiacomo3910
@robertodigiacomo3910 2 жыл бұрын
Great explanation
@sebastiancamilopuertogalin4478
@sebastiancamilopuertogalin4478 2 жыл бұрын
For any given word/term, we want to know how important is that term for a given document, relative to the entire corpus of documents. E.g. for Clinton these subset of words is really important in his inauguration speech, relative to the other inaguration speeches. TF-IDF is simply a multiplication of the metrics TF (term frequency) and IDF (inverse document frequency).
@AKapich
@AKapich Жыл бұрын
Very succinct explanation, thank you very much
@ritvikmath
@ritvikmath Жыл бұрын
You are welcome!
@22malman
@22malman 3 жыл бұрын
Superb!!
@fustigate8933
@fustigate8933 2 жыл бұрын
Nice explanation!
@chaitu2037
@chaitu2037 3 жыл бұрын
Very well explained
@cleansquirrel2084
@cleansquirrel2084 4 жыл бұрын
Awesome video!!
@ritvikmath
@ritvikmath 4 жыл бұрын
Thanks!
@k_anu7
@k_anu7 3 жыл бұрын
If anyone dislikes this explanation god will have to come down to explain him/her.
@almonddonut1818
@almonddonut1818 2 жыл бұрын
Thank you so much!!! 🤩
@emna143
@emna143 Жыл бұрын
Hi, first of all, thanks for the great explanation. I have watched your videos about Word2Vec and TF-IDF, and I need help, please. I'm a student working on a project about binary classification of SQL injection attacks. The dataset I have contains two columns: 'sentence' and 'label.' I need to extract features, but I'm confused about which technique to use: Word2Vec or TF-IDF. Can you help me decide?
@mariapazherrera4306
@mariapazherrera4306 3 жыл бұрын
Amazing!!!!!
@mupetman1214
@mupetman1214 2 жыл бұрын
Would you advise to take out stopping words and run tdidf on the new set of documents?
@ralphhennen5769
@ralphhennen5769 4 жыл бұрын
How do you model multiple objects associated to a term class: Dental Care: United Health Care, Blue Shield, ..., by state? This becomes contextual and local within the text - how close is the word dental care in the text to UHC, for instance. The result would show which states address dental care in their health insurance regulations and which insurance companies make it available - both in a positive and negative way. Understand that this is a narrow example. Thanks
@mango-strawberry
@mango-strawberry 10 ай бұрын
damn.. that was a solid explanation
@mitadrubanerjeechowdhury9092
@mitadrubanerjeechowdhury9092 3 жыл бұрын
Amazing stuff, thanks man for letting me pass the exam.
@MYanton1994
@MYanton1994 2 жыл бұрын
thank you very much
@Begooder
@Begooder 3 жыл бұрын
many thanks
@negusuworku1871
@negusuworku1871 11 ай бұрын
iT IS REALLY NICE. KEEP IT UP
@ritvikmath
@ritvikmath 11 ай бұрын
Thanks a lot 😊
@MultiRockxD
@MultiRockxD 2 жыл бұрын
Is this a good tool to create a top of "important" words in a dataset? or it just helps to see the relevance in a particular document, I want to use it so I can maybe sum all the tdidf of all the documents and create a top words but I don't know if this is the best approach/solution to what I want, thank you in advance
@superbatman1462
@superbatman1462 4 жыл бұрын
Nice Explanation
@ritvikmath
@ritvikmath 4 жыл бұрын
Thanks!
@pushkarparanjpe
@pushkarparanjpe 3 жыл бұрын
This is a great explanation. Thanks. I have a question about differences between the implementation described in this video and another implementation commonly found on the web. Can you explain how these two details would impact the final representation: 1) Term frequency simply calculated as term count 2) Applying vector normalisation (L2) to the document vector obtained in this video Another question which is more open-ended: why is TfIdf still relevant ? Or less provocatively - is there a sweet spot where one would prefer TfIdf over the modern dense vector representations (such as word2vec, doc2vec, etc.) ?
@zephyrsurfteam
@zephyrsurfteam 4 жыл бұрын
Great video! Thanks! I would love to see more content on TFIDF.
@ritvikmath
@ritvikmath 4 жыл бұрын
Noted!
@sia-watsonlee
@sia-watsonlee 2 жыл бұрын
amazing
@ai-force3792
@ai-force3792 Жыл бұрын
very Good
@ritvikmath
@ritvikmath Жыл бұрын
Thanks
@Soutehkeshan
@Soutehkeshan 2 жыл бұрын
Useful :)
@ritvikmath
@ritvikmath 2 жыл бұрын
Glad you think so!
@0xjrr
@0xjrr 4 жыл бұрын
love that in this alternative timeline the last speech is from Obama
@skeletonrowdie1768
@skeletonrowdie1768 4 жыл бұрын
a certain president would really bias the vocabulary data
@swagatggautam6630
@swagatggautam6630 Жыл бұрын
I wonder why my teachers couldn't explain so simply.
@sepideh1111
@sepideh1111 Жыл бұрын
Thanks , great teacher if I could I would have given you 3 thumb
@vasundharasingh8216
@vasundharasingh8216 11 ай бұрын
in cases where all the 3 documents contain the word, even if 2 of them contain the word only once and the 3rd doc contains it a 100 times, tf idf would be 0 as idf would be 0. isn't this misleading then?
@xxxxxx-wq2rd
@xxxxxx-wq2rd 4 жыл бұрын
but if healthcare appears 100 times in one document, and only once in each of the other 2 documents, then the result will be zero!
@nicholasdavis9529
@nicholasdavis9529 3 жыл бұрын
This was my question. If you found out let me know.
@nicholasdavis9529
@nicholasdavis9529 3 жыл бұрын
Great video btw, best explanation.
@durasaksham
@durasaksham 4 ай бұрын
It was that easy
@aleynapolat1545
@aleynapolat1545 2 жыл бұрын
Bro, you are a good narrator but a bad organizer. It would be better that the next time you write on the board more regularly in order to make it easier to follow what you sayin
@australianperson2582
@australianperson2582 4 жыл бұрын
Thanks for politicising education with that exclusion with that example, unsubbed - so partisan.
@ritvikmath
@ritvikmath 4 жыл бұрын
Sorry to see you go, it was not my intention to politicize but rather just to use this as an example.
@abdulbasit0123
@abdulbasit0123 8 ай бұрын
That was a great explanation, Thanks 🤍
@jb_kc__
@jb_kc__ 10 ай бұрын
your explanations are great bro cut to the heart of the issue + ensure conceptual understanding 🫡🫡
@ritvikmath
@ritvikmath 10 ай бұрын
Thank you so much 😀
LSTM Networks: Explained Step by Step!
34:48
ritvikmath
Рет қаралды 29 М.
Collaborative Filtering : Data Science Concepts
12:03
ritvikmath
Рет қаралды 59 М.
Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny
00:32
Family Games Media
Рет қаралды 55 МЛН
So Cute 🥰 who is better?
00:15
dednahype
Рет қаралды 19 МЛН
Sigma Kid Mistake #funny #sigma
00:17
CRAZY GREAPA
Рет қаралды 30 МЛН
BM25 : The Most Important Text Metric in Data Science
18:12
ritvikmath
Рет қаралды 12 М.
Word2Vec : Natural Language Processing
13:17
ritvikmath
Рет қаралды 41 М.
Cross Validation : Data Science Concepts
10:12
ritvikmath
Рет қаралды 40 М.
What is TF-IDF for Beginners (Topic Modeling in Python for DH 02.01)
10:40
Python Tutorials for Digital Humanities
Рет қаралды 13 М.
Hidden Markov Model : Data Science Concepts
13:52
ritvikmath
Рет қаралды 137 М.
Random Forests : Data Science Concepts
15:56
ritvikmath
Рет қаралды 49 М.
Learn Machine Learning Like a GENIUS and Not Waste Time
15:03
Infinite Codes
Рет қаралды 447 М.
Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny
00:32
Family Games Media
Рет қаралды 55 МЛН