45 seconds in and you have already asked all of the correct questions. you have my attention, sir.
@kop-lg7lo Жыл бұрын
And my like
@ssathessa Жыл бұрын
16:00 in and you have completely lost my attention, along with my hopes for being to do this as a 1st timer😂
@funkdefied15 ай бұрын
@@kop-lg7loand my comment
@TC-Loom Жыл бұрын
After watching dozens of hours of similar videos this year, this is the best one. Thank you
@carpal4489Ай бұрын
ah yes javascript and co way better huh?
@erikmejiass Жыл бұрын
I don’t usually post comments, but in this case would do an exception to thank you for the easy to understand and to the point explanation. It’s rarely found these days. Hats off to you 💪
@RabbitHoleSyndrome Жыл бұрын
Thanks! Glad it was helpful!
@eyemazed Жыл бұрын
damn i love when i dive head into NLP content, get instantly overwhelmed and then a video like this gets randomly recommended and instantly clears away about 120 questions that i had. thanks man
@RabbitHoleSyndrome Жыл бұрын
Awesome!
@Richardziani Жыл бұрын
Hey, I totally feel you! 😍 It's amazing how stumbling upon the perfect video can make all those overwhelming questions disappear in an instant! 🙌 Thank you so much for sharing your experience, it's truly inspiring! 🌟 Keep diving headfirst into NLP content, my friend! 💪
@Mullheimer Жыл бұрын
It's amazing how Google just reads your mind. At this point it is just getting retarded. Every day exactly the right yt vid comes up. I've been querying large pdfs today via chatgpt...
@justinyoung1762 Жыл бұрын
This is the best ML for JS tutorial I've seen. SUPER helpful that you started with foundational topics. Thanks for making this.
@mortocks Жыл бұрын
First tutorial that actually makes sense and explains the ecosystem without hand waving. Thanks!!!!!!
@RabbitHoleSyndrome Жыл бұрын
You bet! Thanks for watching 😃
@kiiikoooPT3 ай бұрын
that is actually not true, all the basic scripts in this video start with a hand waving instead of an "Hello World"
@mohamedamine_478 ай бұрын
While I didn't originally come to watch the TypeScript implementation, I found myself fully immersed in your explanations. It wasn't until I'd watched the first 30 minutes that I realized the video was over an hour long. Thank you for sharing this!
@fille.imgnry10 ай бұрын
This is sooo good. The way your thinking, explaining, digging deeper, wanting to understand why things are happening the way they do. Thank you!
@MadeInJack5 ай бұрын
Man, you are the reason why KZbin is such an awesome platform. Thank you so much for this golden course!
@comptvlee Жыл бұрын
Insanely helpful content. You've been one of the content leaders carrying the burden of disseminating all these complex topics into my head. Started with the supabase docs all the way here and I'm grateful!!! This is game changing for my career
@RabbitHoleSyndrome Жыл бұрын
I appreciate the kind words. Glad they have been helpful!
@jimg82969 ай бұрын
Wow that was comprehensive and informative. Never realized there was so much involved in selecting models for creating embeddings. Thank you so much!
@johngrant719710 ай бұрын
I love that you rock Typescript in domains usually reserved for Python.
@sheikhakbar20679 ай бұрын
It's almost 3:00 in the morning, I am going to sleep and watch this excellent video tomorrow!
@JoEl-jx7dm8 ай бұрын
Same but different day
@SamirLohiya-sm7ze8 ай бұрын
This is a great video, and it makes me feel like there is no need to refer to any other video. What a great piece of content!
@zeelthumar Жыл бұрын
I have never watched full tutorial that contains javascript bcz I'm a python guy but for the first time I have gone through the whole tutorial and obviously I subscribed...................U are just awesome...............keep it up
@chrissparks8377Ай бұрын
With a heavy CompSci / Math background, I found this to deliver CRAZY VALUE ❤ in terms of knowledged gained. You touched on the data structures, algorithms, Web API's, JavaScript ecosystem trends, and most importantly, the fundamentals of text embeddings and modern trends around their usage and deployment in an approachable, hands-on manner. 🤯 I'm always on KZbin, but literally had to look for where I could comment -- this might be my 3rd ever... Fantastic work 👏 🎉
@RabbitHoleSyndromeАй бұрын
Glad it was helpful 😄
@shabarinathk8954Ай бұрын
27:50 : Bro started counting from 0 , what a legend and a true engineer
@richardyim89147 ай бұрын
This was crazyyy useful. Not a software or web developer, but trying to use embeddings for academic research. Really great walkthrough!
@mahuzedada2 ай бұрын
There's a tool that allows you to manage embedding without code
@ninjagaskin Жыл бұрын
This beats watching Netflix. Thank you for such a well-put-together tutorial/lecture on embeddings 101. I would really value your advice (and this community too) on a question I have for a personal project: I am attempting to build a pdf analyzer for financial reports - 10ks, Annual Reports, Financial Statements, etc... What is the best metric / category you recommend I use for my embedding model and LLM too? the goal is to allow the upload of any pdf, embed the pdf, store in a vector db, and then query some question answering along with memory capabilities. - new subscriber
@RabbitHoleSyndrome Жыл бұрын
Great use case! I would focus on a retrieval based model which is best suited for question-answer style matching. In terms of which one - I think you will just need to try some of them and test to see if your queries are matching with the content you expect. Good luck (thanks for the sub)!
@scign Жыл бұрын
I love how my head comes up with a question and you go down that exact rabbit hole! You just won a new subscriber here.
@RabbitHoleSyndrome Жыл бұрын
Nice! Thanks for the sub!
@lofikiwii Жыл бұрын
My man!!!! Huge! (it's tony), glad to see a new video! And this was one a monster video! Its jammed pack with so much information man. Keep it going my man!
@cesar_ai_eng Жыл бұрын
I've watched just two of your videos and I feel like I've learnt 6 months of ML Engineering. Many many thanks
@RabbitHoleSyndrome Жыл бұрын
Glad to help!
@abagatelle Жыл бұрын
I'm.....overwhelmed! Learn't that it's a very long road. Very pleased I found your excellent channel.
@JohnnysaidWhat Жыл бұрын
I used clip to embed some fields of study and then feed it user input which would "fuzzy map" to my embeddings. Worked so so well and saved me a ton of heartache. Didn't realize this model could do text2image and vice versa. Will have to try that out as well!
@danielbusquets32827 ай бұрын
Very good video. I just implemented a semantic search engine in my app and it works like magic
@M110ification10 ай бұрын
This is the tutorial I've been trying to find. I like how you describe the different components of the system and didn't just start with a walk thru how to setup a dev environment and other mundane pieces.
@pranjalagnihotri6072 Жыл бұрын
Thank you so so much for this banger video few days ago I was working on a side project (inspired by Wesbos talk) to find the similarity between a tweet replies and group them together I first used bag-of-words to generate word vectors and then used OpenAI embeddings only to find out I need to setup a paid account because of that I had to pause my side project and I would say right after this I am going and will complete my projects using huggingface models thank you so much. I explored HF but was not able to gather such details like you explained everything in video❤
@RabbitHoleSyndrome Жыл бұрын
Awesome! Glad it was helpful 😃
@papa4614 Жыл бұрын
Yes very nice to see your still doing vids ❤❤❤ there is so much your teaching us, thanks for that. I hope you still do the part 3 video series on the electron screensaver project 🤩 stay save
@Muennighoff Жыл бұрын
Amazing video! Will work on adding a ranking for the individual MTEB task tabs. Also we're working on adding more tasks (e.g. code embeddings) & other models to the leaderboard! 🤗
@RabbitHoleSyndrome Жыл бұрын
That’s amazing Niklas, thanks for the update!
@cklim78 Жыл бұрын
Great! Look forward to the updates especially on the code embeddings task, may I know any targeted date when the updates will be released?
@superwin065 ай бұрын
Thank you for covering a lot of sub topics without going deep too much! I learned a lot
@eru6ite Жыл бұрын
Dang! I'm not even halfway and I've already learned so much. Amazing content, mate! 🍻
@shuaiber Жыл бұрын
Fantastic content. Loved the contextual information in the beginning which is relevant to understand all the different components in the ecosystem.
@rickgeyer96856 ай бұрын
This is absolutely the best explanation of embeddings I have ever seen. Thanks so much for this excellent video!
@yuri.caetano Жыл бұрын
Bro your video is hell good edited and content is good, congratulations
@bakistas20 Жыл бұрын
I love how detailed your tutorials are! Keep on
@SantamChakraborty11 ай бұрын
Thank you ! That was amazingly lucid and well put together. Keeping an eye out for your future videos.
@AngelEduardoLopezZambrano Жыл бұрын
This video got me into the rabbit hole of your channel! You just got a new subscriber. Excited to see what you'll come up with next. Particularly, I'd like to see how to fine tune text generation models in modest hardware using javascript. Thanks!
@RabbitHoleSyndrome Жыл бұрын
Will keep that in mind - thanks for the sub!
@jervx8292 ай бұрын
Sir, Thank you 🫡. I have difficulty focusing to videos like this and I watched many tutorials before and often got bored & ended up closing it. But your explanation was so good and I enjoy learning from you. Thank you for sharing your knowledge to us God bless you sir! I wished I discovered you earlier when I was still on college last year, my capstone project could've been different.
@jervx8292 ай бұрын
For those who trying on Nextjs 14 app router (server side). Don't use pnpm. Switch back to npm
@rikvermeer1325 Жыл бұрын
Hmmz... finally a good overview on embeddings that is in depth AND understandable I will watch this multiple times
@snsa_kscc Жыл бұрын
Another banger, bro! You are on 🔥. I love the fact ML ecosystem is morphing with TS/JS web dev. Massive thanks, this was a joy to watch. Have a nice one.
@RabbitHoleSyndrome Жыл бұрын
Appreciate it & thanks for watching! Very excited to see the new possibilities in the TS/JS ecosystem.
@stephenk86326 ай бұрын
Amazing video! Thanks. It was uncanny how questions would be popping in my head and you'd answer them in the next sentence lol. Thanks again
@JunYamog11 ай бұрын
Thanks I made it on the end of the rabbit hole. Interesting to learn that you can do embeddings in a browser.
@neoblackcyptron Жыл бұрын
I'm glad I found this channel. It's very rare to come across super knowledgeable people like the owner of this youtube channel in the AI field who really know their stuff. I am planning to start an AI consulting business, cutting costs reducing token costs is a big deal for me, this really helped. By the way I heard AI consultants who know their stuff: ML, Genetic Algorithms. Robotics, Computer Vision and LLMs with some RPA thrown in can charge $200/hour is that a fair rate or am I underselling me skills. I don't want to ruin it for the rest of my peer by quoting too low.
@roguesherlock Жыл бұрын
man this was so helpful. Thank you! Also, I loved all the btws, fyis, fun facts and side quests haha
@stephensinclair276 Жыл бұрын
That was one of the best videos I have seen on this topic. Going to look for more!
@martinp3839 Жыл бұрын
Excellent! Especially the way you build up the episode from explaining core concepts to a layman..glad I stumbled upon your channel..
@RabbitHoleSyndrome Жыл бұрын
Glad it was helpful!
@martinp3839 Жыл бұрын
I am new to javascript and typescript. When I cloned the repo and tried executing "npm run dev" in the "HuggingFace/apps/embeddings-huggingface/src", get this error "TypeError [ERR_UNKNOWN_FILE_EXTENSION]: Unknown file extension ".ts"", Hit a roadblock. Kindly advise how I can fix this.
@RabbitHoleSyndrome Жыл бұрын
Sounds like you may be running the code as JavaScript (TypeScript has a compile step). The easiest way to get off the ground is probably to install ts-node and run your TS files through that.
@ymroddi Жыл бұрын
Just a side note for those using transformer models in production - interacting with the model should be done in a worker per official docs so it is off the main thread. I know your example was very bare bones so this wasn't an issue.
@RabbitHoleSyndrome Жыл бұрын
Great call!
@shinchima11 ай бұрын
just the ticket - cheers for uploading
@browland601 Жыл бұрын
Love the level of detail here! I'm working on an idea to do text classification using sentence embeddings as input features and then doing logistical regression. If the labels (binary classifier for now) correspond well with the embeddings generated from training examples, then my hope is the logistical regression model will work with inputs not seen before but close semantically. One of the considerations is which embedding model to use, so lots of food for thought here!
@theepicosityofpizza Жыл бұрын
This is how they tested classification performance in the benchmark mentioned, so you're on the right track
@omijmangukiya8984 Жыл бұрын
i suggest using setfit
@hasaniqbal233 Жыл бұрын
Really appreciate you doing these videos using TS and not Python.
@borknagarchile10 ай бұрын
Your video is amazing, you pretty much see all the possible the use cases for when you are trying to implement something like this. Thanks 🎉
@Bruh_6887 ай бұрын
Love you man, thanks for coding and making this tutorial in JavaScript/TypeScript.
@jagrat123544 ай бұрын
Every sec of the video was worth watching
@PatrickSteil10 ай бұрын
Wonderful video... helping to teach us how all this stuff is working under the covers... new subscriber!
@crometica Жыл бұрын
It's incredibly valuable, and you've gained a fan ❤. Keep up the fantastic work!
@IgnoreMyChan Жыл бұрын
I love your 2010-like editing with all the micro-cuts making it very exhausting to watch and listen to. 😂
@moy2010 Жыл бұрын
Probably the best content I have watched on the subject. Kudos!
@RabbitHoleSyndrome Жыл бұрын
Happy to hear! Thanks for watching 😃
@grant_vine Жыл бұрын
Best hour and 24 mins spent this week 😊
@khangvutien2538 Жыл бұрын
With this video, you got a new subscriber. Thanks.
@vinitjha_ Жыл бұрын
Great explaination. You simplified every bit of detail👏
@mytechnotalent Жыл бұрын
Great job and very detailed I like the JS internals showing the steps.
@RabbitHoleSyndrome Жыл бұрын
Awesome!
@ThunderboltPath10 ай бұрын
I appreciate so much that you made a video about this as I’m learning. But more than 90 mins was really long and I had to skip over some detailed explanations to get to see the demo 😂 So if anything, I’d hope you did it in 2 parts, one video for just the demo of these new technologies (what they can do for me) and another going into details behind everything. That’s my 2 cents… Loved the content 🎉😊
@luis96xd Жыл бұрын
Fantastic video, everything was well explained! Thanks Wow, it was an excellent solution to get Array length 1 of embeddings forking the model and adding the label Sentence Transformer, you really research a lot before making a video, great job! 👏💯 I learned a lot, of programming of types, of model parameters and outputs
@RabbitHoleSyndrome Жыл бұрын
Glad it helped!
@TheTaygan6 ай бұрын
Great video with awesome explanations... well done
@jakubwerner118 ай бұрын
So well explained, thank you!
@aksh1618 Жыл бұрын
Another great one! I'm really hooked onto the high quality content you're producing! On a related note, which one of these open source embedding models would you consider using for something like ClippyGPT, and are actually you planning to switch to one of these?
@RabbitHoleSyndrome Жыл бұрын
Thanks for watching! Great question - we will definitely consider it. Will have to do a deeper dive, but ideally a model that performs well with retrieval. Also need to consider practicalities around deployment and operation, so usually smaller model = better.
@danjsy Жыл бұрын
Great video thank you an excellent channel. For corporate use cases where they are predominantly Microsoft houses, for retrieval use cases can you see past the new OpenAI implementations? Thanks !
@Swanidhi Жыл бұрын
Love the rabbit hole content! Subscribed!
@RabbitHoleSyndrome Жыл бұрын
Thanks for the sub!
@gnomewc22 Жыл бұрын
Man ur just a natural teacher, thank you!
@monterourena Жыл бұрын
Amazing job! Please continue doing these videos 🙌🏻
@eliaweiss1 Жыл бұрын
wow!!! amazing stuff, thanks for all this info!
@handlez41110 ай бұрын
Awesome video! Thank you!
@SmallTownDev Жыл бұрын
By far the most informative resource 🚀
@Franx570 Жыл бұрын
I'm only 30 minutes in, but Jesus this video is gold 👏👏
@shtrumj7 ай бұрын
please more content , more depth, loved it, subscribed.
@DavitBarbakadze8 ай бұрын
Man I wish you'd create those videos more often. Maybe do a Patreon acoount. I'd gladly pay.
@N4LNba7777 ай бұрын
Amazing video, thanks for saving me a ton of time!
@RishuKumar-zo2hc Жыл бұрын
Very informative Please release more videos Love your content
@sirishkumar-m5z2 ай бұрын
An excellent comparison of embedding tools! Another free option worth exploring is Hugging Face's Transformers, which offers robust embedding qualifications. #AIEmbeddings #HuggingFace
@lakinmohapatra Жыл бұрын
Very nice explanation. Thanks a lot
@jdkh89 Жыл бұрын
Wow I learned so much from this. Thank you so much!
@RabbitHoleSyndrome Жыл бұрын
Glad it was helpful!
@hamzakyamanywa9792Ай бұрын
loved watching this!
@gunerdown Жыл бұрын
You got my sub, great content!
@sarojregmi20010 ай бұрын
I think they saw your video and added the sentence transformer label in there.
@mohegyux4072 Жыл бұрын
lol I didn't know the name of the channel until minute 40, subscribed
@RabbitHoleSyndrome Жыл бұрын
Thanks for the sub! 😄
@lucamiamiflorida5065 Жыл бұрын
best video on the topic, thanks!
@RabbitHoleSyndrome Жыл бұрын
Glad it was helpful!
@manishbabbar9969 Жыл бұрын
Really liked your content. Thanks for sharing such engaging information.
@RabbitHoleSyndrome Жыл бұрын
Thanks, glad it helped!
@KelvinWKiger Жыл бұрын
Olala! Thank you, thank you... and thank you very much. Take care 🍀
@justine_chang3910 ай бұрын
this video is amazingly good
@LWC Жыл бұрын
Really great video. Thank you for it!
@RabbitHoleSyndrome Жыл бұрын
You bet!
@artem-yw8km Жыл бұрын
cool realy improved inference results
@adrienazie Жыл бұрын
Thanks for the time and effort you have put into creating this video. Great content and explanation.
@aliandiazperez7602 Жыл бұрын
Excellent video tutorial and teaching skills!!!!!
@RabbitHoleSyndrome Жыл бұрын
Thank you! Cheers!
@ahmedali-hy9ew Жыл бұрын
Keep it up with the great content, really love the way you explain stuff, we want more ai tutorials please.
@raphauy9 ай бұрын
Thank you very much!
@OG-ok9zh Жыл бұрын
Did you have a chance to compare the performance regarding total request time vs execution time? Intuition tells me this could be MUCH faster than making a network call, server-side. Did you get the chance to verify this however?
@RabbitHoleSyndrome Жыл бұрын
Great question - no I haven’t tested this yet. Agreed on the intuition though - my gut is that network latency will almost always be greater than inference time. Also worth pointing out you have more control on availability (don’t have to worry about degraded third party API performance which I have experienced in the past)
@OG-ok9zh Жыл бұрын
@@RabbitHoleSyndrome I'll check it out and get back to you today then! Let's see!
@eyemazed Жыл бұрын
just a few questions though - are text embedding models language agnostic? and by language i mean the language text is written in, and if not, are there any benchmarks showcasing the performance? the second question is the token limit, is there a token limit for the embedding generation, in other words how big of a text can you provide to the embedding API?
@artistry7919 Жыл бұрын
19:22 answers the second question
@oscarljimenez5717 Жыл бұрын
You meant programming languages? If yes, well ML and AI are only math, so yes, i actually make my tesis on AI using Java at the time. You just need to understand the math and convert it to code.
@RabbitHoleSyndrome Жыл бұрын
W.r.t. languages - it will all depend on what languages the models were trained on. On the MTEB leaderboard you can see which languages were used (mostly English). You may have the best luck with the Bitext Mining models or models titled/tagged ‘multilingual’. Overall probably best to look into each model and their description on a case by case basis to understand which may be best for your use case (for now).
@elierh4429 ай бұрын
Love your content, keep it up! 👌
@orenmizr24 күн бұрын
that was very interesting. thanks🙏
@RabbitHoleSyndrome19 күн бұрын
Thanks for watching 😄
@daryladhityahenry Жыл бұрын
Hi! This is crazy good content. I learn so much! Thank you so much.. And have a question though, I'm on minute 39:00 when talking about embedding dimension size. Is the lower dimension size more likely to not have word vectored? I mean, maybe word like "eggyolk" ( maybe ) doesn't registered because the embedding id already full so it can't get the vector value from that word? If yes, that is really a trade off that we need to choose carefully right? Or... If we just use normal vocabulary, it should be okay with small dimension? Thankss!!!!!!!!