Creating J.A.R.V.I.S.
2:35
14 күн бұрын
Пікірлер
@stefleur
@stefleur Күн бұрын
Probably a silly question, but in what is all this complicated proccess better than doing a simple copy paste from the url?
@ppp3812
@ppp3812 Күн бұрын
Are there any scrapper available for LinkedIn and Instagram?
@MeinDeutschkurs
@MeinDeutschkurs Күн бұрын
Crawl4ai sounds perfect!
@joepropertykey3612
@joepropertykey3612 Күн бұрын
Keep it up Bro... this is what I deal with every day (parsing pdf's ) . On the average month I will get 200 different files that are updates to cities and their code violation or permit records. It can be a page, or 400 pages to the update. And they all have crazy unstructured text in ornate table designs. I looked at 'Markdown' Friday, just for that it would get the text out of a pdf, and keep the line text/line spacing to parse it a different way how can I point this at a local file on the pc?
@john_blues
@john_blues Күн бұрын
The android in the thumbnail looks like he's DJing. Like he's ready to drop a sick beat...NOW!
@GetzAI
@GetzAI Күн бұрын
Great review. Please do a review on ScrapeGraphAI. Maybe a comparison to Uncle Code's Crawl4AI? I like Crawl4AI and hope UC incorporates PDF options.
@engineerprompt
@engineerprompt Күн бұрын
thanks, yes, both of them are on my TODO list.
@mjacfardk
@mjacfardk Күн бұрын
Yes PLEASE, Do a videos on {Crawl4Ai and ScrapeGraphAI}, and thank you for everything you do and your time 🙏
@engineerprompt
@engineerprompt Күн бұрын
Yes, its on my list.
@thesimplicitylifestyle
@thesimplicitylifestyle Күн бұрын
We must create order from the messiness! 😎🤖
@jimlynch9390
@jimlynch9390 Күн бұрын
It showed some promise except it flaked out with a overflow error. On the pages it seemed to convert it scrambled the data and lost some of it. These pages are primarily transactions in a table with columns separated by whitespace. The pages with plain text worked a bit better.
@TimTruth
@TimTruth Күн бұрын
I just use selenium web driver and JavaScript or Jquery to interact with and get the parts of pages I want. If they use cloud flare or other bot blocking you can run js in console and utilize the copy command then paste in a txt file
@d.d.z.
@d.d.z. Күн бұрын
Is there any path for learning you can recommend me? i´m generating reports from a web using python, looking for an alternative. Thanks in advance.
@ahassan7270
@ahassan7270 Күн бұрын
Thank you so much for sharing this valuable information. It is absolutely helpful.
@jarad4621
@jarad4621 Күн бұрын
For jina reader Api key free for 1 million tokens which was 570 sites then pay 10 for 500 mil worth is 250k sites which is totally insane just pay the tiny amount for much better rate limits
@spectre123
@spectre123 Күн бұрын
Thanks for this video. Can you make a video for the pretrain data corresponds to plain text data stored in the "text" key. E.g: {"text": "Text contained in document n°1"} ? and how many text we need for a good fine tuning results? thanks
@JPy90
@JPy90 Күн бұрын
great thx!
@planetgamecommunity817
@planetgamecommunity817 Күн бұрын
I need this materials very much,, can you share codes and api brothe??
@engineerprompt
@engineerprompt Күн бұрын
link to the notebook is in the video description.
@engineerprompt
@engineerprompt Күн бұрын
If you want to build robust RAG applications based on your own datasets, this is for you: prompt-s-site.thinkific.com/courses/rag
@BoHorror
@BoHorror Күн бұрын
If I just wanted the Model to speak in a certain way, and I have a PDF full of examples what would I need to do.
@engineerprompt
@engineerprompt Күн бұрын
If it's just the tone, you could potentially use few shot prompting to get it working
@BoHorror
@BoHorror Күн бұрын
@@engineerprompt So just a simple example. Input would be Speak Like Jolly Roger and output would be Jolly Roger speaking
@mohsenghafari7652
@mohsenghafari7652 Күн бұрын
Hello. Thank you for your efforts and very good training. It is work in other language ?
@engineerprompt
@engineerprompt Күн бұрын
According to the repo creator, it should.
@mohsenghafari7652
@mohsenghafari7652 Күн бұрын
Hello. Thank you for your efforts and very good training. It is very difficult for us to prepare API and use it. Can you tell me what we should do in cases of free use? Thank you
@khaledbouzaiene3959
@khaledbouzaiene3959 2 күн бұрын
but if my data inculte like dailogue how can be structured where there is one instruction for each response
@imdb6942
@imdb6942 2 күн бұрын
To get instant feedback you MUST use websocket not the http post. Also use a stream playback to instantly return new data coming down websockets into the playback, then you'll receive your 153ms. I can share the code w/ you, I just don't know how to do that here.
@engineerprompt
@engineerprompt 2 күн бұрын
thanks for pointing this out. Would love to look at the code. You can email me: engineerprompt at gmail or reach out on discord :)
@mjaym30
@mjaym30 2 күн бұрын
Very interesting video indeed!! Could you please create a video on how to use colBERTv2 for embedding with pg_vector for persistent storage?
@rahulrajeev9763
@rahulrajeev9763 2 күн бұрын
Really helpful
@adriantang5811
@adriantang5811 2 күн бұрын
Thank you so much and I can't wait for your next exciting video.
@fightsfortheuser
@fightsfortheuser 2 күн бұрын
Awesome analysis, thank you!
@aifortune
@aifortune 3 күн бұрын
I'm all in. better price the eleven labs.
@Cedric_0
@Cedric_0 3 күн бұрын
Was working on a project whwre i need to use my local language but having issuse with coqui ai tts Library, aby other alternative that would be helpful, and easy to use thank you
@engineerprompt
@engineerprompt 3 күн бұрын
Try meloTTS
@Cedric_0
@Cedric_0 3 күн бұрын
Thank you, I will try it
@Beetgrape
@Beetgrape 3 күн бұрын
is it faster than Deepgram?
@engineerprompt
@engineerprompt 3 күн бұрын
Yes, on the playground. The Cartesia team recommends streaming. I am going to test that and report.
@hnb13686
@hnb13686 3 күн бұрын
THis is not completely open-source so dont report it as such with clarification midway in the vid.
@sobeck6900
@sobeck6900 Күн бұрын
what do you mean it's not completely Open Source?
@GAllium14
@GAllium14 3 күн бұрын
What software do you use for those super smooth zooms?
@engineerprompt
@engineerprompt 3 күн бұрын
It's called screen studio. It's only for mac
@gorripotinikhileswar7087
@gorripotinikhileswar7087 3 күн бұрын
Hey , Can we use this offline?
@engineerprompt
@engineerprompt 3 күн бұрын
Yes
@cristian_palau
@cristian_palau 3 күн бұрын
thank you for sharing this excelent tools!
@MrLogansrun35
@MrLogansrun35 3 күн бұрын
why do they censor these models ? AI should remain non biased and present facts when asked not give you reasons why it cannot answer a question just because the truth may offend . facts don't care about feelings. Glad they have overcome censorship.
@Larsbor
@Larsbor 3 күн бұрын
Ok as usual the lack of Gui destroys it for me..😢
@trusterzero6399
@trusterzero6399 3 күн бұрын
Grow out of that and a world will open up
@Larsbor
@Larsbor 3 күн бұрын
I am uncertain about marker, it is for scientific use, but says it removes footers, that is where you normally put in your sources, and apendix links.. so?!
@themorethemerrier281
@themorethemerrier281 3 күн бұрын
This sounds very interesting but I will need to learn some python environment basic before I can put this to the test. A solution like this could help me a lot!
@johntdavies
@johntdavies 3 күн бұрын
Thanks for posting Verbi. I wanted to get it to speak more than just English, I couldn't find any Carteia models that were anything other than English or American but ElevenLabs has great multi-lingual support. The following change in text_to_speach() enable ElevenLabs to speak quite a few languages... elif model == 'elevenlabs': ELEVENLABS_VOICE_ID = "Rachel" client = ElevenLabs(api_key=api_key) audio = client.generate( text=text, voice=ELEVENLABS_VOICE_ID, output_format="mp3_22050_32", model="eleven_multilingual_v2" ) elevenlabs.save(audio, output_file_path)
@chjpiu
@chjpiu 3 күн бұрын
Hi, do you know how much RAM is required for this application? I tried, but it said that it was out of memory. My laptop has 16 GB RAM w/o Nvidia GPU. Thanks a lot
@drgutman
@drgutman 3 күн бұрын
meh, I thought it's a better local tts ...ohh well.
@user-yi2mo9km2s
@user-yi2mo9km2s 3 күн бұрын
Nobody would pay for services while we can do it on our own PC locally.
@user-yi2mo9km2s
@user-yi2mo9km2s 3 күн бұрын
No thanks for advertising.
@michalbiros6221
@michalbiros6221 3 күн бұрын
Oh boy, it's three times more expensive than Google's premium voices and only includes English. Skipped.
@pawan3133
@pawan3133 3 күн бұрын
Thanks for another great video!! Can you please make a video or at least share the material on fine-tuning a quantized mistral v0.3 model
@engineerprompt
@engineerprompt 3 күн бұрын
In general, you want to load the model in 4-bit. Look at my finetuning videos using unsloth.
@KCM25NJL
@KCM25NJL 3 күн бұрын
They still have natural cadence issues, which is a hard problem to solve.
@engineerprompt
@engineerprompt 3 күн бұрын
Yes, I think this is just the alpha version so hopefully will get better over time.
@mohsenghafari7652
@mohsenghafari7652 3 күн бұрын
thanks
@ScottzPlaylists
@ScottzPlaylists 3 күн бұрын
I'm interested in open source only... can't finish watching. Thumbs down, sorry.
@Beetgrape
@Beetgrape 3 күн бұрын
dude, I wanna deploy this on huggingface as an API. make a tutorial on this.
@engineerprompt
@engineerprompt 3 күн бұрын
deployment series is coming soon, will give you an idea on how to do this.
@gregsLyrics
@gregsLyrics 3 күн бұрын
Brilliant vid - it is a godsend. OCRing a PDF is just not workable, period. I gave up on attempting parsing PDF. This new information is amazing and I am once again excited.
@engineerprompt
@engineerprompt 3 күн бұрын
Glad it was helpful!
@MrKarlyboy
@MrKarlyboy 3 күн бұрын
If you wanted this to plug into a chatbot the pricing does not add up. I've done some crunching, it won't even get you far with a basic smallish customer doing say doing 1000-3000 chats a month which isn't a lot. Most engines price in at audio sequence every 15s or 1m. More good engines are emerging. For our low end customers, we usually see 3 to 5 concurrency anyway and that's like the smallest model. Currently we have done 100's of millions of chat, 100's of millions of live chat too. So getting into the billions. The market is competitive. Some of the new google studio voices are comparable, deep gram too. Sure these are nice voices but for streaming api, at cost and competitive, sorry but no! unless the pricing model radically improves. It's early days so hopefully there will be new models, new options and a realization. Suggest you take say 5000, 10000, 30000 and 100,000 chats and work out the text size average transcript on the bot side, and average out the characters. You will see my point!
@engineerprompt
@engineerprompt 3 күн бұрын
that's a valid argument. Hopefully they will be able to reduce their price as they scale.
@christopherchilton-smith6482
@christopherchilton-smith6482 3 күн бұрын
I wonder how far away we are from arbitrarily high accuracy on tasks like this.
@engineerprompt
@engineerprompt 3 күн бұрын
To be honest, when it comes to voice models, open source models are lagging behind!