Creating J.A.R.V.I.S. powered by GROQ and Python

Рет қаралды 143,638

AI FOR DEVS

Ай бұрын

👨‍💻 Learn To Build Real-World AI Solutions ai-for-devs.com

Пікірлер: 147

@danield9368 Ай бұрын

🎉 Thank you! Great job, you inspired me. I’ve subscribed to your channel now.

@222inverter 26 күн бұрын

this is great!...you explain it clearly and it's easy to understand as you go along!...thanks ...subbed!

@mickelodiansurname9578 Ай бұрын

To be honest here I have absolutely no clue why it took YT so long to recommend this channel... the algo is losing its touch if you ask me. I get video about cats and stupid crap. Yet a coding channel, the content I watch several hours of a day while working... not so much! Jesus wept... rant over, subscribed.

@md.hasiburrahmanmonim9755 Күн бұрын

Awesome tutorial! I'm gonna try it... Will give a follow up. In Sha Allah

@dfsed1057 Ай бұрын

Love this - I created this same project last year, but set it aside because of latency issues. I will see how grok changes the game now

@jnevercast Ай бұрын

I find that GPT-3.5 has much better latency than GROQ. In my own voice assistant I use GPT-3.5 for most of the interactions and use OpenAI function calling to defer tasks to other models, for me that's just GPT-4, but it could be GROQ if the workload is large enough that speed is more important than latency

@AIZone1254 3 күн бұрын

your lecture is not for beginners but for a pro level

@huzaifa_a4602 Ай бұрын

Awesome tutorial. Thank you. ♥

@neunmalelf Ай бұрын

Timing and Step by Step explanations = Well done! 👍

@ai-for-devs Ай бұрын

Thanks a lot.

@bosmantiberiu225 Ай бұрын

Oh wow, this tutorial is awesome! I will try it step by step :) Thank you!

@Machiuka Ай бұрын

Awesome tutorial. Thank you for sharing!

@juliovac2113 Ай бұрын

wow man love how you explain. Subbed!

@carloshenrimatos 6 күн бұрын

You gained another subscriber, fantastic video!!!

@ai-for-devs 5 күн бұрын

Thank you so much! I'm glad you enjoyed the video. Welcome to the community!

@starsoldier3643 28 күн бұрын

You sir are a coding machine. Was a pleasure to watch a master such as yourself. Learning a little python myself.

@Blooper1980 Ай бұрын

Cool concept

@sajjaddehghani8735 12 күн бұрын

Great tutorial. easy to understand and fast ,

@ai-for-devs 11 күн бұрын

Thanks! I made sure to speed up the tutorial so you wouldn't fall asleep halfway through! 😄

@matzcontreras Ай бұрын

thanks for sharing dear Sebastian!

@Makisb12 Ай бұрын

It is amazing!

@antaressas3260 7 күн бұрын

The thing that makes me laugh in this video is seeing Adam Savage trying to remove the sheet by pushing it up from the front instead of pulling it from behind.

@JustLennyBenny 6 күн бұрын

That is funny af XD From all people surely he would understand that concept >.>

@ai-for-devs 5 күн бұрын

It happens to the best of us, even Adam Savage. Glad you enjoyed the video!

@alexany4619 4 күн бұрын

Pizza Funghi with mushrooms... 😂

@sebastiankuschmierz Ай бұрын

Klasse Video, sehr clean, auf den Punkt 🙂

@ai-for-devs 24 күн бұрын

Danke Dir

@johnnysim1985 Ай бұрын

Awesome video! Is there a way to adjust settings in Groq to just answer the questions directly instead of adding friendly fluff? I'm following your example and everything works great, but I when returning the translations I get "Sure, here is a translation of promp_here in German:..."

@ai-for-devs Ай бұрын

Glad you liked the video! To get direct answers without extra fluff, try adjusting your prompt to explicitly request a straightforward response: { "role": "system", "content": "Provide a concise, one-sentence answer without unnecessary details." }

@PaulyWollyUTube Ай бұрын

I loved your video and was inspired. I wanted to get the code, so I joined... and you have some interesting courses, but I do not see J.A.R.V.I.S. anywhere on the site courses

@ai-for-devs Ай бұрын

@PaulyWollyUTube You can find it here: www.ai-for-devs.com/products/real-time-ai-mastery-voice-smart-assistants

@mathewlefebvre7335 11 күн бұрын

Awesome channel. Subbed!

@ai-for-devs 11 күн бұрын

Thank you, Mathew! Welcome to the channel!

@-SomeWhereSomehow- 12 күн бұрын

I have built my personal AI Assistant using Neural AI and Chatterbot but this one is Amazing!

@ai-for-devs 11 күн бұрын

Woah, that's cool! Thanks for checking out my AI assistant.

@-SomeWhereSomehow- 11 күн бұрын

@@ai-for-devs 😁 I will be building another one using this

@screamingiraffe Ай бұрын

very nice, now create a self-contained version that doesn't rely on internet resources.

@ai-for-devs Ай бұрын

Absolutely, that's a fantastic suggestion! Implementing a self-contained version using local language models for text-to-speech (T2S) and speech-to-text (S2T), along with integrating a Groq alternative like Mistral or Llama 2 with LM Studio or Ollama, could indeed run on my Mac Studio. However, achieving the same level of quality and performance might require a substantial investment in new GPUs.

@Bigtuna411 26 күн бұрын

@@ai-for-devs Awesome video! WhisHper might be an option to keep things local. Online vendors that offer Privacy* are still hacked too often (see NordVPN). I'd rather run my own security than trust a third party to be meticulous and honest.

@patwaljee 25 күн бұрын

Thank you

@The1neo34 10 күн бұрын

Hey! Where can I get a copy of the code for the web app? I saw the link to the course but just wondering if you provide a github repo for the youtube videos?

@KaradevNetLtd Ай бұрын

can be this set on raspberry pi 4/5 and make with cpp/c#/php or any other then python/java/javascript ??? some language that all will can understand and can program it ?

@adityatiwari3646 26 күн бұрын

Thanks a lot sir finally get to know how basically a web application works in Full Stack , Thankyouu So much💌🌟

@ai-for-devs 26 күн бұрын

... and you also have mastered the AI part. Congrats 🙂

@adityatiwari3646 26 күн бұрын

@@ai-for-devs Thanks to you!!...🙃

@adityatiwari3646 26 күн бұрын

Just a question why you didnt used the pyttsx3 library for generating the reposnses and taking the user input in audio by pyttsx3 library . Using deepgram and making temporary files for the audio input and output , with interpreting can be a heavier task and make the respond slower than using pyttsx3 module for responses and output in audio format , without interpreting with temporary files. That will be more suitable ig , and make the program more faster .

@ai-for-devs 24 күн бұрын

Great approach 🙌

@ntisithoj Ай бұрын

Excellent tutorial.... but am I missing something? You say "We paste the prepared code for the index page", but I do not see where this prepared code exists?

@ai-for-devs Ай бұрын

This is a video from my platform, ai-for-devs.com. Each section includes a Download Section. However, you don’t need to join just to access the code; you can simply send me a PM on Discord at discord.gg/xPBHz9tP, and I’ll provide you with access to the source code.

@Undecentralized 18 күн бұрын

How close are we from having a Jarvis typw AI Assistamt that can be used from Computer to Smart Phone? It would seem like we would have had it by now with all the AI stuff coming out.

@ai-for-devs 17 күн бұрын

Totally

@Cryptowski Ай бұрын

Vielen Dank ;)

@ai-for-devs Ай бұрын

Bitte sehr! Es war mir ein Vergnügen.

@ToolmakerOneNewsletter Ай бұрын

So, can we use the GROQ "Jarvis" interface to access a real LLM like Claude 3 or GPT 4?

@ai-for-devs Ай бұрын

Absolutely, the shown interface can be used with LLMs like Claude 3 or GPT-4. Just exchange the groq call to be a call to GPT or Claude.

@JulienReszka Ай бұрын

Which solution for tts and stt would you recommend ? What do you think about Whisper and WhisperSpeech ? Edit: I just tried Whisper, it works pretty well. Could save some stt money. Didn't figure out how to make WhisperSpeech work yet.

@ai-for-devs Ай бұрын

That's pretty cool! Yes, you're on the right track. For both text-to-speech (TTS) and speech-to-text (STT), there are local alternatives that could replace cloud-based solutions, potentially saving costs in the long run.

@robertheinrich2994 Ай бұрын

it is a 314b model (certainly not a reference to pi) I would have a hard time to run it on my laptop using LMstudio, because of memory restrictions, whereas I can run llama2 70b, mixtral, miqu and other models on my CPU, and offload some parts to the GPU. and my laptop has a 7th gen i7 and a GTX 1070. it's not great, some tasks take up to half an hour. I would love a 70b version of groq.

@actuallyaceit Ай бұрын

In this video he is using Groq's (with a Q) api. He is not running Twitter's recently open sourced model Grok (with a K) on his local machine. I was confused too when I first heard about Groq after only knowing about Grok.

@robertheinrich2994 Ай бұрын

@@actuallyaceit I know, but I would love to see a model by X that is 70b, so I can run it locally

@marcchapleau8343 Күн бұрын

Hayden Pannetiere look alike

@SkyVioletPanda 24 күн бұрын

In your video at 08:38 you mention pasting the prepared code, for the index.html template but I'm not seeing where to get that code? Edit: Nvm. I'll just type it manually.

@ai-for-devs 24 күн бұрын

www.ai-for-devs.com/pl/2148299694

@arnavghosh4242 5 күн бұрын

You should try Langchain or transformers to make it even more powerful

@ai-for-devs 5 күн бұрын

Completely Right 👍 We actually added some more stuff in our tutorials at ai-for-devs.com

@mackdigest Күн бұрын

This is very impressive. It's given me an idea of a way to better implement AI into my companies workflow.

@General_DDay Ай бұрын

Excellent video. Can this be modified to have an avatar speak the answers when asked?

@ai-for-devs Ай бұрын

Maybe with solutions like www.heygen.com/streaming-avatar. Let me check and come back with a new video ;-)

@warchief-e8273 2 күн бұрын

Bro is moving fast

@SteelWolf13 Ай бұрын

Nice. Lost me at using a wav file from a previous video since this was a new recommend channel.

@ai-for-devs Ай бұрын

Sorry to hear that

@SteelWolf13 Ай бұрын

@@ai-for-devs Just means I have to go watch that other video and see your take on it.

@ginkhoba Ай бұрын

Danke. Auch native-speaker ;-)

@ai-for-devs Ай бұрын

Bitte sehr!

@kamalkamals Ай бұрын

the question here is why exactly deepgram instead other solution ?? and why not choose a free package like fast-whisper ?

@ai-for-devs Ай бұрын

Great question! Choosing between Deepgram and solutions like Fast Whisper often comes down to specific needs and preferences. While Deepgram is renowned for its high speed, making it one of the fastest solutions currently available, it's true that free alternatives like Fast Whisper can be very appealing, especially for those on a budget or with less urgent speed requirements. Each option has its strengths, and there's no one-size-fits-all answer.

@kamalkamals Ай бұрын

@@ai-for-devs thank u for this clarification and i can add deepgram not support all languages.

@alitomix 16 күн бұрын

Do I need to install llama on my pc? or it runs a llama image from groq?

@ai-for-devs 11 күн бұрын

You don't need to install LLaMA on your PC; it runs a LLaMA directly on Groq.

@boltonky 6 күн бұрын

The more i think about AI the more i realise how much it will change the way games are played, saying this tho it also causes humans to be more stupid as like auto-correct we rely on them for information. Either way awesome project

@ai-for-devs 5 күн бұрын

Absolutely, I can imagine, that assessing the quality of AI-generated work will likely become a major aspect of human jobs in the future.

@physicsguybrian Ай бұрын

Where is the beautiful woman from the thumbnail? She is the Jarvis we all need!

@ai-for-devs Ай бұрын

Maybe she could indeed be the avatar for J.A.R.V.I.S. in the next video! Who knows, it might just bring a whole new level of charm and interaction to our AI assistant!

@lionlight9514 Ай бұрын

This is really cool, can you buy a groq chip yet?

@ai-for-devs Ай бұрын

Currently, the pricing for Groq's cards is around $20,000 each. Given this price point, direct purchase and deployment of Groq hardware might be a significant investment 😅

@eointolster Ай бұрын

Not sure if you’d find it useful but I’ve made something similar but much slower using anthropic with function calls. I use eleven labs at the moment but after your video seriously looking at groq.

@bradleybrown8428 Ай бұрын

Really? I've just skimmed this video but I know 11labs has much better voices.

@eointolster Ай бұрын

@@bradleybrown8428 elevenlabs has much better voices but slower response

@RealLexable Ай бұрын

Erkennt es also auch automatisch deutsch eingesprochene Sprache?

@ai-for-devs Ай бұрын

Das sollte grundsätzlich möglich sein. Ich würde den prompt noch entsprechend anpassen, dass die Ausgabe auch auf Deutsch erfolgt.

@3-deez Ай бұрын

I have paid for a membership but cannot find where I get the github access and discord as well as the extra video lessons, please help

@ai-for-devs Ай бұрын

Please send a short email to sebastian@ai-for-devs.com to receive your Discord invitation. Additionally, include your GitHub username in the email to secure access to the ai-for-devs GitHub organization. We look forward to your participation.

@3-deez Ай бұрын

@@ai-for-devs Thanks, have done so and look forward to participating

@georgemontgomery1892 13 күн бұрын

Nah, If it doesn't sound like Paul Bettany, I don't want it.

@ai-for-devs 11 күн бұрын

In that case, we'll need to utilize ElevenLabs instead of Deepgram. 😉

@georgemontgomery1892 11 күн бұрын

@@ai-for-devs Do it

@xxmarissxx7690 13 күн бұрын

is it possible to use this code on a raspberry pi with an ALU core without the need for an internet connection??

@ai-for-devs 11 күн бұрын

It's probably possible to generate similar answers, but not with the same inference speed.

@adityatiwari3646 26 күн бұрын

the command "export DG_API_KEY={key}" this is the command for mac , i tried "stex DG_API_KEY {key}" and also tried "ste DG_API_KEY={key}" after both the commands to i got the same "Exception: DeepgramApiError: Invalid credentials. (Status: 401)" Error i tried searching solution but didn't got , how to solve this one?i have made the api key as a member also but still facing this error.......

@ai-for-devs 26 күн бұрын

Please try first to set the key directly in the code. If this works you know that the key is correct.

@adityatiwari3646 26 күн бұрын

@@ai-for-devs Yea i tried doing manually in the code first still getting the same error then i tried doing manually in environment variable by creating a new one and setting up the API Key manually there the issue solved!!....

@cristianscl 20 күн бұрын

you mean you write the api letter by letter insteead of copyng and paste?@@adityatiwari3646

@undeadgaming2102 Ай бұрын

Hello sir i want to ask that it can only chat. I want to make it so that it can access current information, open apps, surf web, cpature screenshot and analyze it and may more how can we do it please help.

@ai-for-devs Ай бұрын

To expand the AI's abilities to include real-time information access, app interactions, web surfing, and image analysis, you can leverage Groq's function calling capability together with other LMs. We have built something similar (with GPT instead of Groq) in the past. Have a look at kzbin.info/www/bejne/jnunla2tfZt1iNE

@undeadgaming2102 Ай бұрын

@@ai-for-devs thanks for the reply and information

@merion297 Ай бұрын

Ah, too bad they don't have a Hungarian Text2Speech model. I hope they'll make it soon. :(

@ai-for-devs Ай бұрын

Oh, that's unfortunate. I know that Fast Whisper supports Hungarian, which might be useful for you. You can find more details here: replicate.com/vaibhavs10/incredibly-fast-whisper. We have used it in kzbin.info/www/bejne/eHPRiHiuqbeXaqM

@merion297 Ай бұрын

@@ai-for-devs Thank you so much! :)

@timothymaggenti717 Ай бұрын

Wow your source code of 28 lines is amazing, I will not be using it, thanks

@Atanepes 29 күн бұрын

Hello I am programmer too I and I am learning Web Development and i got your video after 4 weeks I subsed your channel. and is there already code and how to start?

@ai-for-devs 24 күн бұрын

🙏 On ai-for-devs.com we have an AI Fundamentals Course.

@SkyVioletPanda 24 күн бұрын

PROBLEM: I seem to keep getting the same error: "Exception: DeepgramApiError: Invalid credentials. (Status: 401)". I tried to create a new key and still got the same error. The variable definitely exists in my environment (Had to manually create the variable using "conda env config vars set my_var=value" since I use conda to manage my virtual envs). SOLUTION: I ended up creating a '.env' file and setting the API key in there (DG_API_KEY="keyValue"). For conda users, I installed dotenv using "conda install -c conda-forge python-dotenv" . Seemed to work that way. I decided to just comment this in case anyone runs into the same errors as I did. BTW, 'export' is for mac users, for windows users use 'set'. If your using conda like me, to set a variable in the environment use "conda env config vars set my_var=value" and then restart the environment.

@ai-for-devs 24 күн бұрын

Have you tried to set the key directly in the code?

@cristianscl 20 күн бұрын

hI, IVE GOT THE same problem, generating key is now different from the video because of the permissions, have you solve it? if yes, please tell me. thank you

@superfliping 18 күн бұрын

Whats next, show your skills? 1. CodeCraft Duel: Super Agent Showdown 2. Pixel Pioneers: Super Agent AI Clash 3. Digital Duel: LLM Super Agents Battle 4. Byte Battle Royale: Dueling LLM Agents 5. AI Code Clash: Super Agent Showdown 6. CodeCraft Combat: Super Agent Edition 7. Digital Duel: Super Agent AI Battle 8. Pixel Pioneers: LLM Super Agent Showdown 9. Byte Battle Royale: Super Agent AI Combat 10. AI Code Clash: Dueling Super Agents Edition

@ai-for-devs 17 күн бұрын

I have more terminator vision in mind. Stay tuned.

@darkreader01 Ай бұрын

Clicking a button to record and stop recording does not feel like JARVIS. It would be more real if it was in terminal and the audio is processed in realtime without the need of clicking any button.

@ai-for-devs Ай бұрын

I concur. My initial implementation was designed to listen for any sounds exceeding a predefined threshold, rather than requiring a manual button press. However,I aimed to keep the code as straightforward as possible. I'll be sharing the alternative JS source code on ai-for-devs.com.

@roody_io Ай бұрын

@@ai-for-devsthis could probably be embedded into an alexa echo skill or other assistant platforms without too much trouble i imagine. figure out a good function library or or agent framework.. now were cooking

@LandonShipman-iDzynes Ай бұрын

this is EXACTLY what I have been waiting to do for a year now! @@roody_io - reply back if you figured this out already!

@mrboltik6503 Ай бұрын

@ai-for-devs You can implement wake word library like hey siri by using porcupine by picovoice for example

@Bielzin-te8on 13 күн бұрын

I made a program in python which does it. It records in real time using pyaudio and recorder, then I when silence threshold is achieved it starts recording till the volume gets above threshold, then it converts to text using STT and I receive a answer using GPT-2 model from huggingface. The UI is a 3D brain(mesh to points) using vtk embbed with python code so when the user is speaking the points which make the brain start color changing according to CHUNK audio times a constant to alter the shaders of each point in perspective. I use GPT-2 because my computer is old and only has CPU, I already made it an APK but I haven't uploaded it in Github

@XaljakalX 10 күн бұрын

anyone can find the index html code anywhere?

@XaljakalX 3 күн бұрын

not me been trying to find it but in vain . without it we cannot test anything

@vaibhavsingh8122 Ай бұрын

But look at the size of the cotton. Isn't it cheating ?

@david_6220 Ай бұрын

Aber die Übersetzung ins Deutsche war überhaupt nicht korrekt.

@ai-for-devs Ай бұрын

Die Betonung war nicht ganz sauber, aber "This is a test" => "Das ist ein Test" sollte doch eigentlich passen?

@g.s.3389 Ай бұрын

you do not need deepgram... you can install and use speech_recognition and with a bit of python you do not need any button but you can interact directly with any ollama model... at the same time once you get an answer you can use text2 speech which in instegrate in any linux/mac machine which is "SAY"... might not be amazing but not very different from what you got..... so at the and no money spent in on-line services and with ollama on your PC you have everything on your pc, zero money...

@skaus123 Ай бұрын

yes ,i agree too. beeing independant of any third party online services for tts or stt is more important even if it means a littler more code involved.

@ai-for-devs Ай бұрын

Absolutely. While we've explored various alternatives to Deepgram (see our last videos), its simplicity and speed were key reasons for its inclusion in the tutorial. As you've insightfully noted, the beauty of technology, much like Lego, lies in its modularity and the freedom it offers to interchange components.

@greatjobbuddy Ай бұрын

I think he's just trying to educate on the service and what you can do with it as an example. He's not saying it's the best way or only way to do it. I get what you're saying, but why don't you create a video tutorial and I swear to god I'll watch it because all these tools are good to know in your tool belt and so is doing it as you mentioned. So go make a video so I can do it too!

@nokturnihs 12 күн бұрын

Not very secure... Still decent for a tutorial!

@ai-for-devs 11 күн бұрын

I agree! In a production environment, it's crucial to spend much more time on hardening security.

@squiddymute Ай бұрын

deepgram is not free forever however

@ai-for-devs Ай бұрын

Indeed, Deepgram isn't free indefinitely. It offers robust features for speech-to-text and text-to-speech conversion, making it a valuable tool, but its cost becomes a factor for long-term use.

@squiddymute Ай бұрын

@@ai-for-devs i have the impression your replies are generated automatically by an LLM 🤣

@ai-for-devs Ай бұрын

Sure, a real human would not work on Sunday.