hello sir i need your help on this topic i have elastic beanstalk enviroment i want ec2 instance only ipv6 addresses because aws now charge on ipv4 address but i am stuck on this pls help
@SyedWajahatQuadri11 күн бұрын
Could you please tell me what web server the web application is running on? Is it Nginx or Apache?
@JonatanBerko17 күн бұрын
Great video! Do you have or know of any guides on how to do this in cdk? Thanks for the great content!
@blanky_nap19 күн бұрын
Welcome back! :)
@MrStupiunoАй бұрын
Best video for the topic on the internet. Thank you so much!
@felixalvarez1089Ай бұрын
Hi txs for your very clear video, i am worry about the ec2 m5.xlarge price for a demo test , any recommendation? Gracias
@TrevorSullivanАй бұрын
Hello Felix, you can spin up m5.xlarge using EC2 Spot Instances and it costs much less. Also, make sure you shutdown or terminate the EC2 instance right away when you're done using it.
@felixalvarez1089Ай бұрын
@TrevorSullivan txs for your answer
@nicolamonacaАй бұрын
Hey Trevor, great tutorial, I've learnt a bunch of new stuff and was able to run my Ollama server on GPU, however I've noticed that when I ask purely textual requests Ollama runs 100% on GPU, while if I ask llama3.2-vision to describe an image, it does it 49%CPU/51%GPU, and as such it's really slow. Do you happen to know how we can force llama3.2-vision to use 100% GPU? Thanks!
@TrevorSullivanАй бұрын
Hey there, thanks for your feedback on the video. I don't have enough experience with Ollama to answer that question. Have you tried starting a discussion on the Ollama GitHub repository? Maybe someone else has already noticed this behavior.
@averymathieuАй бұрын
so helpful. wanna try this. hopefully ec2 free trial can be used to test this
@CarsonBlackАй бұрын
Great explanation and well presented! Thanks for leaving in the parts where you struggle to get it to write something executable. It’s important for us to understand the reality of using these tools and how much actual time will be spent tweaking the prompts. Also gives a realistic sense of the reliability of LLM output.
@SherrythegamerАй бұрын
Could you make another video based on the current state of offerings available for prompt flow?
@carlososorio4556Ай бұрын
Thank you so much Sir!! Your tutorials are great!! Newly, thank you!! Subscribed
@xosnrdevАй бұрын
We can’t local folder be used ??
@themooman2 ай бұрын
What an amazing tutorial. Thank you man!!! Super helpful
@mirabdullah1382 ай бұрын
I am getting an error when trying to use Claude as a model to test the Knowledge Base saying: "Your request rate is too high. Reduce the frequency of requests." Any idea what this might be due to?
@TrevorSullivan2 ай бұрын
Hmmm that's odd. Have you tried any other models?
@mirabdullah1382 ай бұрын
@@TrevorSullivan It seems it's been an issue as of late. I did try Titan and that did work - still no luck with Anthropic though.
@TrevorSullivan2 ай бұрын
@@mirabdullah138 that's weird. I usually use Claude Sonnet. Maybe put in a support ticket and see if they can help? It doesn't make sense that you would get that error on a service that you pay for based on actual usage.
@blanky_nap2 ай бұрын
This kind of tutorial I wish I had back in a day I learned AWS! As alwasys really informative video!
@nietoavi2 ай бұрын
Guys at 11:47 When it came time to install nvidia-utils, I had problems with the library and driver version (nvidia-utils-550) [message: Failed to initialize NVML: Driver/library version mismatch NVML library version: 550.107]. The solution was to check the package names and I saw that "nvidia-utils-550-server" existed and that's the one I installed and it worked. I assume that the distributions have been updated and now it has to match the driver that was installed (nvidia-driver-550-server).
@TrevorSullivan2 ай бұрын
Thanks for sharing your solution! I think I've run into this before, where the driver and utility versions mismatched.
@d4663r2 ай бұрын
great tutorial, excellent explanations! thank you sir!
@dmac16532 ай бұрын
Ran into this guy when i was teaching myself rust a while back, was super happy for him to pop up here as well. Content is super high quality.
@TrevorSullivan2 ай бұрын
Thanks for sharing your kind comment! BTW I am also a motorcycle rider, but I haven't been riding the last couple years due to health problems. Enjoy your bike! 🙂
@jamesrruff2 ай бұрын
Amazon needs to improve their setup instructions. So much time lost until finding your video. Thank you.
@boilami2 ай бұрын
Amazing Explanations Thanks!
@harryge84192 ай бұрын
Awesome. Very thorough and impressive. Thank you.
@bongimusprime79812 ай бұрын
Great content, thanks! Couple questions: 1. What do you use to track your clipboard history? That was neat 2. Say you generate a flow you really like. How do you productionize it afterwards? Particularly, what is supposed to be the usable artifact of building one of these flows? Is it to build out a library of prompts, and this flow was used just to run the process end to end?
@TrevorSullivan2 ай бұрын
Thank you! Here are answers to your questions. 1. I use the built-in Windows 11 clipboard manager. You can use WIN + V to activate it. 2. I'm guessing they intend to release APIs to access your Prompt Flows, although it seems they're completely missing during the preview. 🤷🏻♂
@blanky_nap2 ай бұрын
in my rust on aws jorney it turned out that rust sdk indeed requires two hops to fetch credentials, which is not the case for boto3, which worked out of the box in the same container, where i tried to run rust app. now revisiting this video for a deep dive on a topic.
@SumitSingh-wp9lc3 ай бұрын
How can I timely invoke agent after a certain, after every 10 min for that too I have to declare the lambda function .
@amliuyong3 ай бұрын
can you share all your install commands and docker compose yaml file?
@muhammadhananasghar31023 ай бұрын
Informative
@sandeepsahu17163 ай бұрын
Hi sir , please help me.. may i know in aws Console how to check what version if instance metadata service my ec2 is using, i am looking to get information it should show imdsv2.. please let me know where to check in aws Console
@TrevorSullivan3 ай бұрын
Hello, you can see this setting in the instance details, around the 15:30 mark in the video.
@ouchlock3 ай бұрын
Thanks, very clear. I wish it had an option to build and deploy from local machine.
@farzadmf3 ай бұрын
Nice walkthough; question: is it a requirement to use IPV6? Can we use v4 instead?
@TrevorSullivan3 ай бұрын
Yes of course you can use IPv4! As I mentioned in the video, I just use IPv6 because my ISP (Starlink) supports it, and I try to encourage its usage broadly. Thanks for your kind comment.
@farzadmf3 ай бұрын
Thank you for the response!
@prajwalram43623 ай бұрын
Can’t we change the output of the prompts so that we can supply that output to the iterator? Currently it’s not allowing it to pipe with iterator as iterator is expecting array as the input type
@TrevorSullivan3 ай бұрын
I haven't been able to figure out how to use the iterator yet. Last I checked, the documentation was lacking. Any examples of how to use it. This feature is still in preview unfortunately.
@prashant23293 ай бұрын
Very great tutorial, was looking for the exact same thing (y)
@prashant23293 ай бұрын
Can't we use the hosted OpenWebUI web interface to download additional or remove existing model instead of exec'ing into the docker container ?
@TrevorSullivan3 ай бұрын
Hello Prashant, yes I believe that's possible! I am pretty new to OpenWebUI, and am more familiar with using Docker commands, so that's why I went this route in the video. Great point!
@prashant23293 ай бұрын
@@TrevorSullivan that’s really very useful tutorial, probably you’re the first (and only yet) to make tutorial on this. Thanks a lot! Hosting my own LLM soon, I do have 1 qq though. I was reading through some AWS posts and found some details about Inf1 and Trn1 (Trainium and Inferentia) types AWS recommending, and wanted to know your thoughts on which would be the best option to go with for the instance type Inf1 or the G type ? Would love to hear your thoughts on this 🙂
@AndrewCLatham3 ай бұрын
You definitely can. I have been doing that a lot.
@prashant23293 ай бұрын
@@AndrewCLatham thank you 👍🏻
@prashant23293 ай бұрын
Why not use Inf1.xlarge for the EC2 instance type instead?
@GeethDulanjana3 ай бұрын
How much cost for this external private CA bundle?
@TrevorSullivan3 ай бұрын
The private CA certificate doesn't cost you anything. You just generate it with the open source "step" CLI tool. Is that what you're asking?
@nulops3 ай бұрын
What’s is the cost for acceptable spec.
@jace_randАй бұрын
This is what I want to know, is this practical?
@ManishJindalmanisism3 ай бұрын
This was really helpful. Very clear. No rushing through. Can you make a video on how to register both on-prem database like Postgres and RDS database to AWS Datazone?
@TrevorSullivan3 ай бұрын
Hello Manish, thanks so much for your kind comment, and suggestion for another topic! I love the idea to create a video on how to register a relational database with AWS DataZone. I haven't explored that service very much yet, but it would be a great learning opportunity, as always! 😊
@OlavSodal4 ай бұрын
Spot on. Thanks! 🙂
@TrevorSullivan3 ай бұрын
Hello Olav, thanks for checking out our videos! I'm glad this was useful for you. What other topics would you be interested in seeing?
@AndrewLatham-t8k4 ай бұрын
Brilliant!
@TrevorSullivan3 ай бұрын
Thanks so much, Andrew! I'm glad this was helpful. 🙏🏻
@ShubhamRajpurohit-c2s4 ай бұрын
Don't this have any memory concept suppose i say my name is shubham and in next query i ask it what's my name
@MdTohidulIslam-h6q4 ай бұрын
Nice! Thanks for your good initiative.
@farzadmf4 ай бұрын
I think you could set "OLLAMA_BASE_URLS=ollama:11434" for the WebUI (instead of using the IP)
@babusivaprakasam98464 ай бұрын
Your conversation flow is smooth as Prompt flow :). Good job mate. Subscribed
@blanky_nap4 ай бұрын
the previous video for me was like clicky-clicky in the console, but this one makes definitely more sense since due to integration with lambda (as you mentioned) the use cases are limited by imagination. I am not yet that familiar with bedrock api and i ask myself how can i integrate this feature in a real project, since neither me, nor the users of my app will login to AWS to play with the workflow. Would be great if you could also mention this in future videos. Thanks for great content! Keep going!
@blanky_nap4 ай бұрын
Interesting one. I am curious how complicated the flow will be in real life scenarios and if it’s manageable with all those prompt and debug units
@TrevorSullivan4 ай бұрын
Yeah, I think basic workflows will be easier. It's nice to be able to "split" your LLM logic into multiple components though, rather than having to orchestrate these kinds of workflows with your own custom logic. I see some potential here, if they invest in improving its capabilities. They will also need to provide a mechanism to export / import workflows, so they can easily be shared. Maybe I missed this option, or they'll add it later?
@hassanbhatti4 ай бұрын
Appreciate for producing this content.
@lesptitsoiseaux4 ай бұрын
Great video Trevor! If you ever do a follow-up on how to use a KB with an Agent that'd be great. FYI, I'm working on a recommendation engine that gets data from a Bedrock KB and wanting to have it as a frontpage bot. For newbies, it is a bit confusing how to set this up. A prompt flow allowed me to clean the kb results via a lambda but a prompt flow doesn't give me the ability to subsequently drill down on the resultset: it has no memory. An agent has memory but how do I guide it along with the need to clean up the results from a kb via lambda etc. I'm going in circle.... Anyway, I love your videos, thank you for posting them! 😊
@codeinrust4 ай бұрын
Thanks for your kind comment, and the video idea! Have you checked out the new Prompt Flows feature? That might accomplish what you're trying to do easily, combining Knowledge Bases and Agents. It's in preview, and was just announced a month ago. I just learned about it a few minutes ago, when I logged into the Bedrock console!
@codeinrust4 ай бұрын
Actually, I now see you commented 11 days ago on the AWS video about Prompt Flows. So, apparently you already know about it. 😉
@codeinrust4 ай бұрын
And now I see your specific comment about Prompt Flows .... "doesn't give me the ability to subsequently drill down on the resultset". Hmmmm, I would have to think about how to solve that. I am pretty new to LLMs and Bedrock in particular! If you're ever interested in chatting and brainstorming about the topic, let me know.
@lesptitsoiseaux4 ай бұрын
@@codeinrust Hi! How to reconcile the concept of a Bedrock Agent (for memory) and the use of prompt flows... I don't think I can! My use case is a presale bot recommendation engine. So far, getting the results via Lambda from a bedrock kb works, but I really hope I can find out how to get the bot to drill down/filter results previously obtained when the user provides more precision. It is a fun adventure that is for sure.
@ubaidmundewadi63684 ай бұрын
Nice explanation
@Whatever07044 ай бұрын
Great video. Afaik the eventbridge does guarantee event order. I need to create a simple architecture where we have a HR system (not known yet) where I would like to use a event based architecture. But the order of the event must be correct, like employee created -> employee updated -> employee deleted. I then need to have several system receiving these events.
@_deva3 ай бұрын
@Whatever0704 send event to SQS fifo queue that will ensure the order
@namalweerasakara64 ай бұрын
Good clear content. Was really helpful for me. Thank you.
@vivekgsp4 ай бұрын
Thanks for the nice demo. If there is no matched rule, where does the incoming message goes in the event bus? Is it stored somewhere for sometime?
@_deva3 ай бұрын
@vivekgsp no rule is matched then nothing will be store as far as i know, if u need all the event need to be store better to have write two rule with one with no filter condition and send to archive it will available till the standard days. Other the original rule