Пікірлер
@ripper863
@ripper863 Күн бұрын
hello sir i need your help on this topic i have elastic beanstalk enviroment i want ec2 instance only ipv6 addresses because aws now charge on ipv4 address but i am stuck on this pls help
@SyedWajahatQuadri
@SyedWajahatQuadri 11 күн бұрын
Could you please tell me what web server the web application is running on? Is it Nginx or Apache?
@JonatanBerko
@JonatanBerko 17 күн бұрын
Great video! Do you have or know of any guides on how to do this in cdk? Thanks for the great content!
@blanky_nap
@blanky_nap 19 күн бұрын
Welcome back! :)
@MrStupiuno
@MrStupiuno Ай бұрын
Best video for the topic on the internet. Thank you so much!
@felixalvarez1089
@felixalvarez1089 Ай бұрын
Hi txs for your very clear video, i am worry about the ec2 m5.xlarge price for a demo test , any recommendation? Gracias
@TrevorSullivan
@TrevorSullivan Ай бұрын
Hello Felix, you can spin up m5.xlarge using EC2 Spot Instances and it costs much less. Also, make sure you shutdown or terminate the EC2 instance right away when you're done using it.
@felixalvarez1089
@felixalvarez1089 Ай бұрын
@TrevorSullivan txs for your answer
@nicolamonaca
@nicolamonaca Ай бұрын
Hey Trevor, great tutorial, I've learnt a bunch of new stuff and was able to run my Ollama server on GPU, however I've noticed that when I ask purely textual requests Ollama runs 100% on GPU, while if I ask llama3.2-vision to describe an image, it does it 49%CPU/51%GPU, and as such it's really slow. Do you happen to know how we can force llama3.2-vision to use 100% GPU? Thanks!
@TrevorSullivan
@TrevorSullivan Ай бұрын
Hey there, thanks for your feedback on the video. I don't have enough experience with Ollama to answer that question. Have you tried starting a discussion on the Ollama GitHub repository? Maybe someone else has already noticed this behavior.
@averymathieu
@averymathieu Ай бұрын
so helpful. wanna try this. hopefully ec2 free trial can be used to test this
@CarsonBlack
@CarsonBlack Ай бұрын
Great explanation and well presented! Thanks for leaving in the parts where you struggle to get it to write something executable. It’s important for us to understand the reality of using these tools and how much actual time will be spent tweaking the prompts. Also gives a realistic sense of the reliability of LLM output.
@Sherrythegamer
@Sherrythegamer Ай бұрын
Could you make another video based on the current state of offerings available for prompt flow?
@carlososorio4556
@carlososorio4556 Ай бұрын
Thank you so much Sir!! Your tutorials are great!! Newly, thank you!! Subscribed
@xosnrdev
@xosnrdev Ай бұрын
We can’t local folder be used ??
@themooman
@themooman 2 ай бұрын
What an amazing tutorial. Thank you man!!! Super helpful
@mirabdullah138
@mirabdullah138 2 ай бұрын
I am getting an error when trying to use Claude as a model to test the Knowledge Base saying: "Your request rate is too high. Reduce the frequency of requests." Any idea what this might be due to?
@TrevorSullivan
@TrevorSullivan 2 ай бұрын
Hmmm that's odd. Have you tried any other models?
@mirabdullah138
@mirabdullah138 2 ай бұрын
@@TrevorSullivan It seems it's been an issue as of late. I did try Titan and that did work - still no luck with Anthropic though.
@TrevorSullivan
@TrevorSullivan 2 ай бұрын
@@mirabdullah138 that's weird. I usually use Claude Sonnet. Maybe put in a support ticket and see if they can help? It doesn't make sense that you would get that error on a service that you pay for based on actual usage.
@blanky_nap
@blanky_nap 2 ай бұрын
This kind of tutorial I wish I had back in a day I learned AWS! As alwasys really informative video!
@nietoavi
@nietoavi 2 ай бұрын
Guys at 11:47 When it came time to install nvidia-utils, I had problems with the library and driver version (nvidia-utils-550) [message: Failed to initialize NVML: Driver/library version mismatch NVML library version: 550.107]. The solution was to check the package names and I saw that "nvidia-utils-550-server" existed and that's the one I installed and it worked. I assume that the distributions have been updated and now it has to match the driver that was installed (nvidia-driver-550-server).
@TrevorSullivan
@TrevorSullivan 2 ай бұрын
Thanks for sharing your solution! I think I've run into this before, where the driver and utility versions mismatched.
@d4663r
@d4663r 2 ай бұрын
great tutorial, excellent explanations! thank you sir!
@dmac1653
@dmac1653 2 ай бұрын
Ran into this guy when i was teaching myself rust a while back, was super happy for him to pop up here as well. Content is super high quality.
@TrevorSullivan
@TrevorSullivan 2 ай бұрын
Thanks for sharing your kind comment! BTW I am also a motorcycle rider, but I haven't been riding the last couple years due to health problems. Enjoy your bike! 🙂
@jamesrruff
@jamesrruff 2 ай бұрын
Amazon needs to improve their setup instructions. So much time lost until finding your video. Thank you.
@boilami
@boilami 2 ай бұрын
Amazing Explanations Thanks!
@harryge8419
@harryge8419 2 ай бұрын
Awesome. Very thorough and impressive. Thank you.
@bongimusprime7981
@bongimusprime7981 2 ай бұрын
Great content, thanks! Couple questions: 1. What do you use to track your clipboard history? That was neat 2. Say you generate a flow you really like. How do you productionize it afterwards? Particularly, what is supposed to be the usable artifact of building one of these flows? Is it to build out a library of prompts, and this flow was used just to run the process end to end?
@TrevorSullivan
@TrevorSullivan 2 ай бұрын
Thank you! Here are answers to your questions. 1. I use the built-in Windows 11 clipboard manager. You can use WIN + V to activate it. 2. I'm guessing they intend to release APIs to access your Prompt Flows, although it seems they're completely missing during the preview. 🤷🏻‍♂
@blanky_nap
@blanky_nap 2 ай бұрын
in my rust on aws jorney it turned out that rust sdk indeed requires two hops to fetch credentials, which is not the case for boto3, which worked out of the box in the same container, where i tried to run rust app. now revisiting this video for a deep dive on a topic.
@SumitSingh-wp9lc
@SumitSingh-wp9lc 3 ай бұрын
How can I timely invoke agent after a certain, after every 10 min for that too I have to declare the lambda function .
@amliuyong
@amliuyong 3 ай бұрын
can you share all your install commands and docker compose yaml file?
@muhammadhananasghar3102
@muhammadhananasghar3102 3 ай бұрын
Informative
@sandeepsahu1716
@sandeepsahu1716 3 ай бұрын
Hi sir , please help me.. may i know in aws Console how to check what version if instance metadata service my ec2 is using, i am looking to get information it should show imdsv2.. please let me know where to check in aws Console
@TrevorSullivan
@TrevorSullivan 3 ай бұрын
Hello, you can see this setting in the instance details, around the 15:30 mark in the video.
@ouchlock
@ouchlock 3 ай бұрын
Thanks, very clear. I wish it had an option to build and deploy from local machine.
@farzadmf
@farzadmf 3 ай бұрын
Nice walkthough; question: is it a requirement to use IPV6? Can we use v4 instead?
@TrevorSullivan
@TrevorSullivan 3 ай бұрын
Yes of course you can use IPv4! As I mentioned in the video, I just use IPv6 because my ISP (Starlink) supports it, and I try to encourage its usage broadly. Thanks for your kind comment.
@farzadmf
@farzadmf 3 ай бұрын
Thank you for the response!
@prajwalram4362
@prajwalram4362 3 ай бұрын
Can’t we change the output of the prompts so that we can supply that output to the iterator? Currently it’s not allowing it to pipe with iterator as iterator is expecting array as the input type
@TrevorSullivan
@TrevorSullivan 3 ай бұрын
I haven't been able to figure out how to use the iterator yet. Last I checked, the documentation was lacking. Any examples of how to use it. This feature is still in preview unfortunately.
@prashant2329
@prashant2329 3 ай бұрын
Very great tutorial, was looking for the exact same thing (y)
@prashant2329
@prashant2329 3 ай бұрын
Can't we use the hosted OpenWebUI web interface to download additional or remove existing model instead of exec'ing into the docker container ?
@TrevorSullivan
@TrevorSullivan 3 ай бұрын
Hello Prashant, yes I believe that's possible! I am pretty new to OpenWebUI, and am more familiar with using Docker commands, so that's why I went this route in the video. Great point!
@prashant2329
@prashant2329 3 ай бұрын
@@TrevorSullivan that’s really very useful tutorial, probably you’re the first (and only yet) to make tutorial on this. Thanks a lot! Hosting my own LLM soon, I do have 1 qq though. I was reading through some AWS posts and found some details about Inf1 and Trn1 (Trainium and Inferentia) types AWS recommending, and wanted to know your thoughts on which would be the best option to go with for the instance type Inf1 or the G type ? Would love to hear your thoughts on this 🙂
@AndrewCLatham
@AndrewCLatham 3 ай бұрын
You definitely can. I have been doing that a lot.
@prashant2329
@prashant2329 3 ай бұрын
@@AndrewCLatham thank you 👍🏻
@prashant2329
@prashant2329 3 ай бұрын
Why not use Inf1.xlarge for the EC2 instance type instead?
@GeethDulanjana
@GeethDulanjana 3 ай бұрын
How much cost for this external private CA bundle?
@TrevorSullivan
@TrevorSullivan 3 ай бұрын
The private CA certificate doesn't cost you anything. You just generate it with the open source "step" CLI tool. Is that what you're asking?
@nulops
@nulops 3 ай бұрын
What’s is the cost for acceptable spec.
@jace_rand
@jace_rand Ай бұрын
This is what I want to know, is this practical?
@ManishJindalmanisism
@ManishJindalmanisism 3 ай бұрын
This was really helpful. Very clear. No rushing through. Can you make a video on how to register both on-prem database like Postgres and RDS database to AWS Datazone?
@TrevorSullivan
@TrevorSullivan 3 ай бұрын
Hello Manish, thanks so much for your kind comment, and suggestion for another topic! I love the idea to create a video on how to register a relational database with AWS DataZone. I haven't explored that service very much yet, but it would be a great learning opportunity, as always! 😊
@OlavSodal
@OlavSodal 4 ай бұрын
Spot on. Thanks! 🙂
@TrevorSullivan
@TrevorSullivan 3 ай бұрын
Hello Olav, thanks for checking out our videos! I'm glad this was useful for you. What other topics would you be interested in seeing?
@AndrewLatham-t8k
@AndrewLatham-t8k 4 ай бұрын
Brilliant!
@TrevorSullivan
@TrevorSullivan 3 ай бұрын
Thanks so much, Andrew! I'm glad this was helpful. 🙏🏻
@ShubhamRajpurohit-c2s
@ShubhamRajpurohit-c2s 4 ай бұрын
Don't this have any memory concept suppose i say my name is shubham and in next query i ask it what's my name
@MdTohidulIslam-h6q
@MdTohidulIslam-h6q 4 ай бұрын
Nice! Thanks for your good initiative.
@farzadmf
@farzadmf 4 ай бұрын
I think you could set "OLLAMA_BASE_URLS=ollama:11434" for the WebUI (instead of using the IP)
@babusivaprakasam9846
@babusivaprakasam9846 4 ай бұрын
Your conversation flow is smooth as Prompt flow :). Good job mate. Subscribed
@blanky_nap
@blanky_nap 4 ай бұрын
the previous video for me was like clicky-clicky in the console, but this one makes definitely more sense since due to integration with lambda (as you mentioned) the use cases are limited by imagination. I am not yet that familiar with bedrock api and i ask myself how can i integrate this feature in a real project, since neither me, nor the users of my app will login to AWS to play with the workflow. Would be great if you could also mention this in future videos. Thanks for great content! Keep going!
@blanky_nap
@blanky_nap 4 ай бұрын
Interesting one. I am curious how complicated the flow will be in real life scenarios and if it’s manageable with all those prompt and debug units
@TrevorSullivan
@TrevorSullivan 4 ай бұрын
Yeah, I think basic workflows will be easier. It's nice to be able to "split" your LLM logic into multiple components though, rather than having to orchestrate these kinds of workflows with your own custom logic. I see some potential here, if they invest in improving its capabilities. They will also need to provide a mechanism to export / import workflows, so they can easily be shared. Maybe I missed this option, or they'll add it later?
@hassanbhatti
@hassanbhatti 4 ай бұрын
Appreciate for producing this content.
@lesptitsoiseaux
@lesptitsoiseaux 4 ай бұрын
Great video Trevor! If you ever do a follow-up on how to use a KB with an Agent that'd be great. FYI, I'm working on a recommendation engine that gets data from a Bedrock KB and wanting to have it as a frontpage bot. For newbies, it is a bit confusing how to set this up. A prompt flow allowed me to clean the kb results via a lambda but a prompt flow doesn't give me the ability to subsequently drill down on the resultset: it has no memory. An agent has memory but how do I guide it along with the need to clean up the results from a kb via lambda etc. I'm going in circle.... Anyway, I love your videos, thank you for posting them! 😊
@codeinrust
@codeinrust 4 ай бұрын
Thanks for your kind comment, and the video idea! Have you checked out the new Prompt Flows feature? That might accomplish what you're trying to do easily, combining Knowledge Bases and Agents. It's in preview, and was just announced a month ago. I just learned about it a few minutes ago, when I logged into the Bedrock console!
@codeinrust
@codeinrust 4 ай бұрын
Actually, I now see you commented 11 days ago on the AWS video about Prompt Flows. So, apparently you already know about it. 😉
@codeinrust
@codeinrust 4 ай бұрын
And now I see your specific comment about Prompt Flows .... "doesn't give me the ability to subsequently drill down on the resultset". Hmmmm, I would have to think about how to solve that. I am pretty new to LLMs and Bedrock in particular! If you're ever interested in chatting and brainstorming about the topic, let me know.
@lesptitsoiseaux
@lesptitsoiseaux 4 ай бұрын
@@codeinrust Hi! How to reconcile the concept of a Bedrock Agent (for memory) and the use of prompt flows... I don't think I can! My use case is a presale bot recommendation engine. So far, getting the results via Lambda from a bedrock kb works, but I really hope I can find out how to get the bot to drill down/filter results previously obtained when the user provides more precision. It is a fun adventure that is for sure.
@ubaidmundewadi6368
@ubaidmundewadi6368 4 ай бұрын
Nice explanation
@Whatever0704
@Whatever0704 4 ай бұрын
Great video. Afaik the eventbridge does guarantee event order. I need to create a simple architecture where we have a HR system (not known yet) where I would like to use a event based architecture. But the order of the event must be correct, like employee created -> employee updated -> employee deleted. I then need to have several system receiving these events.
@_deva
@_deva 3 ай бұрын
@Whatever0704 send event to SQS fifo queue that will ensure the order
@namalweerasakara6
@namalweerasakara6 4 ай бұрын
Good clear content. Was really helpful for me. Thank you.
@vivekgsp
@vivekgsp 4 ай бұрын
Thanks for the nice demo. If there is no matched rule, where does the incoming message goes in the event bus? Is it stored somewhere for sometime?
@_deva
@_deva 3 ай бұрын
@vivekgsp no rule is matched then nothing will be store as far as i know, if u need all the event need to be store better to have write two rule with one with no filter condition and send to archive it will available till the standard days. Other the original rule