Hey everyone, let's explore Azure OpenAI and all the options and resiliency considerations! Please make sure to read the description for the chapters and key information about this video and others. ⚠ P L E A S E N O T E ⚠ 🔎 If you are looking for content on a particular topic search the channel. If I have something it will be there! 🕰 I don't discuss future content nor take requests for future content so please don't ask 😇 🤔 Due to the channel growth and number of people wanting help I no longer can answer or even read questions and they will just stay in the moderation queue never to be seen so please post questions to other sites like Reddit, Microsoft Community Hub etc. 👂 Translate the captions to your native language via the auto-translate feature in settings! kzbin.info/www/bejne/rGbFZmZjhcx4o6s for a demo of using this feature. Thanks for watching! 🤙
@DikimkdАй бұрын
I am LOVING these AI content videos!
@NTFAQGuyАй бұрын
Glad you're finding them helpful!
@RistoRatilainenАй бұрын
This was very informative “summary” about service in question design options.🤝
@MrAmgadHasanАй бұрын
Thanks so much for covering this. I especially loved the fact that the azure openai resource is regional even if the deployment is global. Just one thing about prompt caching: we cache the output of the tokenization step and the prompt processing step as well (calculating the k-v cache).
@NTFAQGuyАй бұрын
Good info on the prompt caching. Thanks!
@DavidManouchehriАй бұрын
Are you seeing prompt caching actually working in production? I have yet to see a single request for gpt-4o or gpt-4o-mini work with caching on Azure OpenAI.
@elprofesornet8897Ай бұрын
Great video, thanks John. I can now translate this information to my customers.
@VirtualPacketsАй бұрын
Really nice breakdown and whiteboard session, makes perfect sense, thx again John.
@NTFAQGuyАй бұрын
Glad you liked it!
@LifeisbetterwithaMalinoisАй бұрын
Thank you sir John😊
@NTFAQGuyАй бұрын
You're welcome 😊
@akhil.varierm1899Ай бұрын
First comment.😀As always, amazing content and presentation. Thanks John.
@NTFAQGuyАй бұрын
Appreciate you checking it out!
@denver-reed23 күн бұрын
I'm a bit confused on the intelligent routing plus the need for APIM. If a global deployment has intelligent routing to figure out which endpoint is best given utilization + latency, how does APIM fit in?
@NTFAQGuy19 күн бұрын
Resilience for endpoints, ptu to paygo etc as I talk about in video