Рет қаралды 623
In this video, I will be discussing how to use FastAPI for local model serving. FastAPI is a modern, fast (high-performance) web framework for building APIs. It's designed to be easy to use and to provide high performance, making it an ideal choice for building APIs for machine learning models, including large language models (LLMS).
Additionally, we'll discuss how to use Ray for scaling MLOps and LLMOps, as well as how to apply these techniques to applications like stable diffusion. Ray is an open-source system for scaling Python applications from a single machine to a large cluster, providing a simple and scalable way to manage and distribute workloads. Enjoy!
FastAPI Docs:
fastapi.tiango...
HTTP Methods:
www.anyscale.c...
FastAPI For Machine Learning
github.com/Fou...
Why Ray and FastAPI?
www.ray.io/ray...
Ray and FastAPI Deployment
www.anyscale.c...
Linkedin
/ ai-kadhim
Twitter
/ ai_kadhim