Deploy Hugging Face models on Google Cloud: from the hub to Vertex AI

  Рет қаралды 1,177

Julien Simon

Julien Simon

3 ай бұрын

In this series of three videos, I walk you through the deployment of Hugging Face models on Google Cloud, in three different ways:
- Deployment from the hub model page to Inference endpoints ( • Deploy Hugging Face mo... ), with the Google Gemma 7B model,
- Deployment from the hub model page to Vertex AI (this video), with the Microsoft Phi-2 2.7B model,
- Deployment directly from within Vertex AI ( • Deploy Hugging Face mo... ), with the TinyLlama 1.1B model.
Get started at huggingface.co :)
⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos. Follow me on Medium at / julsimon or Substack at julsimon.substack.com. ⭐️⭐️⭐️

Пікірлер: 9
@spicule123
@spicule123 2 ай бұрын
This is fantastic!
@juliensimonfr
@juliensimonfr 2 ай бұрын
Yes, I like it too :)
@SO-vq7qd
@SO-vq7qd 25 күн бұрын
Thank you!
@juliensimonfr
@juliensimonfr 17 күн бұрын
You're welcome!
@recaia
@recaia 3 ай бұрын
If I upload a code like this and I want to send and receive data through my website on my website, which is programmed in Python, and on another host, how is that done? Or should I create an API and upload it to Cloud?
@juliensimonfr
@juliensimonfr 3 ай бұрын
The endpoint is just another HTTPS API. You can invoke it from your app. This blog post has a good example medium.com/@ashika.umanga/deploying-custom-fine-tuned-llms-on-vertex-ai-6f96752f9fc1, see the "Connecting to the endpoint and performing inference" section.
@barber5937
@barber5937 2 ай бұрын
Can I deploy my own huggingface model to vertex? My model says it is endpoints compatible but I don't see anything indicating vertex ai compatibility or how to achieve that
@juliensimonfr
@juliensimonfr 2 ай бұрын
Yes, assuming your model is based on a supported architecture (Llama, etc.). If you create a model card with the appropriate tags (architecture, task type, etc), the "Deploy" button will let you deploy to AWS, Google, etc. See huggingface.co/docs/hub/model-cards for more, as well as model card for well-known models (google, meta, etc.)
@barber5937
@barber5937 2 ай бұрын
@@juliensimonfr Awesome, thanks julien
Deploy a custom model to Vertex AI
5:27
Mark Ryan
Рет қаралды 9 М.
Как бесплатно замутить iphone 15 pro max
00:59
ЖЕЛЕЗНЫЙ КОРОЛЬ
Рет қаралды 8 МЛН
Why Is He Unhappy…?
00:26
Alan Chikin Chow
Рет қаралды 63 МЛН
ЧУТЬ НЕ УТОНУЛ #shorts
00:27
Паша Осадчий
Рет қаралды 10 МЛН
Vertex AI Search Hello World
8:16
Mark Ryan
Рет қаралды 1,7 М.
Serving Machine Learning models with Google Vertex AI
17:35
ML Engineer
Рет қаралды 9 М.
Deploy models with Hugging Face Inference Endpoints
16:45
Julien Simon
Рет қаралды 15 М.
How To Deploy ML Models With Google Cloud Run
20:10
Patrick Loeber
Рет қаралды 46 М.
How to get predictions from an ML model
6:27
Google Cloud Tech
Рет қаралды 33 М.
Rate This Smartphone Cooler Set-up ⭐
0:10
Shakeuptech
Рет қаралды 6 МЛН
$1 vs $100,000 Slow Motion Camera!
0:44
Hafu Go
Рет қаралды 28 МЛН
Частая ошибка геймеров? 😐 Dareu A710X
1:00
Вэйми
Рет қаралды 4,9 МЛН