No video

Deploy Hugging Face models on Google Cloud: directly from Vertex AI

  Рет қаралды 8,665

Julien Simon

Julien Simon

Күн бұрын

In this series of three videos, I walk you through the deployment of Hugging Face models on Google Cloud, in three different ways:
- Deployment from the hub model page to Inference endpoints ( • Deploy Hugging Face mo... ), with the Google Gemma 7B model,
- Deployment from the hub model page to Vertex AI ( • Deploy Hugging Face mo... ), with the Microsoft Phi-2 2.7B model,
- Deployment directly from within Vertex AI (this video), with the TinyLlama 1.1B model.
Get started at huggingface.co :)
⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos. Follow me on Medium at / julsimon or Substack at julsimon.substack.com. ⭐️⭐️⭐️

Пікірлер: 9
@LindsayHiebert
@LindsayHiebert 3 ай бұрын
Great job! Thank you!!!
@juliensimonfr
@juliensimonfr 3 ай бұрын
Glad it was helpful!
@LindsayHiebert
@LindsayHiebert 3 ай бұрын
Julien , excellent episodes. Do you have a cheat sheet reference guide for your video series to develop top level app development skills with Hugging Face models. I am looking for your recommendations for the best path to learn and practice with the best practices and tools. Thanks much!
@juliensimonfr
@juliensimonfr 3 ай бұрын
Hi Lindsay, if you're new to Hugging Face, I recommend starting at huggingface.co/learn, in particular the NLP course and the cookbook. Then, you can start diving deeper into whatever makes more sense to you: cloud, optimization, etc.
@LindsayHiebert
@LindsayHiebert 3 ай бұрын
Thanks much Julien!
@brightworld7550
@brightworld7550 19 күн бұрын
Very helpfull thank you so much. One question. When it comes to pricing. Does Vertex AI bill you based on the usage (how many seconds the model is running) or for as long as the model is running 24/7?
@juliensimonfr
@juliensimonfr 17 күн бұрын
It's instance-based, so you pay for instance time as long as it's up.
@donkeroo1
@donkeroo1 3 ай бұрын
I see at 2:14 you have an error for the gemma model deployment, I am getting the same error for many of the models from HF, however I did manage to get tinyllama working. Any advice on chasing down the error? Logs are telling me the models already exist. Thank you for the guidance!
@juliensimonfr
@juliensimonfr 2 ай бұрын
Looks like the first few days were a bit rough :) Are you still seeing errors? If yes, please post at discuss.huggingface.co
Google Releases AI AGENT BUILDER! 🤖 Worth The Wait?
34:21
Matthew Berman
Рет қаралды 229 М.
Getting started with Gemini on Vertex AI
18:09
Google Cloud Tech
Рет қаралды 1,1 М.
What it feels like cleaning up after a toddler.
00:40
Daniel LaBelle
Рет қаралды 89 МЛН
Пранк пошел не по плану…🥲
00:59
Саша Квашеная
Рет қаралды 7 МЛН
A Star Is About to Explode (And You'll Be Able to See It)
8:45
StarTalk
Рет қаралды 2,1 МЛН
SageMaker JumpStart: deploy Hugging Face models in minutes!
8:23
Serving Machine Learning models with Google Vertex AI
17:35
ML Engineer
Рет қаралды 9 М.
🤗 Hugging Cast S2E3 - Deploying LLMs on Google Cloud
32:11
HuggingFace
Рет қаралды 1,6 М.
Run your own AI (but private)
22:13
NetworkChuck
Рет қаралды 1,3 МЛН
Google Vertex AI Agent Builder Tutorial
14:50
Architecture Bytes
Рет қаралды 10 М.
The most important AI trends in 2024
9:35
IBM Technology
Рет қаралды 231 М.
An LLM journey speed run: Going from Hugging Face to Vertex AI
38:24
Google for Developers
Рет қаралды 1,9 М.
What it feels like cleaning up after a toddler.
00:40
Daniel LaBelle
Рет қаралды 89 МЛН