Serving Gemma on GKE on TPU using Jetstream

Demo: Rapid prototyping with Gemma and Llama.cpp

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Дарю Самокат Скейтеру !

Cool Items! New Gadgets, Smart Appliances 🌟 By 123 GO! House

What it feels like cleaning up after a toddler.

아들이 똑똑하면 생기는 일! 설마 천재??? 아닌가?? 뭔가 이상해 ㅋㅋㅋ

Serving Gemma on GKE on TPU using Jetstream

Рет қаралды 135

Container Bytes

Container Bytes

Күн бұрын

Gemma is a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini models.
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs
In this video Mofi Rahman and Ali Zaidi walks through the process of deploying Gemma on GKE on TPU using JetStream.
Find Gemma on Huggingface - huggingface.co/google
Find Gemma on Kaggle - www.kaggle.com/models/google/...
Follow along the guide: cloud.google.com/kubernetes-e...
Find other guides for serving Gemma and other AIML resources for GKE: g.co/cloud/gke-aiml
Find other resources for learning about Gemma: ai.google.dev/gemma

Пікірлер

Demo: Rapid prototyping with Gemma and Llama.cpp

11:37

Demo: Rapid prototyping with Gemma and Llama.cpp

Google for Developers

Рет қаралды 65 М.

Generative AI in a Nutshell - how to survive and thrive in the age of AI

17:57

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Henrik Kniberg

Рет қаралды 1,7 МЛН

Дарю Самокат Скейтеру !

00:42

Дарю Самокат Скейтеру !

Vlad Samokatchik

Рет қаралды 8 МЛН

Cool Items! New Gadgets, Smart Appliances 🌟 By 123 GO! House

00:18

Cool Items! New Gadgets, Smart Appliances 🌟 By 123 GO! House

123 GO! HOUSE

Рет қаралды 17 МЛН

What it feels like cleaning up after a toddler.

00:40

What it feels like cleaning up after a toddler.

Daniel LaBelle

Рет қаралды 75 МЛН

아들이 똑똑하면 생기는 일! 설마 천재??? 아닌가?? 뭔가 이상해 ㅋㅋㅋ

00:18

아들이 똑똑하면 생기는 일! 설마 천재??? 아닌가?? 뭔가 이상해 ㅋㅋㅋ

MariAndFriends

Рет қаралды 14 МЛН

Architecture of a ML Platform with Resource Sharing on Kubernetes

16:41

Architecture of a ML Platform with Resource Sharing on Kubernetes

Container Bytes

Рет қаралды 442

Getting started with Gemma models

9:00

Getting started with Gemma models

Google for Developers

Рет қаралды 8 М.

Perfect AI development setup for any programming language with Sonnet 3.5 and Claude Projects

28:14

Perfect AI development setup for any programming language with Sonnet 3.5 and Claude Projects

Stanislav Khromov

Рет қаралды 13 М.

What is LangChain?

8:08

What is LangChain?

IBM Technology

Рет қаралды 175 М.

The cloud is over-engineered and overpriced (no music)

14:39

The cloud is over-engineered and overpriced (no music)

Tom Delalande

Рет қаралды 507 М.

RAG with a Neo4j Knowledge Graph: How it Works and How to Set It Up

15:57

RAG with a Neo4j Knowledge Graph: How it Works and How to Set It Up

Neo4j

Рет қаралды 36 М.

Demo: Building cloud-native, AI-powered applications with GKE

12:24

Demo: Building cloud-native, AI-powered applications with GKE

Google for Developers

Рет қаралды 649

End To End Multi Language Invoice Extractor Project Using Google Gemini Pro Free LLM Model

25:59

End To End Multi Language Invoice Extractor Project Using Google Gemini Pro Free LLM Model

Krish Naik

Рет қаралды 27 М.

Google Cloud AI Platforms and Infrastructure

54:07

Google Cloud AI Platforms and Infrastructure

Tech Field Day

Рет қаралды 18 М.

Serve LLM on Google Kubernetes Engine on L4 GPUs

16:51

Serve LLM on Google Kubernetes Engine on L4 GPUs

Container Bytes

Рет қаралды 386

Как правильно выключать звук на телефоне?

0:17

Как правильно выключать звук на телефоне?

Люди.Идеи, общественная организация

Рет қаралды 1,9 МЛН

Что делать если в телефон попала вода?

0:17

Что делать если в телефон попала вода?

Лена Тропоцел

Рет қаралды 2,5 МЛН

Лого для клиента из Таджикистана. Анимация в After Effects

1:00

Лого для клиента из Таджикистана. Анимация в After Effects

FreelStep Shorts

Рет қаралды 2 МЛН

Galaxy Z Fold 6😈 vs SAMSUNG S24 ULTRA vs S23 vs S22 vs A35 5G vs - FREEFIRE TEST #freefire #shorts

0:20

Galaxy Z Fold 6😈 vs SAMSUNG S24 ULTRA vs S23 vs S22 vs A35 5G vs - FREEFIRE TEST #freefire #shorts

Sameer Gaming

Рет қаралды 2,6 МЛН

ЧТО ЭТО За Флешки Замурованные в СТЕНЕ? #shorts

0:53

ЧТО ЭТО За Флешки Замурованные в СТЕНЕ? #shorts

Bubble™

Рет қаралды 6 МЛН

Сколько реально стоит ПК Величайшего?

0:37

Сколько реально стоит ПК Величайшего?

CONSTRUCT PC

Рет қаралды 3,3 МЛН

Копия iPhone с WildBerries

1:00

Копия iPhone с WildBerries

Wylsacom

Рет қаралды 6 МЛН