LLaVA: A large multi-modal language model

Multimodal AI: LLMs that can see (and hear)

LLaVA 1.6 is here...but is it any good? (via Ollama)

Learn Colors Magic Lego Balloons Tutorial #katebrush #shorts #learncolors #tutorial

Une nouvelle voiture pour Noël 🥹

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

We Attempted The Impossible 😱

LLaVA: A large multi-modal language model

Рет қаралды 5,805

Learn Data with Mark

Learn Data with Mark

Күн бұрын

Пікірлер: 10

@Jerad-wu3xt Ай бұрын

Just found your site. I’m learning so much. You do a great job explaining things. Thanks for all you do to help people.

@thisurawz Жыл бұрын

Can you do a video on finetuning a multimodal LLM (Video-LlaMA, LLaVA, or CLIP) with a custom multimodal dataset containing images and texts for relation extraction or a specific task? Can you do it using open-source multimodal LLM and multimodal datasets like video-llama or else so anyone can further their experiments with the help of your tutorial. Can you also talk about how we can boost the performance of the fine-tuned modal using prompt tuning in the same video?

@aragaodan Жыл бұрын

So cool. GenAI is a never ending stream of fun.

@AdamMonago Жыл бұрын

Nice job on this video series Mark.

@StudyWithMe-mh6pi

@StudyWithMe-mh6pi Жыл бұрын

Nice job 👏👏

@kenbajema Жыл бұрын

Thanks for posting. I have it working but I do see an error in Cygwin when I run it regarding a missing cl.exe which exists but it seems to be working

@learndatawithmark

@learndatawithmark Жыл бұрын

It might be worth posting the error as a GitHub issue so they can look into it, but good it works even with the error!

@vishalnakey4703

@vishalnakey4703 Жыл бұрын

👏

@PeterCorless Жыл бұрын

Me, immediately honing in on the misspelling of "instruction" at the 17 second mark. 🫠

@learndatawithmark

@learndatawithmark Жыл бұрын

Didn't spot that haha, good one!

Multimodal AI: LLMs that can see (and hear)

21:19

Multimodal AI: LLMs that can see (and hear)

Shaw Talebi

Рет қаралды 5 М.

LLaVA 1.6 is here...but is it any good? (via Ollama)

5:41

LLaVA 1.6 is here...but is it any good? (via Ollama)

Learn Data with Mark

Рет қаралды 16 М.

Learn Colors Magic Lego Balloons Tutorial #katebrush #shorts #learncolors #tutorial

00:10

Learn Colors Magic Lego Balloons Tutorial #katebrush #shorts #learncolors #tutorial

Kate Brush

Рет қаралды 45 МЛН

Une nouvelle voiture pour Noël 🥹

00:28

Une nouvelle voiture pour Noël 🥹

Nicocapone

Рет қаралды 9 МЛН

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

00:41

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

King jr

Рет қаралды 7 МЛН

We Attempted The Impossible 😱

00:54

We Attempted The Impossible 😱

Topper Guild

Рет қаралды 56 МЛН

Fine-Tuning Multimodal LLMs (LLAVA) for Image Data Parsing

53:43

Fine-Tuning Multimodal LLMs (LLAVA) for Image Data Parsing

Farzad Roozitalab (AI RoundTable)

Рет қаралды 5 М.

Fine-tune Multi-modal LLaVA Vision and Language Models

51:06

Fine-tune Multi-modal LLaVA Vision and Language Models

Trelis Research

Рет қаралды 30 М.

Small Language Models Explained: The Future of Business Transformation

32:24

Small Language Models Explained: The Future of Business Transformation

Ragnar Pitla (Make it Happen)

Рет қаралды 15 М.

Fine-tuning Multi modal LLMs (Llama 3.2 Vision)

8:32

Fine-tuning Multi modal LLMs (Llama 3.2 Vision)

Data Science in your pocket

Рет қаралды 885

Qwen Just Casually Started the Local AI Revolution

16:05

Qwen Just Casually Started the Local AI Revolution

Cole Medin

Рет қаралды 126 М.

Video #202 MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

14:02

Video #202 MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Data Science Gems

Рет қаралды 323

Fine-tuning Llama 3.2 on Your Data with a single GPU | Training LLM for Sentiment Analysis

21:47

Fine-tuning Llama 3.2 on Your Data with a single GPU | Training LLM for Sentiment Analysis

Venelin Valkov

Рет қаралды 10 М.

8 AI Tools I Wish I Tried Sooner

16:10

8 AI Tools I Wish I Tried Sooner

Futurepedia

Рет қаралды 300 М.

NuExtract: An LLM that extracts information

4:08

NuExtract: An LLM that extracts information

Learn Data with Mark

Рет қаралды 1,6 М.

Screen Speak - A multimodal AI Assistant that transforms screenshots into AI analysis

20:51

Screen Speak - A multimodal AI Assistant that transforms screenshots into AI analysis

John Capobianco

Рет қаралды 369

Learn Colors Magic Lego Balloons Tutorial #katebrush #shorts #learncolors #tutorial

00:10

Learn Colors Magic Lego Balloons Tutorial #katebrush #shorts #learncolors #tutorial

Kate Brush

Рет қаралды 45 МЛН