LLaVA: A large multi-modal language model

  Рет қаралды 5,805

Learn Data with Mark

Learn Data with Mark

Күн бұрын

Пікірлер: 10
@Jerad-wu3xt
@Jerad-wu3xt Ай бұрын
Just found your site. I’m learning so much. You do a great job explaining things. Thanks for all you do to help people.
@thisurawz
@thisurawz Жыл бұрын
Can you do a video on finetuning a multimodal LLM (Video-LlaMA, LLaVA, or CLIP) with a custom multimodal dataset containing images and texts for relation extraction or a specific task? Can you do it using open-source multimodal LLM and multimodal datasets like video-llama or else so anyone can further their experiments with the help of your tutorial. Can you also talk about how we can boost the performance of the fine-tuned modal using prompt tuning in the same video?
@aragaodan
@aragaodan Жыл бұрын
So cool. GenAI is a never ending stream of fun.
@AdamMonago
@AdamMonago Жыл бұрын
Nice job on this video series Mark.
@StudyWithMe-mh6pi
@StudyWithMe-mh6pi Жыл бұрын
Nice job 👏👏
@kenbajema
@kenbajema Жыл бұрын
Thanks for posting. I have it working but I do see an error in Cygwin when I run it regarding a missing cl.exe which exists but it seems to be working
@learndatawithmark
@learndatawithmark Жыл бұрын
It might be worth posting the error as a GitHub issue so they can look into it, but good it works even with the error!
@vishalnakey4703
@vishalnakey4703 Жыл бұрын
👏
@PeterCorless
@PeterCorless Жыл бұрын
Me, immediately honing in on the misspelling of "instruction" at the 17 second mark. 🫠
@learndatawithmark
@learndatawithmark Жыл бұрын
Didn't spot that haha, good one!
Multimodal AI: LLMs that can see (and hear)
21:19
Shaw Talebi
Рет қаралды 5 М.
LLaVA 1.6 is here...but is it any good? (via Ollama)
5:41
Learn Data with Mark
Рет қаралды 16 М.
Une nouvelle voiture pour Noël 🥹
00:28
Nicocapone
Рет қаралды 9 МЛН
We Attempted The Impossible 😱
00:54
Topper Guild
Рет қаралды 56 МЛН
Fine-Tuning Multimodal LLMs (LLAVA) for Image Data Parsing
53:43
Farzad Roozitalab (AI RoundTable)
Рет қаралды 5 М.
Fine-tune Multi-modal LLaVA Vision and Language Models
51:06
Trelis Research
Рет қаралды 30 М.
Small Language Models Explained: The Future of Business Transformation
32:24
Ragnar Pitla (Make it Happen)
Рет қаралды 15 М.
Fine-tuning Multi modal LLMs (Llama 3.2 Vision)
8:32
Data Science in your pocket
Рет қаралды 885
Qwen Just Casually Started the Local AI Revolution
16:05
Cole Medin
Рет қаралды 126 М.
8 AI Tools I Wish I Tried Sooner
16:10
Futurepedia
Рет қаралды 300 М.
NuExtract: An LLM that extracts information
4:08
Learn Data with Mark
Рет қаралды 1,6 М.