Image Recognition with LLaVa in Python

  Рет қаралды 15,749

NeuralNine

NeuralNine

Күн бұрын

Пікірлер: 36
@yuvrajkukreja9727
@yuvrajkukreja9727 4 ай бұрын
Awesome, man! I was not aware of customizing Ollama with this kind of Python script! Thanks :)
@ammadkhan4687
@ammadkhan4687 9 күн бұрын
Hi, I love all your videos. Could please make a video on getting structured output using ollama. I have use-case to extract specific information from the image and get the output so that automatically the data will be added in database. thanks in advance.
@NiceTechViews5403
@NiceTechViews5403 12 күн бұрын
impressive this llva! my original plan was to detect objects via yolo7 ..give the detected objects to ollama to get some text..and let this text then sound via a loudspeaker. llva ist detecting much more object i guess!? - thx for your video 🙂
@blackstonesoftware7074
@blackstonesoftware7074 5 ай бұрын
This is quite useful! It gives me some great ideas for my own local apps!
@joebywan
@joebywan 2 ай бұрын
Rad video, thanks dude. Why's the image path take a list, but supplying multiple images to it doesn't work?
@wasgeht2409
@wasgeht2409 5 ай бұрын
Thanks :) Is it possible to use this model as an ocr alternativ to get for example informationen from a jpeg image which is an id-card ?
@sumukhas5418
@sumukhas5418 5 ай бұрын
This will be too much heavy for just that Instead considering yolo would be a better option
@wasgeht2409
@wasgeht2409 5 ай бұрын
@@sumukhas5418 Thanks for the answer :) Actually I am trying pytesseract to read id-card information, which are photographed by a phone and the results are not very good :/ Do you have some ideas, how I could get some better results?
@derekchance8197
@derekchance8197 4 ай бұрын
Are there models that recognize a photo and then vectorizes it?
@AlissonSantos-qw6db
@AlissonSantos-qw6db 5 ай бұрын
Nice, very helpful! Is it possible to create embeddings of pictures with the model?
@declan6052
@declan6052 2 ай бұрын
How can I modify this code to use my local GPU? It seems to default to my CPU but can't find any way to do this easily
@NiceTechViews5403
@NiceTechViews5403 12 күн бұрын
it is using my GPU..i have py39, CUDA 11.2 and cuDNN 8, 2019 Visual Studio, GTX 1660TI “Tuning sm_75”
@R8R809
@R8R809 5 ай бұрын
Thanks for the video, how to make sure that I install Ollama on the GPU not on the CPU?
@GuillermoGarcia75
@GuillermoGarcia75 5 ай бұрын
Riding the awesomeness wave again!
@rajm5349
@rajm5349 3 ай бұрын
can we get the answer in different languages as per the client requrement just like in hindi or tamil or japanese etc if possible
@yuvrajkukreja9727
@yuvrajkukreja9727 4 ай бұрын
how to add long term memory in this local llm ???
@jaykrown
@jaykrown 3 ай бұрын
This was very helpful, my first time getting results from a multimodal LLM directly using Python.
@brpatil_007
@brpatil_007 3 ай бұрын
Is ollama and llava is free to use and I have spec 16GB/1TB RTX 3050Ti what no. of model is suitable for my device 13B one or else. And I already using ollama basic 4GB model in my device is it ok to run 13B model and some Other model like OpenAi or Gemini API??
@giovannicordova4803
@giovannicordova4803 5 ай бұрын
If my local ram is 8 gb, which ollama model would you recommend to use?
@WebWizard977
@WebWizard977 5 ай бұрын
deepseek-coder ❤
@WebWizard977
@WebWizard977 5 ай бұрын
deepseek-coder ❤
@aaronbornmann9835
@aaronbornmann9835 2 ай бұрын
Thanks for your help you legend
@timstevens3361
@timstevens3361 Ай бұрын
what gpu ? how much vram ?
@fastmamajama
@fastmamajama 4 ай бұрын
wow this is too easy to be real. i am using opencv to record videos of flying saucers. i could record images and use llama to verify if there is a flying saucer in it. can i also search videos with videos: instead of images:?
@potatoes1000
@potatoes1000 5 ай бұрын
is this fully offline? I am not sure you downloaded the 13B 7.4Gb package
@naturexmusic2567
@naturexmusic2567 3 ай бұрын
Help me out ,it took less than 10 seconds to get the output , but for me it is like taking 3mins to run , of course it runs , i am happy but it is too late
@santhosh-j7e
@santhosh-j7e 2 ай бұрын
My computer takes more than an hour , the system is installed with a 4GB 3060 GPU , what can I do
@naturexmusic2567
@naturexmusic2567 2 ай бұрын
@@santhosh-j7e I dont know man , i was like working it for my hackathon , i tried like all pc ,like pentium , i3 , i5 ,i7 but no difference.
@Isusgsue
@Isusgsue 5 ай бұрын
What a nice vid. Can I do a ai without using open ai ?
@antonpictures
@antonpictures 5 ай бұрын
rag - webcam - selfawareness - speech --> tutorial pls
@aoa1015
@aoa1015 5 ай бұрын
How much RAM and VRAM needed ?!
@RedFoxRicky
@RedFoxRicky 5 ай бұрын
With 4-bit quantization, for LLaVA-1.5-7B, it uses less than 8GB VRAM on a single GPU, typically the 7B model can run with a GPU with less than 24GB memory, and the 13B model requires ~32 GB memory. You can use multiple 24-GB GPUs to run 13B model
@Justwil07
@Justwil07 4 ай бұрын
7.5 Gb ?????
@Tech_Distro
@Tech_Distro 4 ай бұрын
It's 4.7gb for 7b version
@syedmokarromhossain4867
@syedmokarromhossain4867 5 ай бұрын
First comment 😊😊😊
@arjuntt2604
@arjuntt2604 5 ай бұрын
oh im too fast
Book Recommendation System in Python with LLMs
24:33
NeuralNine
Рет қаралды 7 М.
Image Annotation with LLava & Ollama
14:40
Sam Witteveen
Рет қаралды 31 М.
人是不能做到吗?#火影忍者 #家人  #佐助
00:20
火影忍者一家
Рет қаралды 20 МЛН
Enceinte et en Bazard: Les Chroniques du Nettoyage ! 🚽✨
00:21
Two More French
Рет қаралды 42 МЛН
Quando eu quero Sushi (sem desperdiçar) 🍣
00:26
Los Wagners
Рет қаралды 14 МЛН
Face Recognition With Python 3.10 Tutorial (Webcam)
18:59
Indently
Рет қаралды 116 М.
DuckDB in Python - The Next Pandas Killer?
19:32
NeuralNine
Рет қаралды 36 М.
Python 50 Useful Tips
41:59
DataScience&Coding José Comé
Рет қаралды 6 М.
Image classification with Python and Scikit learn | Computer vision tutorial
32:28
Computer vision engineer
Рет қаралды 58 М.
Denoising Images with OpenCV in Python
10:10
NeuralNine
Рет қаралды 4,6 М.
Build Your Own News Hub in Python - RSS Feed Aggregator
22:25
NeuralNine
Рет қаралды 7 М.
Movie Recommender System in Python with LLMs
25:01
NeuralNine
Рет қаралды 10 М.
Unlocking The Power Of AI: Creating Python Apps With Ollama!
12:12
Matt Williams
Рет қаралды 33 М.
Build a Python AI Image Generator in 15 Minutes (Free & Local)
17:20
5 Custom Python Decorators For Your Projects
25:40
NeuralNine
Рет қаралды 9 М.
人是不能做到吗?#火影忍者 #家人  #佐助
00:20
火影忍者一家
Рет қаралды 20 МЛН