Image Recognition with LLaVa in Python

Рет қаралды 15,749

NeuralNine

Күн бұрын

Пікірлер: 36

@yuvrajkukreja9727 4 ай бұрын

Awesome, man! I was not aware of customizing Ollama with this kind of Python script! Thanks :)

@ammadkhan4687 9 күн бұрын

Hi, I love all your videos. Could please make a video on getting structured output using ollama. I have use-case to extract specific information from the image and get the output so that automatically the data will be added in database. thanks in advance.

@NiceTechViews5403 12 күн бұрын

impressive this llva! my original plan was to detect objects via yolo7 ..give the detected objects to ollama to get some text..and let this text then sound via a loudspeaker. llva ist detecting much more object i guess!? - thx for your video 🙂

@blackstonesoftware7074 5 ай бұрын

This is quite useful! It gives me some great ideas for my own local apps!

@joebywan 2 ай бұрын

Rad video, thanks dude. Why's the image path take a list, but supplying multiple images to it doesn't work?

@wasgeht2409 5 ай бұрын

Thanks :) Is it possible to use this model as an ocr alternativ to get for example informationen from a jpeg image which is an id-card ?

@sumukhas5418 5 ай бұрын

This will be too much heavy for just that Instead considering yolo would be a better option

@wasgeht2409 5 ай бұрын

@@sumukhas5418 Thanks for the answer :) Actually I am trying pytesseract to read id-card information, which are photographed by a phone and the results are not very good :/ Do you have some ideas, how I could get some better results?

@derekchance8197 4 ай бұрын

Are there models that recognize a photo and then vectorizes it?

@AlissonSantos-qw6db 5 ай бұрын

Nice, very helpful! Is it possible to create embeddings of pictures with the model?

@declan6052 2 ай бұрын

How can I modify this code to use my local GPU? It seems to default to my CPU but can't find any way to do this easily

@NiceTechViews5403 12 күн бұрын

it is using my GPU..i have py39, CUDA 11.2 and cuDNN 8, 2019 Visual Studio, GTX 1660TI “Tuning sm_75”

@R8R809 5 ай бұрын

Thanks for the video, how to make sure that I install Ollama on the GPU not on the CPU?

@GuillermoGarcia75 5 ай бұрын

Riding the awesomeness wave again!

@rajm5349 3 ай бұрын

can we get the answer in different languages as per the client requrement just like in hindi or tamil or japanese etc if possible

@yuvrajkukreja9727 4 ай бұрын

how to add long term memory in this local llm ???

@jaykrown 3 ай бұрын

This was very helpful, my first time getting results from a multimodal LLM directly using Python.

@brpatil_007 3 ай бұрын

Is ollama and llava is free to use and I have spec 16GB/1TB RTX 3050Ti what no. of model is suitable for my device 13B one or else. And I already using ollama basic 4GB model in my device is it ok to run 13B model and some Other model like OpenAi or Gemini API??

@giovannicordova4803 5 ай бұрын

If my local ram is 8 gb, which ollama model would you recommend to use?

@WebWizard977 5 ай бұрын

deepseek-coder ❤

@WebWizard977 5 ай бұрын

deepseek-coder ❤

@aaronbornmann9835 2 ай бұрын

Thanks for your help you legend

@timstevens3361 Ай бұрын

what gpu ? how much vram ?

@fastmamajama 4 ай бұрын

wow this is too easy to be real. i am using opencv to record videos of flying saucers. i could record images and use llama to verify if there is a flying saucer in it. can i also search videos with videos: instead of images:?

@potatoes1000 5 ай бұрын

is this fully offline? I am not sure you downloaded the 13B 7.4Gb package

@naturexmusic2567 3 ай бұрын

Help me out ,it took less than 10 seconds to get the output , but for me it is like taking 3mins to run , of course it runs , i am happy but it is too late

@santhosh-j7e 2 ай бұрын

My computer takes more than an hour , the system is installed with a 4GB 3060 GPU , what can I do

@naturexmusic2567 2 ай бұрын

@@santhosh-j7e I dont know man , i was like working it for my hackathon , i tried like all pc ,like pentium , i3 , i5 ,i7 but no difference.

@Isusgsue 5 ай бұрын

What a nice vid. Can I do a ai without using open ai ?

@antonpictures 5 ай бұрын

rag - webcam - selfawareness - speech --> tutorial pls

@aoa1015 5 ай бұрын

How much RAM and VRAM needed ?!

@RedFoxRicky 5 ай бұрын

With 4-bit quantization, for LLaVA-1.5-7B, it uses less than 8GB VRAM on a single GPU, typically the 7B model can run with a GPU with less than 24GB memory, and the 13B model requires ~32 GB memory. You can use multiple 24-GB GPUs to run 13B model