ComfyUI Tutorial Series: Ep11 - LLM, Prompt Generation, img2txt, txt2txt Overview

  Рет қаралды 14,250

pixaroma

pixaroma

Күн бұрын

Пікірлер
@AndreyJulpa
@AndreyJulpa Күн бұрын
A lot of useful information in one video without any fluff. Thank you!
@DezorianGuy
@DezorianGuy 3 ай бұрын
Somehow, other AI KZbinrs are prominently promoted by KZbin and Google, but you’re truly the only tutor one needs to have, imho.
@Searge-DP
@Searge-DP 3 ай бұрын
Great video. It's always fun to see my LLM node in action :) If you want to have more control over the seed on the LLM node, you can also turn the random_seed into an input and connect either a primitive node or another seed generator to get the option to use different seeds every time you run the workflow. I've also tried the gguf versions of Phi-3.5-Mini, for example from bartowski/Phi-3.5-mini-instruct-GGUF on huggingface, and those have great results with smaller llm models and less vram use.
@SebAnt
@SebAnt 3 ай бұрын
Wow!!
@pixaroma
@pixaroma 3 ай бұрын
Thanks, i already did that on the simple workflow at the end of the video that only generate prompts by adding seed as input :) I left it without random seed on flux workflow just in case people want to generate more on the same prompt so it is faster. Thanks for the models i have to check it out. Do you know any similar to florence, that get better description from images?
@pixaroma
@pixaroma 3 ай бұрын
Can you also add an image input to the node? Or a new node that has that input? So i can get a description from an image. It kind of worked with a path to the image but didn't want to add extra nodes just to get the paths in the string format and then use instructions on it
@Searge-DP
@Searge-DP 3 ай бұрын
@@pixaroma No, the node and models don't support images directly. It also can't load files from your drive. In your video it didn't actually look at the images when you pasted the file paths, it picked up the subject from the image filename. What I usually do is combine what you have here: 1. load image 2. get caption with florence2 3. refine caption with searge-llm 4. ??? 5. profit
@ivo_tm
@ivo_tm 3 ай бұрын
Another great video! As MrBeast says, the next step after 10k subscribers is not 20k, but 100k !!! Keep it up!
@lakislam.photoArt
@lakislam.photoArt 2 ай бұрын
i can only say : u r unbelievable. your tutorials are out of this world 👏👏👏
@pixaroma
@pixaroma 3 ай бұрын
Update: The Searge node cannot provide descriptions of images; use it for prompt generation instead. Use Florence for generating descriptions from images. Therefore, use Searge with LLM for prompt generation from text instructions, and Florence 2 for generating prompts from images. Join the conversation on Discord discord.gg/gggpkVgBf3 or in our Facebook group facebook.com/groups/pixaromacommunity. You can now support the channel and unlock exclusive perks by becoming a member: kzbin.info/door/mMbwA-s3GZDKVzGZ-kPwaQjoin
@digitalblizz9102
@digitalblizz9102 2 ай бұрын
I am here to say: You're the best explainer out there when it comes to ComfyUI. Keep it up!
@pixaroma
@pixaroma 2 ай бұрын
Thank you ☺️
@baheth3elmy16
@baheth3elmy16 3 ай бұрын
Great video! Thank you for the diversity in your videos!
@camilocortes8074
@camilocortes8074 3 ай бұрын
do not have words to describe the amount of value this video provides, congrats!!
@WasamiKirua
@WasamiKirua 3 ай бұрын
thanks for your free content. you always bring a lot of value to the community for free. hope will never change
@odrammurks1497
@odrammurks1497 3 ай бұрын
Thank you so so much for these videos. You're doing an amazing job at explaining! ❤
@НастенькаНастя-ь8ц
@НастенькаНастя-ь8ц 2 ай бұрын
Your lessons are really very detailed and useful. I'm doing well, it's fantastic, thank you for your work
@pixaroma
@pixaroma 2 ай бұрын
Glad you like them 🙂 and thanks for support on membership
@kaiserinvictoria4897
@kaiserinvictoria4897 3 ай бұрын
Straight to the point and useful stuff as always,keep going!
@fernandohildebrand6319
@fernandohildebrand6319 10 күн бұрын
Thank you. The best tutorial I´ve found.
@rmeta3391
@rmeta3391 3 ай бұрын
Excellent work, very detailed on Discord. Well organized and easy to follow.
@GenoG
@GenoG 3 ай бұрын
Very well done my friend!! I've been using the Searge LLM for a few days while eagerly awaiting your video. And, as usual, I learned a BUNCH of new ideas and tech tips from this video. I love that you included the Img2Text options!! Thank you! You earned FiDolla!! 😍
@59Marcel
@59Marcel 3 ай бұрын
Another well explained and easy to follow tutorial. Looking forward to giving these a try. Thanks.
@pixaroma
@pixaroma 3 ай бұрын
Thank you ☺️
@jgag777
@jgag777 3 ай бұрын
Your videos are awesome and the explanations are accessible. Thank you.
@jorgeluismontoyasolis9800
@jorgeluismontoyasolis9800 3 ай бұрын
Thank you so much! Really useful, as always. Greetings from Perú
@GenoG
@GenoG 3 ай бұрын
Thanks!
@pixaroma
@pixaroma 3 ай бұрын
Thanks again for continuous support ☺️
@Ozstudiosio
@Ozstudiosio 26 күн бұрын
new enjoyment day with new video thats all :)
@pixaroma
@pixaroma 26 күн бұрын
Thanks ☺️
@BryceHaymond
@BryceHaymond Ай бұрын
Great tutorials so far. The use of a path to an image for a LLM model to see it is not how those work, but you probably figured that out.
@pixaroma
@pixaroma Ай бұрын
Yes I found out later :) need a vision model
@joeshmoe4207
@joeshmoe4207 8 күн бұрын
Honestly baffling given the quality of the other parts of the video.
@jennifertsang6572
@jennifertsang6572 3 ай бұрын
Great video!! Always learning something from you!!
@OhItsAnthony
@OhItsAnthony 2 ай бұрын
22:40 On Windows 10 you can hold Shift and right click and the right click menu will have an option to 'Copy as Path', so no need to go to the File Explorer ribbon menu.
@pixaroma
@pixaroma 2 ай бұрын
thank you for the shortcut 🙂
@dmitrykonovalov9366
@dmitrykonovalov9366 16 күн бұрын
Very very good tutorial
@MaxwellSmi41483
@MaxwellSmi41483 20 күн бұрын
Been absolutely loving this tutorial series and accompanying Discord channel. I have been following without any issues or setbacks until this episode. Searge node cannot be installed as it requires a maximum of Python 3.11, however ComfyUI is currently running on 3.12.7. I have tried everything including backing up the python_embeded directory and installing a 3.11 version in a new python_embeded dir. When i go to run this it tells me that there is no module called comfy, so crashes on line 1 of main which is import comfy.options. If anyone else has experienced this or can help me with this frustration, I would be very grateful indeed!! 🙂 Keep up the great work, learned so much and always grateful for the workflows you include 💯👍
@pixaroma
@pixaroma 20 күн бұрын
A member of our discord channel made a version that works that comes with that nodes and a few other installed, you can use that installer and install comfyui in different folder github.com/Tavris1/ComfyUI-Easy-Install
@MarekCezaryWojtaszek
@MarekCezaryWojtaszek 12 күн бұрын
I have the same problem with Searge for Python 3.11 vs Python 3.12.7 that is shipped with the current version of ComfyUI. I was just about to downgrade my python_embedded to tackle that, but apparently it won't work either. Did you manage to find a solution for that issue?
@pixaroma
@pixaroma 12 күн бұрын
​@@MarekCezaryWojtaszek you can install this comfyui in a different folder and comes with all the settings and those node installed, a member of discord ivo did this installer, I didnt find other solution than this yet. github.com/Tavris1/ComfyUI-Easy-Install
@huongnguyen105
@huongnguyen105 Күн бұрын
thanks, very useful tutorial
@DivineSwd
@DivineSwd 2 ай бұрын
This video is so good *-*
@ASAS-ci3xu
@ASAS-ci3xu 3 ай бұрын
Thank you so much man. Your great work guided me to explore the ComfyUI workflow from zero to a little understanding. By the way, your opening and closing animation are great too! Is it Kling too? How could you do that? Can you do an episode about this? Menu, cursor, animated background, etc... very attractive! Thanks!
@pixaroma
@pixaroma 3 ай бұрын
Depending on animation i use multiple tools, if i get some free time will do a video on that. I use kling, also CapCut with keyframe animation, and also a software DPanimation maker
@SumoBundle
@SumoBundle 3 ай бұрын
Awesome tutorial like always.
@Redemptionz2
@Redemptionz2 Ай бұрын
best tutorial!!
@B3D
@B3D 3 ай бұрын
Thank you for the tutorial it help me so much .
@logman121
@logman121 2 ай бұрын
Well explained tutorial! Thanks a lot! Is there a way how to add my own input which would always be added to generated text from Florence? Thank you?
@pixaroma
@pixaroma 2 ай бұрын
You can use a text concatenate node from was node suite and can combine any text you want from different sources, add a primitive node or a positive from easy use custom node and connect to text concatenate and do the same with output from Florence and the results are those together
@logman121
@logman121 2 ай бұрын
@@pixaroma Thanks for quick reply. It works! :)
@majdmmb3839
@majdmmb3839 7 күн бұрын
Thank you.
@erans
@erans 3 ай бұрын
Hi, thanks for the great tutorial - regarding image -> prompt, sometimes i want to use an existing image (and its prompt) - but with my own lora. is there a way to 'insert' my trigger word inside the generated prompt that was made out of an existing image?
@pixaroma
@pixaroma 3 ай бұрын
You can add a text concatenate node , that will combine your existing prompt with your defined text, just like i do on episode 7 where i combine my styles that are in fact multiple prompts to the actual prompt
@YanXiao_AI
@YanXiao_AI 3 ай бұрын
thank you for your tutorial
@tmkongen
@tmkongen 22 күн бұрын
I get import failed on the Searge node install, tried multiple times. Folder created and file to put in that folder is where it should be. Any ideas?
@pixaroma
@pixaroma 22 күн бұрын
Some had problems on some configurations, it has something to do with dependencies, some had problems and posted on discord, only solution that found was to install comfyui with the node already installed, someone from community made an installer github.com/Tavris1/ComfyUI-Easy-Install
@mayankgupta2937
@mayankgupta2937 2 ай бұрын
amazing videos as usual, if i try to add lora keywords in the prompt they gets lost while going through LLM, anyway to workaround that?
@mayankgupta2937
@mayankgupta2937 2 ай бұрын
used your concatenator idea, worked out well :D
@pixaroma
@pixaroma 2 ай бұрын
yes text concatenate should work you can add extra things :)
@DeeAreDee
@DeeAreDee 2 ай бұрын
How would we do this with a caption generator that doesn't have it's own dedicated node? For example, Joy Caption Alpha Two? I have a Load Checkpoint node with Jay Caption Alpha Two's .safetensors file, but I don't know how to connect that to images to generate captions. Thank you!!!
@pixaroma
@pixaroma 2 ай бұрын
Not sure, only if you find a node for that, that was the problem for me also there are many models and hard to integrate. I did also in another episode the ollama version there only works with ollama models
@videomark9283
@videomark9283 8 күн бұрын
Hallo! I tried to do everything described, but when I start the queue, it says "Failed to import transformers.models.mega.configuration_mega because of the following error (look up to see its traceback): No module named 'transformers.models.mega.configuration_mega'. Where am I wrong?
@pixaroma
@pixaroma 8 күн бұрын
when you start the comfyui check if node got installed the searge one, most of people had that problem because it didnt install the node completely. You can try to use this installer for comfyui in different folder that install those nodes automatically, a member of discord did that installer, and worked out for people could not install it github.com/Tavris1/ComfyUI-Easy-Install
@videomark9283
@videomark9283 8 күн бұрын
@@pixaroma , thanks a lot! You are № 1
@freewheelburning8834
@freewheelburning8834 Ай бұрын
You have probably figured out something with creating a prompt from a path, I used Mistrel Q8 gguf model and I placed a path to a .JPG image of MT. Rushmore I also re-worded the prompt instructions you had, to "Generate a prompt from the image but don't use the path or file the name" and got a very nice image almost exactly like the picture so unless they updated the model perhaps its the wording or the image extension type Im guessing
@logudesignstudio
@logudesignstudio 6 күн бұрын
after install searge llm node missing problem, how to fix?
@pixaroma
@pixaroma 6 күн бұрын
Someone from discord community made this comfyui installer you can try, it install automatically all the nodes i used. I didn't find a way to fix it manually but this worked for many that had same problem like you so you can give it a try github.com/Tavris1/ComfyUI-Easy-Install
@GigaTvn
@GigaTvn 3 ай бұрын
Are you sure mistral LLM model is able to see image by giving it a path ? I think it would need some additional programming to do that. It may recognize file name, thats why it produces subject like women portrait or architecture but image is not similar. I tested with random file name and model was not able to "see" image for me. :) Thanks for video!
@pixaroma
@pixaroma 3 ай бұрын
No, it seems it doesn't see, i talked with creator of the node and he said it just invented a prompt of the image info, so use Florence for that, and I will try to see if i can get another model and node that is better for image captioning. Llava from ollama seems to know how to read images, and something I saw wirh Joy in the name. I will do more research. So the searge node is better for text to text, Florence for image to text
@GigaTvn
@GigaTvn 3 ай бұрын
@@pixaroma Great, waiting for more videos! :)
@FabioAI_Oficial
@FabioAI_Oficial Ай бұрын
I have a question. which one is better? Ollama node or this Florence 2? 🤔
@pixaroma
@pixaroma Ай бұрын
well for me Florence was faster, ollama let me choose more variety of models so depends on the pc specs, but ollama took more vram to run, so I dont use ollama too much. I mostly just use chatgpt so I dont take any of my vram so i can generate faster
@FabioAI_Oficial
@FabioAI_Oficial Ай бұрын
@pixaroma thank you for your help!
@FabioAI_Oficial
@FabioAI_Oficial Ай бұрын
@@pixaroma After installed all, I have problems with the ComfyUI_Searge_LLM about " llama_cpp_cuda " I follow the instructions and nothing happend, other solution you have maybe? Thank you
@pixaroma
@pixaroma Ай бұрын
@@FabioAI_Oficial some people had that problem on certain pc configuration, but I dont know what triggers it. You could try to install this version a member of discord did that has already installed the searge and has the right dependencies, just instal this comfyui in a new folder, github.com/Tavris1/ComfyUI-Easy-Install
@karamsahoo6852
@karamsahoo6852 Ай бұрын
Class apart!
@Fayrus_Fuma
@Fayrus_Fuma 3 ай бұрын
Thank you! I'm sure this feature will help a lot of people in creating interesting designs and creative solutions. (Still waiting for your Upscale videos).
@pixaroma
@pixaroma 3 ай бұрын
Didn't forget about upscaling, this month will be ready, i just need to do more tests
@ESheridan
@ESheridan 2 ай бұрын
Hello! I have a problem. When I try to download Florence-2-base Prompt I have error - DownloadAndLoadFlorence2Model The checkpoint you are trying to load has model type `florence2` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date. How can solfe this issue? Thanks in advance
@pixaroma
@pixaroma 2 ай бұрын
You can try to see if updating transformers work, maybe you have an old version of comfyui and your manager didn't update successfully, you can try to go to update folder and run the update and update dependencies bat file. Or you can tey to go to python_embeded folder and in address bar type cmd and press enter, then run this command: ./python.exe -m pip install --upgrade transformers
@ESheridan
@ESheridan 2 ай бұрын
​@@pixaroma Hello! Thanks for our help. but after update i still have the eroor - DownloadAndLoadFlorence2Model The checkpoint you are trying to load has model type `florence2` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.
@pixaroma
@pixaroma 2 ай бұрын
@@ESheridan i am not sure, i mean that node should just downloaded and use it, maybe need a different version of some dependencies but is hard to t ell. If nothing works just install a new comfyui in different folder
@ESheridan
@ESheridan 2 ай бұрын
​@@pixaroma I relize! Thank you for your help. I have question, what is the best florence model on your opininon?
@pixaroma
@pixaroma 2 ай бұрын
@@ESheridan i used a few from there but now there is a llama 3.2 so probably i will need to do some research on that. I still use chatgpt when i want something more exact
@faycaltech5922
@faycaltech5922 2 ай бұрын
cant install searge llm, i triend to install llama-ccp as they said in github but i've always an error in manager and when i start comfyui
@pixaroma
@pixaroma 2 ай бұрын
some still have that problem, is something with dependencies needs a certain version I think, but I have no way to test it since is different for each system. Alternative is to use Ollama like in episode 13
@faycaltech5922
@faycaltech5922 2 ай бұрын
@@pixaroma thanks, i'll check this
@dominiquemaik3415
@dominiquemaik3415 3 ай бұрын
This error appears for me: DownloadAndLoadFlorence2Model Using `low_cpu_mem_usage=True` or a `device_map` requires Accelerate: `pip install accelerate` Plz help
@pixaroma
@pixaroma 3 ай бұрын
You could try I guess what they recommend and install that to see if it helps. Navigate to the folder where your python_embeded folder is located , in the address bar on top where is the path type cmd and press enter, that will open command window in the exact folder. Paste this command and press enter, then restart. python.exe -m pip install accelerate Or also you can try to fix dependencies if you go to update folder run the bat file that has dependencies in name
@dominiquemaik3415
@dominiquemaik3415 3 ай бұрын
@@pixaroma It worked perfectly, thank you very much!!! 🙏🏼
@sinuva
@sinuva 3 ай бұрын
Nice mate, make some inpaint tutorial. Would be nice
@pixaroma
@pixaroma 3 ай бұрын
Yeah, I plan to do one maybe this month, right now working on upscaling
@Valket
@Valket 3 ай бұрын
Bro I am just curious, are you using ai voice over?
@pixaroma
@pixaroma 3 ай бұрын
Yes is AI voice from my text
@Valket
@Valket 3 ай бұрын
@@pixaroma Holy shit I will spend the entire thirty minute just listening to the way you talks rather than what you say just for how realistic it is. Its scary. What tool is that?
@pixaroma
@pixaroma 3 ай бұрын
@@Valket elevenlabs
@deeceehawk
@deeceehawk 3 ай бұрын
Great explanation and workflows! There is a hot comment for both you and the algorithm :-) DEECEEHAWK
@MaxwellSmi41483
@MaxwellSmi41483 20 күн бұрын
sorry, its not the searge llm, its the llama-cpp-python thats required wont work past python 3.11
@pixaroma
@pixaroma 20 күн бұрын
check my other comment, only solution was that installer I gave link for many, not sure why that cpp cause so much problems
@MarekCezaryWojtaszek
@MarekCezaryWojtaszek 12 күн бұрын
@@pixaroma I am getting the following error while trying to install llama_cpp_python: ERROR: llama_cpp_python-0.2.89+cpuavx2-cp311-cp311-win_amd64.whl is not a supported wheel on this platform. This is because the current version of ComfyUI is shipped with Python 3.12.7: ** Python version: 3.12.7 (tags/v3.12.7:0b05ead, Oct 1 2024, 03:06:41) [MSC v.1941 64 bit (AMD64)] So, for now I gave up. I will just use ChatGPT for help with creating good prompts and wait for Searge update to 3.12.
@hishambarakat8544
@hishambarakat8544 2 ай бұрын
love your vids man, is there anyway to like support you? you got a Patreon or smth?
@pixaroma
@pixaroma 2 ай бұрын
Thank you ☺️ You can join membership there is join button on the channel ☺️ also under videos there are super thanks, the heat icon with dollar.
@talismanna
@talismanna 3 ай бұрын
Anyone got the gguf working in "extra_model_paths.yaml" ... Nothing works for me. I must keep it in the default path.
@pixaroma
@pixaroma 3 ай бұрын
I will do a video tutorial about it since many seems to have that problem, will be ready this week
@ESheridan
@ESheridan 2 ай бұрын
@pixaroma Thank you so much for your details expalnations
ComfyUI Tutorial Series: Ep12 - How to Upscale Your AI Images
30:46
“Don’t stop the chances.”
00:44
ISSEI / いっせい
Рет қаралды 62 МЛН
Леон киллер и Оля Полякова 😹
00:42
Канал Смеха
Рет қаралды 4,7 МЛН
We Attempted The Impossible 😱
00:54
Topper Guild
Рет қаралды 56 МЛН
LTX Video In ComfyUI - The Fastest AI Video Model Run Locally - Tutorial Guide
29:49
OK. Now I'm Scared... AI Better Than Reality!
8:10
AI Revolution
Рет қаралды 332 М.
The Best Games from GMTK Game Jam 2024
23:45
Game Maker's Toolkit
Рет қаралды 546 М.
6 Years of Blender in 13 Minutes
13:01
Kaizen
Рет қаралды 419 М.
ComfyUI AI: FLUX - Why is it the Best at Processing Reality?
14:14
Show, don't tell!
Рет қаралды 1,6 М.
ComfyUI Advanced Understanding Part 3
35:36
Latent Vision
Рет қаралды 30 М.
“Don’t stop the chances.”
00:44
ISSEI / いっせい
Рет қаралды 62 МЛН