Since I made this video I added a "precise style transfer" node to the IPAdapter. You can use that instead of fiddling with the Mad Scientist. It also works with SD1.5 (to some extent). Also since I've been asked quite a few times now... sorry, we do not have exact data of what each block does. 3 and 6 are pretty strong so it was easy but other layers have also some impact on both the composition and the style. Some seems to effect text, others background, others age. But at the moment it doesn't seem there is a "definitive guide". I would have told you otherwise 😅
@flankechen4 ай бұрын
thanks a lot, so in SD1.5, which block for style which for composition?
@CaraDePatoGameplays4 ай бұрын
This intrigued me, I'm going to do a lot of tests to see what they do besides 3 and 6
@animatedstoriesandpoems3 ай бұрын
Why any of your tutorial never work when we try ??
@godlesschannel7730Күн бұрын
i did some testing but hard to figure out 10 seems like about shadow/contrast 7 shine 5 contour? anybody got somewhere? :D share what we find out pls
@MarcSpctr5 ай бұрын
This guy is literally equivalent to what Piximperfect is to Photoshop. I doubt even the people who worked on SDXL had any idea that this much stuff and control can be gained over the models. Like seriously, wtf ???? Amazing work.
@saschamrose64985 ай бұрын
i would say more like video co pilot is to after effects
@latentvision5 ай бұрын
nnaah I guess that the difference is just that I actually share what I find
@GG-hh1sl5 ай бұрын
@@latentvision lol
@DarioToledo5 ай бұрын
Unm3sh
@rhaedas90855 ай бұрын
@@latentvision Share, and explain. You're like that one teacher that didn't just show you the math formula, but showed why it was important and how to use it practically.
@DarkGrayFantasy4 ай бұрын
As always amazing work Matt3o! For those interested in the Crossattention codes this is what they target: 1) General Structure 2) Color Scheme 3) Composition 4) Lighting and Shadow 5) Texture and Detail 6) Style 7) Depth and Perspective 8) Background and Environment 9) Object Features 10) Motion and Dynamics 11) Emotions and Expressions 12) Contextual Consistency
@stefansotra29344 ай бұрын
Where did you get this info?
@DarkGrayFantasy4 ай бұрын
@@stefansotra2934 Research really, nothing more...
@ceegeevibes13354 ай бұрын
wow cool... thanks!
@walidflux4 ай бұрын
is 12 the 0.0 index ? if there is a more clear description for all these please link it
@831digital5 ай бұрын
Best Comfyui channel on KZbin.
@miguelitohacks4 ай бұрын
x4096 agree
@blender_wiki3 күн бұрын
By far
@TriNguyenKV4 ай бұрын
when it comes to teaching and concise explaining, you are the GOAT!!!! Thank you so much, please keep doing this. Thank you!
@lonelyeyedlad7695 ай бұрын
Great work as usual, M! I am happy to see that the group experimentation with the UNET layers has led to the development of a node that will give us more control over our generations. Thank you for your continued efforts in this field!
@leolis785 ай бұрын
Matteo, your work is amazing! You are our Dr. Brown. Our mad scientist who will give 1.21 Gigawatts to the AI to take us to the future. We love you!!! 😄😄😄
@latentvision5 ай бұрын
just doing my part!
@ooiirraa5 ай бұрын
@@latentvision and we are doing our part loving you and being grateful 🎉
@caseyj7894565 ай бұрын
Yeah you are our mad scientist 😂 ❤ Merci Mateo !
@HiProfileAI4 ай бұрын
I love the idea of target conditioning various layers and being able to direct the layer with this kind of control in the cross attention. Thank you Matteo for you continued work and expertise. You give us a lot to play with and work with. The implications of the kind of control we can have in image creation and manipulation will last for years. Continued blessing to and appreciation to you good sir. 🙏🏾👍🏾
@moviecartoonworld44595 ай бұрын
"While keeping up with the influx of new features is important, I'm reminded again of the value of in-depth understanding of a single function. Thank you as always."
@TheCrash15092 ай бұрын
Thank you so much for these tutorials, I was struggling to understand and use Stable Diffusion, but have made so much progress since trying out ComfyUI paired with your tutorials. Please keep making the content you do, it's the best in class!
@ysy695 ай бұрын
Thank you. Exactly, we become conditioned to chase the new shiny toy rather that fully learning and enjoying the old ones. So much can be done with this, looking forward to...
@dck70485 ай бұрын
Image gen is a tech that seemed science fiction a couple years ago, but to have refined it to the point people in their homes can casually do generations like 7:19 is nothing short of outstanding. Thanks as always.
@urbanthem5 ай бұрын
Thanks a thousand Matteo. Your last statement is something I tell time and time again, we only use so little potential in what's already out there. Brilliantly proving that point.
@euroronaldauyeung86255 ай бұрын
genius hacking of cross attention and perfect explanation of the indexing.
@GG-hh1sl5 ай бұрын
Just found the node today and was wondering about its use - thanks for sharing the knowledge!
@autonomousreviews25215 ай бұрын
Love what you're doing for the community - thank you for your time and for sharing :D
@majic_snap5 ай бұрын
My understanding is that Precise generally weakened the weights of more layers, but style has always been a mystery to neural networks, although you have done so well already. I hope you can bring us more surprises, thank you for your contributions! The name 'Mad Scientist' is simply fantastic
@nerdbg17824 ай бұрын
This builds on your previous experimental node where you asked for some help from the community. Glad to see they helped you decipher the layers
@latentvision4 ай бұрын
not to remove anything from the wonderful community but you've been distracted 😄Style and Composition was released months ago, way before the prompt injection.
@nerdbg17824 ай бұрын
@@latentvision I was speaking about block weights, this one: kzbin.info/www/bejne/haO2hWmEppaBi8ksi=VyhskRDQS5m8JFMX Anyhow, it's nice to see the two combined, regardless of if it is a new feature or not. Good stuff, in either case 🙂
@tsentenari435315 күн бұрын
"all you need is another mad scientist"
@johnsondigitalmedia4 ай бұрын
Awesome work! Do you have the info on the other 10 control index points?
@kenwinne4 ай бұрын
Matteo, thank you for bringing us IPAdapter, which provides a solid ground for us to combat the uncertainty generated by large models. I personally like your explanation of basic theories. Although your course is less than 10 minutes, I have studied it repeatedly for several hours. If you have time to explain in detail the specific functions and applications of the 12 layers of the cross nerve, thank you very much for your efforts, thank you!
@tunexpectedv0id5 ай бұрын
Thanks a lot for this new node, really appreciate it.
@Showdonttell-hq1dk5 ай бұрын
This is so incredibly cool! Thank you very much. I can't even imagine how nerve-wracking and exciting the coding was for this. :)
@SouthbayCreations4 ай бұрын
Great video, thank you! Where can we find this node?
@GoblinWar5 ай бұрын
Cos-XL is so tight, I'm a huge fan
@Billybuckets5 ай бұрын
Until I use this a *lot*, I will have no idea what the different UNet blocks do. Maybe you could put a Note node in the pack that contains an estimation of the relative contribution of each block to style, composition, and anything else that might be useful. A++ work as always. Best SD channel around.
@latentvision4 ай бұрын
unfortunately we don't know exactly what the blocks do
@BubbleVolcano4 ай бұрын
Nice work! ❤It's awesome to see real progress on the U-net layer. But having too many parameters can make it tough to get started, even for someone like me who's been at it for over a year. It's just too challenging for ordinary people. If we change the filling parameter to four simple options like ABCD, it might be easier to promote. Ordinary people aren't into the process; they're all about the end result.
@karlwang48374 ай бұрын
it was amazing ,thank you for the work you have done for the community, i really appreciate it
@mickmack8026Ай бұрын
This is awesome, thank you so much! Could the Mad Scientist also be used with Flux?
@madmushroom86395 ай бұрын
Very cool! Would love to see some coding sessions. Maybe you could explain your code a bit. More info about the vector sizes, layers etc :)
@latentvision4 ай бұрын
I was thinking about that... not sure how much interest there would be on that though
@madmushroom86394 ай бұрын
@@latentvision Yeah maybe, but your "ComfyUI: Advanced Understanding (Part 1)" video actually performed really well I think, where you went into more details. That plus some code examples what is going on behind the scenes with your knowledge would be awesome! Maybe a small poll could show if its worth your time :)
@Archalternative3 ай бұрын
Matteo sei davvero incredibile con il tuo lavoro... 🎉
@GG-hh1sl5 ай бұрын
How about a widget setting in the IpAdapter node, to set the strength of each layer with a short lable of its function?
@latentvision5 ай бұрын
we don't know exactly what is the function of each layer unfortunately
@jibcot85415 ай бұрын
Very cool, I need to play with IPAdapter more often, but I am often too busy just improving prompts and upscale workflows!
@jasonchen11395 ай бұрын
Incredible Content ! your work is undoubted the best !
@openroomxyz5 ай бұрын
Thanks that's cool, amazing findings that will help the comunity
@Arknight-p2l5 ай бұрын
You are a mad scientist haha thank you so much Matteo
@latentvision5 ай бұрын
mad for sure, scientist not so much 😅
@Arknight-p2l5 ай бұрын
@@latentvision haha 😂 keep up the great work I love your content.
@Sedtiny5 ай бұрын
Thank you again, my lord
@latentvision5 ай бұрын
most welcome, my liege
@sephia45834 ай бұрын
Is there any similar way to apply Lora style to only specific layer? Maybe we can apply negative weight for composition layer (e.g. layer 3) and positive weight for style layer (e.g. layer 6)?
@legendaryanime695 ай бұрын
Always waiting for your greate video, that help me alot! Thanks
@divye.ruhela5 ай бұрын
Impeccable naming, we're all a little mad by now 🤣
@dreammaking5165 ай бұрын
Insanely cool, also just realized, you are italian as well😂🔥
@MrGingerSir5 ай бұрын
This is awesome! Are you planning on making a version that works with embeds?
@latentvision5 ай бұрын
why not :)
@MrGingerSir5 ай бұрын
@@latentvision sweet!
@fukong5 ай бұрын
God of IPAdapter
@swannschilling4745 ай бұрын
I'll take the blue pill!! 😁 Thanks so much for this one!! 💊
@rsunghun5 ай бұрын
Absolutely amazing 😮
@jccluaviz5 ай бұрын
Thanks you dr.Matteo. I think i need one of your pills to make my days shine. Again an extraodinary work.
@Firespark815 ай бұрын
This is awesome! ty!
@TheFountainOfEnlightenment2 ай бұрын
Awesome!, the best tuts I've seen. Thanks
@alxleiva4 ай бұрын
You called that node based on yourself right? You're truly a mad scientist bringing us the best discoveries! Thank you Mateo
@ceegeevibes13354 ай бұрын
love love love this, going MAD!!!!
@YING1805 ай бұрын
so cool and you are our mad scientist
@michail_7775 ай бұрын
And one more question. Where can I find an explanation of the index/Cross Attention?
@noxin74 ай бұрын
Mateo, This is amazing work with the mad scientist node - My only question (not criticism) is if you plan to convert the index:weight string into widgets for ease of use or is there something that prevents that?
@latentvision4 ай бұрын
yeah I can do that :)
@walidflux5 ай бұрын
Again, blowing minds !!!!
@StudioOCOMA2D3D4 ай бұрын
Very good as always Matteo. Can you explain all the index please? I've noticed only 3: 3: Reference image 5: Composition 6: Style
@abdellahla61595 ай бұрын
Great node, thanks a lot 😁
@jensenkung3 ай бұрын
7:20 my jaw literally drop
@SerginMattos5 ай бұрын
Your work is amazing!
@madmed66723 ай бұрын
you're literally the goat my guy!
@igorkotov89375 ай бұрын
Thank you!
@johnriperti31275 ай бұрын
Thanks Matteo, this is so good!
@nomand5 ай бұрын
incredible. Apart from style and composition, has the community found consensus on what specific qualities of the image other indexes affect?
@latentvision4 ай бұрын
not really unfortunately
@ryanontheinside4 ай бұрын
this is awesome thank you
@krio_gen5 ай бұрын
Unbelievable.
@latentvision5 ай бұрын
believe it!
@krio_gen5 ай бұрын
@@latentvision ))) I dived into it with my head. I feel like a Mad Scientist)
@vf4am23 ай бұрын
This is pretty awesome. Great work! I have a question about the cross attention indexes. Are they tied to output or input blocks in terms of merging? I am wondering if this could help to find the best blocks to merge to for more precision.
@latentvision3 ай бұрын
yeah each index is a block, I use index number instead of block number because it's easier
@vf4am23 ай бұрын
@@latentvision thanks for the follow up. I guess what I was wondering though is with block weights, input and output each have 11 blocks. So if I were to equate the effects of each index to a block to merge to, would index 6 equal output block 6, input block 6 or both?
@latentvision3 ай бұрын
@@vf4am2 index is a reordered list of blocks input > mid > output. ping me on discord if you need a 1:1 relationship
@Nairb9325 ай бұрын
Keep up the good work man
@latentvision5 ай бұрын
I try
@MikeTon4 ай бұрын
Amazing and insightful work! Question wrt to sponsorship, do you have a preference between github vs patreon? I'm getting so much value here that I want to meaningfully support you and will default to github support if there's no preference
@latentvision4 ай бұрын
hey thanks! I don't use patreon because I don't have time to push updates. Either github or paypal at the moment!
@manojkchauhan4 ай бұрын
Hey Matteo, Just finished your ComfyUI tutorial - seriously impressive stuff! 👍❤ Your breakdown of advanced features with practical examples is super motivating. I'm excited to put these into action and unlock the full potential of ComfyUI. Thanks for sharing your knowledge!
@huwhitememes5 ай бұрын
Awesome, Bro
@aidiffuser5 ай бұрын
Hello man, thanks for sharing this amazing improvement on control! Did something change between the style transfer and composition from 2 days ago to this release? I cant seem to reproduce same results :( Or, is there a way to reproduce the exact same layer weights of that previous release within the mad scientist node?
@latentvision4 ай бұрын
no, style and composition should be the same. if you have issues please post an issue on the official repository with a before/after images possibly
@lupinsensei7456Ай бұрын
More than Mad Scientist should be called Great Wizard node.
@nelsonporto5 ай бұрын
GENIUS
@quotesspace17135 ай бұрын
Thanks, that's really cool 🙏🙏. but Is this just for me? I found almost everything too advanced and couldn't understand what's going on, but I would really love to understand it in depth so that I can add my own to it and share. I do have some knowledge on comfyui but this is...
@latentvision5 ай бұрын
check the "basics" series!
@miguelitohacks4 ай бұрын
HOLY SHIT, this is powerful!
@latentvision4 ай бұрын
IKR?!
@ooiirraa5 ай бұрын
Woooow, wow 🎉 you are amazing. This is just soooo cool. Why the negative prompt doesn't go with minus? It would be 3:-2.5, 6:1, and this way all the sintaxis could be consistent everywhere. And people would be able to pass positive and negative as much as they want.
@latentvision4 ай бұрын
I need to think about it, technically you can send a negative value to the positive embeds so it's not that simple
@ooiirraa4 ай бұрын
@@latentvision then it could be a letter like 3:n2.5, 6:1 or 3:2.5n, 6:1 or 3:neg2.5, 6:1 (to make it 100% transparent)
@mycelianotyours19805 ай бұрын
Thank you so much!
@kinai_44144 ай бұрын
Damn that's impressive. Could the same logic be applied to a Lora node in the future ?
@AIFuzz595 ай бұрын
Do you have a list of what the other index layers are? We are experimenting with this now
@latentvision4 ай бұрын
no, it's difficult to undestand. some are subject specific for example (eg: they work with people not with landscapes)
@yvann.mp44 ай бұрын
amazing, thanks a lots
@neofuturist5 ай бұрын
UPDATE ALL THE NODES!!!! thanks Matteo
@ChanhDucTuong16 күн бұрын
This is amazing tyvm. Can we do the same with FLUX?
@nkofr4 ай бұрын
Hi, thanks, wonderful! I just don't understand the point of this custom node having "weight_type" field if we modify the layers' weights in the bottom input field? Is "weight_type" overriden by the values in the input field?
@latentvision4 ай бұрын
"style transfer precise" uses a different strategy to apply the embeds. You need to use it only if you want to do the style transfer thing. If you want to experiment with blocks you can select whatever and it will be overwritten (except again "precise")
@nkofr4 ай бұрын
@@latentvision Thank you Matteo, that's awesome! Grazie
@Mika433445 ай бұрын
W O W!!! AMAZING!
@elegost25704 ай бұрын
@latentvision Is it possible to combine the image to image workflow along with even more control to resemble the input? Aka, control net type of options.
@latentvision4 ай бұрын
yes of course!
@elegost25704 ай бұрын
@@latentvision do you have any pointers in that regard? I’ve tried a few things but keep getting errors :(
@kallamamran5 ай бұрын
Wow... You should make the layers as weight handles and call the layers for what they are :D
@michail_7775 ай бұрын
Mateo hi and thank you. I'm using the Mad Scientist node. Thanks for the clarification. I've become more aware of how to use it. I also have one question about the "IPAdapter Encoder" node, it has an input for a mask? The point is that both input image and mask should be connected to this node. When using only the input image in the "IPAdapter Encoder" node, the output image adopts the style/whatever. But when I also connect an input mask (I tried just a colored map, image, half painted image), the IPAdapter Encoder node has no effect on the generated image at all. Could you please explain how to use the mask in the "IPAdapter Encoder" node?
@latentvision5 ай бұрын
I'm sorry I'm not sure I completely understand, maybe join my discord or post a discussion with some screenshots in the IPAdapter repository
@michail_7775 ай бұрын
@@latentvision Yeah, I already wrote to L2 (quick help).
@gammingtoch2592 ай бұрын
Thanks bro! very usefull :D
@pandelik34504 ай бұрын
So basically the selectable weight type options are just preset lists of 12 values for each of the 12 blocks?
@latentvision4 ай бұрын
the weights type do a little mode that that but yeah they play with the blocks weight
@context_eidolon_music5 ай бұрын
Your 666th like is from me. I don't know what I'd do without your brilliant work. Thank you.
@glassmarble9964 ай бұрын
you have so many secrets matteo :D
@calvinherbst3044 ай бұрын
dying to know what the other index blocks are!
@latentvision4 ай бұрын
don't we all?! 😄
@anton53812 ай бұрын
Thanks for the video! But can't make it work.... do everything like you but getting "Error(s) in loading state_dict for Resampler: size mismatch for proj_in.weight: copying a param with shape torch.Size([1280, 1280]) from checkpoint, the shape in current model is torch.Size([1280, 1664])." could you please help me?
@weilinke922410 күн бұрын
😀you are always awesome~
@latentvision5 күн бұрын
no, YOU are!
@GiovaniFerreiraS4 ай бұрын
Is this an evolution on the prompt block by block thing? I remember you saying on that video that nothing stopped you from using images.
@latentvision4 ай бұрын
the technology is the same but technically we did this before the prompt injection. Visual embeddings are easier to evaluate
@chriswendler54642 ай бұрын
Is there a research paper that we could cite already referring to these layers as composition/style layers?
@latentvision2 ай бұрын
not that I know of
@chriswendler54642 ай бұрын
I will cite your video for now
@pixelcounter5065 ай бұрын
One proposal and one remark: You should name it "clever" instead of mad!^^ Your last words are very well spoken. I have problems to keep track of all the new nodes and developments regarding time and depth of understanding and using. There isn't even enough time to carefully read through manuals, test nodes, combine new workflows based on what we already have. The big players are on the run, too. See all these new announcements what their model brings for the benefit of the world (and for their pockets). [irony off]. Nevertheless, interesting, but challenging times ! And thank you for your contributions, Matteo, always appreciated!
@2shinrei4 ай бұрын
🤯
@lucagenovese72074 ай бұрын
07:20 quella roba è fucking insane.
@bgmspot72425 ай бұрын
Nice❤❤
@davidsmith-lv4kq4 ай бұрын
in the video , at the part where u describe layer 3 , 2.5 as being a negative value, what makes the postive value become negative ?
@PCO774 ай бұрын
if weight_type == "style transfer precise": if layers == 11 and t_idx == 3: uncond = cond cond = cond * 0 The number itself is not converted to a negative number. uncond is set to cond and cond is then zeroed out. You're then increasing the strength of uncond (negative) with higher values in layer 3.