What is Prompt Tuning?

  Рет қаралды 164,936

IBM Technology

IBM Technology

11 ай бұрын

Explore watsonx → ibm.biz/BdvxRp
Prompt tuning is an efficient, low-cost way of adapting an AI foundation model to new downstream tasks without retraining the model and updating its weights. In this video, Martin Keen discusses three options for tailoring a pre-trained LLM for specialization, including: fine tuning, prompt engineering, and prompt tuning ... and contemplates a future career as a prompt engineer.
Get started for free on IBM Cloud → ibm.biz/sign-up-now
Subscribe to see more videos like this in the future → ibm.biz/subscribe-now
#ai #watsonx #llm

Пікірлер: 53
@dominikzmudziak8340
@dominikzmudziak8340 2 ай бұрын
Im stunned how Martin is able to write backwards on this board so efficiently
@pradachan
@pradachan 2 ай бұрын
they just mirror the whole recording
@Gordin508
@Gordin508 11 ай бұрын
Really like these summarization videos on this channel. While they do not go into depth, I appreciate the overarching concepts being outlined and put into context in a clean way without throwing overly specific stuff in the mix.
@johndong4754
@johndong4754 6 ай бұрын
Which channels would you recommend that go into more depth?
@dharamindia563
@dharamindia563 7 ай бұрын
Excellent broad explanation of complex AI topics. One can then deep dive once a basic understanding is achieved ! Thank you
@maxjesch
@maxjesch 11 ай бұрын
So how do I get to those "soft prompts"? Do you have to use prelabeled examples for that?
@WeiweiCheng
@WeiweiCheng 6 ай бұрын
Awesome content. Thanks for uploading. It's great that the video calls out the differences between soft prompting and hard prompting. While soft prompts offer more opportunities for performance tuning, practitioners often face the following issues: - Choosing between hard prompting with a more advanced, but closed, LLM versus soft prompting with an open-sourced LLM that is typically inferior in performance. - Soft prompting is model dependent, and hard prompting is less so.
@SCP-GPT
@SCP-GPT 9 ай бұрын
You should make a guide on FlowGPT / Poe that delves into operators, delimiters, markdown, formatting, and syntax. I've been experimenting on these sites for a while, and the things they can do with prompts are mind-blowing.
@XavierPerales-zm4xx
@XavierPerales-zm4xx 4 ай бұрын
Excellent job explaining key AI terms!
@datagovernor
@datagovernor 9 ай бұрын
More important question, what type of smart/whiteboard are you using?? I love it!
@IBMTechnology
@IBMTechnology 9 ай бұрын
See ibm.biz/write-backwards
@uniqueavi91
@uniqueavi91 8 ай бұрын
crisp and informative
@RobertoNascimento-kw6gy
@RobertoNascimento-kw6gy 2 ай бұрын
Excelente video, bom trabalho
@johndevan3505
@johndevan3505 6 ай бұрын
A lot to unpack here. Great job explaining. I have one question about the difference between incontext learning and prompt tuning with hard prompts. Are they synonymous?
@Asgardinho
@Asgardinho 6 ай бұрын
how do you get the AI to generate that tunable soft prompt?
@scifithoughts3611
@scifithoughts3611 5 ай бұрын
Could you explain labeling done in fine tuning and prompt tuning?
@azadehesmaeili4402
@azadehesmaeili4402 5 ай бұрын
Could you please outline the advantages and disadvantages of fine-tuning versus prompting in the context of large language models?
@apoorvvallabh2976
@apoorvvallabh2976 7 ай бұрын
What data set for supervised learning is used in prompt tuning
@yt-sh
@yt-sh 11 ай бұрын
funny & informative 👏👏👏
@neail5466
@neail5466 11 ай бұрын
Could you please explain a little detail about the strings of numbers how those are indexed? Are those some sort of abstraction that we fully understand! Very informative lecture is this one... Probably everyone should have a little expertise in prompt engineering skill in near future.
@Chris-se3nc
@Chris-se3nc 10 ай бұрын
There are other embedding models that can take strings of concepts and transform them into embedding vectors (string of numbers). You can store those in a number of vector databases.
@8eck
@8eck 10 ай бұрын
This that soft prompt is basically a trainable parameters, which also undergoing backpropagation and its weights are updated? Just like LoRA method, where you attach new trainable parameters to the model and train only those new parameters.
@mikegioia9289
@mikegioia9289 8 ай бұрын
How do you discover the correct soft prompts?
@marc-oliviergiguere3290
@marc-oliviergiguere3290 6 ай бұрын
Very concise and information, but tell me, what technology do you use to write backwards so fast? Do you flip the board in post-production?
@IBMTechnology
@IBMTechnology 6 ай бұрын
Yes, see ibm.biz/write-backwards for details
@user-mn6bb6gi6v
@user-mn6bb6gi6v 11 ай бұрын
Hi, nice talk by the way, but what about some examples of soft turning, i understand is human unreadable, but how exactly you achieve that ? by writing some code ? extra tools ? plugins ? thanks a lot for your reply :)
@sheepcraft7555
@sheepcraft7555 9 ай бұрын
These are learnable parameters added on top the base language models. This is called soft tuning one of the example is prefix tuning. These parameters are learned.
@TimProvencio
@TimProvencio 7 ай бұрын
Does anyone know how they do these videos where it appears that they are writing on the screen. That is so neat!
@IBMTechnology
@IBMTechnology 7 ай бұрын
See ibm.biz/write-backwards
@user-mo7yq9ks1g
@user-mo7yq9ks1g 6 ай бұрын
What is unfancy design prompt?
@mohslimani5716
@mohslimani5716 11 ай бұрын
Thanks for the explanation, but still how could someone succeed in prompt engineering practically
@itdataandprocessanalysis3202
@itdataandprocessanalysis3202 11 ай бұрын
A joke by ChatGPT: Why did the Large Language Model (LLM) turn down a job as a DJ? Because it thought "Prompt Tuning" meant it would have to constantly change the music!
@arpitqw1
@arpitqw1 5 ай бұрын
not fully understood except- prompt tuning-prompt engineering- hard tuning-soft tuning. :P
@manojr4598
@manojr4598 11 ай бұрын
We are trying to create a chatbot using OpenAI API and the response should be limited to the specific topic and it should not respond to the user queries which are not related to the topic. What is the best way to achieve this ? Prompt engineering or prompt tuning ?
@indianmanhere
@indianmanhere 10 ай бұрын
Fine tuning
@fredrikt6980
@fredrikt6980 12 күн бұрын
Really like all of Martins videos but this one only explains what prompt-tuning is not.
@darkashes9953
@darkashes9953 11 ай бұрын
IBM could go for the plunge and make a Quantum computer with 10 million Quantum computer chips with 1000 Qubits and optical circuits instead of just one chip.
@rajucmita
@rajucmita 6 ай бұрын
As a newbee how come I be pro in propmt engineering
@badlaamaurukehu
@badlaamaurukehu 6 ай бұрын
Nomenclature is it's own problem.
@rongarza9488
@rongarza9488 4 ай бұрын
I learned Python in 2 months, great language. Then, I learned the SQLs that Python plays well with. Then, it hit me: AI is doing most of this work! So what is there for me and you to do? "My career may be over before it's begun". Yes, indeed UNLESS we can start using Python for regular business processing, like Accounts Receivable/Payable, Inventory Management, Order Processing, etc. In other words, we can't all be doing AI, especially when it, itself, is doing AI, cheaper, faster, and better.
@avinashpradhan5030
@avinashpradhan5030 3 ай бұрын
🙂
@kaiskermani3724
@kaiskermani3724 Ай бұрын
"A string of numbers is worth a thousand words" tf does that even mean?
@samgoodwin89
@samgoodwin89 7 ай бұрын
Is he writing backwards
@IBMTechnology
@IBMTechnology 7 ай бұрын
See ibm.biz/write-backwards
@NK-ju6ns
@NK-ju6ns 3 ай бұрын
Soft peompring is confusing
@DK-ox7ze
@DK-ox7ze 7 ай бұрын
This is too abstract. Some concrete examples would have helped.
@generichuman_
@generichuman_ 6 ай бұрын
Wow, you managed to make an 8 minute video on prompt tuning without actually talking about what it is or how one would even begin to implement it. All I gleaned from this is that it has something to do with embeddings... Do better IBM...
@scifithoughts3611
@scifithoughts3611 5 ай бұрын
I agree it’s a little obscure. I gave this a second watch through because your comment made me realize that I too wasn’t clear. Here is what I’ve noted: First step: Model creation: A model is created by training it from tons of data (very expensive to do) Because a model alone doesn’t work consistently at this point (racist, errors, hallucinations, toxic,…) it needs more work to be ready for the public. To make it ready one of the three strategies are used: fine tuning, prompt engineering, or prompt tuning with soft prompts. (All three could be used as well, I’ve read papers about such cases.) Fine tuning : Give you have a model, now you create examples about the domain the LLM will represent. The examples are labeled to help the model know what’s is going on. This strategy is labor intensive. (Labeling is another whole area to read up on.) Prompt engineering: Humans design prompts in a human language (explain to the model how to behave). Example: when I tell you a word in English, you respond with the word in French. Prompt tuning using soft prompts: Soft prompts are created by the AI using fine tuning data. These prompts are encoded (not human readable) into a vector. The above is the first six minutes of the video. Next the lecturer show these three applications by adding them to the box picture. This is confusing because it seems like he is applying all three strategies but then concludes that prompt tuning gets the best results. So I guess he is saying use prompt tuning. Since AiML is a new field, I think people will be applying many different strategies in order to get their models to work properly. And this is just scratching the surface. Every few months, people will come up with other strategies that improve the situation. 10 years from now a bunch of these strategies will be discarded and there will be other new ones. The field of ML is defining their design patterns. Pattern books will be written as solutions mature. Prompt engineering and prompt tuning are the two patterns he talks about. I hope that helps. Thinking this through has certainly helped me so thanks for the prompt. 😊
@chavruta2000
@chavruta2000 4 ай бұрын
yes. this is incredibly generic and communicates very little considering this is supposed to be from a communication theory expert.
@RajatKumar-oy9mw
@RajatKumar-oy9mw 3 ай бұрын
Totally agrees..
@bongimusprime7981
@bongimusprime7981 11 ай бұрын
ChatGPT is not an LLM lol
@fabrzy3784
@fabrzy3784 11 ай бұрын
yes it is
4 Methods of Prompt Engineering
12:42
IBM Technology
Рет қаралды 103 М.
Why Large Language Models Hallucinate
9:38
IBM Technology
Рет қаралды 167 М.
Can you beat this impossible game?
00:13
LOL
Рет қаралды 43 МЛН
格斗裁判暴力执法!#fighting #shorts
00:15
武林之巅
Рет қаралды 89 МЛН
Be kind🤝
00:22
ISSEI / いっせい
Рет қаралды 20 МЛН
Fine-tuning Large Language Models (LLMs) | w/ Example Code
28:18
Shaw Talebi
Рет қаралды 237 М.
Hypnotized AI and Large Language Model Security
13:22
IBM Technology
Рет қаралды 7 М.
Prompt Engineering Tutorial - Master ChatGPT and LLM Responses
41:36
freeCodeCamp.org
Рет қаралды 1,3 МЛН
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
What Makes Large Language Models Expensive?
19:20
IBM Technology
Рет қаралды 60 М.
Don't Use ChatGPT Until You Watch This Video
13:40
Leila Gharani
Рет қаралды 1,5 МЛН
Your understanding of evolution is incomplete. Here's why
14:21
The most important AI trends in 2024
9:35
IBM Technology
Рет қаралды 201 М.
Fine-tuning LLMs with PEFT and LoRA
15:35
Sam Witteveen
Рет қаралды 110 М.
AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"
23:47
Can you beat this impossible game?
00:13
LOL
Рет қаралды 43 МЛН