11:55 "As manufacturers of a machine that guesses entire paragraphs, we doubt that it will be possible for an attacker to guess an entire paragraph."
@KernelGhost2 ай бұрын
It is both unexpected and fascinating that token lengths alone can be utilised to determine the assistant response text with such accuracy.
@jimmy00014 күн бұрын
I love the attack vector, great discovery and extremely well demo'd and presentation. One of the cleanest and concise talks, kudos to the presenters as well, very well spoken and no fluff.
@TheJogug3 ай бұрын
Interesting idea and execution. The accuracy of Predicting First Sentence: 55% and Predicting Entire Text: 38% seems high. The sample space of the dataset prompts probably has a huge impact on these numbers.
@ChiefMasterGuru3 ай бұрын
Christ the background noise is unbearable
@DrZbo3 ай бұрын
Add it to the list of incredible talks that are fucked by poor sound
@gavinknight85603 ай бұрын
Probably the closing session in an adjacent room… defcon is a boisterous affair.
@Arisekiwi3 ай бұрын
Could be that their breaking into a system for one of the challenges that happen
@LolWutMikehSM2 ай бұрын
Consider subtitles
@ChiefMasterGuru2 ай бұрын
@@LolWutMikehSM or maybe the event can consider doing the bare minimum audio set up lmao
@shrimpkins2 ай бұрын
My goodness, what you fid'na tell me next, Liz Lemon? That some employees at AI companies might have full unencrypted access to my convos with their products? Nah, nothing to worry about there--that guy from Amazon said a long time ago that Alexa doesn't listen to anything in the room until you say "Hey, Alexa!" and I believe him. If you can't trust a company with a moral mission statement, good God man, who can you trust?!?
@xj0ex392 ай бұрын
Sam Altman has already scanned you and your entire families optics.
@SarahKchannel3 ай бұрын
if you decode the encrypted tokens, from known text, you will get a very high confidence level on thr result. That data you can use as labeled training data. Which you can use to reverse the encryption keys used. From there there is no more guessing.
@blkauxpro3 ай бұрын
Who mixed this FOH setup? It's awful. This is riddled with easily-filterable ambient noise - and from the 2nd stage?! Next Defcon call a pro to run your board and rack. I'm available.
@marco2083 ай бұрын
It's like they're watching a sports game next to this space. Keep your crowd under control if there is no isolation. Almost as if it's on purpose.
@pelic96083 ай бұрын
I really want to see that other talk now. Sounds like they revived Jeopardy. 😄
@royweiss13 ай бұрын
@@pelic9608it was the closing ceremony 😅
@AlecArmbruster3 ай бұрын
If DEFCON hired an actual professional to run their AV, then it wouldn’t be DEFCON.
@nxxxxzn3 ай бұрын
dude, be thankful there's audio at all instead of a fart on the left chan at -90dbfs
@j_t_eklund2 ай бұрын
The biggest problem is they don't employ people with the right mentality for hacking their stuff... So they never detect shit about anything possible to exploit.
@pneuma333 ай бұрын
outstanding work and very scary stuff.
@elvinaguero46512 ай бұрын
What a great work and Collaborations.
@marco2083 ай бұрын
Nice work. I like human understandable attacks. Takes some out of the box thinking to get to this.
@willhatch77212 ай бұрын
Reminds me of how they cracked enigma
@xj0ex392 ай бұрын
Was my magnum opus.
@petevenuti73552 ай бұрын
I'd have to check out what is going on next door! I wonder if the speaker was thinking that too?
@DreadFox_official3 ай бұрын
ohhh that's very interesting!!!!
@PassionforSpace2 ай бұрын
It is clear that the token-length is a vulnerabillty that needs to be addressed.I am wondering,why not encrypt the length? I mean,why not change the length into something that would make it impossible to guess the word:If a word has a token-length of 6-make it something else, if eaves dropping is happening.
@77rdcasa2 ай бұрын
Maybe, but itis probably more complexity. I like you're looking for solution, but the direction of all controllers has decided say to society, can't choose back in time. The mobility of social status are only B and C. The class A Justin making adjustments between themselves with the new commodity. Good luck 4civilization! I'm sorry if It's not understandable.
@legoguy217Ай бұрын
~50% accuracy isn't that bad, but not exactly a stat I'd be scared of. Given a set of known/trained tokens the chances of the guess being wrong is literally a coin flip. God forbid it starts responding in Spanish, then you'll need to train a model for each language.
@thomass94572 ай бұрын
How accurate is the attack if the user is speaking to a custom persona, not the default?
@yahmaar2 ай бұрын
Thank you for sharing
@RoughGanome2 ай бұрын
Great talk
@mechadense3 ай бұрын
14:06 second part
@j_t_eklund2 ай бұрын
This is why recall is bad..
@recklessroges3 ай бұрын
Nice CV application.
@JoshtheFifith2 ай бұрын
endlech a heimishe guy at defcon
@brianhirt50273 ай бұрын
So they used how much LLM crunchpower training LLM's to decode LLM's? How much power & water got blown up so they could build this training model? Where is the point of vulnerbility? Shared networks?
@FireStormOOO_3 ай бұрын
They covered this, about 2 days/$200 of compute on Azure. So almost nothing by AI standards.
@lunafoxfire3 ай бұрын
brian hirt more like brain hurt
@cit01103 ай бұрын
@@lunafoxfirefr😂😂
@brianhirt50273 ай бұрын
@@FireStormOOO_ That seems a little sus. You know how large a dataset is required just to give the models a starting point.
@brianhirt50273 ай бұрын
@@lunafoxfire sounds like you're ready for your big move to middleschool next school year, kiddo. Go find somewhere else to play now. The *adults* are talking.