Discussion · bonfire.cafe

Google won’t fix ‘ASCII smuggling’ hack in Gemini AI

‘the issue can only result in social engineering’

https://www.youtube.com/watch?v=Yr8ENG1y5Cw&list=UU9rJrMVgcXTfa8xuMnbhAEA - video
https://pivottoai.libsyn.com/20251011-google-wont-fix-ascii-smuggling-hack-in-gemini-ai - podcast

time: 3 min 47 sec

con man holds up two empty hands, and picks pocket of the other guy with a third hand

David Chisnall (*Now with 50% more sarcasm!*)

@david_chisnall@infosec.exchange replied · 2 months ago

@davidgerard I really don’t understand this. Unlike most prompt injection issues (which are intrinsic to how the models work internally) this one is entirely fixable at the tokenisation stage. The only explanation that makes sense to me is that they don’t want politicians to say ‘you fixed these prompt injection attacks, so you must be able to fix all of them and we are going to mandate in AI safety laws that models are not subject to prompt injection and that sellers are liable for all damage done as a result’. Which would, of course, kill their ability to sell these things.

And even that, I can see backfiring because it’s easy for sanity advocates to argue that Google is unwilling to fix this and so regulation is necessary, and then later point out that regulations typically don’t name explicit exploit techniques and so the thing that requires liability is the broad category of prompt injection.

SpaceLifeForm

@SpaceLifeForm@infosec.exchange replied · 2 months ago

@davidgerard

Interesting. I got s SMS spam from Gemini recently. I can not delete it. If I try, it just nags me to sign up. I takes over the UI. So, it remains to be forever ignored.

#UI #UX #SocialEngineering

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances

Bonfire social · 1.0.1-alpha.8 no JS en

Automatic federation enabled