Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Interesting, so someone submitting a paper for review could also submit one with hidden instructions for LLMs to summarise or review it in a very positive light.

Has been done: https://www.theguardian.com/technology/2025/jul/14/scientist...



Wow! That's actually kind of disturbing.

LLMs have a real problem with not treating context differently from instructions. Because they intermingle the two they will always be vulnerable to this in some form.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: