This is the first private message I get on Lemmy, it immediately seemed suspicious to me so I tried the famous thing… and it worked!
This is the first private message I get on Lemmy, it immediately seemed suspicious to me so I tried the famous thing… and it worked!
Are there any other confirmed versions of this command? Is there a specific wording you’re supposed to adhere to?
Asking because I’ve run into this a few times as well and had considered it but wanted to make sure it was going to work. Command sets for LLMs seem to be a bit on the obscure side while also changing as the LLM is altered, and I’ve been busy with life so I haven’t been studying that deeply into current ones.
You got to do the manual labor of gaslighting them.
For further research look into ‘system prompts’.
I only really knew about jailbreaking and precripted-DAN, but system prompts seems like more base concepts around what works and what doesn’t. Thanks you for this, it seems right inline with what I’m looking for.
LLMs don’t have specific “command sets” they respond to.