Attached: 1 image
the cyberpunk present is weird as fuck: the latest Shai Hulud malware wave contains an LLM prompt to create biological weapons and nuclear weapons, with the purpose to trip LLM safety refusals so that LLM-based code scanning wont see the malware
https://socket.dev/blog/mini-shai-hulud-miasma-and-hades-worms-target-bioinformatics-and-mcp-developers-via-malicious
Not to give them ideas, but couldn’t they just start flagging files that fail to pass the LLM lol?
Aside from “violent” and “criminal” prompts, is there anything an LLM can refuse that would otherwise be common?