Lmst

My Poetry Style Defeats Your AI Security Style

#News #TechNews #AI #AIsafeguards #Poetry #LLM #lol

Daily Podcast: My Poetry Style Defeats Your AI Security Style

#News #TechNews #AI #AIsafeguards #Poetry #LLM #lol #podcast

The Register: Researchers find hole in AI guardrails by using strings like =coffee. “Large language models frequently ship with “guardrails” designed to catch malicious input and harmful output. But if you use the right word or phrase in your prompt, you can defeat these restrictions.”

https://rbfirehose.com/2025/11/17/the-register-researchers-find-hole-in-ai-guardrails-by-using-strings-like-coffee/

ChatGPT introduces parental controls

https://web.brid.gy/r/https://nerds.xyz/2025/09/chatgpt-openai-parental-controls/

UC Riverside: UCR researchers fortify AI against rogue rewiring. “…researchers at the University of California, Riverside, have developed a method to preserve AI safeguards even when open-source AI models are stripped down to run on lower-power devices.”

https://rbfirehose.com/2025/09/09/uc-riverside-ucr-researchers-fortify-ai-against-rogue-rewiring/