#AIsafeguards

Nick EspinosaNickAEsp
2025-11-24

My Poetry Style Defeats Your AI Security Style

youtu.be/kVpz6pUQ0Zs

Nick EspinosaNickAEsp
2025-11-24

Daily Podcast: My Poetry Style Defeats Your AI Security Style

soundcloud.com/nickaesp/psa

2025-11-17

The Register: Researchers find hole in AI guardrails by using strings like =coffee. “Large language models frequently ship with “guardrails” designed to catch malicious input and harmful output. But if you use the right word or phrase in your prompt, you can defeat these restrictions.”

https://rbfirehose.com/2025/11/17/the-register-researchers-find-hole-in-ai-guardrails-by-using-strings-like-coffee/

NERDS.xyz – Real Tech News for Real Nerdsnerds.xyz@web.brid.gy
2025-09-29
2025-09-09

UC Riverside: UCR researchers fortify AI against rogue rewiring. “…researchers at the University of California, Riverside, have developed a method to preserve AI safeguards even when open-source AI models are stripped down to run on lower-power devices.”

https://rbfirehose.com/2025/09/09/uc-riverside-ucr-researchers-fortify-ai-against-rogue-rewiring/

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst