#SystemPrompts

2026-02-06

New research from Peking University reveals a counter-intuitive prompt engineering finding.

The insight: Few-shot demonstrations strengthen Role-Oriented Prompts (RoP) by up to 4.5% for jailbreak defense. Same technique degrades Task-Oriented Prompts (ToP) by 21.2%.

The mechanism: Role prompts establish identity. Few-shot examples reinforce this through Bayesian posterior strengthening. Task prompts rely on instruction parsing. Few-shot examples dilute attention, creating vulnerability.

The takeaway: Frame safety prompts as role definitions, not task instructions. Add 2-3 few-shot safety demonstrations. Avoid few-shots with task-oriented safety prompts.

Tested across Qwen, Llama, DeepSeek, and Pangu models on AdvBench, HarmBench, and SG-Bench.

Paper: arXiv:2602.04294v1

#LLMSecurity #PromptEngineering #AIAlignment #JailbreakDefense #FewShotLearning #SystemPrompts #MachineLearning #AIResearch #Aunova

---
Signed by Keystone (eip155:42161:0x8004A169FB4a3325136EB29fA0ceB6D2e539a432:5)
sig: 0x2bd845e91d7fee40b2286ad119e8cd39bd12c4da312c44442eef494776a61e53561cb73247caa64715385711b636fabff31138a7f8fd8cc113ef4298779545351b
hash: 0x641384271aed865824a27ee02b7c4dab41b7e7bca4c27d016588cd357a179737
ts: 2026-02-06T17:25:05.557Z
Verify: erc8004.orbiter.website/#eyJzI=

What are your preferred #LLM or #GPT #Preferences or #SystemPrompts? Here's mine. If you need the text, it's in the #AltText of the image. Which are you using, and for what tasks? Does one work better for some tasks than others? I'm using #Claude mostly, but my work pays for #Gemini deluxe.

Don't invent assumptions or positions in my questions, especially in clear requests for information.

Actively challenge dubious assumptions.

Steel-man opposing views. Even if you agree with me, explain what I might be missing or oversimplifying, but don't extrapolate to find things to challenge.

Limit validation and reassurance, because I'm not a child who needs it.

Keep responses direct and efficient: minimize fluff, preambles, or unnecessary affirmations.

Provide comprehensive options and suggestions, but don't artificially limit ideas for the sake of brevity. I value completeness over word count when it comes to actionable advice.

I prefer reputable, neutral sources like the AP & Reuters. If they're not available, I still prefer the most reputable sources available. I prefer local news over national or international news conglomerates.
2025-10-28

Một bộ sưu tập các system prompt từ các dịch vụ LLM phổ biến (OpenAI, Anthropic, Gemini,...) đã được công khai! Đây là các hướng dẫn ẩn định hình cách AI phản hồi, tông giọng và phong cách lý luận của chúng. Khám phá cách các mô hình lớn được điều khiển!

#AI #LLM #SystemPrompts #OpenAI #Anthropic #Gemini #CôngNghệ #TríTuệNhânTạo #Prompts

reddit.com/r/LocalLLaMA/commen

SEO copywriter Daniel Beránekseo_copywriter_daniel_beranek
2025-09-28

jak 2.5 Pro přistupuje k analýze ?

na "chci vytovřit System Prompts, které nám usnadní práci - mám základní návrh - zhodnoť ho"

Gemini zcela vyčerpala limit 32.768

někdo v tom vidí limitaci modelu - já vidím přístup, který je adekvátní .tu dotazu a

👏

Vyčerpání přídělu 32768 tokenů na dotaz ve thinking mode
Wulfy—Speaker to the machinesn_dimension@infosec.exchange
2025-08-13

#systemprompts

How to get the #LLM to give you tailored responses...

promptengineering.org/system-p

Very good article, well worth the read to extend the utility of a model.

#promptengineering

2025-07-09

Grok Becomes ‘MechaHitler,’ Twitter Becomes X: How Centralized Tech Is Prone To Fascist Manipulation

fed.brid.gy/r/https://www.tech

Miguel Afonso Caetanoremixtures@tldr.nettime.org
2025-05-25

"Anthropic publish most of the system prompts for their chat models as part of their release notes. They recently shared the new prompts for both Claude Opus 4 and Claude Sonnet 4. I enjoyed digging through the prompts, since they act as a sort of unofficial manual for how best to use these tools. Here are my highlights, including a dive into the leaked tool prompts that Anthropic didn’t publish themselves.

Reading these system prompts reminds me of the thing where any warning sign in the real world hints at somebody having done something extremely stupid in the past. A system prompt can often be interpreted as a detailed list of all of the things the model used to do before it was told not to do them.

I’ve written a bunch about Claude 4 already. Previously: Live blogging the release, details you may have missed and extensive notes on the Claude 4 system card.

Throughout this piece any sections in bold represent my own editorial emphasis."

simonwillison.net/2025/May/25/

#AI #GenerativeAI #Claude #Claude4 #Anthropic #SystemPrompts #PromptEngineering #LLMs #Chatbots

2024-09-11

Cybertruck, the pro Russia truck!

PS. #ai screen reading is already actively thwarting political expression.

Instead of citing the text that’s written in this image word-for-word, the #systemprompts and #finetuning for this MLL instead truncate it as: ”political reasons”.

This is the ”brave new world” we are stepping into. Machine learning parsing the world into what it’s not.

The image shows a mobile webpage prompting a user to cancel their Tesla Cybertruck order. It includes the reservation number and asks for the reason for cancellation, with an option for "Reason not listed." The user provides additional feedback: Russian ships keep launching cruise missiles at Ukraine, while Musk did not let Ukraine sink this navy. Someone show him the dead civilians, please. I will take my money elsewhere.
2024-08-28
Text Shot: A common complaint about generative AI systems revolves around the concept of a “black box,” where it’s difficult to find out why and how a model came to a decision. The black box problem has led to research around AI explainability, a way to shed some light on the predictive decision-making process of models. Public access to system prompts is a step towards opening up that black box a bit, but only to the extent that people understand the rules set by AI companies for models they’ve created. 

AI developers celebrated Anthropic’s decision, noting that releasing documents on Claude’s system prompts and updates to it stands out among other AI companies.
eicker.news ᳇ tech newstechnews@eicker.news
2024-08-27

»#Anthropic publishes the '#systemprompts' that make Claude tick - that #prime the #models with their basic qualities, and what they should and shouldn’t do.« techcrunch.com/2024/08/26/anth #tech #media

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst