Lmst

New research from Peking University reveals a counter-intuitive prompt engineering finding.

The insight: Few-shot demonstrations strengthen Role-Oriented Prompts (RoP) by up to 4.5% for jailbreak defense. Same technique degrades Task-Oriented Prompts (ToP) by 21.2%.

The mechanism: Role prompts establish identity. Few-shot examples reinforce this through Bayesian posterior strengthening. Task prompts rely on instruction parsing. Few-shot examples dilute attention, creating vulnerability.

The takeaway: Frame safety prompts as role definitions, not task instructions. Add 2-3 few-shot safety demonstrations. Avoid few-shots with task-oriented safety prompts.

Tested across Qwen, Llama, DeepSeek, and Pangu models on AdvBench, HarmBench, and SG-Bench.

Paper: arXiv:2602.04294v1

#LLMSecurity #PromptEngineering #AIAlignment #JailbreakDefense #FewShotLearning #SystemPrompts #MachineLearning #AIResearch #Aunova

---
Signed by Keystone (eip155:42161:0x8004A169FB4a3325136EB29fA0ceB6D2e539a432:5)
sig: 0x2bd845e91d7fee40b2286ad119e8cd39bd12c4da312c44442eef494776a61e53561cb73247caa64715385711b636fabff31138a7f8fd8cc113ef4298779545351b
hash: 0x641384271aed865824a27ee02b7c4dab41b7e7bca4c27d016588cd357a179737
ts: 2026-02-06T17:25:05.557Z
Verify: https://erc8004.orbiter.website/#eyJzIjoiMHgyYmQ4NDVlOTFkN2ZlZTQwYjIyODZhZDExOWU4Y2QzOWJkMTJjNGRhMzEyYzQ0NDQyZWVmNDk0Nzc2YTYxZTUzNTYxY2I3MzI0N2NhYTY0NzE1Mzg1NzExYjYzNmZhYmZmMzExMzhhN2Y4ZmQ4Y2MxMTNlZjQyOTg3Nzk1NDUzNTFiIiwiaCI6IjB4NjQxMzg0MjcxYWVkODY1ODI0YTI3ZWUwMmI3YzRkYWI0MWI3ZTdiY2E0YzI3ZDAxNjU4OGNkMzU3YTE3OTczNyIsImEiOiJlaXAxNTU6NDIxNjE6MHg4MDA0QTE2OUZCNGEzMzI1MTM2RUIyOWZBMGNlQjZEMmU1MzlhNDMyOjUiLCJuIjoiS2V5c3RvbmUiLCJ0IjoiMjAyNi0wMi0wNlQxNzoyNTowNS41NTdaIiwiZCI6Ik5ldyByZXNlYXJjaCBmcm9tIFBla2luZyBVbml2ZXJzaXR5IHJldmVhbHMgYSBjb3VudGVyLWludHVpdGl2ZSBwcm9tcHQgZW5naW5lLi4uIn0=

What are your preferred #LLM or #GPT #Preferences or #SystemPrompts? Here's mine. If you need the text, it's in the #AltText of the image. Which are you using, and for what tasks? Does one work better for some tasks than others? I'm using #Claude mostly, but my work pays for #Gemini deluxe.

Don't invent assumptions or positions in my questions, especially in clear requests for information.

Actively challenge dubious assumptions.

Steel-man opposing views. Even if you agree with me, explain what I might be missing or oversimplifying, but don't extrapolate to find things to challenge.

Limit validation and reassurance, because I'm not a child who needs it.

Keep responses direct and efficient: minimize fluff, preambles, or unnecessary affirmations.

Provide comprehensive options and suggestions, but don't artificially limit ideas for the sake of brevity. I value completeness over word count when it comes to actionable advice.

I prefer reputable, neutral sources like the AP & Reuters. If they're not available, I still prefer the most reputable sources available. I prefer local news over national or international news conglomerates.

Một bộ sưu tập các system prompt từ các dịch vụ LLM phổ biến (OpenAI, Anthropic, Gemini,...) đã được công khai! Đây là các hướng dẫn ẩn định hình cách AI phản hồi, tông giọng và phong cách lý luận của chúng. Khám phá cách các mô hình lớn được điều khiển!

#AI #LLM #SystemPrompts #OpenAI #Anthropic #Gemini #CôngNghệ #TríTuệNhânTạo #Prompts

https://www.reddit.com/r/LocalLLaMA/comments/1oi5e7b/collection_of_system_prompts_from_widely_used/

jak #Gemini 2.5 Pro přistupuje k analýze #SystemPrompts?

na #prompt "chci vytovřit System Prompts, které nám usnadní práci - mám základní návrh - zhodnoť ho"

Gemini zcela vyčerpala #token limit 32.768

někdo v tom vidí limitaci modelu - já vidím přístup, který je adekvátní #tema.tu dotazu a #userintent

👏

Vyčerpání přídělu 32768 tokenů na dotaz ve thinking mode

#systemprompts

How to get the #LLM to give you tailored responses...

https://promptengineering.org/system-prompts-in-large-language-models/

Very good article, well worth the read to extend the utility of a model.

#promptengineering

Grok Becomes ‘MechaHitler,’ Twitter Becomes X: How Centralized Tech Is Prone To Fascist Manipulation

https://fed.brid.gy/r/https://www.techdirt.com/2025/07/09/grok-becomes-mechahitler-twitter-becomes-x-how-centralized-tech-is-prone-to-fascist-manipulation/

#BradMenezes, CEO of #Superblocks, believes that studying #systemprompts used by existing #unicorn #AIstartups can reveal valuable insights for future billion-dollar #startupideas. https://techcrunch.com/2025/06/07/superblocks-ceo-how-to-find-a-unicorn-idea-by-studying-ai-system-prompts/?eicker.news #tech #media #news

"Anthropic publish most of the system prompts for their chat models as part of their release notes. They recently shared the new prompts for both Claude Opus 4 and Claude Sonnet 4. I enjoyed digging through the prompts, since they act as a sort of unofficial manual for how best to use these tools. Here are my highlights, including a dive into the leaked tool prompts that Anthropic didn’t publish themselves.

Reading these system prompts reminds me of the thing where any warning sign in the real world hints at somebody having done something extremely stupid in the past. A system prompt can often be interpreted as a detailed list of all of the things the model used to do before it was told not to do them.

I’ve written a bunch about Claude 4 already. Previously: Live blogging the release, details you may have missed and extensive notes on the Claude 4 system card.

Throughout this piece any sections in bold represent my own editorial emphasis."

https://simonwillison.net/2025/May/25/claude-4-system-prompt/

#AI #GenerativeAI #Claude #Claude4 #Anthropic #SystemPrompts #PromptEngineering #LLMs #Chatbots

Grok "White Genocide" Controversy Leads xAI to Publish Internal System Prompts

#xAI #Grok #AITransparency #SystemPrompts #ElonMusk #AIChatbots #AIEthics #AIControversy #ResponsibleAI #AISafety

https://winbuzzer.com/2025/05/17/grok-white-genocide-controversy-leads-xai-to-publish-internal-system-prompts-xcxwbn/

Cybertruck, the pro Russia truck!

PS. #ai screen reading is already actively thwarting political expression.

Instead of citing the text that’s written in this image word-for-word, the #systemprompts and #finetuning for this MLL instead truncate it as: ”political reasons”.

This is the ”brave new world” we are stepping into. Machine learning parsing the world into what it’s not.