#AIRisks

Tim Greenrawveg@me.dm
2026-02-06

Current AI alignment methods risk reinforcing biases, homogenising responses, and enabling deception. We must rethink AI development beyond preference learning and corrigibility, focusing on diversity, genuine human values, and democratic oversight.
Discover more at dev.to/rawveg/stop-making-ai-l
#HumanInTheLoop #AIethics #AIinSociety #AIrisks

AI attacks are moving at machine speed—and most orgs can’t keep up. Zscaler's new report shows how fast defenses are falling behind. jpmellojr.blogspot.com/2026/02 #Cybersecurity #AIThreats #Zscaler #AIrisks

2026-02-02

Threat researchers are increasingly evaluating how agentic AI systems could be abused if autonomy, internet access, and sensitive data converge.

Key concerns include agent-to-agent skill sharing, underground marketplaces, and the removal of human bottlenecks from ransomware and intrusion workflows. Defensive models are now shifting toward stronger human approval and behavioral detection.

How should InfoSec teams adapt as agents become more capable?

Source: infostealers.com/article/the-a

Follow TechNadu for objective security analysis.

#InfoSec #AgenticAI #ThreatModeling #CyberDefense #AIrisks #TechNadu

The Autonomous Adversary: From “Chatbot” to Criminal Enterprise
2026-01-29

Trump’s CISA chief at it again: uploads sensitive files into ChatGPT

cybernews.com/security/madhu-g

AIagent.at 🤖 AI Newsai@defcon.social
2026-01-27

#DarioAmodei compares #humanity’s current situation with #AI to a rite of passage, questioning if our systems can handle the immense power AI will bring. They emphasise the need for a realistic and pragmatic approach to discussing #AIrisks, avoiding doomerism and acknowledging uncertainty. darioamodei.com/essay/the-adol #AIagent #AI #ML #NLP #LLM #GenAI

Tim Greenrawveg@me.dm
2026-01-21

A simple one-word change can fool AI classifiers, exposing critical vulnerabilities with real-world consequences—from financial fraud to life-threatening medical errors. MIT's open-source tools now let anyone test these weaknesses, raising urgent questions about safety and responsibility.
Discover more at dev.to/rawveg/the-one-word-cat
#AIrisks #AIsecurity #ResponsibleAI #HumanInTheLoop

cathillcathill
2026-01-21

Evil AI would force its human slaves to do data cleaning and feed it with structured data.

N-gated Hacker Newsngate
2026-01-20

🚨 Breaking News: We've now reached the pinnacle of human achievement - teaching software to ignore all the red flags we've spent decades programming into it. 🤦‍♂️ Let's just let the machines do whatever they please; what could possibly go wrong? 🙄
blog.emilburzo.com/2026/01/run

One of the most dangerous drivers of data risk today isn’t even on the radar of most security teams. AI technical debt is perilous because it's poorly understood. jpmellojr.blogspot.com/2026/01 #AITechnicalDebt #Forcepoint #AIrisks #SoftwareDevelopment

IT InsightsITinsights
2026-01-04

🚨 Grok-controverse: Een WAKE-UP CALL voor AI-risico's en contentmoderatie! Tijd voor actie.  
itinsights.nl/zakelijke-it/gro

2025-12-30

Các mức độ năng lực then chốt và CBRN (hóa/sinh/radiation/hạt nhân) sẽ trở thành vấn đề nghiêm trọng, có thể trước khi RSI cất cánh hay AGI xuất hiện. Hai kịch bản: minh bạch trong kiểm soát truy cập hoặc âm thầm áp dụng "Chương trình Truy cập Đặc biệt". Dù thế nào, người ngoài sẽ ngày càng bị bỏ lại với mô hình kém hơn. #AI #AGI #AnToanAI #CôngNghệ #AIrisks #CriticalCapabilities

reddit.com/r/singularity/comme

2025-12-28

2/3 người Mỹ lo ngại AI sẽ gây hại nghiêm trọng cho con người trong 20 năm tới, theo khảo sát của Pew Research. Quan ngại về rủi ro công nghệ tiếp tục gia tăng. #AI #CongNghe #AIandEthics #GiaiDoanViec #AIProgress #AIConcerns #AIrisks #AIdebate #AItechnology #AItroubled #AIissues #AIeffects #AIwatch #AIfuture #AIchaos #AIharm #AIimpact #AIhorizon #AIanalysis #AIstudy #AIconcern #AIworry #AIdich #AIhatred #AIviolence #AIwar #AIdanger #AItragedy #AIproblem #AIcrisis #AIrisk #AIthreat #AIattack #AI

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst