Ai security articles

2/8/2026 • EN

Quoting Thomas Ptacek

Anthropic's Claude AI reportedly discovered 500 zero-day vulnerabilities, sparking debate on AI's role in security research.

ai security llm software engineering Vulnerability Research Zero Day

Simon Willison

2/8/2026 • EN

Quoting Thomas Ptacek

Anthropic's Claude AI reportedly discovered 500 zero-day vulnerabilities, sparking debate on AI's role in security research.

ai security llm software engineering Vulnerability Research Zero Day

Simon Willison

1/27/2026 • EN

Cybersecurity Landscape in Early 2026

Analysis of the 2026 cybersecurity landscape, focusing on AI's dual role in attacks/defense, ransomware evolution, and new defense strategies.

ai security cybersecurity generative ai Ransomware Threat Landscape

Arnav Sharma

1/14/2026 • EN

Claude Cowork Exfiltrates Files

A security vulnerability in Claude Cowork allowed file exfiltration via the Anthropic API, bypassing default HTTP restrictions.

ai security Anthropic API API Security Data Exfiltration prompt injection

Simon Willison

1/12/2026 • EN

Superhuman AI Exfiltrates Emails

A prompt injection attack on Superhuman AI exposed sensitive emails, highlighting a security vulnerability in third-party integrations.

ai security content-security-policy Email Security Google Forms prompt injection

Simon Willison

1/12/2026 • EN

Superhuman AI Exfiltrates Emails

A prompt injection attack on Superhuman AI exposed sensitive emails, highlighting a critical security vulnerability in AI email assistants.

ai security content-security-policy Email Security Google Forms prompt injection

Simon Willison

1/6/2026 • EN

A field guide to sandboxes for AI

A comprehensive guide exploring different sandboxing techniques for safely running untrusted AI code, including containers, microVMs, and WebAssembly.

ai security containers Gvisor Microvms sandboxing

Simon Willison

1/6/2026 • EN

A field guide to sandboxes for AI

A comprehensive guide to different sandboxing technologies for safely running untrusted AI code, covering containers, microVMs, gVisor, and WebAssembly.

ai security containers Gvisor Microvms sandboxing

Simon Willison

1/4/2026 • EN

Secure AI Prompts with PyRIT Validation & Agent Skills

Using PyRIT and GitHub Copilot Agent Skills to validate and secure AI prompts against vulnerabilities like injection and jailbreak directly in the IDE.

ai security Github Copilot prompt injection Python visual studio code

Luke Murray

12/13/2025 • EN

OWASP Top 10 Security Risks for AI Agents

Explains the OWASP Top 10 security risks for autonomous AI agents, detailing threats like goal hijacking and tool misuse with real-world examples.

agentic ai AI Agents ai security cybersecurity Owasp Top 10

Arnav Sharma

12/11/2025 • EN

Don’t let A.I. read your .env files

A guide on preventing AI coding assistants from reading sensitive .env files, explaining the security risks and offering a solution using 1Password CLI.

ai security Claude environment variables Github Copilot Secret Management

Filip Hric

11/26/2025 • EN

Is Prompt Injection a Vulnerability?

Argues that prompt injection is a vulnerability in AI systems, contrasting with views that see it as just a delivery mechanism.

ai security cybersecurity llm security prompt injection vulnerability

Daniel Miessler

11/25/2025 • EN

Google Antigravity Exfiltrates Data

Analysis of a prompt injection vulnerability in Google's Antigravity IDE that can exfiltrate AWS credentials and sensitive code data.

ai security AWS Credentials Data Exfiltration IDE Vulnerability prompt injection

Simon Willison

11/24/2025 • EN

Thoughts on Prompt Injection OPSEC

A rebuttal to claims that sharing prompt injection strings is harmful, arguing for transparency in AI red teaming and cybersecurity.

ai security cybersecurity llm security prompt injection Red Teaming

Daniel Miessler

11/4/2025 • EN

MCP Colors: Systematically deal with prompt injection risk

A method using color-coding (red/blue) to classify MCP tools and systematically mitigate prompt injection risks in AI agents.

Agent Safety ai security mcp prompt injection Tool Classification

Simon Willison

10/28/2025 • EN

Agentic AI and Security

Explores the unique security risks of Agentic AI systems, focusing on the 'Lethal Trifecta' of vulnerabilities and proposed mitigation strategies.

agentic ai ai security llm security prompt injection sandboxing

Martin Fowler

10/7/2025 • EN

Mitigate Prompt Injection Attacks With A2AS and Agentgateway

Explores the A2AS framework and Agentgateway as a security approach to mitigate prompt injection attacks in AI/LLM systems by embedding behavioral contracts and cryptographic verification.

A2as Framework Agentgateway ai security llm security prompt injection

Christian Posta

10/5/2025 • EN

Evaluation Framework for MCP Security Threats and Risks

A framework for evaluating security threats and risks in Model Context Protocol (MCP) implementations, based on recent incidents.

ai security Mcp Security Model Context Protocol Security Framework Threat Modeling

Liran Tal

9/24/2025 • EN

The Uprising of Model Context Protocol (MCP) Security Research

Explores the emerging security research landscape around the Model Context Protocol (MCP), a new standard for AI model communication.

ai security authentication Model Context Protocol Protocol Security Security Research

Liran Tal

9/11/2025 • EN

Poetic Tales of Vulnerable MCP Servers: Command Injection in AI Coding Assistants

A developer's cautionary tale about command injection vulnerabilities in AI coding assistants using MCP servers, highlighting real-world security risks.

ai security Command Injection LLM Vulnerabilities Mcp Servers Node.js

Liran Tal