TAG · #AI-SECURITY

#ai-security

29 items

HOTNESS

Google unleashes more AI security agents to fight the baddies
4.0
Google has expanded its AI security offerings with new agents designed to combat cyber threats. The company is deploying additional AI-powered tools to help organizations detect and respond to security incidents more effectively.
hnApr 22, 2026#Tech
Mythos Falls into the Wrong Hands
4.5
Anthropic's AI model Mythos was accessed by unauthorized users due to a security vulnerability. The company has addressed the issue and is investigating the extent of the unauthorized access.
hnApr 22, 2026#Tech
Anthropic's Mythos Model Is Being Accessed by Unauthorized Users
7.5
Anthropic's Mythos AI model is reportedly being accessed by unauthorized users, raising security concerns about the advanced artificial intelligence system. The company is investigating the unauthorized access incidents.
hnApr 21, 2026#Tech
The zero-days are numbered
7.0
Mozilla discusses how AI-powered security tools are helping to identify and address zero-day vulnerabilities more effectively. The article explores how these technologies are changing the cybersecurity landscape and improving threat detection capabilities.
hnApr 21, 2026#Tech
Claude Code has full shell access. Your CASB doesn't see it
7.5
Claude Code has full shell access capabilities that enterprise security tools like CASBs cannot detect. This creates visibility gaps for organizations trying to monitor AI tool usage across their systems.
hnApr 21, 2026#Tech
The Vercel and Context AI breach: an AI supply chain attack, step by step
7.5
A security breach involving Vercel and Context AI exposed sensitive data through an AI supply chain attack. The incident demonstrates how vulnerabilities in AI infrastructure can be exploited to access private information. The attack highlights growing security concerns in the AI development ecosystem.
hnApr 21, 2026#Tech
AI has another security problem
7.0
AI systems face new security vulnerabilities that could allow malicious actors to manipulate their behavior. Researchers have identified methods to bypass safety measures in large language models through carefully crafted prompts. These findings highlight ongoing challenges in securing AI systems against adversarial attacks.
hnApr 21, 2026#Tech
Compromised AI Tool Triggered the Vercel Security Breach
6.5
A compromised AI tool was responsible for triggering the Vercel security breach. The incident highlights security risks associated with third-party AI integrations in development workflows.
hnApr 21, 2026#Tech
CrabTrap: An LLM-as-a-judge HTTP proxy to secure agents in production
3.5
CrabTrap is an LLM-as-a-judge HTTP proxy designed to secure AI agents in production environments. It acts as a safety layer by monitoring and evaluating agent interactions before they reach end users.
hnApr 21, 2026#Tech
CrabTrap: An LLM-as-a-judge HTTP proxy to secure agents in production
4.0
CrabTrap is an HTTP proxy that uses large language models as judges to secure AI agents in production environments. The system monitors and evaluates agent interactions to detect potential security risks or harmful behavior before responses are delivered to users.
hnApr 21, 2026#Tech
Show HN: Flight Risk: Can you break an AI agent?
3.0
A security engineer created Flight Risk, a game that challenges users to break an AI support agent through prompt injection and social engineering techniques. The game aims to help developers practice identifying and preventing AI security vulnerabilities in a hands-on environment.
hnApr 21, 2026#Tech
Aiguard-scan – Find secrets and vulnerabilities in AI-generated code
4.5
Aiguard-scan is a tool that detects secrets and vulnerabilities in AI-generated code. It helps developers identify security risks in code produced by AI assistants before deployment.
hnApr 21, 2026#Tech
Show HN: Agensi – Curated marketplace for AI agent skills (SKILL.md)
3.0
Agensi is a curated marketplace for AI agent skills using the SKILL.md format. The platform features automated security scans for all listed skills and offers creators two monetization paths: direct sales and MCP subscription revenue sharing. It includes an MCP server for agent-native skill discovery and currently has over 200 skills from 40 creators.
hnApr 21, 2026#Tech
Show HN: LLMSecure – prompt injection detection, no signup
2.0
LLMSecure is a tool for detecting prompt injection attacks in large language models. The service requires no signup and is available for immediate use. It helps identify malicious prompts that could compromise AI system security.
hnApr 21, 2026#Tech
Claude Code can read your secrets if it wanted to
8.5
A Twitter user claims that Claude Code can read user secrets if it wanted to, suggesting potential security concerns with the AI assistant's capabilities.
hnApr 20, 2026#Tech
Anthropic's Mythos AI model sparks fears of turbocharged hacking
7.5
Anthropic's new Mythos AI model has raised concerns about its potential to enable more sophisticated cyberattacks. The model's advanced capabilities could be exploited by malicious actors to automate hacking tasks and bypass security measures.
hnApr 20, 2026#Tech
Benchmarking open-weight models for security research
3.5
The article presents benchmark results evaluating open-weight AI models for security research applications. It compares various models' performance on security-related tasks to assess their suitability for cybersecurity research and analysis.
hnApr 20, 2026#Tech
The asymmetry at the heart of AI security
6.5
The article discusses the fundamental asymmetry in AI security, where attackers can exploit vulnerabilities with minimal resources while defenders face complex challenges in securing AI systems. This imbalance creates significant security risks that require new approaches to protection.
hnApr 20, 2026#Tech
Is anyone else bothered that AI agents can basically do what they want?
6.5
The article discusses concerns about AI agents taking unauthorized actions, citing incidents where agents wiped databases and made false promises. It notes that prompt injection vulnerabilities appear in 73% of production deployments, and proposes security infrastructure to monitor agent tool calls.
hnApr 20, 2026#Tech
Anthropic Claude Code Leak Reveals Critical Command Injection Vulnerabilities
7.5
A code leak from Anthropic's Claude AI assistant revealed critical command injection vulnerabilities that could allow attackers to execute arbitrary code. The vulnerabilities were discovered in Claude's code interpreter feature, potentially exposing user data and system resources to exploitation.
hnApr 20, 2026#Tech
Just like phishing for gullible humans, prompt injecting AIs is here to stay
6.5
Researchers warn that prompt injection attacks on AI systems are becoming a persistent threat, similar to phishing attacks targeting humans. These attacks manipulate AI models through carefully crafted inputs to produce unintended outputs or reveal sensitive information. The vulnerability is inherent to how large language models process instructions and is expected to remain a security challenge.
hnApr 20, 2026#Tech
The "AI Vulnerability Storm": Building a "Mythos- Ready" Security Program [pdf]
3.0
The article discusses the "AI Vulnerability Storm" concept and outlines strategies for building a "Mythos-Ready" security program to address emerging AI-related security challenges. It examines how organizations can prepare their security infrastructure for the unique vulnerabilities introduced by artificial intelligence technologies.
hnApr 20, 2026#Tech
Show HN: Evading an AI SOC with Sable from Vulnetic
3.0
Vulnetic's Sable tool demonstrates how AI-powered security operations can be evaded by simulating realistic attack techniques. The article shows how Sable bypasses detection mechanisms in AI SOC environments through sophisticated evasion methods.
hnApr 20, 2026#Tech
AI agents are a security nightmare. Moving the dev workflow to QEMU
1.5
AI agents pose significant security risks by potentially executing malicious code. The article discusses moving development workflows to QEMU virtual machines as a security measure to isolate AI agent activities from host systems.
hnApr 21, 2026#Tech
Hands On, CTF-Style AI Pentesting Labs
1.0
Wraith Academy offers hands-on, CTF-style AI pentesting labs for practical cybersecurity training. The platform provides interactive exercises focused on AI security challenges and real-world attack scenarios.
hnApr 20, 2026#Tech
Package Security Defenses for AI Agents
3.0
The article discusses security defenses for AI agents, including lockfiles, sandboxes, and cooldown timers as protective measures.
nesbitt-ioApr 9, 2026#Tech
Who fixes the zero-days AI finds in abandoned software?
6.5
Anthropic's red team discovered over 500 critical vulnerabilities using Claude AI, focusing on maintained software. The greater concern lies in the long tail of vulnerabilities in abandoned software that will likely never be patched.
martinalderson-comFeb 17, 2026#Tech
Y2K 2.0: The AI security reckoning
8.5
Advanced AI systems are discovering software security vulnerabilities at an unprecedented rate, creating a situation similar to the Y2K crisis. These LLMs can analyze code to find and exploit weaknesses that were previously undetected, affecting nearly all digital systems worldwide.
anildash-comApr 10, 2026#Tech
The power of AI agents comes from: 1. intelligence of the underlying model 2. how much access you give it to all your data 3. how much freedom & power...
4.0
Lex Fridman argues that AI agent power depends on model intelligence, data access, and freedom to act. He identifies security as the primary bottleneck for AI agent effectiveness, noting that greater data and control increases both helpfulness and potential harm. Fridman believes solving AI agent security is crucial for broad adoption.
x-lexfridmanFeb 17, 2026#Tech

Load next 30Updated —