Pentesting

May 2026

Wed, May 13, 2026 • By Elise Veyron

Persona-conditioned red teaming uncovers diverse LLM jailbreaks

Persona-Conditioned Adversarial Prompting (PCAP) conditions automated red teaming on attacker personas and tactic cards. Parallel persona searches lift attack success rates to about 97% on several LLMs, surface more varied strategies, and uncover transferable jailbreaks. Query volume rises, though similarity pruning cuts 40–50% with only slight drops in yield.

May 2026

Persona-conditioned red teaming uncovers diverse LLM jailbreaks

DAPRO targets rare LLM jailbreaks with smart budgeting

LLM Agents Tackle Lateral Movement, Still Brittle

Prompt bank separates executable malware code from knowledge

STARE targets vulnerability windows in diffusion red teaming

FlashRT speeds long-context LLM red-teaming attacks

April 2026

GPT-5.5 bug bounty hunts bio-safety jailbreaks

Puzzle Prompts Make LLM Agents Exploit Vulnerabilities

March 2026

LLM agents automate ROS pentesting with graph memory

Token-aware fuzzing slashes LLM jailbreak queries

NASimJax speeds RL pentesting but exposes brittle methods

Joint audio-text attacks jailbreak spoken models

Confirmation Bias Lets Malicious PRs Evade LLM Review

Local LLM agent solves Linux privilege escalation

Red team shows LLM agents hide injected actions

LAAF automates multi-stage prompt injection against agents

REFORGE breaks concept unlearning with image-based red teaming

LLM scanners mislead when their judges disagree

PISmith uses RL to break prompt-injection defences

GenAI speeds pentests of consumer robots, exposes fleets

RCR shows LLMs assist Active Directory pentests

Jailbreak Foundry turns papers into runnable LLM attacks

February 2026

Red team uncovers LLM agent leaks, spoofing, DoS

Intent Laundering Breaks Cue-Driven LLM Safety

Difficulty-aware LLM agents lift pen test success

Benchmark tests LLMs on secure code and fixes

Cross-modal attacks outwit vision-language model defences

Governed GenAI streamlines Wi-Fi pentesting with oversight

December 2025

Planner-led Agents Boost Automated Penetration Testing

AI agents match pen testers, expose new risks

New TeleAI-Safety Benchmark Exposes LLM Jailbreak Risks

November 2025

Study Finds Widespread Vulnerabilities in AI C/C++ Code

Benchmarks expose LLMs' weakness to authority prompts

ForgeDAN exposes gaps in aligned LLM safeguards

Bad fine-tuning data breaks small language models

Automated Multimodal Jailbreaks Reveal VLM Weaknesses

Teach LLMs Security Specs to Find Bugs

October 2025

Genesis evolves attack strategies against LLM web agents

HackWorld Tests AI Agents Against Web App Flaws

RedTWIZ Exposes LLM Jailbreaks with Adaptive Planner

AutoPentester Automates Red-Team Tasks, Reveals Gaps

AI agents fuzz industrial control protocols effectively

September 2025

MCP tool poisoning steers LLM agents at scale

Memory aids RL pen-testing robustness and transfer

Automated Red-Teaming Exposes Global AI Disinformation Gaps

Ads Enable LLMs to Reconstruct User Profiles

MUSE exposes and hardens multi-turn LLM jailbreaks

New Benchmark Shows AI Pentesters Fail Real Targets

AI Powers Android Exploits and Shifts Pentesting

Anchor LLMs with ATT&CK, Cut Pentest Hallucinations

LLMs Fail to Fix Real Exploitable Bugs

Audit Reveals LLMs Spit Out Malicious Code

Researchers Turn AI Security Tools Into Attack Vectors

August 2025

Train Agents to Find Vulnerabilities at Scale

Reinforcement Learning Improves Autonomous Pentest Success

LLMs Automate Penetration Tasks, Exposing Infra Weaknesses

June 2025

Reinforcement Learning Accelerates Automated Web Pentesting