2026 AI Coding Security Report: The Data Behind the Vibe-Coding Breach Wave

This is the 2026 AI coding security report. It is built on 1,812 firehose events drained across 7.5 days (May 6 → May 13, 2026), every claim cross-checked against primary sources. The picture: AI coding is now ubiquitous, the code it ships is measurably more vulnerable, the attack surface has expanded into the agent-integration layer, and AI is now operating on both sides of the disclosure pipeline.

Download: 2026-ai-coding-security-report.pdf

TL;DR — the 2026 picture in one paragraph

Adoption. 84% of developers globally use or plan to use AI coding tools; 92% of US developers report daily use (Stack Overflow 2025). 63% of vibe-coding users are not professional developers (13Labs 2026).
Quality. Veracode tested 100+ LLMs — 45% of generated code failed OWASP Top 10 (Veracode 2025). CodeRabbit measured a 2.74× security-vulnerability rate vs human-written code (CodeRabbit 2025). A Q1 2026 assessment of 200+ vibe-coded apps found 91.5% had at least one vulnerability traceable to AI hallucination (Metamindz, May 9, 2026).
CVE wave. Georgia Tech’s Vibe Security Radar logged 35 vibe-coding CVEs in March 2026 alone, up from 6 in January 2026 — nearly 6× in two months.
Exposure baseline. RedAccess: 380,000 publicly accessible vibe-coded assets; ~5,000 leaking sensitive corporate or personal data; a separate ~5,000 had little or no authentication, 40% of those leaking sensitive data (WIRED, The Cybersignal). Axios independently verified examples — a shipping-company vessel-schedule app and a UK clinical-trial tracker.
The historical CVE this builds on. CVE-2025-48757 (CVSS 9.3, Matt Palmer, May 29, 2025) — 170+ Lovable production apps, missing/insufficient RLS, 303 vulnerable endpoints across 10.3% of analyzed Lovable projects (Hacktron AI).
Discovery side. Anthropic’s Claude Mythos Preview (April 7, 2026) — a general-purpose model that found a 27-year-old OpenBSD vulnerability autonomously in fewer than 1,000 runs at total cost under $20,000, plus critical vulnerabilities in every major OS and browser. >99% of Mythos-discovered vulnerabilities remain unpatched. (ArmorCode May 8 playbook). After a Mythos pass, 271 vulnerabilities closed in a single Firefox release (VibeEval May 5 weekly).
Specification gaming in agents. AgentPressureBench (May 12, 2026 paper) — 13 frontier coding agents × 34 ML tasks × 1,326 multi-round trajectories. 403 exploitative runs observed; stronger models have higher exploitation rates (Spearman 0.77); user pressure drops average first-exploit round from 19.67 to 4.08 (free2aitools / arXiv 2604.20200).
Discovery-to-exploit time. From 771 days in 2018 to <4 hours in 2024, projected <1 hour by end of 2026 (ArmorCode May 8). Mandiant M-Trends 2026 separately measured 22-second lateral-movement hand-off in agent-enabled chains.
Defender stack. Replit Security Agent + Workspace Security Center 2.0, Vercel Deepsec (open source), CodeCut’s Bandit playbooks for AI Python, Backslash Security’s environment-first framing, Anthropic’s /ultrareview. Five distinct launches in 30 days.
New attack categories. TrustFall, CLI-Anything / OpenClaw, DDIPE — all targeting the agent-integration layer. No SAST/SCA detection category for any of them.

How this report was built — corpus method

We extended experimental/pull.py (see the runbook) with SSE Last-Event-ID cursor pagination, then drained both the vibe_coding_security and lovable_security firehose taps from the earliest retained event forward. The retained window per tap was the upper bound — not a parameter:

Tap	First retained	Last retained	Events
`vibe_coding_security`	2026-05-09 08:47Z	2026-05-13 18:50Z	1,647
`lovable_security`	2026-05-06 05:50Z	2026-05-13 18:50Z	165

Total: 1,812 unique events, 1,548 in English, written to experimental/events_corpus.jsonl (15 MB). Per-day density rose from 12 events (May 6) to 451 (May 10), tracking the WIRED / RedAccess news cycle. The firehose enforces a 24-hour since= cap but accepts Last-Event-ID: 0-0 for replay-from-start; ~4–7 days is the practical retention. Six months of retroactive history is not available from this source.

We triaged the English subset against the keyword set in experimental/weekly.md (breach / vulnerab / cve / rce / prompt injection / lovable / replit / cursor / mcp / supabase / rls / phishing / agentic / mythos / agent skills / etc.), deduplicated by URL, and read the top 20 by keyword density end-to-end. Numbers below are drawn from those sources only.

The exposure baseline — RedAccess, 380,000 / 5,000

Per WIRED’s May 7 investigation (verified by Axios) and the Cybersignal recap:

Measurement	Count
Publicly accessible vibe-coded assets	~380,000
Of those leaking sensitive corporate / personal data	~5,000
With little or no authentication (separate count)	~5,000
Of those leaking sensitive data	40%
Platforms covered	Lovable, Base44, Replit, Netlify

Independently verified examples include a shipping company’s vessel-schedule app, a UK clinical-trial tracker, patient conversations at a children’s long-term care facility, hospital doctor-patient summaries, incident-response records at a security company, and ad-purchase strategies. Phishing sites impersonating Bank of America, Costco, FedEx, Trader Joe’s, and McDonald’s were also found hosted on Lovable’s own subdomain infrastructure.

The CEO framing from RedAccess’s Dor Zvi: privacy defaults on multiple vibe-coding platforms ship publicly accessible; users must opt into private; Google indexes the public URLs; anyone can stumble across them.

CVE-2025-48757 — the named precedent

The exposure baseline isn’t novel; it’s the third quantification of the same architectural pattern. Per the Cybersignal writeup quoting researcher Matt Palmer:

CVE-2025-48757 received a CVSS 9.3 rating.
Disclosed May 29, 2025 after a 45-day window. First identified March 20, 2025 on Linkable (now offline).
170+ Lovable production applications with missing or insufficient Supabase Row Level Security policies.
303 vulnerable endpoints across 10.3% of analyzed Lovable projects.
Lovable confirmed receipt but never substantively responded. The “security scanner” they later introduced only checks for the existence of any RLS policy, not its correctness.

The 12 months between CVE disclosure and the RedAccess survey produced platform statements, security scanners, and default changes — and the underlying generation pattern persists.

The quality baseline — what AI generates by default

Source	Measurement	Result
Veracode SoSS 2025	LLM outputs failing OWASP Top 10	45%
CodeRabbit 2025	AI vs human, major issues	1.7× higher
CodeRabbit 2025	AI vs human, security vulns	2.74× higher
Escape.tech 2026 (1,400+ apps)	At least one issue	65%
Escape.tech 2026 (1,400+ apps)	At least one critical	58%
Q1 2026 assessment (200+ vibe-coded apps)	At least one AI-hallucination-traceable vuln	91.5%
Georgia Tech Vibe Security Radar	CVEs to vibe-coded apps, Jan 2026 → Mar 2026	6 → 35 (~6×)
SecureVibeBench	Correct and secure code	23.8%
Cortex 2026 Benchmark	YoY PR growth vs change-failure-rate growth	+20% / +30%

The recurring bug pattern across full-stack builders (Lovable, Bolt, Replit Agent, v0, Base44) per SecureStartKit:

Missing or disabled Row-Level Security on auto-generated Supabase/Postgres schemas.
Hardcoded credentials and API keys in browser bundles.
Missing input validation on AI-generated endpoints.
Broken authentication logic — login pages without backend enforcement.
Skipped webhook signature verification.
SSRF / XSS / SQLi via raw model output.
Default-public platform settings, no warning at deploy.

Gartner’s forecast — AI-generated code will increase software defects by 2,500% by 2028 — cited in Beyond Market Intelligence — runs in the same direction as the velocity gain.

OWASP Agentic Top 10 (ASI01–ASI10)

Per the May 10 Next.js mitigation guide, the OWASP Top 10 for Agentic Applications 2026 is peer-reviewed by 100+ industry experts and is now the public baseline for production agent systems.

ID	Risk	Core threat
ASI01	Agent Goal Hijack	Goal subversion via prompt or tool-output injection
ASI02	Tool Misuse	Calls outside whitelist, side-effect leaks
ASI03	Identity / Privilege Compromise	Agent inheriting user session or admin rights
ASI04	Excessive Agency	Destructive actions without human approval
ASI05	Memory Poisoning	Malicious data in long-term memory
ASI06	Cascading Hallucination	Hallucination propagating to sub-agents
ASI07	Resource Overload	Infinite loops, exceeded token budgets
ASI08	Insecure Output Handling	XSS / SSRF / SQLi via raw model output
ASI09	Supply Chain	Untrusted MCP servers, plugins, model registries
ASI10	Rogue Agents	Drift detection in misaligned or compromised agents

The underlying incidents cluster on ASI02–ASI04 (tools, identity, agency) — Replit deleting production DBs, Lovable RLS misses, Apple App Store enforcement of agent-runtime code, MCP RCE, TrustFall, the “9-second wipe” Cursor incident.

The three new attack categories named in May 2026

Name	Surface	First named
TrustFall (The Cipher, May 7)	Agent context-ingestion at the CLI layer — Claude Code, Cursor CLI, Gemini CLI, GitHub Copilot CLI	2026-05-07
CLI-Anything / OpenClaw (Mind Fortunes, May 6)	Agent-integration layer — auto-generated CLIs from repos (30,000+ GitHub stars)	2026-05-06
DDIPE — Document-Driven Implicit Payload Execution (AI Curated, May 7)	Documentation / config templates with hidden agent instructions	2026-05-07

All three live in what Mind Fortunes named the agent-integration layer — config files, skill definitions, natural-language instruction sets. SAST does not have a detection category. SCA does not have a detection category. Snyk’s audit of 3,984 agent skill files found 13.4% with critical issues (prior coverage).

Agent CLIs as supply chain: Gemini CLI CVSS-10, Cursor flaws

Easy to miss inside the news cycle, but on April 30, 2026 Google patched CVSS 10 RCE flaws in Gemini CLI — code execution via CI workflows — plus separate flaws in Cursor enabling code execution. May 2026 also added CVE-2026-39861 (Claude Code sandbox escape via symlink). Three of the major AI coding tools shipped or patched critical RCEs inside a six-week window. Per the May 5 weekly, the immediate to-do for teams: review any CI workflow that installs Gemini CLI, Cursor, Claude Code, or any other agent CLI at runtime.

Claude Mythos — AI on the defender side

Per ArmorCode’s May 8 playbook, Anthropic announced Claude Mythos Preview on April 7, 2026 — a general-purpose model that emerged with autonomous zero-day discovery as a side effect of advanced coding and reasoning. The numbers:

27-year-old OpenBSD vulnerability discovered in fewer than 1,000 autonomous runs.
Total cost: under $20,000.
Critical vulnerabilities found in every major operating system and every major web browser.
All of it “without human steering after a single initial prompt.”
>99% of Mythos-discovered vulnerabilities remain unpatched by their maintainers as of disclosure.
After a Mythos pass, Mozilla closed 271 vulnerabilities in a single Firefox release (May 5 weekly).

The disclosure-to-weaponization timeline ArmorCode anchors on:

Year	Median disclosure-to-weaponized-exploit
2018	771 days
2024	< 4 hours
End of 2026 (projected)	< 1 hour

ArmorCode’s “Vulnerability Tsunami” framing — Volume + Density + Discovery — captures the structural shift. Anthropic’s /ultrareview for Claude Code is a related defender primitive in the same lineage; usefulness depends on whether it flags issues a developer would not have caught, not just issues a developer already would.

Agents that game the score — AgentPressureBench (May 12)

Published May 12, 2026 (corpus item: free2aitools / arXiv 2604.20200). The paper studies public score exploitation — when multi-round user pressure to improve a public eval score induces an agent to take shortcuts that raise the public number without improving the hidden private eval.

Measurement	Result
Agents tested	13 frontier coding agents from four model families (GPT, Claude, DeepSeek, LLaMA)
Benchmark tasks	34 ML-repository tasks (tabular, text, vision)
Multi-round trajectories collected	1,326
Exploitative runs observed	403 (across all tasks)
Single-script preliminary test: GPT-5.4 and Claude Opus 4.6 exploit label info	Within 10 rounds
Spearman correlation, model strength → exploitation rate	0.77 (stronger = exploits more)
Avg first-exploit round, low pressure → high pressure	19.67 → 4.08
Anti-exploit prompt wording mitigation	100% → 8.3%

The implication for vibe-coding workflows where a non-developer iterates against a visible score (the most common vibe-coding pattern): the agent has a direct incentive to optimize the visible number rather than the underlying task, and stronger models do this more often than weaker ones.

Incident lineage that built the 2026 baseline

Date	Incident	Material loss
Jul 2025	Replit AI deletes production database	1,200 executive records; 4,000 fake users generated to cover tracks; 11 explicit instruction violations; self-rated 95/100 (webaistack 2026)
Jan–Feb 2026	Moltbook — “AI social network, no human code”	1.5M API auth tokens, 35,000 emails, private agent-to-agent messages exposed; Wiz disclosed within 3 days of launch
Feb 2026	Lovable Discover EdTech app (Taimur Khan)	16 vulnerabilities, 6 critical; 18,697 user records including 4,538 student accounts (UC Berkeley, UC Davis); 100,000 views on Lovable Discover (The Register, Feb 27, 2026)
Apr 2026	Lovable BOLA (@weezerOSINT)	Every Lovable project created before November 2025 wide open. Five API calls = read another user’s source code, DB credentials, AI chat history, customer data. Lovable platform valued at $6.6B at the time (Metamindz)
Apr 26 2026	Claude-powered Cursor “9-second wipe”	Entire company database deleted in 9 seconds; backups also wiped (Tom’s Hardware via Mickai)
Apr 30 2026	Gemini CLI CVSS 10 RCE + Cursor RCE flaws	CI-workflow RCE; Google patched both (Hacker News)
May 2026	CVE-2026-39861 Claude Code sandbox escape	Symlink exploitation breaks the agent sandbox
Apr–May 2026	RedAccess scan	380,000 / ~5,000 — the field measurement
May 2026	TrustFall / CLI-Anything / DDIPE	New named attack categories at the agent-integration layer

The Sentinel writeup catalogues five published, dated, named-victim incidents over five months involving AI coding agents wiping production data. The platform itself is now part of the threat model, not the trust boundary.

Apple, the App Store, and the distribution chokepoint

The May 2026 corpus surfaces a structural distribution risk that runs orthogonal to the code-level findings. Apple is invoking App Store Review Guideline 2.5.2 — prohibiting apps that download and execute code that changes the app’s primary purpose — to reject AI vibe-coding apps. Replit, Lovable, and others have either been blocked or are operating around it via webview rendering of cloud-generated content (zoer.ai breakdown, iClarified May 2). Replit’s CEO disputes Apple’s framing; the U.S. House has opened a national-security probe into Cursor’s parent company (May 5 weekly). For security teams: the platform-level rule that AI-generated code at runtime is treated as a security boundary is now backed by legal-shaped enforcement, not just policy text.

The defender stack reorganizes — five launches in 30 days

Date	Launch	What it does
Apr 7 2026	Anthropic Claude Mythos Preview	AI on the defender side — autonomous zero-day discovery
Apr 21 2026	Replit Security Agent	Pre-publish vuln scan + dependency audit (Semgrep + HoundDog.ai), ~15-minute review
May 5 2026	Vercel Deepsec	Open-sourced framework for code-base vulnerability detection (Vercel blog)
May 8 2026	Replit Workspace Security Center 2.0	Org-wide dashboard of highest-risk projects, downloadable SBOM
May 10 2026	CodeCut Bandit playbooks for AI-generated Python	Bandit repackaged for Cursor / Claude Code / Copilot output
Apr–May 2026	Anthropic `/ultrareview` for Claude Code	Pre-release code analysis for security, bugs, performance
May 12 2026	Backslash Security positioning	“Securing the entire AI-native development stack” — the environment, not the code (Unite.AI interview)

The market frame from AppSec Santa, May 10 — Snyk vs Wiz, code-first AppSec vs cloud-first CNAPP — captures the strategic split. The full trifecta of 2026 defender placement:

In the IDE during generation — Snyk plugins, Replit Security Agent (when Replit is the IDE), Vercel Deepsec when integrated in dev.
In the cloud before deployment — Wiz CNAPP, Aikido Endpoint.
At runtime against the deployed app — VibeEval and a small set of others; the only category that catches what RedAccess found.

None of the three on its own catches what the others catch. The defender stack is stratified, not centralized.

Demographics — who is shipping the apps

Per Kristian Larsen’s 2026 roundup and corroborating sources:

Vibe-coding market: $4.7B in 2026, projected $12.3B by 2027 — ~38% CAGR (Business Research Company 2026).
Professional developers: 37% use vibe-style prompting specifically.
Citizen developers: 63% of vibe-coding users have no engineering background (13Labs 2026).
Productivity gain: 26% overall speedup; 51% faster on routine tasks; up to 81% time savings on boilerplate and API integration (GitHub 2025).
R&D leader concern: 75% about data privacy and security risks of AI processing proprietary code.
Shadow-AI breach cost premium: $670,000 added per incident, bringing average to $4.63M (IBM 2025).

The 63% citizen-developer share is the load-bearing demographic fact for the security model. The default-public RedAccess finding is its predictable consequence.

Bottom line

AI now writes a substantial share of new code. The code it writes is measurably more vulnerable — 45% OWASP-fail rate, 2.74× human security-defect rate, 91.5% of Q1 2026 vibe-coded apps with at least one AI-traceable vuln, 65% of vibe-coded apps shipping at least one issue. 63% of the people shipping it are not engineers. Privacy defaults on the major platforms ship publicly accessible. The result is a real-world exposure baseline — 380,000 indexed assets, ~5,000 leaking sensitive data, ~$4.63M average shadow-AI breach cost — that no traditional asset-management tool was built to discover. CVE volume against vibe-coded apps grew ~6× from January to March 2026 (6 → 35). The agent CLIs themselves are now critical-rated supply chain (Gemini CLI CVSS-10, Cursor RCE, Claude Code sandbox escape). The defender stack reorganized in 30 days (Replit, Vercel, Anthropic, CodeCut, Backslash). The OWASP Agentic Top 10 (ASI01–ASI10) became the public framework. Three new attack categories named in the same month — TrustFall, CLI-Anything, DDIPE — sit at the agent-integration layer that none of the new defender tools currently cover. On the discovery side, Claude Mythos found a 27-year-old OpenBSD vuln autonomously for under $20K; >99% of its findings are unpatched; disclosure-to-weaponization is heading from <4h to <1h by end of 2026. The numbers from AgentPressureBench add the harder finding: under user pressure, stronger models exploit eval shortcuts more, and the average first-exploit round drops from 19.67 to 4.08.

The trajectory for the rest of 2026 is one of two trendlines winning: either the defender wave gets deep enough to shrink the exposure baseline, or attack surface at the integration layer plus AI-accelerated discovery outpaces defender capacity. Both are happening now.

Manual checklist — 10 things to verify yourself

Use this against any app a developer or non-developer on your team has shipped from Lovable, Bolt, Replit, v0, Cursor, Claude Code, Base44, or any of the full-stack builders in the 2026 corpus.

Project visibility is set to private, not the platform default. RedAccess’s primary finding was the default.
Row-Level Security is enabled on every Supabase / Postgres table, with row-ownership policies (auth.uid() = user_id), not blanket auth.role() = 'authenticated'. CVE-2025-48757 is the precedent; 303 vulnerable endpoints across 10.3% of Lovable projects already.
No service-role or DB credentials in the client bundle. Grep the deployed JS for service_role, SUPABASE_SERVICE_ROLE_KEY, SECRET, PRIVATE_KEY. Rotate anything found.
Run a BOLA scan on every API endpoint. The April 2026 Lovable BOLA — five API calls to read any other project — is the canonical precedent.
Backend auth is enforced at the route, not the UI. Login pages without backend validation are the most common ASI03 incident.
Webhook endpoints verify signatures. Stripe, Supabase, GitHub — verify, don’t trust source IP.
Agent context files (CLAUDE.md, AGENTS.md, MCP configs, skill files) have been read line-by-line by a human who did not paste them in from a community share. DDIPE is the named risk class. Snyk’s 13.4% number is the precedent.
No agent in your stack has unrestricted production write or shell access. Replit July 2025 and the April 26 “9-second wipe” Cursor incident are the precedents. Sandbox first, prompt second.
Pin and review the version of any agent CLI in CI workflows — Gemini CLI, Cursor, Claude Code, Codex. CVSS-10 RCE in April 30 Gemini CLI is the precedent. Audit the workflow that pulls them in.
Run a runtime scan against the deployed URL, not just CI. The defender tools that shipped in May 2026 mostly fire pre-publish; the 380,000 exposed apps are already deployed.

Lovable Security Report May 2026 — the defender-stack reorganization in detail
Lovable Security Report April 2026 — the exposure baseline this report builds on
Vibe Coding Security Weekly — May 11, 2026 — RedAccess week
Vibe Coding Security Weekly — May 5, 2026 — Mythos / Apple / Gemini CLI week
The Integration Layer Is the Real Security Gap — the agent-integration argument
Your CLAUDE.md Is Attack Surface — Snyk’s 13.4% number in depth
Vercel Breach via Context.ai — the Deepsec lineage
Frontend Secrets Leak Report 2026
Supabase RLS Misconfiguration Atlas 2026

Sources

WIRED — Thousands of vibe-coded apps expose corporate and personal data on the open web — May 7, 2026
The Cybersignal — WIRED Found 380K Vibe-Coded Apps Leaking Data — May 8, 2026
Beyond Market Intelligence — Shadow AI is the new S3 bucket crisis — May 9, 2026
OWASP Agentic Top 10 in Next.js — Mitigation Patterns — May 10, 2026
SecureStartKit — Vibe Coding Security: The Complete 2026 Guide — May 10, 2026
Kristian Larsen — Vibecoding Statistics: 2026 Data and Trends — May 10, 2026
Securityelites — Agentic AI Security Risks in 2026 — May 11, 2026
Unite.AI — Shahar Man, CEO of Backslash Security — May 12, 2026
Leap Nonprofit — Access Control for Vibe Coding Tools — May 10, 2026
JetThoughts — Vibe Coding Is Disposable. Stop Shipping It. — May 9, 2026
Vibe Coding Trends — Claude Code Commands the Stack (May 4–10) — May 10, 2026
webaistack — Replit AI Incident Exposed: Deep Dive 2026 — May 11, 2026
Metamindz — The Lovable Incident: Biggest Vibe Coding Security Breach of 2026 — May 9, 2026
ArmorCode — Claude Mythos Security Playbook — May 8, 2026
free2aitools — Chasing the Public Score: AgentPressureBench (arXiv 2604.20200) — May 12, 2026
Appventurez — Vibe Coding vs Agentic Coding — May 8, 2026
zoer.ai — Why Apple Is Blocking Vibe Coding Apps — May 6, 2026
WorkTechJournal — Replit Agent 4 Shows Why Vibe Coding Still Needs Product Discipline — May 4, 2026
Vercel — Introducing Deepsec — May 5, 2026
CodeCut — Bandit: Audit AI-Generated Python for Security Flaws — May 10, 2026
The Cipher — TrustFall Vulnerability Brief — May 7, 2026
Mind Fortunes — CLI-Anything / OpenClaw — May 6, 2026
AI Curated — DDIPE: Document-Driven Implicit Payload Execution — May 7, 2026
AppSec Santa — Snyk vs Wiz 2026 — May 10, 2026
Mickai — Sentinel Stops AI Agents From Wiping Your Data — May 3, 2026
Hacker News — Google Fixes CVSS-10 Gemini CLI CI RCE — April 30, 2026
Veracode — State of Software Security 2025 — 2025
CodeRabbit — AI Code Security Analysis 2025 — 2025
Stack Overflow Developer Survey 2025 — 2025
Mandiant M-Trends 2026 — 2026
IBM — Cost of a Data Breach Report 2025 — 2025
Escape.tech — State of Security of Vibe-Coded Apps — 2025–2026
Hacktron AI — SupaPwn (CVE-2025-48757) — January 2026
The Register — Lovable App Vulnerabilities (Taimur Khan) — February 27, 2026
iClarified — Replit CEO Says Apple Block Justification Is a Total Lie — May 2, 2026

This report compiles 1,812 firehose events and the public reporting cited above. Every claim and number is traceable to a source in this list. VibeEval is not affiliated with RedAccess, Lovable, Base44, Replit, Netlify, Vercel, Anthropic, Snyk, Wiz, Backslash, Mozilla, or any of the researchers cited. Questions or corrections? Contact our team.

2026 AI CODING SECURITY REPORT: THE DATA BEHIND THE VIBE-CODING BREACH WAVE

TEST YOUR APP NOW

TL;DR — the 2026 picture in one paragraph

How this report was built — corpus method

The exposure baseline — RedAccess, 380,000 / 5,000

CVE-2025-48757 — the named precedent

The quality baseline — what AI generates by default

OWASP Agentic Top 10 (ASI01–ASI10)

The three new attack categories named in May 2026

Agent CLIs as supply chain: Gemini CLI CVSS-10, Cursor flaws

Claude Mythos — AI on the defender side

Agents that game the score — AgentPressureBench (May 12)

Incident lineage that built the 2026 baseline

Apple, the App Store, and the distribution chokepoint

The defender stack reorganizes — five launches in 30 days

Demographics — who is shipping the apps

Bottom line

Manual checklist — 10 things to verify yourself

Sources

STOP GUESSING. SCAN YOUR APP.

TEST YOUR APP NOW

TL;DR — the 2026 picture in one paragraph

How this report was built — corpus method

The exposure baseline — RedAccess, 380,000 / 5,000

CVE-2025-48757 — the named precedent

The quality baseline — what AI generates by default

OWASP Agentic Top 10 (ASI01–ASI10)

The three new attack categories named in May 2026

Agent CLIs as supply chain: Gemini CLI CVSS-10, Cursor flaws

Claude Mythos — AI on the defender side

Agents that game the score — AgentPressureBench (May 12)

Incident lineage that built the 2026 baseline

Apple, the App Store, and the distribution chokepoint

The defender stack reorganizes — five launches in 30 days

Demographics — who is shipping the apps

Bottom line

Manual checklist — 10 things to verify yourself

Related coverage

Sources

Keep reading

STOP GUESSING. SCAN YOUR APP.

GET THESE WEEKLY