Patterns | VibeEval

There is a difference between telling you a bug is common and showing you the bug, on a URL, with a curl command you can run yourself. Most security blogs do the first. This series does the second.

Every article points at a scenario on gapbench.vibe-eval.com — a public security benchmark we operate, currently 104 scenarios. Hit the URL, see the bug, run our scanner, see the finding. No corpus claims, no anonymous client name-drops, no “we scanned 1,500 apps” handwaving. The pattern, the live demo, the detection.

Start here

Why we built gapbench, and why every heuristic scanner needs a ref0 — the manifesto. Read this first if you want the reasoning behind the whole series.
False positives and the ref0 control — how we calibrate. The methodology behind every detection.

Auth

Trust boundaries

Data exposure

Infrastructure

Frontend / JS

Agents and LLMs

Injections and primitives

Concurrency

Race conditions in money paths — TOCTOU on balance, paid-flag, and resource limits

How to use this series

Every article follows the same loose shape — pattern, demo URL, why the AI does it, how we catch it, what to do. We deliberately don’t use a rigid template; each piece reads like a conversation with someone who has seen the bug too many times. The structure is the URL — gapbench.vibe-eval.com/site/<scenario>/ is up right now, you can hit it, the bug is real, the finding reproduces.

If you’ve read one of the data studies and want the anatomy, this is where the anatomy lives. If you’ve read a case study and want to verify the pattern is real, this is where you verify it.

PATTERNS WE KEEP FINDING