Data-driven insights on flaky tests, AI testing, test maintenance, and the future of QA.
53% of AI-generated code contains security vulnerabilities. Iterative AI revision makes it worse. If AI can't write secure code, can it write meaningful tests?
Read post →Most self-healing tools are backup selectors in a priority queue. When a selector breaks because the feature broke, it 'heals' and hides the bug.
Read post →Average postmortem: 3-7 action items. Under 40% completed in 90 days. 60% never completed. 'Add a regression test' always loses to feature work.
Read post →Multiple tools are moving from CSS/XPath selectors to the browser's accessibility tree. MCP is becoming the standard way AI agents interact with browsers.
Read post →Hasura deleted 95%, Nubank deleted all, another team deleted half. Same pattern: flaky tests weren't catching bugs. Deleting without safety nets differs.
Read post →The scripting part of QA is exactly what AI automates. The judgment and strategy part isn't. But most QA roles are still defined around execution, not strategy.
Read post →Compliance platforms automate 80% of infra monitoring, but app-level testing evidence is still manual screenshots. 2 engineers, 2 weeks, every audit.
Read post →Currents.dev analyzed 400M test records: Playwright flake rate 0.72% vs Cypress 0.83%. But Cypress masks race conditions, so week one is terrifying.
Read post →55% of teams spend 20+ hrs/week maintaining e2e tests. One fintech: 23 hrs/week just updating tests for UI changes. Hasura deleted 95% of theirs.
Read post →53% of AI-generated code has security vulns. After 5 rounds of GPT-4o fixes, code had 37% MORE vulns. Creation is accelerating faster than verification.
Read post →Why tests pass locally but fail in CI: timing/race (40%), environment (30%), resource contention (15%), test isolation (15%). Full taxonomy.
Read post →79% cite AI as the most impactful testing tech. 46% distrust AI testing accuracy. 30% of AI testing projects get abandoned after POC.
Read post →Google: 84% of CI pass-to-fail transitions are flaky. Buildkite: 800M seconds of flaky re-runs per month. One team deleted half their tests, bugs went down.
Read post →