How We Used Agentic AI to Fix Kong Gateway's Flakiest Tests
Each change to Kong Gateway's codebase triggers a comprehensive test suite that runs more than 17,000 * 2 = 34,000 test cases among the two primary architectures (x86 and ARM) we support. This process takes about 23.5 hours on a single machine. But we don't wait that long. A large fleet of machines runs the suite in parallel, and we shard the work aggressively so each commit finishes in a fraction of that time. That setup works well, right up until flaky tests get involved.