Autonomous QA
Point it at your web app. It logs in as every user, tests every page, captures every API call, and tells you what broke.
Preflight
1/29 · Sonnet 4.6
Before anything runs, mentu checks that your dev server is up, the browser is reachable, and your test users can log in. If any of these fail, the run stops here. Not three steps later.
46s · $0.14
Scaffold Data
2/29 · Sonnet 4.6
The agent creates test data in your database. Suppliers, tenders, purchase orders, invoices. Then it queries every table to verify counts match. If the data isn't right, nothing downstream will be either.
5m6s · $0.74
Scaffold Verify
3/29 · skipped
This step already passed in a prior run. The engine skips it. No reason to verify data that hasn't changed.
0s · $0
Login
2 steps · parallel
Two agents launch at the same time. One logs in as a buyer, the other as a supplier. Each types credentials into a real browser, submits the form, and extracts the auth token. Both write reports.
2m10s · $0.56 · 2/2 passed
Browser Tests
15 steps · parallel burst
The big one. 15 agents run at once, each testing a different page. Every agent logs in, navigates, records all network traffic, checks the DOM, counts errors, takes a screenshot, and writes a report. Dashboard, invoices, analytics, settings -- every page your users touch.
16m9s · $4.20 · 15/15 passed
Detail + Fault Injection
6 steps · parallel
Now mentu breaks things on purpose. It injects server errors, timeouts, and permission denials on critical pages. The test: does your app show an error message, or does it crash?
5m33s · $1.95 · 6/6 passed
API Error Analysis
27/29 · Sonnet 4.6
Every browser test captured its network traffic. This step reads all of it and looks for any API call that returned an error. Zero errors across 149 calls.
32s · $0.11
Auth Cross-Audit
28/29 · Sonnet 4.6
Did any auth token leak between buyer and supplier sessions? This step reads every report and checks. Org IDs, JWT consistency, request authentication -- across both portals, across 149 API calls.
1m12s · $0.38
Result
29 steps complete
Done. 28 of 29 steps passed. The engine saves two patterns to the recipe library for future runs. Every report, screenshot, and network capture is on disk.
Wave 1/9 · $0.14
Key numbers
See also
- Circuit Breaker -- what happens when an agent gets stuck
- Workflow Patterns -- how recipes compose
- Local Inference -- same recipe, $0, local models