Allow the audit user agent
If your WAF blocks unknown bots, allow the audit user agent for public HTML, robots.txt, sitemap.xml, and llms.txt.
User-Agent: layzr.ai-agentic-audit/1.0
The free sitemap and llms.txt generators need to fetch public pages and root files. If your host returns a challenge, 403, 429, or 503, use this checklist before rerunning the tool.
If your WAF blocks unknown bots, allow the audit user agent for public HTML, robots.txt, sitemap.xml, and llms.txt.
User-Agent: layzr.ai-agentic-audit/1.0
Security challenges are useful for forms and private routes, but they often block crawler-facing files that should stay public.
/robots.txt /sitemap.xml /llms.txt /llms-full.txt
Cloudflare, Vercel, and other edge providers can challenge automated requests before your app receives them.
Look for bot fight mode, WAF rules, rate limits, and security checkpoints.
When crawlers cannot execute your app, static root files are the most reliable discovery path.
https://example.com/robots.txt https://example.com/sitemap.xml https://example.com/llms.txt
What happened
The generators stop when they cannot safely read the site. We do not try to bypass bot protection, solve challenges, or store blocked responses.