Discover public URLs from an existing sitemap or a homepage crawl, then export clean sitemap XML.
No login and no database. Results stay in this tab session until the tab closes.
sitemap.xml preview
session onlyWhat happens when you generate
We read robots.txt, common sitemap paths, or homepage links.
We keep same-site pages and skip assets, feeds, admin paths, and files.
Each URL gets a standard loc entry and a fresh lastmod value.
Publish the result as /sitemap.xml and reference it in robots.txt.
The generator favors safe, same-origin URLs and gives you a file you can publish without an account.
Uses your current sitemap when one is already discoverable.
Falls back to internal links when no sitemap is present.
Skips off-site links, feeds, assets, admin endpoints, and API files.
Exports a standard urlset sitemap search engines understand.
Why it matters
If a sitemap already exists, use it as the source of truth. If it does not, the homepage crawl gives you a compact first draft.
The homepage is prioritized first, then shallow same-site URLs are sorted by path depth.
Discovery treats www and non-www host variants as the same site when filtering URLs.
After publishing the sitemap, generate an llms.txt file from those same pages.
No login. Paste a domain, review the URLs, then copy or download the XML.