About the scanner bot
User-Agent
Our scanner identifies itself as:
CanAISeeBot/1.0 (+https://canaisee.com/about-bot)What it does
When a visitor enters your URL, our scanner fetches a small set of files from your site: the canonical URL, /robots.txt, /llms.txt, /ai.txt, /sitemap.xml, /.well-known/mcp.json, /.well-known/agent-card.json, and an optional .md mirror. For one check, it additionally loads the canonical URL in a headless Chromium instance to compare the rendered text to the plain HTML.
Rate and politeness
The scanner honors robots.txt. Each scan produces at most a handful of requests against your origin, and we cache results for several hours so repeated shares of a scorecard don't retrigger fetches.
Blocking the scanner
Add this to your robots.txt to decline:
User-agent: CanAISeeBot
Disallow: /Takedown for a specific scan
If a scorecard permalink exists for your site and you want it removed from the public archive, email takedowns@canaisee.com with the URL.