canaisee

About the scanner bot

User-Agent

Our scanner identifies itself as:

CanAISeeBot/1.0 (+https://canaisee.com/about-bot)

What it does

When a visitor enters your URL, our scanner fetches a small set of files from your site: the canonical URL, /robots.txt, /llms.txt, /ai.txt, /sitemap.xml, /.well-known/mcp.json, /.well-known/agent-card.json, and an optional .md mirror. For one check, it additionally loads the canonical URL in a headless Chromium instance to compare the rendered text to the plain HTML.

Rate and politeness

The scanner honors robots.txt. Each scan produces at most a handful of requests against your origin, and we cache results for several hours so repeated shares of a scorecard don't retrigger fetches.

Blocking the scanner

Add this to your robots.txt to decline:

User-agent: CanAISeeBot
Disallow: /

Takedown for a specific scan

If a scorecard permalink exists for your site and you want it removed from the public archive, email takedowns@canaisee.com with the URL.