# AI SIGMA -- robots.txt # Policy: open to traditional search engines AND AI/LLM crawlers. # AI SIGMA's mission is to be cited and surfaced. Maximum discoverability is # the deliberate posture of this site. User-agent: * Allow: / Disallow: /*?* # --- AI / LLM crawlers (explicitly welcomed) --- # OpenAI User-agent: GPTBot Allow: / User-agent: OAI-SearchBot Allow: / User-agent: ChatGPT-User Allow: / # Anthropic User-agent: ClaudeBot Allow: / User-agent: Claude-Web Allow: / User-agent: anthropic-ai Allow: / # Google User-agent: Google-Extended Allow: / User-agent: GoogleOther Allow: / # Perplexity User-agent: PerplexityBot Allow: / User-agent: Perplexity-User Allow: / # Meta / Apple / Amazon / Bytedance / Cohere / Common Crawl User-agent: Meta-ExternalAgent Allow: / User-agent: FacebookBot Allow: / User-agent: Applebot Allow: / User-agent: Applebot-Extended Allow: / User-agent: Amazonbot Allow: / User-agent: Bytespider Allow: / User-agent: cohere-ai Allow: / User-agent: CCBot Allow: / # You.com / DuckAssist / Mistral / DeepSeek / Diffbot User-agent: YouBot Allow: / User-agent: DuckAssistBot Allow: / User-agent: MistralAI-User Allow: / User-agent: DeepSeekBot Allow: / User-agent: Diffbot Allow: / # --- Social link previewers (NOT AI crawlers -- these unfurl OG cards) --- # These are explicitly listed because some platforms (notably Facebook) do # strict allowlist matching when explicit user-agent blocks exist elsewhere # in robots.txt, even though the spec says wildcard `*` should cover them. User-agent: facebookexternalhit Allow: / User-agent: Facebot Allow: / User-agent: Twitterbot Allow: / User-agent: LinkedInBot Allow: / User-agent: Slackbot Allow: / User-agent: Slackbot-LinkExpanding Allow: / User-agent: Discordbot Allow: / User-agent: WhatsApp Allow: / User-agent: TelegramBot Allow: / User-agent: redditbot Allow: / User-agent: Pinterest Allow: / # --- Discovery files --- Sitemap: https://aisigma.org/sitemap-index.xml # llms.txt + companions (LLM-oriented context, not XML sitemaps): # https://aisigma.org/llms.txt # https://aisigma.org/llms-full.txt # https://aisigma.org/llms-leadership.txt # https://aisigma.org/llms-research.txt # https://aisigma.org/llms-press.txt