# Gojo Ethiopian Restaurant Robots.txt # https://www.gojoethiopianrestaurant.ca # Main directives for all bots User-agent: * Allow: / Disallow: /components/ Disallow: /config/ Disallow: /search/ Disallow: /account/ Disallow: /api/ Disallow: /static/ Disallow: /*?*author=* Disallow: /*?*tag=* Disallow: /*?*month=* Disallow: /*?*view=* Disallow: /*?*format=* # Block legal pages from all crawlers Disallow: /privacy-policy.html Disallow: /terms-of-use.html # Search engine crawlers — block legal pages and prevent llms.txt from being indexed. # Note: the `Noindex:` directive in robots.txt was deprecated by Google in Sept 2019 and is # ignored. We use `Disallow: /llms.txt` to block crawling, and keep `Noindex:` as a legacy # hint for crawlers that may still honour it (e.g. some Bing/Yandex variants). # For a guaranteed noindex on the .txt response, also serve llms.txt with the # `X-Robots-Tag: noindex` HTTP header at the web-server / CDN level. User-agent: Googlebot Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /llms.txt Noindex: /llms.txt User-agent: Bingbot Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /llms.txt Noindex: /llms.txt User-agent: Yandex Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /llms.txt Noindex: /llms.txt User-agent: DuckDuckBot Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /llms.txt Noindex: /llms.txt # AI Bot specific directives - allow access to llms.txt for grounding User-agent: GPTBot Allow: /llms.txt Allow: / Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /components/ User-agent: ChatGPT-User Allow: /llms.txt Allow: / Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /components/ User-agent: CCBot Allow: /llms.txt Allow: / Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /components/ User-agent: anthropic-ai Allow: /llms.txt Allow: / Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /components/ User-agent: Claude-Web Allow: /llms.txt Allow: / Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /components/ User-agent: ClaudeBot Allow: /llms.txt Allow: / Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /components/ User-agent: cohere-ai Allow: /llms.txt Allow: / Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /components/ User-agent: Google-Extended Allow: /llms.txt Allow: / Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /components/ User-agent: PerplexityBot Allow: /llms.txt Allow: / Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /components/ User-agent: Applebot-Extended Allow: /llms.txt Allow: / Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /components/ User-agent: meta-externalagent Allow: /llms.txt Allow: / Disallow: /privacy-policy.html Disallow: /terms-of-use.html Disallow: /components/ # Google Ads Bot User-agent: AdsBot-Google User-agent: AdsBot-Google-Mobile User-agent: AdsBot-Google-Mobile-Apps Allow: / # Crawl delay for heavy bots User-agent: Baiduspider Crawl-delay: 10 User-agent: Sogou Crawl-delay: 10 # Sitemap location Sitemap: https://www.gojoethiopianrestaurant.ca/sitemap.xml # LLMs.txt location for AI agents # https://www.gojoethiopianrestaurant.ca/llms.txt