← Home

Frequently asked questions about AI search visibility

Twenty straight answers to the questions we hear every week about getting cited by ChatGPT, Claude, Gemini, and Perplexity.

How do I show up in ChatGPT?

Allow OpenAI's crawlers (GPTBot, OAI-SearchBot, ChatGPT-User) in robots.txt, publish an llms.txt, write each page to answer one specific question in the first paragraph, add FAQ and HowTo schema, and earn citations from third-party sources like Reddit, Wikipedia, and industry publications. The free analyzer on the home page checks all of these signals in under thirty seconds.

How do I show up in Google Gemini?

Allow Google-Extended in robots.txt (this is a separate consent from regular Googlebot), keep your site indexed in Google Search, add Article and FAQPage schema, and make sure your YouTube videos have full transcripts because Gemini pulls heavily from them. If you block Google-Extended you will appear in Google Search but not in Gemini's AI answers. Full playbook in our Gemini guide.

How do I show up in Claude?

Allow ClaudeBot and Claude-Web in robots.txt, earn mentions on authoritative sources Claude already trusts (Wikipedia, official documentation, well-known publications), write long-form structured content with clear citations, and add Organization and Person schema so Anthropic's models can verify who is speaking. Full playbook in our Claude guide.

Is ranking in ChatGPT the same as ranking in Google?

No. Google ranks pages and shows them in a list. ChatGPT picks sentences and stitches together an answer with citations. The work overlaps — clean technical SEO and quality content help both — but generative engines weight clarity and third-party corroboration more than backlinks, and they reward question-shaped headings far more than Google does.

Does ChatGPT actually read my website?

Yes, in two ways. GPTBot crawls sites for model training, and OAI-SearchBot fetches pages live when a ChatGPT user asks a question that requires fresh information. If either is blocked in your robots.txt, you are invisible to ChatGPT for that purpose.

What is llms.txt?

llms.txt is a proposed standard (championed by Jeremy Howard and adopted by Anthropic, Mistral, and a growing list of others) for telling large language models which parts of your site matter. It is a markdown file at the root of your domain that lists your most important pages with short, plain-text summaries. ChatGPT, Claude, and Gemini already use it as a hint when present.

How is llms.txt different from robots.txt or sitemap.xml?

robots.txt controls crawler access — yes or no, per user-agent. sitemap.xml lists every URL on your site so search engines can discover them. llms.txt is a human-and-LLM-readable summary of your most important pages, formatted as markdown with short descriptions. Think of it as a TL;DR for AI models.

How long does it take to start showing up in ChatGPT?

Live web search results (OAI-SearchBot, Perplexity, Gemini's AI Overviews) can pick up changes within days. Showing up inside a model's baked-in knowledge — without a live search — takes longer, typically one to two training refresh cycles, which is roughly a few months for the major models.

Do I need to use FAQ schema or HowTo schema?

Strongly recommended. FAQPage and HowTo schema are the structured-data formats generative models lift most often. They give the model a clean, unambiguous source of question-answer pairs and step-by-step instructions, which is exactly what it needs to produce an answer.

Which AI crawlers should I allow in robots.txt?

At minimum: GPTBot (OpenAI training), OAI-SearchBot (ChatGPT live web search), ChatGPT-User (user-triggered fetches), ClaudeBot and Claude-Web (Anthropic), Google-Extended (Google's Gemini training), PerplexityBot, CCBot (Common Crawl, used by many models), and Applebot-Extended. The analyzer flags any that are blocked on your site.

Should I block AI bots to protect my content?

It is a real tradeoff. Blocking means your content cannot be used for training, but it also means you will not appear in those models' answers. For most businesses, the visibility upside is much larger than the training concern — you are already public on the web. For publishers with paid content, the calculus is different.

Does Cloudflare block AI bots by default?

In mid-2024 Cloudflare started shipping AI-bot blocking enabled by default for some plans. If your site is behind Cloudflare and you have not checked recently, log in to the dashboard, find the AI bots section, and make sure the bots you want to allow are explicitly permitted. This is one of the most common "invisible to AI" causes we see.

How do I know if I am actually being cited by ChatGPT?

Three signals to watch. (1) Check your server logs for hits from OAI-SearchBot or ChatGPT-User — those are live citation fetches. (2) Set up Bing Webmaster Tools, because OpenAI uses Bing's index for parts of ChatGPT Search. (3) Manually query ChatGPT (with web search on) for the topics you should rank on and see if your domain appears in the citations.

What is the single most important thing I can do?

Make sure your robots.txt explicitly allows GPTBot, OAI-SearchBot, ClaudeBot, Google-Extended, and PerplexityBot. About 40% of small business sites we audit have at least one of these blocked, usually unintentionally. Fixing this is free and takes five minutes.

Does the analyzer store my URL or my data?

No. The analyzer fetches the URL you submit, scores it, and returns the report. The URL and the report are not stored, sold, or used for training. See the privacy policy for full detail.

Is the analyzer really free?

Yes. Free, no signup, no rate limit on normal use. We pay for it because it is a lead source for our paid SEO service at GoRankPilot — the better the free tool, the more visitors learn about us.

Why does my site score low even though it ranks on Google?

Google SEO and AI-search visibility overlap but are not the same. Common reasons a Google-ranked site scores low here: AI bots blocked in robots.txt, no llms.txt, no FAQ schema, no answer-first content structure, or content hidden behind JavaScript that AI crawlers cannot render.

Does the analyzer work on sites behind a login?

No. The analyzer fetches your page anonymously the way a real AI crawler would. Pages behind login walls cannot be scored — but they also cannot be cited by ChatGPT, which is the point.

What if my site uses Next.js, Webflow, Shopify, or WordPress?

All work fine. The analyzer cares about the rendered HTML, not the platform. If your platform requires JavaScript to render the main content, you should ship server-side rendering or pre-rendering — most AI crawlers do not execute JavaScript.

Will this tool fix my site for me?

The analyzer tells you what is wrong. If you want it actually fixed and monitored over time, that is what GoRankPilot does — done-for-you SEO content that ships robots.txt, schema, llms.txt, FAQ pages, and weekly blog posts to your site on autopilot.

Still have a question? Email hello@webair.ai and we'll add it here if it's useful for other readers.