NumberSleven · Informational
How Perplexity and ChatGPT Decide What to Cite — A Practical Guide for Businesses
Citation vs ranking — a fundamental difference
In traditional search, your goal is to rank in position 1–3 for target keywords. In AI search, your goal is to be cited in the synthesised answer. These are different objectives that require different strategies.
Ranking is about winning a competition among pages. Citation is about being the clearest, most unambiguous, most authoritative source for a specific claim or answer. You can rank well and never be cited by AI engines. You can have a modest backlink profile and be cited consistently if your content is structured correctly.
The entity resolution step
Before an AI engine cites a business, it resolves the entity — it determines who you are, whether you’re the same entity mentioned elsewhere on the web, and what category you belong to. This resolution happens through several signals working together.
Your Organisation schema with sameAs links to your LinkedIn, Google Business Profile, and Crunchbase (if applicable) creates a consistent entity record that AI engines can match across sources. Without these links, your business may appear as separate entities in different contexts — “NumberSleven” and “NumberSleven FZ LLC” may be treated as two different things.
Entity disambiguation is the first thing a GEO audit checks. Businesses with strong entity consistency get cited more reliably than businesses with inconsistent names, descriptions, and category signals across their digital presence.
What Perplexity specifically looks for
Perplexity crawls the web in near-real-time (PerplexityBot) and synthesises answers from current sources. Its citation preferences show a strong bias toward pages with clear structured data, explicit service/product definitions, and FAQ content that directly answers the query.
For a query like “AI automation consultant Dubai,” Perplexity will typically cite 3–5 sources. The selection criteria, based on observable citation patterns, weight: page specificity (a service page beats a homepage), structured data presence, explicit mention of the query location in content and schema, and whether the page answers likely follow-up questions within the same content.
What ChatGPT cites and when
ChatGPT with Browse (GPTBot) has different citation behaviour from Perplexity. It tends to synthesise more heavily from its training data for established topics, and relies on real-time crawling for recent or specific queries. For local business queries in the GCC — where training data coverage is thinner than for US or UK markets — real-time crawling matters more, which means structured data and robots.txt configuration for GPTBot are especially important.
The practical implication for UAE businesses: explicitly allowing GPTBot in your robots.txt and ensuring your service pages have clear JSON-LD schema is more impactful here than it would be for a London-based business with extensive training data coverage.
The FAQ effect
FAQPage schema is disproportionately influential in AI citation decisions. When your website contains an explicit question and a direct, factual answer — marked up in FAQPage schema — AI engines often use that content directly when answering similar queries. This is the closest thing to a reliable shortcut in GEO optimisation.
The questions that perform best in FAQs for citation purposes are the exact questions your target customers ask before buying — not the questions you want to answer, but the questions they actually have. These come from sales call notes, support tickets, LinkedIn DMs, and competitor review pages.
Ready to start?
Book a Discovery Sprint
45 minutes. Fixed fee. Written summary within 48 hours. Credited toward any package if you proceed.