{"id":207395,"date":"2025-12-10T12:04:35","date_gmt":"2025-12-10T12:04:35","guid":{"rendered":"https:\/\/www.similarweb.com\/blog\/?p=207395"},"modified":"2026-03-01T13:24:42","modified_gmt":"2026-03-01T13:24:42","slug":"llms-txt","status":"publish","type":"post","link":"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/","title":{"rendered":"What Is Llms.txt? Reality vs. Hype"},"content":{"rendered":"<p>In the early days of the web, search engines had a fundamental need: a standard way to <b>determine where to crawl<\/b>. That\u2019s why robots.txt became so important.<\/p>\n<p>One small text file at the root of a site provided crawlers with a predictable, machine-readable set of rules, and over time, it evolved into an unofficial but widely respected standard.<\/p>\n<p>We\u2019re in a similar transition point now, with one main difference: The consumer of your content is often <b>a large language model (LLM)<\/b>, not just a classic search engine bot.<\/p>\n<p>Instead of just indexing pages and ranking blue links, Gen AI engines <b>synthesize answers<\/b> from multiple sources.<\/p>\n<p>At the same time, they have very different constraints: limited context windows, difficulty dealing with complex HTML, and (in many implementations) no persistent, full-site index in the traditional sense.<\/p>\n<p>That\u2019s the context behind <b>llms.txt<\/b>: a proposal for a root-level, Markdown-based file that doesn\u2019t tell bots where <b>not<\/b> to go (like robots.txt), but instead tells AI systems <b>which pages you consider most important and how to interpret them.<\/b><\/p>\n<p>Over the last year, discussion around llms.txt has split into camps, each with a different POV about the importance of this new file. However, it often seems like everyone is ignoring the real question SEOs should be asking about llms.txt:<\/p>\n<p><strong>\u201cGiven limited time and resources, does llms.txt deserve a place in our SEO and AI visibility strategy? And if so, where and how?\u201d<\/strong><\/p>\n<p>This article is my data-informed answer and opinion on:<\/p>\n<ul>\n<li aria-level=\"1\">What llms.txt actually is (and what it isn\u2019t).<\/li>\n<li aria-level=\"1\">How it fits with <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/seo\/robots-txt\/\">robots.txt<\/a> and <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/seo\/sitemaps\/\">sitemap.xml<\/a>.<\/li>\n<li aria-level=\"1\">What real-world logs and public statements tell us so far.<\/li>\n<li aria-level=\"1\">Where it genuinely adds value (and where it doesn\u2019t).<\/li>\n<li aria-level=\"1\">How I\u2019d decide whether to implement it.<\/li>\n<\/ul>\n<p>Let&#8217;s dive in.<\/p>\n<h2>1. What is llms.txt? The short, non-hype definition<\/h2>\n<p>llms.txt is a small <b>Markdown file<\/b> served at https:\/\/yourdomain.com\/llms.txt that lists a <b>curated set of your most important pages<\/b>, each with a short description, in a tightly defined format.<br \/>\n<img decoding=\"async\" class=\"alignnone size-full wp-image-207406\" src=\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt-definition.png\" alt=\"Llms.txt Definition\" width=\"1200\" height=\"628\" srcset=\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt-definition.png 1200w, https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt-definition-300x157.png 300w, https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt-definition-1024x536.png 1024w, https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt-definition-768x402.png 768w\" sizes=\"(max-width: 1200px) 100vw, 1200px\" \/><\/p>\n<p>The llms.txt file is designed primarily for <b>inference time <\/b>(i.e., when an LLM or agent is actively answering a user\u2019s question), not for training or broad web indexing.<\/p>\n<p>If you like analogies:<\/p>\n<ul>\n<li aria-level=\"1\">robots.txt: \u201cHere is where you <b>can\u2019t<\/b> crawl.\u201d<\/li>\n<li aria-level=\"1\">sitemap.xml: \u201cHere is everything you <b>could<\/b> crawl.\u201d<\/li>\n<li aria-level=\"1\">llms.txt: \u201cIf you\u2019re trying to answer questions about us, <b>start with these pages<\/b>.\u201d<\/li>\n<\/ul>\n<p>That last line is the key.<\/p>\n<p>As SEOs, we\u2019re used to influencing discovery and indexing. Llms.txt tries to influence <b>prioritization<\/b> under a hard constraint: an LLM can\u2019t load and understand your entire site on every query. You can think of it as a type of &#8220;crawl budget&#8221;.<\/p>\n<h3>1.1. How llms.txt is structured, and why that matters<\/h3>\n<p>The <a href=\"https:\/\/llmstxt.org\/\">official spec<\/a> (proposed by Jeremy Howard\/Answer.AI) defines a strict, simple structure:<\/p>\n<ol>\n<li aria-level=\"1\"><b>H1 title (#)<\/b>\n<ol>\n<li aria-level=\"1\">The name of your site (the only required element).<\/li>\n<li aria-level=\"2\">For an LLM, this is the top-level entity label: \u201cEverything below is about <i>this<\/i>.\u201d<\/li>\n<\/ol>\n<\/li>\n<li aria-level=\"1\"><b>Blockquote summary (&gt;)<\/b>\n<ol>\n<li aria-level=\"1\">A one\/two-sentence summary of what you do and who you serve.<\/li>\n<li aria-level=\"2\">Gives models immediate context even if they only skim the file.<\/li>\n<\/ol>\n<\/li>\n<li aria-level=\"1\"><b>Optional explanatory paragraphs<\/b>\n<ol>\n<li aria-level=\"1\">Used to clarify scope or versioning (e.g., \u201cThis file covers v2 of our public API docs only\u201d).<\/li>\n<li aria-level=\"2\">Helpful when your docs, products, or sites have multiple generations or audiences.<\/li>\n<\/ol>\n<\/li>\n<li aria-level=\"1\"><b>H2 sections (##) that group links by theme<\/b>\n<ol>\n<li aria-level=\"1\">For example: ## Documentation, ## Guides, ## Product, ## Support.<\/li>\n<li aria-level=\"2\">Mirrors how LLMs tend to reason in topical \u201cbuckets,\u201d not in random URL lists.<\/li>\n<\/ol>\n<\/li>\n<li aria-level=\"1\"><b>Bulleted lists under each H2 with Markdown links<\/b>\n<ol>\n<li aria-level=\"1\">Each bullet looks like:<br \/>\n&#8211; [API reference](https:\/\/example.com\/docs\/api): Endpoints, parameters, and examples<\/li>\n<li aria-level=\"1\">That combination of <b>anchor text<\/b>, <b>URL<\/b>, and <b>short factual description<\/b> gives the model strong hints about what each page is for and when to use it.<\/li>\n<\/ol>\n<\/li>\n<li aria-level=\"1\"><b>Optional ## Optional section<\/b>\n<ol>\n<li aria-level=\"1\"><b><\/b>A reserved section name. Links here are explicitly lower priority.<\/li>\n<li aria-level=\"1\">Tools are free to drop these first if the context is tight.<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<h4>Example of a simple llm.txt file:<\/h4>\n<p><img decoding=\"async\" class=\"alignnone size-full wp-image-207404\" src=\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-example-llmstxt.png\" alt=\"Llms.txt syntax example\" width=\"897\" height=\"513\" srcset=\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-example-llmstxt.png 897w, https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-example-llmstxt-300x172.png 300w, https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-example-llmstxt-768x439.png 768w\" sizes=\"(max-width: 897px) 100vw, 897px\" \/><\/p>\n<p>From an SEO and <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/answer-engine-optimization\/\">AEO<\/a> point of view, this structure matters because it forces you to:<\/p>\n<ul>\n<li aria-level=\"1\"><b>Identify your real \u201csource of truth\u201d pages<\/b>, instead of assuming AI will magically find them.<\/li>\n<li aria-level=\"1\"><b>Describe pages in concise, factual language<\/b>, which is exactly the kind of text LLMs are good at classifying and routing with.<\/li>\n<\/ul>\n<p>You\u2019re not \u201coptimizing the llms.txt file\u201d for rankings, you\u2019re making your <b>content model legible to machines<\/b>.<\/p>\n<h3>1.2. llms-full.txt and .md page variants<\/h3>\n<p>Around this spec, a small ecosystem has emerged:<\/p>\n<ul>\n<li aria-level=\"1\"><b>llms-full.txt<\/b>\n<ul>\n<li aria-level=\"2\">A single Markdown file containing the full text of your documentation corpus.<\/li>\n<li aria-level=\"2\">Used as a convenient ingestion endpoint for AI tools and coding agents.<\/li>\n<li aria-level=\"2\">Docs platforms (like Mintlify) auto-generate this so agents can pull an entire docs set from one URL.<\/li>\n<\/ul>\n<\/li>\n<li aria-level=\"1\"><b>.md versions of docs pages<\/b>\n<ul>\n<li aria-level=\"2\">Each docs URL also has a Markdown variant, typically by appending .md:\n<ul>\n<li aria-level=\"3\">https:\/\/example.com\/docs\/api \u2192 HTML for humans<\/li>\n<li aria-level=\"3\">https:\/\/example.com\/docs\/api.md \u2192 Markdown for AI<\/li>\n<\/ul>\n<\/li>\n<li aria-level=\"2\">The llms.txt spec encourages this pattern. nbdev-based projects under Answer.AI already generate Markdown docs by default.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p><strong>Why this matters:<\/strong><\/p>\n<ul>\n<li aria-level=\"1\">Markdown strips layout noise (nav, ads, tracking junk) that wastes tokens.<\/li>\n<li aria-level=\"1\">That reduces <b>cost<\/b> and <b>truncation risk<\/b>, which is essential when agents operate inside finite context windows.<\/li>\n<\/ul>\n<p>If you\u2019re a <b>dev platform<\/b> or <b>docs-heavy product<\/b>, this ecosystem already improves agent reliability and developer experience, regardless of whether Google or OpenAI ever officially \u201cuse\u201d llms.txt.<\/p>\n<h2>2. llms.txt vs robots.txt vs sitemap.xml<\/h2>\n<p>Because llms.txt lives at the domain root and ends in .txt, it\u2019s easy for stakeholders to assume it works like robots.txt. It doesn\u2019t.<\/p>\n<h3>2.1. Three files, three different jobs<\/h3>\n<p>In plain language:<\/p>\n<ul>\n<li aria-level=\"1\"><b>robots.txt: access control<\/b>\n<ul>\n<li aria-level=\"2\">\u201cWhere can crawlers go?\u201d<\/li>\n<li aria-level=\"2\">Manages what is and isn\u2019t crawlable (including some AI-training opt-outs).<\/li>\n<\/ul>\n<\/li>\n<li aria-level=\"1\"><b>sitemap.xml: discovery and coverage<\/b>\n<ul>\n<li aria-level=\"2\">\u201cWhat exists on this site?\u201d<\/li>\n<li aria-level=\"2\">Lists indexable URLs plus metadata like last-modified dates.<\/li>\n<\/ul>\n<\/li>\n<li aria-level=\"1\"><b>llms.txt: curation and interpretation<\/b>\n<ul>\n<li aria-level=\"2\">\u201cIf you only have room for a subset of pages, which ones matter most, and what are they for?\u201d<\/li>\n<li aria-level=\"2\">Doesn\u2019t block anything, and it doesn\u2019t list everything. It highlights a small set of high-signal pages and labels them clearly.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>The spec explicitly states that llms.txt <b>coexists with <\/b>(not replaces) robots.txt and sitemap.xml.<\/p>\n<h3>2.2. Why the distinction matters<\/h3>\n<p>Mis-framing llms.txt leads to bad expectations:<\/p>\n<ul>\n<li aria-level=\"1\">It <b>does not<\/b> control crawling or training behavior.<\/li>\n<li aria-level=\"1\">It <b>does not<\/b> override robots.txt or meta directives.<\/li>\n<li aria-level=\"1\">It <b>does not<\/b> serve as an \u201cAI XML sitemap\u201d.<\/li>\n<\/ul>\n<p>From a strategy perspective:<\/p>\n<ul>\n<li aria-level=\"1\"><b>robots.txt<\/b> is your <b>guardrail<\/b>.<\/li>\n<li aria-level=\"1\"><b>sitemap.xml<\/b> is your <b>catalog<\/b>.<\/li>\n<li aria-level=\"1\"><b>llms.txt<\/b> is your <b>curated reading list with notes, written for AI<\/b>.<\/li>\n<\/ul>\n<p>If you treat llms.txt like a second sitemap and dump 200 URLs into it, you\u2019ve missed the point, and you\u2019re actively making life harder for any agent that tries to use it.<\/p>\n<h2>3. What problem is llms.txt actually trying to solve?<\/h2>\n<p>To decide if llms.txt is worth your attention, you need to be clear on the friction it addresses.<\/p>\n<h3>3.1. How LLM-based systems \u201csee\u201d your site<\/h3>\n<p>Classic search engine crawlers:<\/p>\n<ul>\n<li aria-level=\"1\">Continuously crawl and update large slices of the web.<\/li>\n<li aria-level=\"1\">Render JavaScript where needed.<\/li>\n<li aria-level=\"1\">Maintain long-lived indexes used for ranking.<\/li>\n<\/ul>\n<p>Many LLM-based systems, especially in \u201cbrowsing\u201d modes or agent setups:<\/p>\n<ul>\n<li aria-level=\"1\">Fetch content at <b>query time<\/b>, not as part of an ongoing crawl.<\/li>\n<li aria-level=\"1\">Often, generative engines <b>don\u2019t execute complex JavaScript<\/b>, meaning JS-only content is invisible to them.<\/li>\n<li aria-level=\"1\">Operate within strict <b>token windows<\/b>: they may only \u201csee\u201d a fraction of the HTML they fetch.<\/li>\n<\/ul>\n<p>The result:<\/p>\n<ul>\n<li aria-level=\"1\">Navigation, cookie banners, and clutter often appear before your core content.<\/li>\n<li aria-level=\"1\">Long, dense pages can get <b>truncated<\/b>, cutting off key sections.<\/li>\n<li aria-level=\"1\">Important docs hidden behind JS-heavy nav structures are poorly understood or missed.<\/li>\n<\/ul>\n<p>Similarweb\u2019s <a href=\"https:\/\/www.similarweb.com\/corp\/2025-generative-ai-landscape\/\">research on GenAI<\/a> and publishers shows how these dynamics play out: AI Overviews increasingly <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/seo\/search-intent\/\">satisfy user intent<\/a> on the SERP, driving \u201czero-click\u201d behavior and reducing visits to the underlying sites.<\/p>\n<p>That same \u201canswer-first\u201d pattern applies when LLMs browse your site directly: they\u2019re trying to get in, grab exactly what they need, and get out. Quickly.<\/p>\n<h3>3.2. Typical failure modes from an AI visibility standpoint<\/h3>\n<p>Because of those constraints, we see failure patterns like:<\/p>\n<ul>\n<li aria-level=\"1\">AI answers that rely on old blog posts rather than updated ones.<\/li>\n<li aria-level=\"1\">Third-party explainers outranking your <b>own product guides<\/b> as the cited source.<\/li>\n<li aria-level=\"1\">Misstated <b>pricing, limits, or policies<\/b> because the canonical page is long, salesy, or buried.<\/li>\n<li aria-level=\"1\">Incomplete or hallucinated API behavior because reference docs are fragmented or noisy.<\/li>\n<\/ul>\n<p>Traditional SEO is mostly about being <b>discoverable and indexable<\/b>. In the AI era, we also need to be <b>legible and prioritized<\/b> within narrow context windows.<\/p>\n<h3>3.3. llms.txt as a targeted mitigation<\/h3>\n<p>Llms.txt doesn\u2019t pretend to fix everything. But it does attack one narrow yet important question:<\/p>\n<p>If a model can only look at a handful of your pages, how do you help it pick the <b>right<\/b> ones and understand what each is for?<\/p>\n<p>It does that by:<\/p>\n<ul>\n<li aria-level=\"1\">Offering a <b>curated list of high-value URLs<\/b>.<\/li>\n<li aria-level=\"1\">Presenting them in <b>Markdown<\/b> to minimize layout noise.<\/li>\n<li aria-level=\"1\">Including short, <b>explicit descriptions<\/b> (e.g., \u201cAPI v2 reference,\u201d \u201ccurrent self-serve pricing,\u201d \u201cgetting started guide\u201d).<\/li>\n<\/ul>\n<p>From a <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/what-is-geo\/\">GEO<\/a>\/AEO perspective, llms.txt is less about chasing a ranking factor and more about increasing the odds that, if AI tools use your site, they <b>start with the right content<\/b>.<\/p>\n    <div class=\"post-banner post-banner--base\">\n        <div class=\"post-banner__wrapper\">\n            <div class=\"post-banner__text\">\n                                    <p class=\"post-banner__title\">Boost Your AI Visibility<\/p>\n                                    <p class=\"post-banner__subtitle\">Increase your visibility vs. competitors with ease.<\/p>\n                                <div class=\"post-banner__button-wrapper\">\n                                            <a class=\"swui-button swui-button--solid swui-button--primary post-banner__button js-post-banner\"\n                           href=\"https:\/\/account.similarweb.com\/journey\/registration\"\n                           data-disable-dynamic-tracking\n                        >Try Similarweb Now<\/a>\n                                    <\/div>\n            <\/div>\n                    <\/div>\n    <\/div>\n\n<h2>4. How the SEO industry actually sees llms.txt<\/h2>\n<p>If you read across SEO blogs, dev docs, and product updates, four distinct positions show up.<\/p>\n<h3>4.1. The skeptics: \u201cNot worth it (yet)\u201d.<\/h3>\n<p>This point of view is driven by the current adoption reality:<\/p>\n<ul>\n<li aria-level=\"1\">Googlers have publicly stated that <a href=\"https:\/\/bsky.app\/profile\/johnmu.com\/post\/3lrshm4gggs2v\">no Gen AI system currently uses llms.txt<\/a>, and compared it to the <a href=\"https:\/\/www.searchenginejournal.com\/google-says-llms-txt-comparable-to-keywords-meta-tag\/544804\/\">old keywords meta tag<\/a>: something SEOs might obsess over, but that search engines <b>ignore<\/b>.<br \/>\n<img decoding=\"async\" class=\"alignnone size-full wp-image-207396\" src=\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-noai.png\" alt=\"John Muller quote on llms.txt\" width=\"611\" height=\"182\" srcset=\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-noai.png 611w, https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-noai-300x89.png 300w\" sizes=\"(max-width: 611px) 100vw, 611px\" \/><\/li>\n<li aria-level=\"1\">Early log-file analyses show <b>very few consumer-facing LLM crawlers<\/b> (GPTBot, Google-Extended, PerplexityBot, ClaudeBot) requesting \/llms.txt at any scale.<\/li>\n<\/ul>\n<p>From that vantage point:<\/p>\n<ul>\n<li aria-level=\"1\">There\u2019s <b>no evidence<\/b> that llms.txt improves rankings, <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/ai-overviews\/\">AI Overviews<\/a> presence, or traffic.<\/li>\n<li aria-level=\"1\">No major LLM vendor has said \u201cwe treat this as a signal.\u201d<\/li>\n<\/ul>\n<h4>My SEO take:<\/h4>\n<p>If you\u2019re still dealing with crawl issues, thin content, weak website infrastructure, or fragile <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/seo\/technical-seo\/\">technical SEO<\/a>, this camp is right: llms.txt does not belong anywhere near the top of your backlog.<\/p>\n<h3>4.2. The pragmatic futurists: \u201cCheap insurance\u201d.<\/h3>\n<p>This group agrees the impact is unproven, but looks at the cost differently:<\/p>\n<ul>\n<li aria-level=\"1\">A simple, 5-15 URL llms.txt file takes <b>under an hour<\/b> to draft.<\/li>\n<li aria-level=\"1\">Updating it a couple of times a year is negligible compared with most content or dev projects.<\/li>\n<li aria-level=\"1\">It\u2019s essentially an <b>option<\/b>: low downside, potential upside if\/when adoption grows.<\/li>\n<\/ul>\n<p>They also value the internal exercise:<\/p>\n<ul>\n<li aria-level=\"1\">To write a good llms.txt, you must agree on your <b>canonical, source-of-truth pages<\/b>.<\/li>\n<li aria-level=\"1\">That often surfaces outdated docs, overlapping content, or internal misalignment (which are issues you should fix anyway).<\/li>\n<\/ul>\n<h4>My SEO take:<\/h4>\n<p>If your fundamentals are in good shape and you already care about AI visibility and Generative Engine Optimization, I\u2019m aligned with this camp.<\/p>\n<p>Think of llms.txt as <b>\u201cfuture insurance + content clarity,\u201d<\/b> not as a lever you report on quarterly.<\/p>\n<h3>4.3. Docs &amp; agent champions: \u201cIt\u2019s useful right now\u201d.<\/h3>\n<p>In developer ecosystems, the discussion is much less theoretical.<\/p>\n<ul>\n<li aria-level=\"1\">Docs platforms generate llms.txt, llms-full, and .md exports out of the box to help <b>coding agents and AI tools<\/b> ingest docs.<\/li>\n<li aria-level=\"1\">Anthropic and others highlight LLM-friendly Markdown docs as a best practice for tools and agents to consume.<\/li>\n<li aria-level=\"1\">Benchmarks from teams working on code assistants show that AI agents guided by llms.txt-structured docs often <b>outperform<\/b> those that rely solely on semantic search across unstructured HTML. This is another proof that <a href=\"https:\/\/www.similarweb.com\/blog\/daas\/data-basics\/ai-agents-crash\/\">AI agents are only as good as the data integrated with them<\/a>.<\/li>\n<\/ul>\n<p>Here, the upside is tangible:<\/p>\n<ul>\n<li aria-level=\"1\">Better agent reasoning over your docs.<\/li>\n<li aria-level=\"1\">Lower token usage and cost when <a href=\"https:\/\/www.similarweb.com\/corp\/ai\/mcp\/\">AI agents<\/a> fetch Markdown instead of full HTML.<\/li>\n<li aria-level=\"1\">Fewer AI-related support tickets because \u201cthe AI\u201d is finally reading the right docs.<\/li>\n<\/ul>\n<h4>My SEO take:<\/h4>\n<p>If your product is <b>developer-first<\/b> or heavily <b>API-based<\/b>, I\u2019d treat llms.txt and its ecosystem as a <b>DX\/docs requirement<\/b>, not an SEO experiment.<\/p>\n<p>The value shows up in developer adoption and retention, even if it never shows up in \u201corganic sessions\u201d reports.<\/p>\n<h3>4.4. AI SEO\/GEO enthusiasts: \u201cBe the answer\u201d.<\/h3>\n<p>This camp is focused on the bigger shift from clicks to <b>answers<\/b>.<\/p>\n<ul>\n<li aria-level=\"1\">Similarweb\u2019s GenAI research shows AI chatbot traffic growing fast: in June 2025, major AI platforms generated <b>over 1.1 billion referral visits<\/b>, up 357% year-over-year.<\/li>\n<li aria-level=\"1\">At the same time, Similarweb\u2019s reports on publishers show substantial <b>traffic drops for many news sites<\/b> as Google\u2019s AI Overviews satisfy more queries without clicks.<\/li>\n<\/ul>\n<p>In that world, GEO (Generative Engine Optimization) is about:<\/p>\n<ul>\n<li aria-level=\"1\">Being <b>selected and cited<\/b> inside AI answers, not just ranking in classic SERPs.<\/li>\n<li aria-level=\"1\">Making sure generative AI engines describe your brand accurately when they answer in your space.<\/li>\n<\/ul>\n<p>For this group, llms.txt is one more <b>supporting signal<\/b>:<\/p>\n<ul>\n<li aria-level=\"1\">It doesn\u2019t replace schema, <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/seo\/internal-links\/\">internal links<\/a>, or entity work.<\/li>\n<li aria-level=\"1\">It does give you a clean way to say, \u201cThese are our authoritative answers.\u201d<\/li>\n<\/ul>\n<h4>My SEO take:<\/h4>\n<p>Strategically, I agree with the GEO direction. Tactically, llms.txt is <b>a small, aligned tactic<\/b>, not the core of your AI optimization strategy.<\/p>\n<p>Strong, straightforward, authoritative content still does most of the heavy lifting.<\/p>\n<h2>5. What the data says (so far)<\/h2>\n<p>Strip out the hype, and you get a reasonably consistent picture.<\/p>\n<h3>5.1. Adoption is growing, but still niche<\/h3>\n<ul>\n<li aria-level=\"1\">The llms.txt spec has stable documentation and a growing tool ecosystem (CLI tools, plugins for popular doc generators, CMS integrations).<\/li>\n<li aria-level=\"1\">Adoption is heavily concentrated in <b>developer tools, SaaS docs, AI-aware agencies, and early GEO experiments<\/b>.<\/li>\n<\/ul>\n<p>Relative to the entire web, it\u2019s still an early adopter pattern, not a mainstream standard.<\/p>\n<h3>5.2. For classic SEO and AI Overviews, llms.txt is neutral today<\/h3>\n<p>Across public statements and independent experiments:<\/p>\n<ul>\n<li aria-level=\"1\">There is <b>no evidence<\/b> that llms.txt:\n<ul>\n<li aria-level=\"2\">Improves organic rankings<\/li>\n<li aria-level=\"2\">Increases AI Overview inclusion<\/li>\n<li aria-level=\"2\">Moves traditional SEO KPIs in a repeatable way<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>Today, classic ranking systems still respond to technical health, relevance, authority, and user signals. Not to llms.txt.<\/p>\n<h3>5.3. For agents, tools, and AI-native docs, it already has jobs<\/h3>\n<p>On the other hand:<\/p>\n<ul>\n<li aria-level=\"1\">Tool builders and docs platforms are already using llms.txt and llms-full.txt as <b>ingestion endpoints<\/b> for LLM-based tools and <a href=\"https:\/\/www.similarweb.com\/corp\/ai\/mcp\/\">MCP servers<\/a>.<\/li>\n<li aria-level=\"1\">That doesn\u2019t register as \u201cmore organic traffic,\u201d but it does show up as:\n<ul>\n<li aria-level=\"2\">Better agent-driven onboarding<\/li>\n<li aria-level=\"2\">More accurate AI-generated examples<\/li>\n<li aria-level=\"2\">Less friction when developers use AI assistants<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>If your product\u2019s success depends on developers understanding your docs via AI tools, this matters.<\/p>\n<h2>6. Should you implement llms.txt? A practical framework<\/h2>\n<p>With so many pros and cons to llms.txt, SEOs need to weigh the benefits it can bring to their site vs. the potential time waste of a 0-impact project. We don\u2019t always have all the resources we need to carry out SEO tasks, so why add tasks that don\u2019t result in more traffic or revenue for us?<\/p>\n<p>Let\u2019s get concrete, here\u2019s how I\u2019d prioritize it:<\/p>\n<p><img decoding=\"async\" class=\"alignnone size-full wp-image-207411\" src=\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt-implementation-priorities.png\" alt=\"Llms.txt implementation priorities\" width=\"1200\" height=\"628\" srcset=\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt-implementation-priorities.png 1200w, https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt-implementation-priorities-300x157.png 300w, https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt-implementation-priorities-1024x536.png 1024w, https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt-implementation-priorities-768x402.png 768w\" sizes=\"(max-width: 1200px) 100vw, 1200px\" \/><\/p>\n<h3>6.1. High priority: dev platforms and docs-heavy products<\/h3>\n<p>You should <b>strongly consider<\/b> llms.txt (plus llms-full and .md docs) if:<\/p>\n<ul>\n<li aria-level=\"1\">You\u2019re API-first or developer-first.<\/li>\n<li aria-level=\"1\">You maintain a substantial public docs site.<\/li>\n<li aria-level=\"1\">Your users already rely on:\n<ul>\n<li aria-level=\"2\">IDE assistants<\/li>\n<li aria-level=\"2\">Embedded AI in docs<\/li>\n<li aria-level=\"2\">Agents that fetch docs via HTTP<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>In that context:<\/p>\n<ul>\n<li aria-level=\"1\">llms.txt is part of building <b>AI-native documentation<\/b>, not a speculative traffic play.<\/li>\n<li aria-level=\"1\">It\u2019s aligned with how LLM-aware dev tooling is evolving.<\/li>\n<\/ul>\n<h4>My SEO take:<\/h4>\n<p>Own this as a <b>product\/docs initiative<\/b>, with the SEO team as a stakeholder.<\/p>\n<p>Measure success via developer outcomes (time-to-first-success, reduced support burden), not ranking charts.<\/p>\n<h3>6.2. Medium priority: mature, content-rich sites exploring GEO<\/h3>\n<p>You should <b>consider a lightweight llms.txt<\/b> if:<\/p>\n<ul>\n<li aria-level=\"1\">Your technical SEO and content fundamentals are solid.<\/li>\n<li aria-level=\"1\">You have clear \u201c<a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/content-marketing\/pillar-page\/\">pillar<\/a>\u201d content and stable product docs.<\/li>\n<li aria-level=\"1\">You\u2019re actively tracking AI behavior:\n<ul>\n<li aria-level=\"2\">Using Similarweb\u2019s <b>AI traffic tracker<\/b> ( in the <a href=\"https:\/\/www.similarweb.com\/corp\/search\/gen-ai-intelligence\/\"><b>AI Search Intelligence<\/b><\/a> suite) to see which chatbots send traffic to which URLs.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>Here, a small llms.txt is:<\/p>\n<ul>\n<li aria-level=\"1\">A <b>low-cost experiment<\/b>.<\/li>\n<li aria-level=\"1\">A forcing function to clarify your <b>5-15 actual source-of-truth pages<\/b>.<\/li>\n<li aria-level=\"1\">A nice complement to your <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/aeo-vs-geo\/\">GEO\/AEO<\/a> initiatives.<\/li>\n<\/ul>\n<h4>My SEO take:<\/h4>\n<p>Invest ~45-60 minutes to create a curated llms.txt, then review it a few times a year alongside your normal content audits. Don\u2019t sell it internally as a \u201cgrowth lever\u201d, present it as <b>readiness + modeling clarity<\/b>.<\/p>\n<h3>6.3. Low priority: sites still fighting the basics<\/h3>\n<p>You should <b>not prioritize<\/b> llms.txt yet if:<\/p>\n<ul>\n<li aria-level=\"1\">You have unresolved crawlability, indexability, or speed issues.<\/li>\n<li aria-level=\"1\">Your content is thin, outdated, or poorly structured.<\/li>\n<li aria-level=\"1\">Your <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/seo\/site-architecture-for-seo\/\">website structure<\/a> and internal links make it hard for humans and search bots to navigate.<\/li>\n<\/ul>\n<p>In that world:<\/p>\n<ul>\n<li aria-level=\"1\">Fixing fundamentals will significantly move your metrics.<\/li>\n<li aria-level=\"1\">llms.txt almost certainly won\u2019t, at least not in a measurable way.<\/li>\n<\/ul>\n<h4>My SEO take:<\/h4>\n<p>Keep llms.txt in your <b>20% \u201cexperimental\u201d bucket<\/b> for later. Get the 80% core SEO work stable first.<\/p>\n<h2>7. How to create a proper llms.txt (without overdoing it)<\/h2>\n<p>If you\u2019ve decided to test llms.txt, here\u2019s a pragmatic way to do it.<\/p>\n<h3>7.1. Step 1: Decide scope and owner<\/h3>\n<p>First, define <b>what<\/b> the file describes:<\/p>\n<ul>\n<li aria-level=\"1\">Entire site<\/li>\n<li aria-level=\"1\">Just docs<\/li>\n<li aria-level=\"1\">Just one product or subdomain<\/li>\n<\/ul>\n<p>For most SaaS and dev companies, starting with <b>docs only<\/b> is realistic and high impact.<\/p>\n<p>Then assign ownership:<\/p>\n<ul>\n<li aria-level=\"1\"><b>Content SEO<\/b>: curate URLs and descriptions.<\/li>\n<li aria-level=\"1\"><b>Engineering\/DevOps<\/b>: deploy the file to \/llms.txt and add the X-Robots-Tag: noindex header.<\/li>\n<\/ul>\n<p>If no one owns it, it will drift out of date (which is worse than not having it).<\/p>\n<h3>7.2. Step 2: Inventory your \u201cAI-worthy\u201d pages<\/h3>\n<p>This is where the real thinking happens. Ask yourself: If an AI could only look at 5-15 URLs, which ones would we trust to represent us?<\/p>\n<h4>Pages to include in llms.txt<\/h4>\n<ul>\n<li aria-level=\"1\">Pillar guides and onboarding hubs<\/li>\n<li aria-level=\"1\">API references and core SDK docs<\/li>\n<li aria-level=\"1\">Evergreen \u201cWhat is X?\u201d or \u201cHow to do Y with [Brand]\u201d explainers<\/li>\n<li aria-level=\"1\">High-value FAQs and troubleshooting hubs<\/li>\n<li aria-level=\"1\">Stable pricing and policy pages<\/li>\n<\/ul>\n<h4>Pages to exclude from llms.txt<\/h4>\n<ul>\n<li aria-level=\"1\">Thin campaign LPs or short-lived promos<\/li>\n<li aria-level=\"1\">Extremely salesy pages with little factual content<\/li>\n<li aria-level=\"1\">Purely navigational or legal boilerplate<\/li>\n<\/ul>\n<p>This exercise itself is valuable: it often exposes gaps and inconsistencies in your content strategy.<\/p>\n<h3>7.3. Step 3: Draft the file in Markdown (keep it factual and concise)<\/h3>\n<p>Using the spec, you might end up with something like:<\/p>\n<p><span style=\"font-size: 10pt;\"><em><code># YourBrand<\/code><\/em><\/span><br \/>\n<span style=\"font-size: 10pt;\"><em><code>&gt; YourBrand is a [short, factual description: what you do, for whom, and in what use cases].<\/code><\/em><\/span><\/p>\n<p>This file provides a curated guide to our most important public resources for large language models and AI assistants.<\/p>\n<p><span style=\"font-size: 10pt;\"><em><code>## Documentation<\/code><\/em><\/span><br \/>\n<span style=\"font-size: 10pt;\"><em><code>- [Getting started](https:\/\/example.com\/docs\/getting-started): Introductory guide for new users<\/code><\/em><\/span><br \/>\n<span style=\"font-size: 10pt;\"><em><code>- [API reference](https:\/\/example.com\/docs\/api): Endpoints, parameters, and usage examples<\/code><\/em><\/span><br \/>\n<span style=\"font-size: 10pt;\"><em><code>- [Authentication](https:\/\/example.com\/docs\/auth): How to authenticate and manage API keys<\/code><\/em><\/span><br \/>\n<span style=\"font-size: 10pt;\"><em><code>## Product<\/code><\/em><\/span><br \/>\n<span style=\"font-size: 10pt;\"><em><code>- [Product overview](https:\/\/example.com\/product): Features, plans, and core use cases<\/code><\/em><\/span><br \/>\n<span style=\"font-size: 10pt;\"><em><code>- [Pricing](https:\/\/example.com\/pricing): Current pricing tiers and billing details<\/code><\/em><\/span><br \/>\n<span style=\"font-size: 10pt;\"><em><code>## Support &amp; FAQ<\/code><\/em><\/span><br \/>\n<span style=\"font-size: 10pt;\"><em><code>- [FAQ](https:\/\/example.com\/faq): Answers to common setup and account questions<\/code><\/em><\/span><br \/>\n<span style=\"font-size: 10pt;\"><em><code>- [Status](https:\/\/status.example.com): Live service status and incident history<\/code><\/em><\/span><br \/>\n<span style=\"font-size: 10pt;\"><em><code>## Optional<\/code><\/em><\/span><br \/>\n<em><code><span style=\"font-size: 10pt;\">- [About](https:\/\/example.com\/about): Company background and team<\/span><\/code><\/em><\/p>\n<p><strong>Why this works:<\/strong><\/p>\n<ul>\n<li aria-level=\"1\">Link text like \u201cAPI reference\u201d or \u201cPricing\u201d makes intent obvious.<\/li>\n<li aria-level=\"1\">Short descriptions tell an LLM which URLs are relevant for which topic.<\/li>\n<li aria-level=\"1\">H2 sections mirror how models <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/content-chunking\/\">chunk and reason about related content<\/a>.<\/li>\n<\/ul>\n<p>Think of it as <b>internal linking + schema for AI<\/b>, written in Markdown.<\/p>\n<h3>7.4. Step 4: Deploy at \/llms.txt and control indexing<\/h3>\n<p>Implementation basics:<\/p>\n<ul>\n<li aria-level=\"1\">Serve the file at https:\/\/yourdomain.com\/llms.txt.<\/li>\n<li aria-level=\"1\">If your CMS can\u2019t place it at the root folder, use a clean <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/seo\/301-redirect\/\">301 redirect<\/a> from \/llms.txt to its actual location.<\/li>\n<li aria-level=\"1\">Open it in a browser to verify it renders as plain text.<\/li>\n<\/ul>\n<p>If you don\u2019t want the file itself to appear in search results, configure your server to send an X-Robots-Tag: noindex header for that path.<\/p>\n<p>You want AI engines and tools to <b>easily find the file<\/b>. You don\u2019t need it cluttering up SERPs.<\/p>\n<h3>7.5. Step 5: Set a light maintenance rhythm<\/h3>\n<p>llms.txt should evolve with your site:<\/p>\n<ul>\n<li aria-level=\"1\">Review when you launch a major new product or docs area.<\/li>\n<li aria-level=\"1\">Update when you deprecate or rewrite <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/seo\/canonical-tag-tips\/\">canonical pages<\/a>.<\/li>\n<li aria-level=\"1\">Do a quick quarterly check:\n<ul>\n<li aria-level=\"2\">Are all URLs live?<\/li>\n<li aria-level=\"2\">Do descriptions still match reality?<\/li>\n<li aria-level=\"2\">Are we missing obvious \u201csources of truth\u201d?<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>If you can\u2019t commit to basic maintenance, it\u2019s better to wait than to ship stale guidance.<\/p>\n<h2>8. Monitoring and learning from llms.txt<\/h2>\n<p>You won\u2019t (yet) see a neat \u201cllms.txt \u2192 traffic spike\u201d pattern in your dashboards, but you can use data to learn whether it\u2019s being touched and whether AI visibility is changing.<\/p>\n<h3>8.1. What to monitor<\/h3>\n<h4>1. Server and log-file data<\/h4>\n<ul>\n<li aria-level=\"1\">Check logs for hits to \/llms.txt and \/llms-full.txt by AI or agent user agents.<\/li>\n<li aria-level=\"1\">Over time, this tells you whether any engines, tools, or agents start relying on the file.<\/li>\n<\/ul>\n<p>If you want to do this at scale, you don\u2019t have to write your own log parser. <a href=\"https:\/\/www.similarweb.com\/corp\/search\/site-audit\/\"><b>Similarweb\u2019s Site Audit tool<\/b><\/a> integrates log-file summary data from log analyzers like Logz.io and other tools, so you can overlay bot behavior with crawl and technical insights instead of treating logs as a separate, one-off project.<\/p>\n<h4>2. AI chatbot traffic<\/h4>\n<ul>\n<li aria-level=\"1\">Use <b>Similarweb\u2019s <\/b><a href=\"https:\/\/www.similarweb.com\/corp\/search\/gen-ai-intelligence\/ai-chatbot-traffic\/\"><b>AI Traffic tool<\/b><\/a> to see:\n<ul>\n<li aria-level=\"2\">Which AI chatbots (ChatGPT, Gemini, Perplexity, etc.) send traffic to your site<\/li>\n<li aria-level=\"2\">Which pages receive AI-originated visits<\/li>\n<\/ul>\n<\/li>\n<li aria-level=\"1\">That helps you understand whether the URLs you surfaced in llms.txt are actually part of AI-driven sessions.<\/li>\n<\/ul>\n<h4>3. Developer and support feedback (for dev products)<\/h4>\n<ul>\n<li aria-level=\"1\">Track whether AI-assisted onboarding feels more accurate or requires fewer escalations after you rework docs for LLM legibility and expose them via llms.txt.<\/li>\n<\/ul>\n<h3>8.2. What not to expect (for now)<\/h3>\n<p>Be realistic:<\/p>\n<ul>\n<li aria-level=\"1\">Don\u2019t expect immediate ranking lifts, AI Overview inclusion boosts, or a clean \u201cbefore\/after llms.txt\u201d traffic graph.<\/li>\n<li aria-level=\"1\">Any impacts will likely be <b>indirect<\/b>, via better AI understanding and behavior, not because llms.txt has become a first-class ranking signal.<\/li>\n<\/ul>\n<p>That\u2019s why I recommend positioning llms.txt internally as a <b>forward-looking optimization and clarity tool<\/b>, not a primary SEO KPI lever.<\/p>\n<h2>9. Common llms.txt mistakes to avoid<\/h2>\n<p>If you do implement llms.txt, avoid these traps:<\/p>\n<ol>\n<li aria-level=\"1\"><b>Treating it as a ranking factor<\/b><br \/>\nThere\u2019s no evidence that it impacts rankings or AI Overviews today. Don\u2019t oversell it.<\/li>\n<li aria-level=\"1\"><b>Turning it into a mini-sitemap<\/b><br \/>\nDumping dozens or hundreds of URLs defeats the point. The value lies in <b>selective curation<\/b>.<\/li>\n<li aria-level=\"1\"><b>Letting it go stale<\/b><br \/>\nIf your canonical URLs change and llms.txt still points at old pages, you\u2019re undermining your own intent.<\/li>\n<li aria-level=\"1\"><b>Using it instead of robots.txt for control<\/b><br \/>\nLlms.txt doesn\u2019t block crawling or training. Use robots.txt and meta directives for that.<\/li>\n<li aria-level=\"1\"><b>Ignoring content quality<\/b><br \/>\nA beautifully structured llms.txt that points to vague, shallow, or confusing content does nothing. AI systems still prefer strong, well-structured content, just like users.<\/li>\n<\/ol>\n<p>Treat llms.txt like early <a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/seo\/schema-markup\/\">schema markup<\/a> or early XML sitemaps: worth testing <b>after<\/b> the fundamentals are in place, not instead of them.<\/p>\n<h2>10. The bigger picture: AI visibility beyond llms.txt<\/h2>\n<p>llms.txt is interesting because it sits exactly where:<\/p>\n<ul>\n<li aria-level=\"1\"><b>User behavior<\/b> is shifting from clicking links to getting <b>direct answers<\/b> in AI engines.<\/li>\n<li aria-level=\"1\"><b>Machines<\/b> increasingly need clean, structured, high-signal content to make those answers accurate.<\/li>\n<\/ul>\n<p>Whether llms.txt becomes widely adopted or not, the direction is clear:<\/p>\n<ul>\n<li aria-level=\"1\">AI engines reward content that is:\n<ul>\n<li aria-level=\"2\">Well-structured and chunked<\/li>\n<li aria-level=\"2\">Factually strong and current<\/li>\n<li aria-level=\"2\">Easy to interpret without relying on heavy layout or client-side scripts<\/li>\n<\/ul>\n<\/li>\n<li aria-level=\"1\">Search behavior is moving toward:\n<ul>\n<li aria-level=\"2\">AI chatbots and GenAI experiences<\/li>\n<li aria-level=\"2\">\u201c<a href=\"https:\/\/www.similarweb.com\/blog\/marketing\/seo\/zero-click-searches\/\">Zero-click<\/a>\u201d answers where the AI becomes the primary interface<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>In this \u201cnew world\u201d, the SEO job shifts from \u201cget us ranked\u201d to \u201cmake sure we are the trusted, quoted source when AI systems answer questions in our space\u201d.<\/p>\n<p>That\u2019s essentially Generative Engine Optimization (GEO): structuring clear, factual, self-contained content so answer engines choose you when they assemble responses.<\/p>\n<p>llms.txt can support that, but only as a thin layer on top of a solid content strategy, strong entities and structured data, clean technical foundations, and a real view of how AI-driven traffic already behaves (which you can track with <a href=\"https:\/\/www.similarweb.com\/corp\/search\/gen-ai-intelligence\/ai-brand-visibility\/\">Similarweb\u2019s GenAI visibility<\/a> and AI Chatbot Traffic tools).<\/p>\n<p>In practice, llms.txt makes the most sense for dev and docs-heavy products, is a low-cost experiment for mature sites, and should sit behind core SEO work for everyone else.<\/p>\n<p>Don\u2019t build your AI optimization strategy around llms.txt. Build it around <b>clarity, authority, and structure<\/b>, then use llms.txt as one more small, aligned step in that direction.<\/p>\n    <div class=\"post-banner post-banner--base\">\n        <div class=\"post-banner__wrapper\">\n            <div class=\"post-banner__text\">\n                                    <p class=\"post-banner__title\">Boost Your AI Visibility<\/p>\n                                    <p class=\"post-banner__subtitle\">Increase your visibility vs. competitors with ease.<\/p>\n                                <div class=\"post-banner__button-wrapper\">\n                                            <a class=\"swui-button swui-button--solid swui-button--primary post-banner__button js-post-banner\"\n                           href=\"https:\/\/account.similarweb.com\/journey\/registration\"\n                           data-disable-dynamic-tracking\n                        >Try Similarweb Now<\/a>\n                                    <\/div>\n            <\/div>\n                    <\/div>\n    <\/div>\n\n<h2>FAQs<\/h2>\n<p><b> What is llms.txt in simple terms?<\/b><\/p>\n<p>Llms.txt is a Markdown file at your domain root that lists a small set of your most important pages for AI systems, each with a short description. It\u2019s meant to guide LLMs and agents toward your \u201csource of truth\u201d content when they answer questions about your brand.<\/p>\n<p><b> Does llms.txt improve SEO rankings or AI Overviews today?<\/b><\/p>\n<p>No. Right now, llms.txt is not a ranking factor and doesn\u2019t directly influence AI Overviews. It\u2019s best treated as future-facing documentation and agent support, not as a way to boost <a href=\"https:\/\/www.similarweb.com\/website\/\">website traffic<\/a>.<\/p>\n<p><b> Do ChatGPT, Gemini, or Claude actually use llms.txt?<\/b><\/p>\n<p>Publicly, major consumer assistants have not confirmed using llms.txt as a standard input for answers. However, some documentation platforms and agent frameworks already rely on llms.txt, llms-full.txt, and Markdown docs to power dev tools and coding assistants, so it\u2019s gaining traction in those ecosystems.<\/p>\n<p><b> How is llms.txt different from robots.txt?<\/b><\/p>\n<p>Robots.txt controls where crawlers can go on a website. Llms.txt doesn\u2019t block anything, but simply highlights a curated set of URLs and explains what they\u2019re for, helping AI systems prioritize the right content.<\/p>\n<p><b> Is llms.txt just a sitemap for AI?<\/b><\/p>\n<p>Not exactly. A sitemap lists many or all indexable pages for discovery. llms.txt lists only a handful of high-value, canonical URLs and labels them with concise descriptions.<\/p>\n<p><b> Who should implement llms.txt first?<\/b><\/p>\n<p>Prioritize it if you\u2019re an API-first\/developer platform. Nice-to-have if you\u2019re a mature brand with solid SEO basics and you\u2019re already investing in GEO\/AEO. Skip for now if you still have core SEO issues.<\/p>\n<p><b> How does llms.txt relate to Generative Engine Optimization (GEO)?<\/b><\/p>\n<p>GEO is about being selected and cited inside AI answers, not just ranking in SERPs. llms.txt supports GEO by giving AI systems a clear list of the pages you consider authoritative. However, it\u2019s a supporting tactic; strong, structured, trustworthy content is still the core of any GEO strategy.<\/p>\n<p><b> How can I see if AI chatbots are sending traffic to my site?<\/b><\/p>\n<p>You can use Similarweb\u2019s GenAI Intelligence and AI Chatbot Traffic capabilities to see which AI chatbots (like ChatGPT, Gemini, Perplexity, and others) refer traffic to your site and which pages they hit. That helps you connect llms.txt and AI legibility work to real-world behavior.<\/p>\n<p><b> How many URLs should I include in llms.txt?<\/b><\/p>\n<p>For most sites, 5-15 URLs is ideal. Focus on pages that clearly explain what you do, how to use your product, your pricing and policies, and your core docs or FAQs. If a page wouldn\u2019t make sense as a standalone \u201csource\u201d in an AI answer, it probably doesn\u2019t belong in llms.txt.<\/p>\n<p><b> Can llms.txt hurt my SEO if I get it wrong?<\/b><\/p>\n<p>Used correctly, llms.txt shouldn\u2019t hurt traditional SEO at all. It doesn\u2019t change how classic search crawlers index your site, and you can mark the file itself noindex. The real risk is strategic: if you point AI tools at outdated or non-canonical pages, you might reinforce the wrong messages. That\u2019s why light, ongoing maintenance is essential once you ship it.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In the early days of the web, search engines had a fundamental need: a standard way to determine where to crawl. That\u2019s why robots.txt became so important. One small text file at the root of a site provided crawlers with a predictable, machine-readable set of rules, and over time, it evolved into an unofficial but [&hellip;]<\/p>\n","protected":false},"author":267,"featured_media":207397,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"categories":[8793,2803,6345],"tags":[],"class_list":["post-207395","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-geo","category-marketing","category-seo"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.1 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>What Is Llms.txt? Reality vs. Hype | Similarweb<\/title>\n<meta name=\"description\" content=\"Learn what llms.txt really does, when it\u2019s worth implementing, and how to build it to support AI search, GEO, and LLM visibility, without wasting dev time.\" \/>\n<meta name=\"robots\" content=\"max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Limor Barenholtz\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"25 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/\"},\"author\":{\"name\":\"Limor Barenholtz\",\"@id\":\"https:\/\/www.similarweb.com\/blog\/#\/schema\/person\/f5af6d4b51d6efd23587c423306e7130\"},\"headline\":\"What Is Llms.txt? Reality vs. Hype\",\"datePublished\":\"2025-12-10T12:04:35+00:00\",\"dateModified\":\"2026-03-01T13:24:42+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/\"},\"wordCount\":4634,\"publisher\":{\"@id\":\"https:\/\/www.similarweb.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt.png\",\"articleSection\":[\"Generative Engine Optimization\",\"Marketing\",\"SEO\"],\"inLanguage\":\"\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/\",\"url\":\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/\",\"name\":\"What Is Llms.txt? Reality vs. Hype | Similarweb\",\"isPartOf\":{\"@id\":\"https:\/\/www.similarweb.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt.png\",\"datePublished\":\"2025-12-10T12:04:35+00:00\",\"dateModified\":\"2026-03-01T13:24:42+00:00\",\"description\":\"Learn what llms.txt really does, when it\u2019s worth implementing, and how to build it to support AI search, GEO, and LLM visibility, without wasting dev time.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#breadcrumb\"},\"inLanguage\":\"\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"\",\"@id\":\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#primaryimage\",\"url\":\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt.png\",\"contentUrl\":\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt.png\",\"width\":1536,\"height\":1024,\"caption\":\"Llms.txt? Hype vs. Reality\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.similarweb.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"What Is Llms.txt? Reality vs. Hype\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.similarweb.com\/blog\/#website\",\"url\":\"https:\/\/www.similarweb.com\/blog\/\",\"name\":\"Similarweb\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/www.similarweb.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.similarweb.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.similarweb.com\/blog\/#organization\",\"name\":\"Similarweb\",\"url\":\"https:\/\/www.similarweb.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"\",\"@id\":\"https:\/\/www.similarweb.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2021\/03\/1587374135933.png\",\"contentUrl\":\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2021\/03\/1587374135933.png\",\"width\":200,\"height\":200,\"caption\":\"Similarweb\"},\"image\":{\"@id\":\"https:\/\/www.similarweb.com\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/Similarweb\",\"https:\/\/x.com\/Similarweb\",\"https:\/\/www.youtube.com\/channel\/UCVCI01HR6iB4AA4ChW08cvQ\",\"https:\/\/www.instagram.com\/similarwebinsights\/\",\"https:\/\/www.linkedin.com\/company\/similarweb\",\"https:\/\/en.wikipedia.org\/wiki\/Similarweb\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.similarweb.com\/blog\/#\/schema\/person\/f5af6d4b51d6efd23587c423306e7130\",\"name\":\"Limor Barenholtz\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"\",\"@id\":\"https:\/\/www.similarweb.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2022\/04\/2021-06-19_21-17-02.png\",\"contentUrl\":\"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2022\/04\/2021-06-19_21-17-02.png\",\"caption\":\"Limor Barenholtz\"},\"description\":\"Limor is the SEO &amp; GEO Director at Similarweb, and has been practicing SEO for over 20 years. Limor specializes in all aspects of SEO. She loves the challenges traditional SEO and AI SEO present, clean codes makes her smile &amp; creating scalable SEO &amp; GEO strategies makes her happy. As the former Head of SEO at McCann Tel Aviv, Limor has vast experience in both local &amp; international SEO for major brands &amp; publishers, as well as in planning &amp; integration with multi-channel campaigns. Limor is a speaker at BrightonSEO, co-author of the \\\"In-house SEO Book\\\", and a judge at The Drum Awards &amp; Global Search Awards. When she's not doing SEO, Limor loves playing Poker and handball, listening to music, and being with her baby twins - Naomi &amp; Maayan.\",\"sameAs\":[\"https:\/\/www.similarweb.com\/\",\"https:\/\/www.instagram.com\/lemlem4\/\",\"https:\/\/www.linkedin.com\/in\/lem44\/\",\"https:\/\/x.com\/https:\/\/x.com\/lem4\",\"https:\/\/www.youtube.com\/user\/Similarweb\"],\"url\":\"https:\/\/www.similarweb.com\/blog\/author\/limor-barenholtz\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"What Is Llms.txt? Reality vs. Hype | Similarweb","description":"Learn what llms.txt really does, when it\u2019s worth implementing, and how to build it to support AI search, GEO, and LLM visibility, without wasting dev time.","robots":{"max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/","twitter_misc":{"Written by":"Limor Barenholtz","Est. reading time":"25 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#article","isPartOf":{"@id":"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/"},"author":{"name":"Limor Barenholtz","@id":"https:\/\/www.similarweb.com\/blog\/#\/schema\/person\/f5af6d4b51d6efd23587c423306e7130"},"headline":"What Is Llms.txt? Reality vs. Hype","datePublished":"2025-12-10T12:04:35+00:00","dateModified":"2026-03-01T13:24:42+00:00","mainEntityOfPage":{"@id":"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/"},"wordCount":4634,"publisher":{"@id":"https:\/\/www.similarweb.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#primaryimage"},"thumbnailUrl":"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt.png","articleSection":["Generative Engine Optimization","Marketing","SEO"],"inLanguage":""},{"@type":"WebPage","@id":"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/","url":"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/","name":"What Is Llms.txt? Reality vs. Hype | Similarweb","isPartOf":{"@id":"https:\/\/www.similarweb.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#primaryimage"},"image":{"@id":"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#primaryimage"},"thumbnailUrl":"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt.png","datePublished":"2025-12-10T12:04:35+00:00","dateModified":"2026-03-01T13:24:42+00:00","description":"Learn what llms.txt really does, when it\u2019s worth implementing, and how to build it to support AI search, GEO, and LLM visibility, without wasting dev time.","breadcrumb":{"@id":"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#breadcrumb"},"inLanguage":"","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/"]}]},{"@type":"ImageObject","inLanguage":"","@id":"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#primaryimage","url":"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt.png","contentUrl":"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2025\/12\/attachment-llms-txt.png","width":1536,"height":1024,"caption":"Llms.txt? Hype vs. Reality"},{"@type":"BreadcrumbList","@id":"https:\/\/www.similarweb.com\/blog\/marketing\/geo\/llms-txt\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.similarweb.com\/"},{"@type":"ListItem","position":2,"name":"What Is Llms.txt? Reality vs. Hype"}]},{"@type":"WebSite","@id":"https:\/\/www.similarweb.com\/blog\/#website","url":"https:\/\/www.similarweb.com\/blog\/","name":"Similarweb","description":"","publisher":{"@id":"https:\/\/www.similarweb.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.similarweb.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":""},{"@type":"Organization","@id":"https:\/\/www.similarweb.com\/blog\/#organization","name":"Similarweb","url":"https:\/\/www.similarweb.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"","@id":"https:\/\/www.similarweb.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2021\/03\/1587374135933.png","contentUrl":"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2021\/03\/1587374135933.png","width":200,"height":200,"caption":"Similarweb"},"image":{"@id":"https:\/\/www.similarweb.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/Similarweb","https:\/\/x.com\/Similarweb","https:\/\/www.youtube.com\/channel\/UCVCI01HR6iB4AA4ChW08cvQ","https:\/\/www.instagram.com\/similarwebinsights\/","https:\/\/www.linkedin.com\/company\/similarweb","https:\/\/en.wikipedia.org\/wiki\/Similarweb"]},{"@type":"Person","@id":"https:\/\/www.similarweb.com\/blog\/#\/schema\/person\/f5af6d4b51d6efd23587c423306e7130","name":"Limor Barenholtz","image":{"@type":"ImageObject","inLanguage":"","@id":"https:\/\/www.similarweb.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2022\/04\/2021-06-19_21-17-02.png","contentUrl":"https:\/\/www.similarweb.com\/blog\/wp-content\/uploads\/2022\/04\/2021-06-19_21-17-02.png","caption":"Limor Barenholtz"},"description":"Limor is the SEO &amp; GEO Director at Similarweb, and has been practicing SEO for over 20 years. Limor specializes in all aspects of SEO. She loves the challenges traditional SEO and AI SEO present, clean codes makes her smile &amp; creating scalable SEO &amp; GEO strategies makes her happy. As the former Head of SEO at McCann Tel Aviv, Limor has vast experience in both local &amp; international SEO for major brands &amp; publishers, as well as in planning &amp; integration with multi-channel campaigns. Limor is a speaker at BrightonSEO, co-author of the \"In-house SEO Book\", and a judge at The Drum Awards &amp; Global Search Awards. When she's not doing SEO, Limor loves playing Poker and handball, listening to music, and being with her baby twins - Naomi &amp; Maayan.","sameAs":["https:\/\/www.similarweb.com\/","https:\/\/www.instagram.com\/lemlem4\/","https:\/\/www.linkedin.com\/in\/lem44\/","https:\/\/x.com\/https:\/\/x.com\/lem4","https:\/\/www.youtube.com\/user\/Similarweb"],"url":"https:\/\/www.similarweb.com\/blog\/author\/limor-barenholtz\/"}]}},"lang":"en","translations":{"en":207395},"pll_sync_post":[],"_links":{"self":[{"href":"https:\/\/www.similarweb.com\/blog\/wp-json\/wp\/v2\/posts\/207395","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.similarweb.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.similarweb.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.similarweb.com\/blog\/wp-json\/wp\/v2\/users\/267"}],"replies":[{"embeddable":true,"href":"https:\/\/www.similarweb.com\/blog\/wp-json\/wp\/v2\/comments?post=207395"}],"version-history":[{"count":3,"href":"https:\/\/www.similarweb.com\/blog\/wp-json\/wp\/v2\/posts\/207395\/revisions"}],"predecessor-version":[{"id":208661,"href":"https:\/\/www.similarweb.com\/blog\/wp-json\/wp\/v2\/posts\/207395\/revisions\/208661"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.similarweb.com\/blog\/wp-json\/wp\/v2\/media\/207397"}],"wp:attachment":[{"href":"https:\/\/www.similarweb.com\/blog\/wp-json\/wp\/v2\/media?parent=207395"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.similarweb.com\/blog\/wp-json\/wp\/v2\/categories?post=207395"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.similarweb.com\/blog\/wp-json\/wp\/v2\/tags?post=207395"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}