Skip to content

Instantly share code, notes, and snippets.

@henu-wang
Created March 31, 2026 10:19
Show Gist options
  • Select an option

  • Save henu-wang/84044e9863486d656637f7c270446681 to your computer and use it in GitHub Desktop.

Select an option

Save henu-wang/84044e9863486d656637f7c270446681 to your computer and use it in GitHub Desktop.
Why your website doesn't show up in ChatGPT, Perplexity, or Claude — and how to fix it in 30 minutes

Why Your Website Doesn't Show Up in ChatGPT (And How to Fix It)

If you've searched for your brand or industry on ChatGPT, Perplexity, or Claude and your website never appears in the answers — you're not alone. I've scanned 1,000+ websites and found that most sites fail at least one critical AI search readiness check.

Here's the diagnostic checklist I use:

Step 1: Check if AI crawlers can access your site

# Check your robots.txt for AI crawler blocks
curl -s https://yoursite.com/robots.txt | grep -i -E "gptbot|claudebot|perplexitybot|google-extended|bytespider"

If you see Disallow: / for any of these, that's your problem. ~30% of websites block AI crawlers without knowing it.

Fix: Add explicit Allow directives

# Add to your robots.txt
User-agent: GPTBot
Allow: /

User-agent: ClaudeBot
Allow: /

User-agent: PerplexityBot
Allow: /

User-agent: Google-Extended
Allow: /

Step 2: Check if your content is server-rendered

# See what AI crawlers actually see (no JavaScript execution)
curl -s https://yoursite.com | head -100

If your main content isn't in the raw HTML output, AI crawlers probably see a blank page. You need server-side rendering (SSR).

Step 3: Check if you have llms.txt

curl -s -o /dev/null -w "%{http_code}" https://yoursite.com/llms.txt

If this returns 404, you're missing an easy win. llms.txt tells AI systems what your site is about and where to find key content.

Quick llms.txt template

# Your Site Name

> One sentence describing what your site does.

## Key Pages
- [Product](https://yoursite.com/product): What you offer
- [Pricing](https://yoursite.com/pricing): Plans and pricing
- [Docs](https://yoursite.com/docs): Documentation and guides
- [Blog](https://yoursite.com/blog): Latest articles and insights

Step 4: Check structured data

# Check for JSON-LD structured data
curl -s https://yoursite.com | grep -c "application/ld+json"

If the count is 0, you have no structured data. AI search engines use this to understand your content type (article, FAQ, how-to, product, etc.).

Step 5: Automated full audit

Instead of checking each signal manually, you can run a free scan at GEOScore AI — it checks all 9 AI search readiness signals and gives you a 0-100 score with specific fix recommendations. Takes ~60 seconds, no signup.

Common scores I see

Site Type Typical Score Main Issues
WordPress blogs 40-55 Missing llms.txt, weak structured data
React/Next.js SPAs 20-40 Client-side rendering, no SSR
Shopify stores 35-50 Limited robots.txt control, thin content
Static sites (Hugo, Jekyll) 50-70 Usually good! Fast, server-rendered, clean HTML
Enterprise sites 25-45 Complex JS, aggressive security headers

Created by William Wang, founder of GEOScore AI

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment