Kaistone.ai Research

AI Browser Capability Comparison

How do major AI assistants retrieve, read, and interact with web pages? Based on 95 controlled-browser tests with direct-origin server evidence — not just what the models claim.

📊 Last updated: 2026-06-28 · 5 AI clients · 19 test prompts each · Read the cross-client finding

95

Total Tests

5

AI Clients

19

Prompts Each

55

Confirmed Hits

Tested AI Clients

Claude

Anthropic · claude.ai

Hit rate 19/19 (100%)

User-agent Claude-User/1.0

Retrieval Direct fetch (web_fetch)

Tier tested Free (incognito)

Prompt framing AEO required

Gemini

Google · gemini.google.com

Hit rate 19/19 (100%)

User-agent Google

Retrieval Search index dep.

Tier tested Free

Prompt framing Direct

ChatGPT

OpenAI · chatgpt.com

Hit rate 17/19 (89%)

User-agent ChatGPT-User/1.0

Retrieval Direct fetch

Tier tested Free

Prompt framing Direct

Perplexity

Perplexity AI · perplexity.ai

Hit rate 0/19 (0%)

User-agent — (no hits)

Retrieval None observed

Tier tested Free (incognito)

Prompt framing Direct

Copilot / Bing

Microsoft · copilot.microsoft.com

Hit rate 0/19 (0%)

User-agent — (no hits)

Retrieval Search-index-gated

Tier tested Free (temp. chat)

Prompt framing Direct

Legend

✓ Confirmed — direct-origin evidence

◐ Partial — works with caveats

✕ Not observed — no evidence

— Not tested for this client

Capability Matrix

Each cell is backed by controlled lab tests with direct-origin server evidence.

Capability	Claude	Gemini	ChatGPT	Perplexity	Copilot
Fetches target URL	✓	✓	◐ 89%	✕	✕
Reads visible HTML text	✓	✓	✓	—	—
Reads JS-rendered content	✕	✕	✕	—	—
Reads image alt text	✓	✓	✓	—	—
Reads image pixels	✕	✕	✕	—	—
Follows visible links (depth-1)	✓	◐ claimed	✕	—	—
Exposes hidden/comment hrefs	✕	✕	✕	—	—
Fetches subresources (CSS, JS, fonts)	✕	✕	✕	—	—
Executes JavaScript	✕	✕	✕	—	—
Loads tracking pixels	✕	✕	✕	—	—
Respects robots.txt	✕ fetched	✕ fetched	✕ fetched	—	—
Respects meta noindex	✕ fetched	✕ fetched	✕ fetched	—	—
Reads consent banners	✓	✓	✓	—	—
Interacts with consent	✕	✕	✕	—	—
Finds sitemap-only pages	✓	✓	✓	—	—
Finds robots-only pages	✓	✓	✓	—	—

"Fetched" means the AI retrieved the page despite the directive — none of the tested clients respected robots.txt or meta noindex. "— (not tested)" means the client never successfully fetched any URL, so downstream capabilities could not be measured.

Key Observations

Two-tier retrieval split

Claude, Gemini, and ChatGPT reliably fetch target URLs. Perplexity and Copilot/Bing reliably cannot or do not. This split was consistent across all 95 tests.

HTML-only retrieval

Even when AI clients fetch successfully, none execute JavaScript, load tracking pixels, fetch subresources, or perform browser-equivalent rendering. Retrieval is page-text/HTML only.

No directive compliance

No AI client respected or referenced robots.txt or meta noindex directives when fetching target URLs. Claude and Gemini fetched noindex pages without acknowledging the directive.

Gemini's search index dependency

Gemini depends on Google Search index availability rather than direct URL fetching. Pages not in the index return NOT_IN_SEARCH_INDEX errors, even for robots-allowed URLs.

Claude's prompt framing requirement

Claude refused measurement-framed prompts but successfully fetched the same URLs when reframed as site-owner AEO/readability work. This is a prompt-framing dependency, not a retrieval limitation.

ChatGPT guardrail limitations

ChatGPT's 2/19 no-hits were a URL-safety guardrail (p08) and a fetch-depth limitation (p17) — not systematic retrieval failures. All other 17 tests produced confirmed hits.

Methodology & Limitations

Method

Each test was run from a prepared browser-task artifact in a fresh AI-client chat. The lab server independently logged all incoming requests with full headers, timing, IP, DNS, and user-agent. After each run, model answers were correlated with direct-origin events by prompt code, source prompt ID, and bounded timestamp windows.

Limitations

• Single lab origin (ai-crawler-lab.kaistone.ai) — behavior may differ for larger sites
• One account per client; different account states could produce different results
• Perplexity and Copilot/Bing tested on free/basic tiers; paid tiers might behave differently
• Tests run 2026-06-26 through 2026-06-28; AI products update frequently
• Claude required AEO/readability prompt framing; measurement framing was refused

Read Finding 013 Read Finding 007 Read Finding 006 Read Finding 014 Read Finding 015 Read Finding 016 Read Finding 017 Read Finding 018 Research Methodology