Findings
Reproducible observations from the lab.
Finding 001: AI bot identification requires multi-signal evidence
Baseline method for combining user agent, network, resource, JavaScript, and timing signals.
Finding 002: OpenRouter claims need direct lab evidence
OpenRouter responses can claim retrieval without matching origin hits, so direct lab evidence is the stronger source.
Finding 003: per-model OpenRouter probe attribution
Per-model URLs and structured test metadata make future origin hits attributable to a specific model attempt.
Finding 004: OpenClaw web fetch reading baseline
A lightweight fetch-and-read surface sees server-rendered text, but not JavaScript-rendered content or PNG image text.
Finding 005: local HTTP directive compliance baseline
A policy-naive local HTTP client fetches all directive fixtures while preserving robots.txt and robots meta evidence for later comparisons.
Finding 006: ChatGPT manual dual-stack reading baseline
ChatGPT needed standard dual-stack HTTPS reachability, then behaved like an HTML retriever: one page request per prompt, no JavaScript, no image fetch, no hidden href exposure, and no depth-link follow.