Web index

How AI-readable
is the open web?

Lyrenth crawls and audits the open web for the signals that matter to AI agents reading it: structured-data coverage, render-mode mix, heading hygiene, content density, and access friction. The counters below are live; the seven signal aggregates recompute about every 10 minutes.

Indexed documents

1,010,611,358

Total pages currently in the canonical index

Audited pages

1,009,133,198

99.9% of indexed pages audited

Avg AI Readiness

5.6/ 10

Weighted mean across seven content signals

Per-signal coverage

What the audit measures

Each audited page is scored on seven content-derived signals. Below is the corpus-wide mean for each one: how the open web performs against AI-agent-friendly hygiene.

Structured data20% weight20%

Whether the page carries JSON-LD blocks (Article, Product, FAQPage…). Structured data lets agents extract facts without parsing prose.

Heading hygiene15% weight70%

Single H1, monotonic descent through H2 / H3. Predictable structure makes a page easier to skim and section.

Static renderability20% weight50%

Share of pages served without a headless-Chromium escalation. Static-renderable pages cost less and never arrive empty to first-pass scrapers.

Content density20% weight55%

Ratio of meaningful markdown to raw HTML. High density means most of the page is content, not nav / chrome / ads.

Title & description10% weight67%

Non-empty, sensible-length title and description that are not generic placeholders.

Open access10% weight100.0%

Share of indexed pages that reach readers (and agents) without paywall or login-wall markers. Paywalled pages are indexed and scored too, so this is a measured rate across the corpus, not a definition.

Content depth5% weight82%

Whether the page has enough words to be substantive on its own. Sub-stub pages get partial credit; pages with no body fail outright.

How to read this

What these numbers mean

For AI agent builders

A high site-wide structured-data percentage means agents can extract facts cheaply. Low static-renderability or density means more of your input tokens pay for headless rendering and chrome, not content. Lyrenth normalizes these into a single AIDocument shape regardless of how the source page is built.

Read the quickstart →

For site owners

The signals above are exactly what Lyrenth measures per page on your verified domains. Verify a domain to see your own AI Readiness score on the dashboard, broken down page by page.

Verify a domain →

Methodology: every audited page is scored on seven content signals, each 0.0 to 1.0. Per-page scores roll up to a domain average and a corpus-wide mean. Raw page content stays in our private index; only aggregate counts and means are public. Counters are live; signal aggregates recompute about every 10 minutes.

How AI-readableis the open web?

What the audit measures

What these numbers mean

For AI agent builders

For site owners

How AI-readable
is the open web?