LIVE OBSERVATORY · UPDATED WEEKLY

How Well Does Your
Agent Read the Web?

An open benchmark measuring token efficiency across 44 real websites. Fewer tokens means faster agents, lower costs, and more context for reasoning.

v0.5.1 · 37 sites · Run Jun 29, 2026 · Δ avg +0.0x vs 0.5.1

Avg Compression

Peak Compression

Sites Benchmarked

Top Compression Leaders

Teal = SOM tokens · Dark = raw HTML tokens · Lower SOM is better

Full Benchmark Results

All 44 sites · Click headers to sort · Search to filter

SITE	CATEGORY	HTML TOKENS	SOM TOKENS	COMPRESSION ▼
cloud.google.com	SaaS & Cloud	877K	6K	136.2x
nytimes.com	News & Media	497K	4K	114.8x
arstechnica.com	News & Media	140K	1K	108.2x
techcrunch.com	News & Media	145K	1K	101.9x
kubernetes.io/docs	Dev Tools	125K	1K	101.7x
linear.app	SaaS & Cloud	923K	11K	83.3x
figma.com	SaaS & Cloud	522K	9K	56.9x
stripe.com/docs	SaaS & Cloud	365K	7K	53.9x
tailwindcss.com	SaaS & Cloud	396K	8K	46.8x
nodejs.org	General	184K	5K	35.5x
wired.com	News & Media	562K	16K	34.3x
typescriptlang.org	Dev Tools	103K	4K	22.9x
aws.amazon.com	SaaS & Cloud	109K	5K	20.3x
vercel.com	SaaS & Cloud	206K	11K	18.6x
nextjs.org	Dev Tools	104K	6K	18.3x
theguardian.com	News & Media	484K	29K	16.7x
bbc.com/news	News & Media	147K	10K	14.1x
azure.microsoft.com	News & Media	159K	15K	10.5x
docker.com	SaaS & Cloud	126K	12K	10.4x
github.com/plasmate-labs/plasmate	Dev Tools	180K	19K	9.3x
angular.dev	Dev Tools	33K	4K	7.5x
en.wikipedia.org/wiki/Rust_(programming_language)	General	190K	28K	6.9x
vuejs.org	Dev Tools	35K	9K	3.9x
getbootstrap.com	Dev Tools	29K	10K	3.0x
developer.mozilla.org/en-US/docs/Web/JavaScript	Dev Tools	54K	22K	2.4x
svelte.dev	Dev Tools	38K	18K	2.1x
lobste.rs	General	18K	9K	1.9x
docs.rs	Dev Tools	5K	4K	1.2x
rust-lang.org	Dev Tools	5K	5K	1.0x
pypi.org	Dev Tools	6K	7K	0.9x
news.ycombinator.com	General	12K	15K	0.8x
jsonplaceholder.typicode.com	General	2K	3K	0.8x
postgresql.org	Dev Tools	7K	9K	0.7x
python.org	General	9K	14K	0.6x
example.com	General	162	357	0.5x
crates.io	General	70	300	0.2x
producthunt.com	General	3K	23K	0.1x

Category Breakdown

Average compression ratio by site category

SaaS & Cloud

~47x

cloud.google.com, linear.app, figma.com, vercel.com, stripe.com, tailwindcss.com

News & Media

~41x

nytimes.com, wired.com, bbc.com, guardian.com

Dev Tools & Docs

~15x

nodejs.org, typescriptlang.org, react.dev, nextjs.org

Static & Minimal

~0.7x

example.com, crates.io, pypi.org, Hacker News

Browse by Category

Deep-dive into vertical-specific benchmark data

Developer Documentation

26.0xavg compression

10 sites measured

View leaderboard →

Cost at Scale

Estimate daily savings switching from raw HTML to SOM

pages/day1,000

100100K

HTML Daily Cost

$99.54

33,181,000 tokens

SOM Daily Cost

$5.69

1,895,000 tokens

You save

$93.86/day

31,286,000 tokens saved @ $3/MTok

Methodology

Plasmate version	0.5.1
HTML baseline	curl -sL (raw HTTP, no rendering)
Token counter	tiktoken cl100k_base (GPT-4 tokenizer)
Date	Jun 29, 2026
Platform	Linux x86_64
Sites	37 attempted, 37 successful, 0 failed (anti-bot)
Source	github.com/plasmate-labs/plasmate-benchmarks

SOM is defined by the open SOMspec specification.

Contribute

Add your own sites to the benchmark:

git clone https://github.com/plasmate-labs/plasmate-benchmarks
# Add your URL to urls.txt
# Run: ./run-benchmark.sh
# Submit a PR with your results

Source: github.com/plasmate-labs/plasmate-benchmarks

Observatory Vision

Re-run weekly against the latest Plasmate release. Watch the GitHub repo for update notifications. Track how the web is changing for AI agents. Which sites are improving their agent-friendliness. Which are getting worse. Results follow the WebTaskBench Protocol v1.0 — a reproducible methodology open to third-party submissions.

★ Watch on GitHub for weekly updates

Badges & Certifications

For SOM compliance scoring, badges, and certifications, see somordom.com — the community's SOM compliance tool.

How Well Does YourAgent Read the Web?

Top Compression Leaders

Full Benchmark Results

Category Breakdown

Browse by Category

Cost at Scale

Methodology

Contribute

Observatory Vision

Badges & Certifications

How Well Does Your
Agent Read the Web?