///WebTaskBench
WEBTASKBENCH / v1.0 / APR 2026

How Well Does Your
Agent Read the Web?

An open benchmark measuring token efficiency across 44 real websites. Fewer tokens means faster agents, lower costs, and more context for reasoning.

0
Sites Benchmarked
0x
Avg Compression
0M
Tokens Saved
0/44
SOM Wins

Top Compression Leaders

Teal = SOM tokens · Dark = raw HTML tokens · Lower SOM is better

Full Benchmark Results

All 44 sites · Click headers to sort · Search to filter

>
SITEHTML TOKENSSOM TOKENSCOMPRESSION
cloud.google.com759K6K117.9x
nytimes.com532K5K110.3x
linear.app885K11K81.5x
stripe.com/docs346K6K54.3x
figma.com537K11K49.0x
tailwindcss.com418K9K47.3x
nodejs.org188K5K39.1x
vercel.com376K11K33.4x
wired.com455K18K25.6x
typescriptlang.org103K4K23.9x
mongodb.com300K13K23.8x
aws.amazon.com106K5K21.9x
nextjs.org123K6K21.3x
theguardian.com459K26K17.4x
bbc.com/news131K10K13.2x
azure.microsoft.com164K14K11.4x
react.dev107K10K11.1x
arstechnica.com149K15K9.7x
docker.com115K12K9.2x
ycombinator.com96K11K8.8x
github.com/plasmate-labs/plasmate157K18K8.5x
notion.so75K9K7.9x
angular.dev32K4K7.4x
en.wikipedia.org/wiki/Rust_(programming_language)189K27K7.0x
httpbin.org3K6864.3x
docs.github.com29K7K4.1x
vuejs.org34K9K3.9x
getbootstrap.com29K10K3.0x
medium.com4K1K2.9x
kubernetes.io/docs123K48K2.5x
svelte.dev38K18K2.1x
lobste.rs19K9K2.1x
developer.mozilla.org/en-US/docs/Web/JavaScript52K27K1.9x
npmjs.com4K3K1.3x
docs.rs5K4K1.2x
rust-lang.org5K5K1.0x
pypi.org6K6K0.9x
news.ycombinator.com12K14K0.8x
jsonplaceholder.typicode.com2K3K0.7x
python.org9K14K0.6x
postgresql.org6K9K0.6x
example.com1523310.4x
crates.io713720.1x
producthunt.com2K29K0.0x

Category Breakdown

Average compression ratio by site category

SaaS & Cloud
~47x
cloud.google.com, linear.app, figma.com, vercel.com, stripe.com, tailwindcss.com
News & Media
~41x
nytimes.com, wired.com, bbc.com, guardian.com
Dev Tools & Docs
~15x
nodejs.org, typescriptlang.org, react.dev, nextjs.org
Static & Minimal
~0.7x
example.com, crates.io, pypi.org, Hacker News

Cost at Scale

Estimate daily savings switching from raw HTML to SOM

100100K
HTML Daily Cost
$99.54
33,181,000 tokens
SOM Daily Cost
$5.69
1,895,000 tokens
You save
$93.86/day
31,286,000 tokens saved @ $3/MTok

Methodology

Plasmate version0.3.0
HTML baselinecurl -sL (raw HTTP, no rendering)
Token countertiktoken cl100k_base (GPT-4 tokenizer)
DateApril 1, 2026
PlatformLinux x86_64
Sites51 attempted, 44 successful, 7 failed (anti-bot)
Sourcegithub.com/plasmate-labs/plasmate-benchmarks

Observatory Vision

This benchmark will be re-run weekly. Track how the web is changing for AI agents. Which sites are improving their agent-friendliness. Which are getting worse.

Contribute

Add your own sites to the benchmark:

git clone https://github.com/plasmate-labs/plasmate-benchmarks
# Add your URL to urls.txt
# Run: ./run-benchmark.sh
# Submit a PR with your results

Source: github.com/plasmate-labs/plasmate-benchmarks

Badges & Certifications

For SOM compliance scoring, badges, and certifications, see somordom.com — the community's SOM compliance tool.