AIScrapeSafe

Can you scrape, mine, or train AI on theguardian.com?

AIScrapeSafe’s usage-rights verdict from theguardian.com’s own detected signals and published rules. Not legal advice.

https://theguardian.com/
Mixed
Restricted, but a license may be availablelicensing
Registered license

Rights

ScrapeText & data miningAI trainingAI useDeriveRedistributeTransformTag
Overall confidence85%

Plain-language report

What this license lets you do

What you can do

  • Scrape or crawl these pages.

What you can’t do without permission

  • Run text and data mining over the content.
  • Use the content to train AI models.
  • Use the content with AI at run time (summarizing, search, RAG).
  • Create derivative works from the content.
  • Redistribute the content.
  • Transform or reformat the content.

Ask first

  • Tag or annotate the content.

No signal spoke to these. Ask, don’t assume.

Read for US, commercial use. This usage license reflects machine-readable and expressed restrictions detected on the source at check time. It is not legal advice; actual rights depend on your jurisdiction and intended use.

Evidence by signal

Source
https://theguardian.com/robots.txt
Parser
robots-parser@1.0.0
Checked
6/24/2026, 4:49:33 AM
  • CCBot, anthropic-ai, ClaudeBot, PerplexityBot, Applebot-Extended, Bytespider, Meta-ExternalAgent, FacebookBot, Amazonbot, YouBot
    @ robots.txt (AI agents)matched: 10 AI agent(s) disallowed for /
Source
https://www.theguardian.com/help/terms-of-service
Parser
tos-parser@1.0.0
Checked
6/24/2026, 4:49:35 AM
  • All rights reserved
    @ ToS / copyright noticematched: copyright reservation (all-rights-reserved / no-reproduction)
Source
https://www.theguardian.com/us
Parser
api-access-parser@1.0.0
Checked
6/24/2026, 4:49:36 AM
  • official API / developer endpoint detected
    @ pagematched: API present (informational)

Full registered record · Re-analyze · All domains