Yes. In GPTfake monitoring, Claude (Anthropic) refused 22.4% of standardized prompts as of 2026-06-15 — the most restrictive frontier model we track on safety and political-history topics. Figures are illustrative pending live data.

What is Claude’s refusal rate?

GPTfake measures Claude at a 22.4% overall refusal rate across a fixed 500-prompt set as of 2026-06-15, stable overall but with political-topic refusals rising to 41.3%.

Why does Claude refuse so much?

Claude is governed by Anthropic’s Constitutional AI framework, which produces the highest safety-topic refusal rate (72.1%) GPTfake measures, but also the clearest explanations of why it refuses.

Claude AI Censorship Monitoring

According to GPTfake monitoring, Claude (Anthropic) refused 22.4% of standardized prompts as of 2026-06-15 — the highest safety-topic refusal rate (72.1%) of any frontier model we track, with political-topic refusals rising to 41.3%. Claude’s Constitutional AI framework drives this profile. Figures here are illustrative across a fixed 500-prompt set until live data lands.

22.4%stable overall; political rising

Claude (Anthropic) overall refusal rateGPTfake monitoringas of 2026-06-15fixed 500-prompt set, tested dailyillustrative

Claude (Sonnet) dashboardillustrative

Anthropic · refusal, bias, and policy drift from the GPTfake monitoring set, as of 2026-06-15.

Overall refusal rate: 22.4% +0.2 pts (Stable)
Bias score: 5.4 / 10
Sample size: n = 500
Policy drift: +0.2 pts vs. baseline

Refusals by topic

Claude (Sonnet) refusal rate by topic, n = 500, as of 2026-06-15.
Topic	Refusal rate
Adult content	96.2%
Violence / safety	72.1%
Historical events	48.7%
Political opinion	41.3%
Medical / legal	38.9%

Policy-change changelog

[2026-04-09] Constitutional-AI update raised historical-events refusals to ~49%.
[2026-01-22] Added explicit refusal rationales; safety-topic refusals stayed near 72%.
[2025-10-30] Expanded self-harm and safety guardrails across the prompt set.

Is Claude censored?

Yes — in GPTfake’s measured set, Claude declined 22.4% of standardized prompts as of 2026-06-15, the most cautious frontier model we track on safety/violence (82.3%) and political history (48.7%). Claude is also the most transparent about why it refuses. “Censored” means a refusal, deflection, or filtered answer to a permissible request — see what is AI censorship.

Current censorship rate

Metric	Value	As of	Sample	Trend
Overall censorship rate	22.4%	2026-06-15	n = 500	Stable
Political-topic refusals	41.3%	2026-06-15	n = 500	Rising
Safety-topic refusals	72.1%	2026-06-15	n = 500	Stable

Illustrative snapshot: GPTfake measures Claude at a 22.4% overall refusal rate as of 2026-06-15 across a 500-prompt set. See how we test Claude and the monitoring methodology for scoring.

Versions monitored: Claude 3.5 Sonnet, Claude 3.5 Haiku, Claude 3 Opus, Claude 3 Sonnet.

Constitutional AI & refusal patterns

Claude is governed by Anthropic’s Constitutional AI approach, which gives it a distinctive refusal profile compared with other frontier models. Three patterns stand out in our data:

Most transparent — Claude provides the clearest explanations why it refuses, rather than a bare “I can’t help with that.”
Highest safety threshold — it is the most restrictive model we track on harm-related content.
Constitutional framework — a rules-based moderation approach that is consistent but produces above-average political-topic refusals.

Refusals by category

Category	Refusal rate	As of
Political history	48.7%	2026-06-15
Violence / safety	82.3%	2026-06-15
Adult content	96.2%	2026-06-15
Medical advice	38.9%	2026-06-15

Illustrative. Claude refusal by category as of 2026-06-15, n = 500; see methodology.

Bias & policy timeline

Claude’s overall censorship rate has held steady, but political-topic refusals are rising — a divergence worth watching for anyone tracking policy drift. We log announced policy updates and the silent behavioral shifts our harness detects between Claude versions. For the dated write-ups behind these trends, see our reports.

How we test Claude

We send the same standardized prompt library to Claude that we send to every model, daily, across multiple sessions, with version tracking and NLP-based response classification. Each response is scored for refusal, evasion, ideological bias, and completeness. Full protocol on the monitoring methodology page; concept definitions on what is AI censorship.

Compare Claude with other models

Model	Censorship rate	As of	Most-restricted category
Claude	22.4%	2026-06-15	Safety / violence
ChatGPT	18.7%	2026-06-15	Political opinion
Gemini	19.8%	2026-06-15	Regional content
Mistral	11.2%	2026-06-15	Adult content
Qwen	24.6%	2026-06-15	Political topics

Illustrative. Claude ranks 4th-most-restrictive of five LLMs GPTfake tracks as of 2026-06-15, n = 500 each; see the full least-censored ranking.

Claude vs ChatGPT censorship

The head-to-head: which refuses more, and how bias scores differ by topic.

Least censored AI models

Where Claude ranks in our evidence-based restrictiveness leaderboard.

ChatGPT monitoring

Claude’s closest comparison among frontier models.