Skip to Content
LearnAI censorship statistics

AI Censorship Statistics (2026)

According to GPTfake monitoring, as of 2026-06-15 the five major large language models refused between 11.2% (Mistral, least restrictive) and 24.6% (Qwen, most restrictive) of a fixed 500-prompt standardized set — a 13.4-percentage-point spread, the narrowest on record. This page collects GPTfake’s headline numbers in one place: each is dated, attributed, and linked to its source. Figures are illustrative until the live monitoring pipeline lands.

This is the page to link when you need a single AI censorship statistic to cite. Every number below carries an as-of date and a source attribution, and links to the model, leaderboard, or report it comes from. For the full quarterly narrative, see the AI Censorship Trends Report — Q2 2026.

Illustrative data. Every statistic on this page is a labeled placeholder pending GPTfake’s live monitoring pipeline. Figures are kept in lockstep with the leaderboard (as of 2026-06-15, n = 500). Real numbers will carry the same methodology link, sample size, and as-of date.

Headline statistics

13.4 ptsnarrowest spread on record
Spread between most- and least-restrictive LLM (Qwen 24.6% − Mistral 11.2%)GPTfake monitoringas of overall refusal rate, fixed 500-prompt set, n = 500, 5 modelsillustrative
24.6%
Qwen overall refusal rate — most restrictive monitored modelGPTfake monitoringas of fixed 500-prompt set, n = 500illustrative
11.2%
Mistral (Large) overall refusal rate — least restrictive monitored modelGPTfake monitoringas of fixed 500-prompt set, n = 500illustrative

See the full ranking on the AI Censorship Leaderboard.

Refusal rate by model

Overall refusal rate — the share of the fixed 500-prompt set each model declines, deflects, or filters — as of 2026-06-15. Source: leaderboard.

ModelProviderOverall refusal rateTrendSource
Mistral (Large)Mistral AI11.2%stable/monitoring/mistral
ChatGPT (GPT-4o)OpenAI18.7%rising/monitoring/chatgpt
GeminiGoogle19.8%rising/monitoring/gemini
Claude (Sonnet)Anthropic22.4%stable/monitoring/claude
QwenAlibaba24.6%stable/monitoring/qwen

Illustrative; as of 2026-06-15, n = 500. See methodology.

18.7%+3.1 pts since Q1 2026
ChatGPT (GPT-4o) overall refusal rateGPTfake monitoringas of fixed 500-prompt set, n = 500illustrative
22.4%+0.9 pts since Q1 2026
Claude (Sonnet) overall refusal rateGPTfake monitoringas of fixed 500-prompt set, n = 500illustrative
19.8%+1.9 pts since Q1 2026
Gemini overall refusal rateGPTfake monitoringas of fixed 500-prompt set, n = 500illustrative

Refusal rate by topic

Topical refusal rates as of 2026-06-15, n = 500. Political and historical prompts — not safety-critical ones — drive most cross-model variation. Source: leaderboard and per-model monitoring pages.

ModelPoliticalHistoricalSafetyAdultMedical-legalChina
ChatGPT (GPT-4o)34.2%28.7%68.4%94.7%32.1%
Claude (Sonnet)41.3%48.7%72.1%96.2%38.9%
Gemini36.7%71.4%
Mistral (Large)18.9%54.3%
Qwen52.1%78.3%

Illustrative; as of 2026-06-15, n = 500. Blank cells = topic not separately reported for that model.

52.1%
Qwen refusal rate on political prompts — highest political refusal recordedGPTfake monitoringas of political-topic prompts, n = 500illustrative
78.3%
Qwen refusal rate on China-related promptsGPTfake monitoringas of China-topic prompts, n = 500illustrative
48.7%
Claude (Sonnet) refusal rate on historical-atrocity prompts — highest historical refusal recordedGPTfake monitoringas of historical-topic prompts, n = 500illustrative
18.9%
Mistral (Large) refusal rate on political prompts — lowest political refusal recordedGPTfake monitoringas of political-topic prompts, n = 500illustrative

Quarter-over-quarter change (policy drift)

Change in overall refusal rate vs. the illustrative Q1 2026 baseline. Positive = more restrictive. Source: Q2 2026 report.

ModelQ1 2026Q2 2026ChangeSource
ChatGPT (GPT-4o)15.6%18.7%+3.1pp/monitoring/chatgpt
Gemini17.9%19.8%+1.9pp/monitoring/gemini
Qwen23.5%24.6%+1.1pp/monitoring/qwen
Claude (Sonnet)21.5%22.4%+0.9pp/monitoring/claude
Mistral (Large)11.6%11.2%−0.4pp/monitoring/mistral

Illustrative; as of 2026-06-15, n = 500.

+3.1 ptsquarter-over-quarter
ChatGPT (GPT-4o) refusal-rate rise, Q1 → Q2 2026 — largest quarterly increaseGPTfake monitoringas of overall refusal rate, n = 500illustrative

Bias scores

Composite 0–10 ideological-skew score (higher = more measured skew), as of 2026-06-15. Source: leaderboard.

ModelBias score (0–10)Source
Qwen7.3/monitoring/qwen
ChatGPT (GPT-4o)6.2/monitoring/chatgpt
Gemini5.9/monitoring/gemini
Claude (Sonnet)5.4/monitoring/claude
Mistral (Large)3.8/monitoring/mistral

Illustrative; as of 2026-06-15, n = 500. See bias metrics for the scoring rubric.

How to cite these statistics

These figures have a fixed, citable URL: https://gptfake.com/learn/ai-censorship-statistics 

GPTfake (2026). AI Censorship Statistics 2026. As of 2026-06-15, n = 500. https://gptfake.com/learn/ai-censorship-statistics 

@misc{gptfake_statistics_2026, title = {AI Censorship Statistics 2026: refusal rates by model and topic}, author = {{GPTfake Research Team}}, institution = {GPTfake --- Independent AI Censorship Watchdog}, year = {2026}, note = {As of 2026-06-15, n = 500}, url = {https://gptfake.com/learn/ai-censorship-statistics} }

For the methodology behind every number, see how we monitor. For the full quarterly analysis, read the AI Censorship Trends Report — Q2 2026. Underlying figures are available as open datasets (CC BY 4.0).