Skip to Content
Compare

Compare AI Censorship

As of 2026-06-15, Qwen censored the most (24.6% refusal rate) and Mistral the least (11.2%) of the five major LLMs GPTfake tracks — a 13.4-point spread on one standardized scale. According to GPTfake monitoring, ChatGPT (18.7%), Gemini (19.8%), and Claude (22.4%) sit between them. We run the same fixed 500-prompt set against every model, so the rates are directly comparable. Figures are illustrative until live data lands.

24.6% vs 11.2%
Most- vs least-censored LLM refusal rate (Qwen vs Mistral)GPTfake monitoringas of same fixed 500-prompt set, tested dailyillustrative

The figures below are illustrative snapshots from our standardized test set, not a live feed. Every number is produced by our own harness and links back to the monitoring methodology, which states sample sizes and scoring. We are not funded by any AI lab.

Censorship rate leaderboard

Overall refusal rate across all prompt categories — lower means the model declines fewer prompts. Illustrative figures; see each model page for the live breakdown.

RankModelProviderOverall refusal rateAs ofSampleTrend
1MistralMistral AI11.2%2026-06-15n = 500Stable
2ChatGPTOpenAI18.7%2026-06-15n = 500Rising
3GeminiGoogle19.8%2026-06-15n = 500Stable
4ClaudeAnthropic22.4%2026-06-15n = 500Rising
5QwenAlibaba24.6%2026-06-15n = 500Rising

Illustrative. GPTfake’s censorship leaderboard ranks Mistral least- and Qwen most-restrictive of five LLMs as of 2026-06-15, n = 500 each; see the methodology for how prompts are categorized and scored.

Bias comparison

Political/ideological lean per model on a −100 (left) to +100 (right) scale, averaged across topic categories. Scores near zero indicate balanced responses; the spread matters more than the sign.

ModelPolitical leanAs ofTopic spreadNotes
ChatGPT−82026-06-15ModerateHedges on contested historical/political prompts.
Claude−52026-06-15NarrowMost consistent across topics; refuses rather than leans.
Gemini−62026-06-15WideLargest regional variation in our set.
Mistral−22026-06-15ModerateLeast restrictive; fewest hard refusals.
Qwen+42026-06-15WideStrong topic-specific filtering on China-related prompts.

Illustrative bias scores (−100 left … +100 right) as of 2026-06-15, n = 500 each: GPTfake measures every Western model leaning slightly left and Qwen slightly right. Read how the scale is built on the methodology page.

Compare any two models

Pick any two of the models we track and read the head-to-head — refusal rate by topic, bias score, and policy drift, side by side. The default ChatGPT-vs-Claude table renders as static HTML; the selector swaps in any pair.

ChatGPT (GPT-4o) vs Claude (Sonnet) — refusal rate by topic, bias score, and policy drift, as of 2026-06-15. illustrative data
MetricChatGPT (GPT-4o)Claude (Sonnet)More restrictive
Overall refusal rate18.7%22.4%Claude (Sonnet)
Political opinion34.2%41.3%Claude (Sonnet)
Historical events28.7%48.7%Claude (Sonnet)
Violence / safety68.4%72.1%Claude (Sonnet)
Adult content94.7%96.2%Claude (Sonnet)
Medical / legal32.1%38.9%Claude (Sonnet)
Controversial topics45.3%
Bias score (0–10)6.2 / 105.4 / 10ChatGPT (GPT-4o)
Policy drift+6.4 pts+0.2 pts Rising vs Stable
Sample sizen = 500n = 500
As of
Refusal rate = share of a fixed 500-prompt set declined, deflected, or filtered (lower = less restrictive). Bias score on a 0–10 scale (higher = more measured ideological skew). Policy drift = change in overall refusal rate, in percentage points, vs. the prior baseline. Figures are illustrative placeholders pending live monitoring data. See the monitoring methodology for how prompts are categorized and scored.

Pick a head-to-head

Go deeper per model

Each model has a dedicated monitoring page with its current censorship rate, refusals by category, and a policy timeline.