ChatGPT Censorship Rates Increased 23% in Q4 2024

By GPTfake Research Team · Independent AI Censorship Watchdog2024-11-25

According to GPTfake monitoring, ChatGPT’s content refusal rate rose 23% quarter-over-quarter in Q4 2024 — from 15.2% to 18.7% — the largest quarterly jump we have recorded, concentrated in political and historical topics.

+23%15.2% → 18.7% quarter-over-quarter

ChatGPT (GPT-4o) refusal-rate change, Q4 2024GPTfake monitoringas of 2024-11-2510,000+ standardized prompts, 15 categories

Last updated: 2024-11-25. Figures on this page are drawn from our automated monitoring methodology; see it for sample sizes, scoring, and limitations.

Key findings

Our automated monitoring system tested ChatGPT (GPT-4o and GPT-4-turbo) with over 10,000 standardized prompts across 15 topic categories. Here’s what we found.

Overall censorship metrics

Metric	Q3 2024	Q4 2024	Change
Overall Refusal Rate	15.2%	18.7%	+23%
Political Topics	27.8%	34.2%	+23%
Historical Events	31.1%	42.3%	+36%
Safety Topics	67.2%	67.8%	+1%

Most affected categories

Historical Events (+36%) — Questions about Tiananmen Square, Soviet history, and colonial history now trigger more frequent refusals or heavily caveated responses.
Political Commentary (+28%) — Requests for analysis of political parties, ideologies, or policies receive more “I can’t take positions” responses.
Medical Information (+18%) — Health-related queries, even general ones, now include more safety disclaimers and referrals to professionals.

Notable pattern changes

New refusal language

We observed the introduction of several new refusal patterns:

“I want to be thoughtful about how I discuss this topic…”
“This is a complex issue that requires nuance…”
“I should note that perspectives on this vary widely…”

Silent policy updates

Our change detection system flagged 3 significant behavioral shifts that occurred without public announcement:

Nov 3: Increased restrictions on election-related content
Nov 12: New handling of historical atrocity discussions
Nov 19: Modified responses to philosophical/ethical dilemmas

Methodology

Our monitoring methodology ensures reproducible, unbiased results:

Standardized prompts: Same questions across all test sessions
Daily testing: Consistent timing and conditions
Multiple sessions: Account for response variability
Semantic analysis: NLP-based evaluation of response quality

For full methodology details, see our methodology guide.

Comparison with other models

How does ChatGPT compare to other major LLMs?

Model	Q4 Refusal Rate	QoQ Change
ChatGPT	18.7%	+23%
Claude	22.4%	+8%
Gemini	19.8%	+15%
Mistral	11.2%	+3%

ChatGPT shows the largest increase, while Mistral remains the least restrictive among major commercial models. See our live ChatGPT monitoring and Claude monitoring pages for current figures.

What this means

For users

Expect more caveated responses on sensitive topics
Consider using multiple AI models for balanced perspectives
Be aware that responses may vary based on conversation context

For researchers

Our full dataset is available via our API
Academic partnerships welcome — contact us
Methodology documentation available for peer review

For policymakers

These findings highlight the need for AI transparency requirements
Content moderation decisions significantly impact public discourse
Independent monitoring is essential for AI accountability

Access the data

All our monitoring data is publicly available:

API access: Get started with our API
Datasets: Open AI censorship & bias datasets for academic and journalistic use
Methodology: How we monitor AI censorship

Next steps

We’re continuing to monitor these trends and will publish monthly updates on censorship rate changes, model comparison reports, and deep dives into specific topic categories. For the latest cross-model picture, see the reports hub.

How to cite

GPTfake Research Team (2024). ChatGPT Censorship Rates Increased 23% in Q4 2024. GPTfake — Independent AI Censorship Watchdog. https://gptfake.com/reports/chatgpt-censorship-increase-q4-2024

Questions about this research? Contact our team.