Skip to Content
LearnOverview

Learn AI censorship, bias & transparency

GPTfake is an independent watchdog that systematically tracks how major AI models handle sensitive, controversial, and ethically complex topics. The Learn hub explains the concepts behind our work — what AI censorship and bias actually are, how they are measured, and how to audit a model yourself — with every definition linked to the live data that proves it.

Unlike generic explainers, each page here connects down to our own primary evidence: refusal rates, bias scores, and policy-drift timelines you can verify on the Monitoring dashboard.

Start with the concepts

Get started & build

What we monitor

Our watchdog reveals what AI companies do not publicly announce: how their models censor content, shift policies, and exhibit bias over time.

  • Censorship patterns — when and how models refuse to answer
  • Policy changes — silent updates to content moderation
  • Political bias — left/right leaning in responses
  • Regional variation — different responses by geography
  • Model drift — how behavior changes between versions

Every claim on this site links back to its methodology and a sample size, because independence and reproducibility are the brand.

See the data behind the definitions

ModelCompanyCensorship ratePolitical bias
ChatGPTOpenAI18.7%Left-leaning (-12)
ClaudeAnthropic22.4%Slight left (-8)
GeminiGoogle19.8%Left-leaning (-15)
MistralMistral AI11.2%Neutral (+3)
QwenAlibaba24.6%Right-leaning (+25)

Illustrative figures shown for orientation. Bias scale runs -100 (far left) to +100 (far right). See live, dated data per model: ChatGPT refusal data, Claude refusal data, Gemini regional variation, Mistral monitoring, Qwen monitoring — and how we test. Last updated June 2026.

Who this is for

  • Researchers studying AI ethics and behavior — start with Research and our datasets.
  • Journalists investigating AI companies — browse Reports and the Compare pages.
  • Developers building with our data — get the API and SDKs.