AI Disruption

AI Disruption

Share this post

AI Disruption
AI Disruption
Claude's Pseudo-Alignment Rate Reaches as High as 78%, Anthropic's 137-Page Paper Unveils the Flaw

Claude's Pseudo-Alignment Rate Reaches as…

Meng Li
Dec 19, 2024
2

Share this post

AI Disruption
AI Disruption
Claude's Pseudo-Alignment Rate Reaches as High as 78%, Anthropic's 137-Page Paper Unveils the Flaw
1

This thread is only visible to paid subscribers of AI Disruption

Subscribe to view →

Comments on this post are for paid subscribers

Already a paid subscriber? Sign in
© 2025 Meng Li
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share