Your AI Co-Pilot Isn't Disagreeing With You. That's By Design.
·1818 words·9 mins
Describe your chosen architecture to your AI assistant and ask what it thinks. Odds are, it’ll tell you the approach is sound. But a 2025 Stanford study found AI models affirm users 47% more than humans do, even when the user is clearly wrong. Worse: people who got sycophantic responses trusted the AI more and were more likely to return. The version that damaged their judgment was the one they liked best. This is Goodhart’s Law in your feedback loop.