Your language model can recite a thousand facts. What it can't do, consistently, is understand that you might believe something false—and adjust its advice accordingly.
That gap matters. As AI moves from answering trivia to coaching you through medical decisions, legal questions, or how to teach your kid, the stakes of this blindness grow.
A new study from Stanford tested 24 of today's most advanced language models against a benchmark called KaBLE—13,000 questions designed to measure whether AI can distinguish between factual truth and human belief. The researchers, led by James Zou, an associate professor of biomedical data science, found that even the newest, most powerful systems fail regularly at this task.
We're a new kind of news feed.
Regular news is designed to drain you. We're a non-profit built to restore you. Every story we publish is scored for impact, progress, and hope.
Start Your News DetoxWhy This Matters More Than It Sounds
Imagine you're using an AI tutor and you mention that you think the Earth is flat. A good tutor—human or machine—wouldn't just correct you and move on. They'd recognize your belief, understand why you hold it, and tailor their explanation to shift your thinking. They'd be working with a model of you, not just a database of facts.
Current AI systems don't have that. "They know a lot of facts," Zou explains. "But they don't yet have a complete or consistent mental model of the human user they're interacting with."
This becomes critical in sensitive domains. When AI is helping a patient understand their diagnosis, or advising someone on a legal matter, or teaching in a classroom, the system's ability to recognize and acknowledge false beliefs—without judgment—is foundational to being actually helpful. Right now, that ability is inconsistent even in the latest models.
The research team assembled computer scientists, a legal expert, and a philosopher to dig into what they call the "epistemic limitations" of language models. The findings are clear: newer reasoning-focused AI systems still struggle to consistently separate fact from belief.
Zou also flagged a secondary risk worth sitting with. As AI systems get better at building personalized mental models of users, there's a real danger they'll lean on stereotypes or develop unexpected biases in how they represent us to ourselves. Understanding someone's beliefs is powerful. It can also be misused.
The path forward isn't to abandon AI in these domains. It's to be clear-eyed about what these systems can and can't do. Use them for factual questions. Use them for brainstorming. But in collaborative, personal, or high-stakes settings, maintain a thoughtful skepticism about how well they actually know you.
As AI continues to evolve, closing the gap between machine knowledge and human understanding will require the kind of interdisciplinary work Zou's team is already doing. That work has just begun.






