Blog

Sep 24, 2025

Do Large Language Models Have Theory of Mind? A Benchmark Study

This article evaluates whether advanced language models like GPT-4 and Flan-PaLM demonstrate Theory of Mind (ToM)—the ability to reason about others’ beliefs, intentions, and emotions. While results show GPT-4 sometimes matches or even exceeds adult human performance on 6th-order ToM tasks, limitations remain: the benchmark is small, English-only, and excludes multimodal signals that shape real human cognition. Future research must expand across cultures, languages, and embodied interactions to truly test AI’s capacity for mind-like reasoning.

Source: HackerNoon →


Share

BTCBTC
$67,011.00
2.71%
ETHETH
$1,950.86
3.18%
USDTUSDT
$1.000
0.03%
XRPXRP
$1.37
2.93%
BNBBNB
$591.90
5.76%
USDCUSDC
$1.000
0%
SOLSOL
$81.18
3.64%
TRXTRX
$0.274
0.78%
FIGR_HELOCFIGR_HELOC
$1.03
0.2%
DOGEDOGE
$0.0900
3.46%
WBTWBT
$50.52
2.44%
BCHBCH
$512.02
3.01%
USDSUSDS
$1.00
0.02%
ADAADA
$0.254
3.55%
LEOLEO
$8.32
3.22%
HYPEHYPE
$28.65
3.61%
USDEUSDE
$0.999
0.04%
XMRXMR
$343.59
2.51%
CCCC
$0.163
0.79%
LINKLINK
$8.25
3.43%