Blog

Sep 24, 2025

Do Large Language Models Have Theory of Mind? A Benchmark Study

This article evaluates whether advanced language models like GPT-4 and Flan-PaLM demonstrate Theory of Mind (ToM)—the ability to reason about others’ beliefs, intentions, and emotions. While results show GPT-4 sometimes matches or even exceeds adult human performance on 6th-order ToM tasks, limitations remain: the benchmark is small, English-only, and excludes multimodal signals that shape real human cognition. Future research must expand across cultures, languages, and embodied interactions to truly test AI’s capacity for mind-like reasoning.

Source: HackerNoon →


Share

BTCBTC
$79,098.00
2.78%
ETHETH
$2,223.86
3.26%
USDTUSDT
$0.999
0.03%
BNBBNB
$673.19
1.08%
XRPXRP
$1.43
7.01%
USDCUSDC
$1.000
0.01%
SOLSOL
$89.53
4.11%
TRXTRX
$0.352
0.76%
FIGR_HELOCFIGR_HELOC
$1.02
1.36%
DOGEDOGE
$0.113
2.86%
WBTWBT
$58.40
3.02%
USDSUSDS
$0.999
0.04%
HYPEHYPE
$44.84
1.03%
ADAADA
$0.261
5.27%
LEOLEO
$10.18
0.14%
ZECZEC
$522.23
2.17%
BCHBCH
$426.11
2.7%
LINKLINK
$10.07
5.67%
XMRXMR
$381.41
4.39%
CCCC
$0.160
1.87%