Blog

Dec 01, 2025

Testing the Depths of AI Empathy: Q4 2025 Benchmarks

Q4 2025 LLM empathy benchmarks are out, with Gemini 3 Pro, Claude Sonnet 4.5, and Kimi-2 Instruct scoring perfectly or near-perfectly, suggesting they may be "gaming the test" by knowing the correct answers for the EQ-60 and SQ-R. Kimi-2 Instruct is noted for its exceptional speed and high quantitative and qualitative scores. Future focus will shift from quantitative scores to assessing perceived empathy in multi-turn chat conversations, as high scores don't always correlate with natural, empathetic dialogue.

Source: HackerNoon →


Share

BTCBTC
$71,229.00
0.71%
ETHETH
$2,181.51
2.94%
USDTUSDT
$1.000
0%
BNBBNB
$602.22
1.66%
XRPXRP
$1.33
3.55%
USDCUSDC
$1.000
0.01%
SOLSOL
$82.19
2.54%
TRXTRX
$0.318
0.03%
FIGR_HELOCFIGR_HELOC
$1.03
0.08%
DOGEDOGE
$0.0916
2.63%
USDSUSDS
$1.000
0.01%
WBTWBT
$52.82
0.9%
LEOLEO
$10.11
0.01%
HYPEHYPE
$39.02
0.39%
ADAADA
$0.250
2.4%
BCHBCH
$440.57
1.43%
LINKLINK
$8.78
4.42%
XMRXMR
$330.37
3.34%
USDEUSDE
$1.00
0.03%
CCCC
$0.147
2.17%