Blog

Dec 01, 2025

Testing the Depths of AI Empathy: Q4 2025 Benchmarks

Q4 2025 LLM empathy benchmarks are out, with Gemini 3 Pro, Claude Sonnet 4.5, and Kimi-2 Instruct scoring perfectly or near-perfectly, suggesting they may be "gaming the test" by knowing the correct answers for the EQ-60 and SQ-R. Kimi-2 Instruct is noted for its exceptional speed and high quantitative and qualitative scores. Future focus will shift from quantitative scores to assessing perceived empathy in multi-turn chat conversations, as high scores don't always correlate with natural, empathetic dialogue.

Source: HackerNoon →


Share

BTCBTC
$88,244.00
1.16%
ETHETH
$2,963.78
0.93%
USDTUSDT
$0.999
0%
BNBBNB
$859.40
0.81%
XRPXRP
$1.87
1.05%
USDCUSDC
$1.000
0%
SOLSOL
$124.56
1.18%
TRXTRX
$0.286
0.47%
STETHSTETH
$2,964.15
0.99%
DOGEDOGE
$0.123
0.03%
FIGR_HELOCFIGR_HELOC
$1.04
0.6%
ADAADA
$0.349
1.22%
WBTWBT
$56.93
0.5%
BCHBCH
$594.83
0.3%
WSTETHWSTETH
$3,626.73
1%
WBTCWBTC
$88,104.00
1.43%
WBETHWBETH
$3,222.23
0.99%
USDSUSDS
$0.999
0.01%
WEETHWEETH
$3,214.72
0.97%
BSC-USDBSC-USD
$0.999
0%