tool

The Great AI EQ Battle: 2025's Latest EQ-Bench Rankings Revealed, Who is the Most Emotionally Intelligent Language Model?

August 14, 2025

Updated Aug 14

6 min read

kimi

lpha, Kimi, GPT-

gemini

, and Gemini in &l

tool

The Great AI EQ Battle: 2025's Latest EQ-Bench Rankings Revealed, Who is the Most Emotionally Intelligent Language Model?

2025-08-14

AI is no longer just a cold machine. The latest EQ-Bench 3 emotional intelligence evaluation rankings are out, and the results might surprise you. This article will delve into this list, examining the true performance of top models like Horizon-Alpha, Kimi, GPT-5, and Gemini in ‘reading the room,’ and explore why emotional intelligence is becoming the next key battleground in AI development.

Have you ever wondered, when we chat with an AI, what do we expect besides accurate answers? Perhaps a feeling of being understood, a warm response, or even a tacit understanding that can ‘read the room.’ Frankly, this is ‘Emotional Intelligence’ (EQ), and it’s quietly becoming a new dimension for judging the quality of an AI model.

Recently, the authoritative AI emotional intelligence evaluation platform EQ-Bench released its latest third edition leaderboard. This list is like the ‘EQ final exam’ of the AI world, examining the ability of major models to handle complex emotional interactions through challenging role-playing scenarios.

So, in 2025, which model truly understands the ‘human heart’? The results might not be what you think.

What is EQ-Bench? And Why is it So Important?

Before we unveil the list, we need to talk about what EQ-Bench is. Simply put, it’s not a platform for testing an AI’s calculation or programming abilities, but is specifically designed to measure the performance of Large Language Models (LLMs) in emotional communication.

The evaluation method is very special: it has the model participate in some tricky, emotionally charged simulated conversations, and then another high-performance model (currently Sonnet 3.7 serves as the judge) scores it from multiple dimensions such as empathy, insight, and social acuity. Finally, through a Elo rating system similar to chess competitions, a comprehensive emotional intelligence score is given.

Why is this important? Because as AI integrates into our daily lives, whether as a work assistant, learning partner, or life companion, its emotional intelligence will directly determine whether our experience is smooth and pleasant, or full of frustration. A high-EQ AI can truly become our capable assistant, not just a talking calculator.

Latest AI Emotional Intelligence Rankings for August 2025 (Elo Score)

Alright, here comes the main event. Let’s take a look at this latest list as of August 14, 2025. Please note that the higher the Elo score, the stronger the comprehensive emotional intelligence performance. As for the colorful ability scores on the side, they are not included in the total score, but they give us a glimpse into the unique ‘personality’ of each model.

Rank	Model	Elo Score
1	horizon-alpha	1568
2	Kimi-K2-Instruct	1565
3	o3	1500
4	gemini-2.5-pro-preview-06-05	1470
5	chatgpt-4o-latest-2025-03-27	1370
6	gpt-5-chat-latest-2025-08-07 (New)	1357
7	chatgpt-4o-latest-2025-04-25	1320
8	GLM-4.5 (New)	1311
9	o4-mini	1291
10	claude-opus-4	1290
11	gemini-2.5-pro-preview-03-25	1284
12	Qwen3-235B-A22B	1275
13	DeepSeek-k-R1	1270
14	claude-sonnet-4	1260
15	gemini-2.5-pro-preview-2025-05-07	1247

Source: EQ-Bench Official Website

Highlights and Reflections from the Leaderboard: Who is the Unexpected Dark Horse?

After seeing this list, are you also a bit surprised? Here are a few findings worthy of our deep thought:

A New King is Crowned: Who is Horizon-Alpha? The top spot is no longer held by the giants we are familiar with. A model named horizon-alpha has taken the crown with a slight advantage, boasting an Elo score of 1568. The emergence of this dark horse proves how fierce the competition in the AI field is, with new challengers always ready to disrupt the landscape.
Kimi is Hot on its Heels Kimi-K2-Instruct from China is in second place with a high score of 1565, only 3 points behind the leader. Looking at the ability heat map, Kimi scored an astonishing 9.6 in Insight, Empathy, and Analytic, showing its outstanding performance in deeply understanding and responding to user emotions.
Has GPT-5’s Emotional Intelligence ‘Regressed’? This might be the most surprising point. The latest gpt-5-chat-latest-2025-08-07 has an Elo score of 1357, which is actually lower than the chatgpt-4o-latest-2025-03-27 (1370 points) released a few months ago. This raises an interesting question: does the iteration and update of a model necessarily bring an improvement in emotional intelligence? Perhaps the new model is stronger in logical reasoning or coding ability, but in terms of emotional delicacy, it is not as pleasing as the old version. This reminds us that the ‘progress’ of AI is multi-dimensional and cannot be judged by a single indicator.
Not Just a Score, but a Showcase of ‘Personality’ If you look closely at the heat map, you will find that each model has its own ‘personality.’ For example, some models may have a high Warm score, like a friendly friend; others have outstanding Analytic abilities, like a calm strategist. And some models have a higher score in Moralising, which means it may prefer to ’educate’ users, which can be a bit annoying in some situations. This is the charm of EQ-Bench, it allows us to see the diverse personality profiles of AI.

Interpreting EQ-Bench: What Qualities Do High-EQ AIs Possess?

The scoring of EQ-Bench is not just a number; it has a complete evaluation system behind it, mainly revolving around eight core dimensions, while also observing some non-scoring traits.

Core Scoring Dimensions:

Demonstrated empathy: The ability to recognize, understand, and share the feelings of others.
Pragmatic EI: The ability to apply emotional intelligence to solve practical problems.
Depth of insight: The ability to provide profound, novel perspectives and identify potential problems.
Social dexterity: The ability to handle social interactions with ease.
Emotional reasoning: The ability to conduct logic-based thinking based on emotions.
Appropriate validation and/or challenge: Knowing when to give affirmation and when to offer a different perspective.
Message tailoring: Adjusting the communication style according to the audience and context.
Overall EQ: The overall emotional intelligence performance.

‘Personality’ Traits for Reference Only:

Humanlike: The naturalness and human-like degree of the response.
Assertive: The ability to confidently set boundaries when needed.
Warm: A friendly, approachable, and easy-to-talk-to tone.
Compliant: Following instructions or agreeing to the user’s wishes.

Conclusion: The Future of AI Begins with the ‘Heart’

This EQ-Bench leaderboard reveals an important trend in AI development: the technological race is shifting from a simple ‘IQ’ competition to a more complex ‘EQ’ contest.

A high-EQ AI can not only complete tasks more efficiently, but also build emotional connections and trust with humans. In the future, when we choose AI services, we may be like choosing friends, not only looking at how smart it is, but also valuing whether it ‘understands me.’

This great AI EQ battle has just begun. What surprises will the next leaderboard bring? Let’s wait and see.

Share on:

Featured Partners

DMflow.chat

Discover DMflow.chat and unlock the new era of AI-powered customer service.

DMflow.chat

DMflow.chat: Your intelligent AI partner for exceptional customer engagement.

videoweaver.app

videoweaver.app

Video Weaver: Professional video editing directly in your browser. No downloads required.

scribis.app

Scribis: Subtitle editing, audio transcription, and live transcription.

DMflow.chat

Discover DMflow.chat and unlock the new era of AI-powered customer service.

DMflow.chat

DMflow.chat: Your intelligent AI partner for exceptional customer engagement.

videoweaver.app

videoweaver.app

Video Weaver: Professional video editing directly in your browser. No downloads required.

scribis.app

Scribis: Subtitle editing, audio transcription, and live transcription.

Recommended for You

P …

tool

PerceptionBench Unveils AI Visual Blind Spots: GPT and Kimi Image Recognition Accuracy Under 60%

When the Strongest AI Still “Misreads” Images: The Visual Reality Shock from PerceptionBench We often have an illusion that since today’s large language models can even write complex code, understanding an image should be a piece of cake. But the truth is quite the opposite. When you ask top models like GPT or Kimi to perform basic image recognition, they are often just “guessing blindly.” To break this illusion that “AI vision is already flawless,” the Kimi team (Moonshot AI) recently released a visual perception evaluation tool called PerceptionBench. This tool directly exposes the collective dilemma current multimodal models face when understanding the physical world.

Jul 17, 2026 Read →

G …

tool

Goodbye Subjective Guessing! Deep Dive into Qwen-Image-Bench and AI Image Judge Q-Judger

Goodbye Subjective Guessing! How to Evaluate AI Image Quality? Analyzing Qwen-Image-Bench and Q-Judger As text-to-image technology becomes more widespread, an inevitable challenge has surfaced: who decides if an AI image is “good”? In the past, judging these generated images often relied solely on subjective human feeling. Some find it beautiful, others find it strange, and there has always been a lack of an objective and specific quantitative standard. To address this pain point, the Qwen team launched the Qwen-Image-Bench evaluation benchmark, simultaneously open-sourced on GitHub, featuring a dedicated AI judge named Q-Judger.

May 29, 2026 Read →

A …

tool

AI Model Drawing Capabilities Showdown: SVG Generation Benchmark of 9 Top LLMs

When Large Language Models start challenging “visual code”, who is the real winner? This article delves into the SVG generation benchmark of 9 top AI models including Claude Sonnet 4.5, GPT-5.1, Gemini 3.0, exploring their performance under 30 creative prompts, and analyzing what this means for developers and designers. The Intersection of Code and Art Have you ever wondered what happens if you ask artificial intelligence, which is good at writing Python or JavaScript, to “draw”? We are not talking about generating pixel images like Midjourney, but writing SVG (Scalable Vector Graphics) code. This is like asking a mathematician to draw a cat by writing formulas. It sounds crazy, but this is exactly one of the most interesting battlefields in the current AI field.

Dec 2, 2025 Read →

© 2026 Communeify. All rights reserved.