What is Kimi AI? Chinese chatbot beats ChatGPT, Claude on major benchmarks – Crypto News – Crypto News
Connect with us
What is Kimi AI? Chinese chatbot beats ChatGPT, Claude on major benchmarks What is Kimi AI? Chinese chatbot beats ChatGPT, Claude on major benchmarks

Metaverse

What is Kimi AI? Chinese chatbot beats ChatGPT, Claude on major benchmarks – Crypto News

Published

on

​In the beginning of this year, DeepSeek’s viral streak put an end to all notions of America’s unhindered supremacy in the AI race, as the Chinese chatbot showed that it was possible to build similarly powerful models to those used by ChatGPT and Gemini at a fraction of the cost. Since then, there have been a plethora of AI companies that have come up from China, including Alibaba’s Qwen, Baidu’s Ernie, and Moonshot AI’s Kimi.

​The new Chinese models have kept on closing the gap with the Western ones like ChatGPT. However, a new Kimi model has now even dethroned ChatGPT and Anthropic’s Claude on many important benchmarks.

​Kimi K2 Thinking beats ChatGPT and Claude:

​Earlier in the week, Moonshot released its Kimi K2 Thinking model, which goes head-to-head against the likes of GPT-5 and Claude Sonnet 4.5 Thinking. As per the data shared by the company, K2 Thinking defeated both ChatGPT and Claude in the coveted Humanity’s Last Exam benchmark.

​In HLE’s (text only) assessment, Kimi K2 Thinking scored 44.9%, compared to the 41.7% score of GPT-5 (High) and 32% score of Claude Sonnet 4.5 Thinking.

​Similar results were also seen in the BrowseComp benchmark, which analyzes the performance of AI chatbots in agentic web-browsing and search-related tasks, where Kimi K2 Thinking garnered a score of 60.2% compared to GPT-5’s 54.9% and Claude Sonnet 4.5 Thinking’s 24.1%.

​Kimi K2 Thinking, however, still seems to be lagging behind ChatGPT in coding-related tasks, with the new model getting a score of 83.1% in LiveCodeBench V6—a benchmark that measures AI’s performance in competitive programming. In the same benchmark, GPT-5 got a score of 87% and Claude Sonnet 4.5 Thinking got a score of 64%.

​The same performance for K2 Thinking, however, was not replicated in LMArena’s ranking, where Gemini 2.5 Pro still takes the lead while GPT-5 is on the 4th spot.

​How to use Kimi K2 Thinking?

​Kimi K2 Thinking is currently available to use via the Kimi website or app, which is available on both Android and Apple App Stores. A major advantage for users willing to move to Kimi is that the chatbot is available for free with no hard limit in place for casual users. In contrast, OpenAI does not let free or ChatGPT Go users access its GPT-5 (High) model, while free users have a very strict limit of just 10 messages every 5 hours. The San Francisco-based AI startup is also providing its 399/year ChatGPT Go subscription for free in India, but that also comes with a message limit.

Trending