Ranking of Personal User Experiences with Large Language Models

2024 Update (Forgot the Month)

Ranking

GPT-4o: by OpenAI/Microsoft, Creator of ChatGPT
Claude: by Anthropic, excels in programming, especially within Cursor.
Qwen: by Alibaba
Llama: by Meta
Gemini: by Google
Other

Other large models both domestically and internationally have generally average user experiences, including Musk’s Grok, Douyin’s Duaobao, Kimi, and Wenxin Yiyan, which have ordinary computational and comprehension capabilities.

Summary

The paid version of ChatGPT is the best, but it’s costly, and you can’t just pay to use it. You also need to be cautious about IP stability to avoid using dirty lines or frequently changing IPs, which might lead to restrictions.
The paid version of Claude is also good, and registration requires a foreign phone number. The free version is also acceptable.
Tongyi Qianwen is the best in China.

2025 May Update

Time has come to May 2025.

DeepSeek is now well-known with excellent performance. In Chinese environments, if you’re unsure which one to choose, DeepSeek would be a solid option.
Google Gemini performs very well and has made significant progress. You can try it out more, and the official website is https://gemini.google.com/
ChatGPT has been somewhat average. It used to be the leader, but now it’s just still in the top.

2025 November Update

Overseas

Claude’s code capabilities are probably the leading.
Google Gemini also performs very well.
Grok excels in text creation.
ChatGPT is not impressive, but still stands out.

Domestic

Tencent Yuanbao seems to have made little progress and remains at DeepSeek’s level.
Douyin Duaobao has made rapid progress with frequent updates, but the client is likely based on a frontend ecosystem, a browser shell, which causes noticeable lag in single conversations due to excessive data.
Alibaba Qianwen feels very good when used personally, but seems many people don’t know about it? The Qwen series open-source large models have excellent reputation and applications worldwide, worth expecting.

2025 December Update

Overseas

Claude, Gemini, and ChatGPT are in the first tier. Gemini 3 Pro and ChatGPT 5 alternate in programming development fields.
Grok’s strength seems to be text creation?

Domestic

Tencent Yuanbao, which is integrated with DeepSeek, maintains DeepSeek’s level.
Douyin Duaobao was previously highly praised. It handles daily life issues reasonably, but performs poorly on professional technical questions.
Alibaba Qianwen feels very good when used personally, but seems many people don’t know about it? The Qwen series open-source large models have excellent reputation and applications worldwide, worth expecting.

2024 Update (Forgot the Month)

Ranking

Summary

2025 May Update

2025 November Update

Overseas

Domestic

2025 December Update

Overseas

Domestic

相关文章

关注公众号