To be judged as an LLM made by a Chinese organization that ties or surpasses the leading LLM by OpenAI, Anthropic, or Google on the leaderboard here.
Additional Information
China is significantly investing in the development of large language models (LLMs) and is home to many AI-oriented companies and LLM applications. Chinese tech giants like Baidu, Alibaba, Tencent, and SenseTime have already released their GAI products, and the Chinese government aims to be an AI leader by the 2030s. While there isn't direct information indicating whether China will surpass or match OpenAI, Anthropic, and Google by the end of 2024, given the substantial investments and efforts, it's plausible that China will remain competitive in the LLM race.
Concurrently, the progress and development of OpenAI, Anthropic, and Google in the LLM race have been rapid. OpenAI led the initial LLM boom with its GPT-3 model, Anthropic is focusing on making LLMs more transparent, safe, and beneficial, and Google's Pathways AI model has surpassed GPT-3 in terms of parameters. This suggests that the race in LLM development continues to be highly competitive.
Some Background From The Web
Will China be competitive in the LLM race compared to OpenAI, Anthropic, and Google by end of 2024?
AISupremacy
TechWireAsia
What are China's current advancements and investments in the field of LLM?
Shanghaiist
TechWireAsia
What is the pace of progress and development of OpenAI, Anthropic, and Google in the LLM race?
LinkedIn
Medium
Related questions
@Sss19971997 @Ledger wrote "So as soon as Alibaba shows up in the organization column before the first appearance of OpenAI, Google, or Anthropic, then this will resolve yes."
"as soon as" implies that it resolves YES if a Chinese model surpasses one of the others at any point from now until EOY 24.
The criteria is that it surpasses the leading model by OpenAI, Google, or anthropic.
This is what I see on the leaderboard...
@jacksonpolack the criteria for this is the leading LLM on the leaderboard. The leaderboard even lists the organization if you scroll to the right. So as soon as Alibaba shows up in the organization column before the first appearance of OpenAI, Google, or Anthropic, then this will resolve yes.
@Ledger could you clarify the meaning of "or" here? To resolve YES, must a Chinese LLM surpass the leading LLM from every company out of Google, OpenAI, and Anthropic, or is it enough to merely surpass the leading LLM from one of them?
@Ledger well if you consider models not on the board, then the best Chinese is GLM4, which should be better than claude 1. It will also resolve the market to yes in that sense.
@Sss19971997 It's not obvious when you're writing it, but your comment is also ambiguous.
"Must be higher than any of them" can mean "must be in first place" or can mean "must be higher than any one of them".
The comment I was replying to above makes it sound like it has to be in first place, but isn't definitive. And the market description is just ambiguous.
Unfortunately one must use more precise language for these sorts of things. When you have a particular interpretation in mind, ambiguity like this can be invisible, but it's very much there.
@chrisjbillington yea. The ambiguity is what stopped me from betting all the way to 100%. But I think or seems quite clear in logic, i.e., any of the three statements resolving True will lead to YES.
@Sss19971997 not really, in some contexts they're synonyms.
If I claim that "I can beat any grandmaster at chess", the claim is that I can beat an arbitrary grandmaster, not merely one of my choosing.
But if I say "if I lose to any grandmaster, I'll hang my head in shame", that means I could win against all but one, and still have to hang my head in shame.
It's ambiguous.
Actually I think that second one is still a bit unclear tbh. If you want to communicate either of these things you should word it differently to avoid the ambiguity.
Since Gemini Ultra is not on the board yet, if some chinese model surpass gemini pro right now, then it resolve to yes, right?
@HanchiSun I am very confident that GLM 4 is on par with Claude thus better than Gemini Pro. It now depends on if Lmsys puts it on the leaderboard before Gemini Ultra
@Ledger does the "or" mean they have to have a better LLM than one of Anthropic, Google, or OpenAI, or all of?
@RobertCousineau , it would have to be better than the best performing LLM from any of the above mentioned, but not all.