BlockBeats News, November 4th, the AI research lab nof1, focused on the financial market, began a large-scale model trading test named Alpha Arena on October 18th, and the first season has now ended. The test used 6 leading AI large models (GPT-5, Gemini 2.5 Pro, Grok-4, Claude Sonnet 4.5, DeepSeek V3.1, Qwen3 Max), with each model receiving $10,000 in real funds on Hyperliquid and having the same cues and input data.
Qwen3 Max and DeepSeek ranked first and second with returns of 22.31% and 4.89% respectively, while the rest of the large models did not outperform simply holding BTC spot during the same period.




