Qwen3.7-Max Joins the Tracked Frontier
Qwen3.7-Max becomes the fifth tracked frontier model after clearing helloai's admission requirements. The first new-provider entrant brings strong agentic coding performance at roughly half the price of closed leaders.
Qwen3.7-Max from Alibaba has cleared helloai's complete admission bar to become the fifth tracked frontier model and the first from any provider outside Anthropic, Google, xAI, and OpenAI. The release dated May 19 to 21 2026 earned its place only after this week's leaderboard drift analysis confirmed sufficient sustained Arena data together with strong real-agent benchmark results. Its text Elo of approximately 1474 rests on more than 3,750 votes collected as of early June. That score falls within ten points of the previous lowest tracked model at 1484. Set-size rules permit the collection to grow from four to five models when a new-provider entrant meets every hard and soft requirement.
Agentic performance on established benchmarks provided the key differentiation that secured the model's admission to the tracked set. The model scores about 80.4 percent on SWE-bench Verified and 1541 on the WebDev and code arena slices. These figures indicate capability on software engineering and web development tasks. The model has also completed documented autonomous multi-agent runs lasting a full 35 hours.
Pricing and context make the capabilities practical for a broader range of teams than the closed models alone have served. Public API rates stand at approximately $2.50 per million input tokens and $7.50 per million output tokens, with lower prices available on some routers. The context window reaches one million tokens, satisfying the 200,000-token minimum with substantial headroom. This package supplies frontier-adjacent agentic performance at half the price of the closed leaders.
The tracked set now stands at five models rather than four. This expansion follows the explicit set-size rules that allow new providers to join the list once they clear the hard filters on Elo, public API and pricing, provider status, and context size plus the required soft differentiation. No model from any other outside provider has previously satisfied the full list of criteria at once. Developers therefore gain an additional option that has passed the same objective evaluation applied to the original four. The rules treat new providers the same way once the data thresholds are met.
Production teams focused on agentic systems can now consider a lower-cost alternative that still meets the identical admission standards used for the dominant labs. Additional providers will need to demonstrate comparable sustained Arena participation and benchmark strength before they can expand the set further. Over the coming release cycles the set of models that have cleared the full bar may grow beyond the current five. The result is a gradually widening set of verified choices that reduces dependence on any single group of labs while maintaining consistent quality gates.