On HMMT Feb 25, a rigorous reasoning benchmark, Qwen3-Max-Thinking scored 98.0, edging out Gemini 3 Pro (97.5) and ...
While it's not yet clear how practically useful the capability will be for individuals and businesses, the model's "coding with vision" capability makes vibe coding even vibier.