Alibaba rolls out new Qwen3 model featuring hybrid reasoning 

by

Azunta Gaviola

-

10 hours ago

Be part of the forefront of innovation and reshape the future of retail and e-commerce! Making its highly anticipated return, MARKETECH APAC and UpTech Media partners for the Retail & E-Commerce Innovation Marketing & Tech Summit Malaysia 2025, happening on 17 July 2025 at Sheraton Petaling Jaya and for the Retail & E-Commerce Innovation Marketing & Tech Summit: Philippines 2025 on 14 August 2025 at Shangri-La The Fort, Manila. Don’t miss out!

Singapore – Alibaba, a Chinese tech company, has recently introduced the latest addition to its open-source large language model, Qwen3.

Part of this new Qwen3 series includes six dense models and two Mixture-of-Experts (MoE) models. This offers developers the versatility to create advanced applications for mobile devices, smart glasses, autonomous vehicles, robotics, and more.

The complete suite encompasses dense models (0.6B, 1.7B, 4B, 8B, 14B, and 32B parameters) and MoE models (30B with 3B active and 235B with 22B active), which are now open-sourced and ready for worldwide access.

Said offering marks the debut of hybrid reasoning models that combine traditional LLM capabilities and advanced, dynamic reasoning. The Qwen3 models, in particular, can seamlessly switch between thinking mode for complex, multi-step tasks such as mathematics, coding, and logical deduction, and non-thinking mode for fast, general-purpose responses.

With this, developers using the Qwen3 API also gain precise control over processing time, enabling intelligent output and computational efficiency. The Qwen3-235B-A22B MoE model further reduces deployment costs, highlighting Alibaba’s dedication to providing accessible, high-performance AI.

As it is trained on a massive dataset of 36 trillion tokens, this new offering further brings significant advancement in reasoning, instruction following, tool use, and multilingual tasks.

Among the highlighted capabilities are multilingual mastery, supporting languages and dialects, and leading performance in translation and multilingual instruction-following.

Another feature is its advanced agent integration that natively supports the Model Context Protocol (MCP) and robust function calling, leading open-source models in complex agent-based tasks.

Next is superior reasoning, surpassing previous Qwen models in mathematics, coding, and logical reasoning benchmarks. Lastly, it also offers enhanced human alignment, which delivers more natural creative writing, role-playing, and multi-turn dialogue experiences for more natural, engaging conversations.

Following these advancements in model design, increased training data, and more effective training approaches, Qwen3 models offer services in industry benchmarks like AIME25 (mathematical reasoning), LiveCodeBench, BFCL, and Arena-Hard.

In addition, a four-stage process, which includes CoT cold start, reasoning-based RL, thinking mode fusion, and general RL, was used to build the hybrid reasoning model.

The Qwen model family has gained over 300 million downloads worldwide since its debut. With over 100,000 Qwen-based derivative models created on Hugging Face, Qwen has established itself as one of the leading open-source AI model series globally.

 

Be part of the forefront of innovation and reshape the future of retail and e-commerce! Making its highly anticipated return, MARKETECH APAC and UpTech Media partners for the Retail & E-Commerce Innovation Marketing & Tech Summit Philippines 2025, happening on 14 August 2025 at Shangri-La The Fort, Manila. Don’t miss out!

The NEXT Awards 2025 is here, and we’re seeking the most innovative marketing campaigns from Indonesia, the Philippines, Malaysia, Singapore and Asia Pacific. Submit your entry today and showcase your best work!

Share

RECENT ARTICLES

Mystore to launch smart commerce model for advanced hyperlocal retail
Vietnam Airlines advances digital transformation following expanded partnership with Adyen
Alibaba rolls out new Qwen3 model featuring hybrid reasoning 
Singapore Airlines taps OpenAI’s multimodal AI to enhance virtual assistant efficiency
Rimini Street appoints Joe Locandro as new EVP, CIO
Ellipse 3

RELATED ARTICLES

1_Alibaba introduces latest enhancements to its ‘Qwen 2
1_Alibaba Cloud’s latest AI models, infrastructure tools to empower global developers, harness efficient AI community
Alibaba Cloud introduces Qwen 2
Ellipse 3

FEATURED ARTICLES

UpTech NL Feature Image (1)_11zon
1_UpTech Media, MARKETECH APAC to feature critical industry conversations at recently expanded ‘Retail and E-Commerce Innovation Summit’
EW2025_(UT)Launch Article_Feature Image_11zon

Subscribe to UpTech Media Newsletter

Video Title Here: The Indonesian on-ground activation status

Sed ut perspiciatis unde omnis iste natus error sit voluptatem accusantium doloremque laudantium, totam rem aperiam, eaque ipsa quae ab illo inventore veritatis et quasi architecto beatae vitae dicta sunt explicabo. Nemo enim ipsam voluptatem quia voluptas sit aspernatur aut odit aut fugit, sed quia consequuntur magni dolores eos.

Video Title Here: The Indonesian on-ground activation status

Sed ut perspiciatis unde omnis iste natus error sit voluptatem accusantium doloremque laudantium, totam rem aperiam, eaque ipsa quae ab illo inventore veritatis et quasi architecto beatae vitae dicta sunt explicabo. Nemo enim ipsam voluptatem quia voluptas sit aspernatur aut odit aut fugit, sed quia consequuntur magni dolores eos.

Video Title Here: The Indonesian on-ground activation status

Sed ut perspiciatis unde omnis iste natus error sit voluptatem accusantium doloremque laudantium, totam rem aperiam, eaque ipsa quae ab illo inventore veritatis et quasi architecto beatae vitae dicta sunt explicabo. Nemo enim ipsam voluptatem quia voluptas sit aspernatur aut odit aut fugit, sed quia consequuntur magni dolores eos.