Monday, 9 June 2025
30.7 C
Singapore
34.9 C
Thailand
26.2 C
Indonesia
29.4 C
Philippines

Alibaba reveals Qwen3, a powerful new series of AI models

Alibaba launches Qwen3, a powerful open AI model family with hybrid reasoning and strong performance that rivals Google and Openai.

Chinese tech giant Alibaba has introduced Qwen3, a new family of artificial intelligence models that could rival top names like Google and OpenAI. Launched on April 29, the Qwen3 series offers a wide range of models and is available under an open licence on platforms such as Hugging Face and GitHub.

With models ranging from 0.6 billion to 235 billion parameters, Qwen3 covers a broad scope of problem-solving abilities. In AI, parameters represent how much a model can learn from data, and generally, more parameters mean better performance. According to Alibaba, some of the Qwen3 models even surpass Google’s Gemini 2.5 Pro and OpenAI’s o3-mini on key benchmarks.

A hybrid approach to problem-solving

What makes Qwen3 stand out is its “hybrid” design. This means it can switch between thinking deeply about a task or providing a quick answer, depending on the situation. For example, asking a complex question takes time to “reason” through the answer. For simpler queries, it responds quickly. This feature allows users to control how much computing power—or “thinking budget”—they want the model to use.

“We have seamlessly integrated thinking and non-thinking modes, offering users the flexibility to control the thinking budget,” the Qwen team explained in a blog post. “This design enables users to configure task-specific budgets with greater ease.”

Some Qwen3 models also use a “mixture of experts” (MoE). This method divides tasks into smaller parts and sends them to specialised models, or “experts”, which work together to generate an answer. This approach helps the models work faster and more efficiently.

Strong performance across many tests

The Qwen3 models support 119 languages and were trained using a dataset of nearly 36 trillion tokens. A token is a small piece of data the model uses to learn—around 1 million tokens equal 750,000 words. The training data included everything from textbooks and code snippets to AI-generated content and question-answer pairs.

According to Alibaba, this extensive training has made Qwen3 much more advanced than its predecessor, Qwen2. While none of the models are dramatically better than top offerings from Google or OpenAI, they are considered highly competitive.

One standout model is Qwen-3-235B-A22B, the largest in the Qwen3 family. It slightly outperforms OpenAI’s o3-mini and Google’s Gemini 2.5 Pro on Codeforces, a popular coding competition site. It also scores higher on AIME, a tough maths test, and BFCL, a benchmark for reasoning ability. However, this top-tier model isn’t available to the public for now.

The biggest publicly available model, Qwen3-32B, is still strong. It competes well with other AI tools, including models from DeepSeek, a Chinese AI lab. On several coding tests like LiveCodeBench, Qwen3-32B even beats OpenAI’s o1 model.

A growing role in the open-source AI landscape

Alibaba says Qwen3 does more than solve problems. It’s also good at calling tools, following instructions, and copying data formats. Besides downloading the models, you can access Qwen3 through cloud services like Fireworks AI and Hyperbolic.

Tuhin Srivastava, CEO and co-founder of cloud hosting company Baseten sees Qwen3 as part of a larger trend. “The U.S. is doubling down on restricting sales of chips to China and purchases from China,” he said, “but models like Qwen3 that are state-of-the-art and open … will undoubtedly be used domestically.”

He added that businesses are now combining custom-built tools with ready-made models from companies like Anthropic and OpenAI. With Qwen3, Alibaba is showing that Chinese firms are catching up in AI and setting new standards.

Hot this week

Gamevil: From RPG trailblazer to blockchain pivot in mobile gaming’s shifting landscape

Gamevil’s evolution into Com2uS Holdings shows how mobile gaming giants adapt through acquisitions, platform shifts, and blockchain innovation.

Qualcomm patches major chip flaws as hackers exploit zero-days

Qualcomm fixes three serious zero-day flaws used in hacking campaigns and urges users to install updates from phone makers as soon as possible.

Gundam Base Pop-up World Tour to arrive in Singapore this August

Gundam Base Pop-up World Tour lands in Singapore this August with exclusive kits and merchandise at Jewel Changi.

BlueVoyant adds SBOM capabilities to strengthen third-party cyber risk management

BlueVoyant has added SBOM capabilities to its cyber risk platform, enhancing third-party software monitoring and regulatory compliance.

Updated BMW iX lands in Singapore with fresh look and tech upgrades

The updated BMW iX arrives in Singapore with fresh design touches, new tech, and free charging perks for EV lovers.

Gamevil: From RPG trailblazer to blockchain pivot in mobile gaming’s shifting landscape

Gamevil’s evolution into Com2uS Holdings shows how mobile gaming giants adapt through acquisitions, platform shifts, and blockchain innovation.

Beijing academy introduces ‘RoboBrain’ AI model to power humanoid robots in China

Beijing launches RoboBrain 2.0, a powerful open-source AI to boost China’s growing humanoid robotics industry.

Rokid to launch new AR glasses globally on AliExpress during the 618 summer sale

Chinese AR brand Rokid will launch its new smart glasses globally on AliExpress on June 16, with a US$100 discount during the 618 sale.

Xbox console games now appear in the Xbox PC app — here’s what it means for you

Xbox console games are now showing in the Xbox PC app, hinting at Microsoft’s push to combine PC and console gaming in one place.

Related Articles

Popular Categories

OSZAR »