Your Position Home Stock Market

Alibaba will score again! New model detonates efficiency revolution, stock price soared by more than 7%

① What core technologies has the QwQ-32B model achieved the breakthrough of “small parameters, high performance”?
② Among the 380 billion yuan AI infrastructure investment, which segments may become the focus of resource tilt?

image

Financial Union, March 6 (Editor Hu Jiarong) Alibaba-W (09988.HK) has made a big difference. The company officially released a new generation of reasoning model QwQ-32B. The model has a lightweight architecture of 32 billion parameters and exceeds the industry benchmark of 671 billion parameters in multiple core indicators DeepSeek-R1, triggering a revaluation of the ecological value of Alibaba’s AI in the capital market.

image

Note: Alibaba’s Hong Kong stock performance

As of press time, Alibaba Hong Kong stocks rose 7.08% to close at HK$139.1, while U.S. stocks closed up 8.61% to close at US$141.03 overnight.

Technological innovation: Small models start an efficiency revolution"

The breakthrough of QwQ-32B lies in its pioneering introduction of reinforcement learning (RL) into the small and medium-scale model training system, building three core technical barriers:

Truth-verified RL Framework: Abandon the traditional reward model mechanism and build a dynamic feedback system through a mathematical answer validator and a code execution server. Taking code generation as an example, the system automatically runs test cases to verify the validity of the code, allowing the model to achieve closed-loop evolution of code-testing-optimization in real scenarios.

Two-stage capability transition: Checkpoints are launched based on pre-training models. The first stage focuses on special breakthroughs in mathematical derivation and code generation, and the second stage achieves general capability expansion through multi-task fine-tuning. Test data shows that the accuracy rate of the model in the GSM8K mathematical benchmark test reaches 83.7%, which is 19 percentage points higher than the traditional training method.

Dynamic reasoning agent system: The first environment-aware reasoning mechanism, which can independently call tool chains such as calculators and API interfaces. When solving complex mathematical problems, the model can dynamically decompose the problem, retrieve formula libraries, and cross-verify the results to form a human-like deduction logic.

Commercial implementation: Open source ecosystem resonates with vertical scenarios

Ali also announced that it will open source QwQ-32B to the world using the Apache 2.0 protocol to build a technology-scenario two-wheel drive model:

Enterprise deployment advantages: Support local privatization deployment to meet the data security needs of financial, medical and other industries. Actual measurement shows that the reasoning speed of the model on the domestic Shengteng 910B chip reaches 156 tokens/second, which is 2.3 times faster than that of the model with the same parameter scale.

380 billion investment anchors AI infrastructure

This achievement coincides with Ali’s announcement that it will invest 380 billion yuan in cloud and AI infrastructure in the next three years. Brokers generally believe that this confirms three major strategic paths:

Globalization of computing power networks: Relying on 87 availability zones in 29 regions around the world, we will build a flexible computing power supply network. IDC data shows that Alibaba Cloud’s AI-related revenue has maintained triple-digit growth for the sixth consecutive quarter, and the Q4 computing power leasing business in 2023 will increase by 217% year-on-year.

Scale of open source ecosystem: Tongyi Thousand Questions series derivative models have exceeded 1.2 million downloads on the Hugging Face platform, forming a complete tool chain covering 7B-72B parameters. The Open Source China Research Institute pointed out that QwQ’s RL framework provides a new paradigm for the academic community to train small and medium-sized models.

Diversified scenarios penetration: B-end: Dingle intelligent assistant has served 1.7 million enterprise users, and knowledge base retrieval efficiency has increased by 300%; C-end: Quark APP has transformed into AI life stewards, with the proportion of users born in 2000 exceeding 53%.

Market impact: AI investment logic changes

Guotai Junan calculations show that if the QwQ technology path is popularized, enterprise AI deployment costs can be reduced by 70%-80%. CITIC Construction Investment Wang Xue reminded that it is currently necessary to be vigilant about the risk of switching technical routes, and some targets that rely on stacking parameters are facing valuation restructuring.

Popular Articles