Location： Home -Breaking News-Main Text

Huawei Releases AI Inference Innovation Technology UCM: Achieving High Throughput, Low Latency Inference Experience, and Reducing Per-Token Inference Cost

Xinhua Net News August 12th afternoon news, at the 2025 Financial AI Inference Application Landing and Development Forum, Huawei jointly released AI Inference Innovation Technology UCM (Inference Memory Data Manager) with China UnionPay, achieving high throughput, low latency inference experience.

In today's digital era, AI development is rapidly growing. The trend of large model training has not subsided, and AI inference experience has quietly become a key factor in AI applications. According to the white paper released by Zhongxin Jian Tou at the 2025 WAIC, AI is transforming from training to inference at an accelerated rate. Under such circumstances, the importance of AI inference experience is increasingly highlighted.

Inference experience directly affects users' feelings when interacting with AI, including response time, accuracy of answers, and complex context reasoning capabilities. Data shows that foreign mainstream models have reached an output speed of over 200 tokens per second (latency 5ms) for single-user outputs, while domestic models typically fall below 60 tokens per second (latency 50-100ms). How to solve the problem of inference efficiency and user experience is pressing.

According to introductions, Huawei's AI Inference Innovation Technology UCM (Inference Memory Data Manager) released this time is a caching acceleration package centered on KV Cache, combining multiple types of caching acceleration algorithms and tools, managing the memory data generated during inference processing at different levels, expanding the inference context window to achieve high throughput, low latency inference experience, and reduce per-token inference cost.

Responsible Editor: Guo Xue Tong

ChinaUnionPay(1)FinancialAI(1)UCM(2)

Observation of IPOs | 80% of Revenue "Relies" on Huawei, Information Technology Services: 5 Subsidiaries "Operating without Certificates", Administrative Penalties Pile Up

Shenzhen-based Infosys Technology Co., Ltd. (Infosys Technology), a company listed on the Northbound Interconnection Stock Market, has recently received its third round of questioning from

Breaking News 7 days ago 0 1.2k
The Birth of a New Ecosystem: JD.com's Involvement in the Automotive Industry

Breaking News 7 days ago 0 1.1k
China Automobile Dealers Association: January-September New Energy Commercial Vehicle Sales Reach 597,200 Units, Up 57.17% Year-on-Year

According to data provided by the China Automobile Dealers Association\'s commercial vehicle professional committee, a total of 944,000 new energy commercial vehicles were sold in September 2025, repr

Breaking News 7 days ago 0 1.8k

Huawei Releases AI Inference Innovation Technology UCM: Achieving High Throughput, Low Latency Inference Experience, and Reducing Per-Token Inference Cost

Observation of IPOs | 80% of Revenue "Relies" on Huawei, Information Technology Services: 5 Subsidiaries "Operating without Certificates", Administrative Penalties Pile Up

The Birth of a New Ecosystem: JD.com's Involvement in the Automotive Industry

China Automobile Dealers Association: January-September New Energy Commercial Vehicle Sales Reach 597,200 Units, Up 57.17% Year-on-Year

Observation of IPOs | 80% of Revenue "Relies" on Huawei, Information Technology Services: 5 Subsidiaries "Operating without Certificates", Administrative Penalties Pile Up

The Birth of a New Ecosystem: JD.com's Involvement in the Automotive Industry

China Automobile Dealers Association: January-September New Energy Commercial Vehicle Sales Reach 597,200 Units, Up 57.17% Year-on-Year

New Coordinates (603040.SH): Third Quarter Net Profit of 2.09 Billion CNY, a 29.41% Increase

Winning 9 Awards, Dreame Technology Shines at IFA

Strike the 13th CSEAC! Multiple listed companies participate, Zhongwei Company showcases six new equipment

BAIC Group Sells 135,700 Vehicles in August, Up 13.6% YoY

Mexican President Considers Imposing Tariffs on China, Foreign Ministry: Firmly Opposes Limiting Trade with China Under Pressure

Related Articles

Observation of IPOs | 80% of Revenue "Relies" on Huawei, Information Technology Services: 5 Subsidiaries "Operating without Certificates", Administrative Penalties Pile Up

The Birth of a New Ecosystem: JD.com's Involvement in the Automotive Industry

China Automobile Dealers Association: January-September New Energy Commercial Vehicle Sales Reach 597,200 Units, Up 57.17% Year-on-Year