Huawei Ascend 910C offers 60% of Nvidia H100 performance: Report

Figure 1, view larger image

A new report cites that Huawei Ascend 910C chip offers 60% of Nvidia H100’s performance in DeepSeek AI models. The details arrive as the China-made AI LLMs are gaining popularity in the tech world for their cost-effective and efficient features.


DeepSeek R1 is the latest Chinese AI reasoning model. It performs excellently in tasks like maths, coding, and scientific problem-solving operations. The ultimate technology is already giving a tough spot to U.S. rivals like ChatGPT and Gemini.

Figure 2, view larger image


While DeepSeek R1 has been trained on the Nvidia H100 processor, it uses Huawei Ascend 910C for inference. Now a new report adds more insight to this matter.


The DeepSeek team reportedly verified that Huawei Ascend 910C performance reaches 60% of the Nvidia H100 chips. The Chinese chipset performance is “unexpectedly good” in inference tasks. Besides, the handwritten CUNN kernel and other optimizations can further improve the Ascend chip’s performance.

Figure 3, view larger image


If accurate, it is a great achievement for Huawei and the Chinese chip industry. Regardless of strict restrictions, the OEM managed to develop an improved AI chipset.


Other points in the report highlight that DeepSeek models supported Huawei AI chips since day one. The model converted CUDA (Compute Unified Device Architecture) to CUNN with one line of code.


It obtained higher performance by just simple optimizations, which is another big achievement in the Chinese tech field.


Source - HC 


Signing off

@Rahul S  


@iQOO Connect  @Parakram Hazarika  @NITIN  @Aojesh  @JStreetS  


Figure 4, view larger image


Tech