Industry Expert Blogs

Benefit of pruning and clustering a neural network for before deploying on Arm Ethos-U NPU

arm Blogs - George Gekov, Arm
Jul. 24, 2023

Pruning and clustering are optimization techniques:

Pruning: setting weights to zero
Clustering: grouping weights together into clusters

These techniques modify the weights of a Machine Learning model. In some cases, they enable:

Significant speed-up of the inference execution
Reduction of the memory footprint
Reduction in the overall power consumption of the system

We assume that you can optimize your workload without loss in accuracy and that you target an Arm® Ethos NPU. You can therefore prune and cluster your neural network before using the Vela compiler and deploying it on the Ethos-U hardware. See below for more information on optimizing your workload.

Click here to read more ...

Search Silicon IP

12,000 IP Cores from 400 Vendors

Related Blogs

No portion of this site may be copied, retransmitted, reposted, duplicated or otherwise used without the express written permission of Design And Reuse.

Industry Expert Blogs

Benefit of pruning and clustering a neural network for before deploying on Arm Ethos-U NPU

Search Silicon IP

Related Blogs

Partner with us

List your Products

Design-Reuse.com