We present a new method for structured pruning of neural networks, based on the recently proposed neuron merging trick in which following a pruning operation, the weights of the next layer are suitably modified. By a rigorous mathematical analysis of the neuron merging technique we prove an upper bound on the reconstruction error. This bound defines a new objective function for pruning-and-merging.
Articles
Related Articles
August 1, 2020
Industrial-strength OLTP using main memory and many cores
GaussDB, and its open source version named openGauss, are Huawei's relational database management systems (RDBMS), featuring...
Read More >
1 MIN READING
January 25, 2024
Rotation Invariant Quantization for Model Compression
Post-training Neural Network (NN) model compression is an attractive approach for deploying large, memory-consuming models on...
Read More >
1 MIN READING
May 22, 2019
Time-multiplexed parsing in marking-based network telemetry
Network telemetry is a key capability for managing the health and efficiency of a large-scale network....
Read More >
1 MIN READING