Layer Folding: Neural Network Depth Reduction using Activation Linearization


Amir Ben Dror (Samsung),* Niv Zehngut (Samsung), Avraham Raviv (SIRC), Evgeny Artyomov (Samsung), Ran Vitek (Samsung Israel R&D Center)
The 33rd British Machine Vision Conference

Abstract

Despite the increasing prevalence of deep neural networks, their applicability in resource-constrained devices is limited due to their computational load. While modern devices exhibit a high level of parallelism, real-time latency is still highly dependent on networks' depth. Although recent works show that below a certain depth, the width of shallower networks must grow exponentially, we presume that neural networks typically exceed this minimal depth to accelerate convergence and incrementally increase accuracy. This motivates us to transform pre-trained deep networks that already exploit such advantages into shallower forms. We propose a method that learns whether non-linear activations can be removed, allowing to fold consecutive linear layers into one. We use our method to provide more efficient alternatives to MobileNet and EfficientNet architectures on the ImageNet classification task. We release our code and trained models at https://github.com/LayerFolding/Layer-Folding.

Video



Citation

@inproceedings{Dror_2022_BMVC,
author    = {Amir Ben Dror and Niv Zehngut and Avraham Raviv and Evgeny Artyomov and Ran Vitek},
title     = {Layer Folding: Neural Network Depth Reduction using Activation Linearization},
booktitle = {33rd British Machine Vision Conference 2022, {BMVC} 2022, London, UK, November 21-24, 2022},
publisher = {{BMVA} Press},
year      = {2022},
url       = {https://bmvc2022.mpi-inf.mpg.de/0612.pdf}
}


Copyright © 2022 The British Machine Vision Association and Society for Pattern Recognition
The British Machine Vision Conference is organised by The British Machine Vision Association and Society for Pattern Recognition. The Association is a Company limited by guarantee, No.2543446, and a non-profit-making body, registered in England and Wales as Charity No.1002307 (Registered Office: Dept. of Computer Science, Durham University, South Road, Durham, DH1 3LE, UK).

Imprint | Data Protection