Packing Convolutional Neural Networks in the Frequency Domain

Wang, Yunhe; Xu, Chang; Xu, Chao; Tao, Dacheng

Access status:

Open Access

Field	Value	Language
dc.contributor.author	Wang, Yunhe
dc.contributor.author	Xu, Chang
dc.contributor.author	Xu, Chao
dc.contributor.author	Tao, Dacheng
dc.date.accessioned	2021-12-21T01:37:27Z
dc.date.available	2021-12-21T01:37:27Z
dc.date.issued	2019	en_AU
dc.identifier.uri	https://hdl.handle.net/2123/27249
dc.description.abstract	Deep convolutional neural networks (CNNs) are successfully used in a number of applications. However, their storage and computational requirements have largely prevented their widespread use on mobile devices. Here we present a series of approaches for compressing and speeding up CNNs in the frequency domain, which focuses not only on smaller weights but on all the weights and their underlying connections. By treating convolution filters as images, we decompose their representations in the frequency domain as common parts (i.e., cluster centers) shared by other similar filters and their individual private parts (i.e., individual residuals). A large number of low-energy frequency coefficients in both parts can be discarded to produce high compression without significantly compression romising accuracy. Furthermore, we explore a data-driven method for removing redundancies in both spatial and frequency domains, which allows us to discard more useless weights by keeping similar accuracies. After obtaining the optimal sparse CNN in the frequency domain, we relax the computational burden of convolution operations in CNNs by linearly combining the convolution responses of discrete cosine transform (DCT) bases. The compression and speed-up ratios of the proposed algorithm are thoroughly analyzed and evaluated on benchmark image datasets to demonstrate its superiority over state-of-the-art methods.	en_AU
dc.publisher	IEEE	en_AU
dc.relation.ispartof	IEEE Transactions on Pattern Analysis and Machine Intelligence	en_AU
dc.title	Packing Convolutional Neural Networks in the Frequency Domain	en_AU
dc.type	Article	en_AU
dc.subject.asrc	0801 Artificial Intelligence and Image Processing	en_AU
dc.identifier.doi	10.1109/TPAMI.2018.2857824
dc.type.pubtype	Author accepted manuscript	en_AU
dc.relation.arc	DE180101438
dc.relation.arc	FL-170100117
dc.relation.arc	DP-180103424
dc.relation.arc	LP- 150100671
dc.rights.other	© 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.	en_AU
usyd.faculty	SeS faculties schools::Faculty of Engineering::School of Computer Science	en_AU
workflow.metadata.only	No	en_AU