Show simple item record

FieldValueLanguage
dc.contributor.authorWang, Yunhe
dc.contributor.authorXu, Chang
dc.contributor.authorXu, Chao
dc.contributor.authorTao, Dacheng
dc.date.accessioned2021-12-21T01:37:27Z
dc.date.available2021-12-21T01:37:27Z
dc.date.issued2019en_AU
dc.identifier.urihttps://hdl.handle.net/2123/27249
dc.description.abstractDeep convolutional neural networks (CNNs) are successfully used in a number of applications. However, their storage and computational requirements have largely prevented their widespread use on mobile devices. Here we present a series of approaches for compressing and speeding up CNNs in the frequency domain, which focuses not only on smaller weights but on all the weights and their underlying connections. By treating convolution filters as images, we decompose their representations in the frequency domain as common parts (i.e., cluster centers) shared by other similar filters and their individual private parts (i.e., individual residuals). A large number of low-energy frequency coefficients in both parts can be discarded to produce high compression without significantly compression romising accuracy. Furthermore, we explore a data-driven method for removing redundancies in both spatial and frequency domains, which allows us to discard more useless weights by keeping similar accuracies. After obtaining the optimal sparse CNN in the frequency domain, we relax the computational burden of convolution operations in CNNs by linearly combining the convolution responses of discrete cosine transform (DCT) bases. The compression and speed-up ratios of the proposed algorithm are thoroughly analyzed and evaluated on benchmark image datasets to demonstrate its superiority over state-of-the-art methods.en_AU
dc.publisherIEEEen_AU
dc.relation.ispartofIEEE Transactions on Pattern Analysis and Machine Intelligenceen_AU
dc.titlePacking Convolutional Neural Networks in the Frequency Domainen_AU
dc.typeArticleen_AU
dc.subject.asrc0801 Artificial Intelligence and Image Processingen_AU
dc.identifier.doi10.1109/TPAMI.2018.2857824
dc.type.pubtypeAuthor accepted manuscripten_AU
dc.relation.arcDE180101438
dc.relation.arcFL-170100117
dc.relation.arcDP-180103424
dc.relation.arcLP- 150100671
dc.rights.other© 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_AU
usyd.facultySeS faculties schools::Faculty of Engineering::School of Computer Scienceen_AU
workflow.metadata.onlyNoen_AU


Show simple item record

Associated file/s

Associated collections

Show simple item record

There are no previous versions of the item available.