Matrix Neural Networks

Gao, Junbin; Guo, Yi; Wang, Zhiyong

Permalink

Access status:

Open Access

Type

Working Paper

Author/s

Gao, Junbin
Guo, Yi
Wang, Zhiyong

Abstract

Traditional neural networks assume vectorial inputs as the network is arranged as layers of single line of computing units called neurons. This special structure requires the non-vectorial inputs such as matrices to be converted into vectors. This process can be problematic. Firstly, ...
See moreTraditional neural networks assume vectorial inputs as the network is arranged as layers of single line of computing units called neurons. This special structure requires the non-vectorial inputs such as matrices to be converted into vectors. This process can be problematic. Firstly, the spatial information among elements of the data may be lost during vectorisation. Secondly, the solution space becomes very large which demands very special treatments to the network parameters and high computational cost. To address these issues, we propose matrix neural networks (MatNet), which takes matrices directly as inputs. Each neuron senses summarised information through bilinear mapping from lower layer units in exactly the same way as the classic feed forward neural networks. Under this structure, back prorogation and gradient descent combination can be utilised to obtain network parameters e ciently. Furthermore, it can be conveniently extended for multimodal inputs. We apply MatNet to MNIST handwritten digits classi cation and image super resolution tasks to show its e ectiveness. Without too much tweaking MatNet achieves comparable performance as the state-of-the-art methods in both tasks with considerably reduced complexity.
See less

Date

2016-11-02

Publisher

Business Analytics.

Department, Discipline or Centre

Discipline of Business Analytics

Subjects

Neural Networks
Back Propagation
Machine Learning
Pattern Recognition
Image Super Resolution