Stacked fully convolutional networks with multi-channel learning: application to medical image segmentation

Bi, Lei; Kim, Jinman; Kumar, Ashnil; Fulham, Michael; Feng, Dagan

Permalink

Access status:

Open Access

Type

Article

Author/s

Bi, Lei
Kim, Jinman
Kumar, Ashnil
Fulham, Michael
Feng, Dagan

Abstract

The automated segmentation of regions of interest (ROIs) in medical imaging is the fundamental requirement for the derivation of high-level semantics for image analysis in clinical decision support systems. Traditional segmentation approaches such as region-based depend heavily ...
See moreThe automated segmentation of regions of interest (ROIs) in medical imaging is the fundamental requirement for the derivation of high-level semantics for image analysis in clinical decision support systems. Traditional segmentation approaches such as region-based depend heavily upon hand-crafted features and a priori knowledge of the user. As such, these methods are difficult to adopt within a clinical environment. Recently, methods based on fully convolutional networks (FCN) have achieved great success in the segmentation of general images. FCNs leverage a large labeled dataset to hierarchically learn the features that best correspond to the shallow appearance as well as the deep semantics of the images. However, when applied to medical images, FCNs usually produce coarse ROI detection and poor boundary definitions primarily due to the limited number of labeled training data and limited constraints of label agreement among neighboring similar pixels. In this paper, we propose a new stacked FCN architecture with multi-channel learning (SFCN-ML). We embed the FCN in a stacked architecture to learn the foreground ROI features and background non-ROI features separately and then integrate these different channels to produce the final segmentation result. In contrast to traditional FCN methods, our SFCN-ML architecture enables the visual attributes and semantics derived from both the fore- and background channels to be iteratively learned and inferred. We conducted extensive experiments on three public datasets with a variety of visual challenges. Our results show that our SFCN-ML is more effective and robust than a routine FCN and its variants, and other state-of-the-art methods.
See less

Date

2017-05-04

Publisher

Springer

Funding information

ARC DP140100211

Licence

Other

Rights statement

This is a post-peer-review, pre-copyedit version of an article published in The Visual Computer. The final authenticated version is available online at: https://doi.org/10.1007/s00371-017-1379-4.

Faculty/School

Faculty of Engineering

Citation

Bi, L., Kim, J., Kumar, A. et al. Vis Comput (2017) 33: 1061. https://doi.org/10.1007/s00371-017-1379-4

Subjects

Fully convolutional networks (FCNs), Segmentation, Regions of interest (ROI)