Learning Dense Contextual Features For Semantic Segmentation

Keleş, Hacer YalimLim, Long Ang2021-12-012021-12-012020-06-30http://hdl.handle.net/20.500.12575/76545Semantic segmentation, which is one of the key problems in computer vision, has been applied in various application domains such as autonomous driving, robot navigation, or medical imagery, to name a few. Recently, deep learning, especially deep neural networks, have shown significant performance improvement over conventional semantic segmentation methods. In this paper, we present a novel encoder-decoder type deep neural network-based method, namely XSeNet, that can be trained end-to-end in a supervised manner. We adapt ResNet-50 layers as the encoder and design a cascaded decoder that composes of the stack of the X-Modules, which enables the network to learning dense contextual information and having wider field-of-view. We evaluate our method using CamVid dataset, and experimental results reveal that our method can segment most part of the scene accurately and even outperforms previous state-of-the art methods.enSemantic segmentationDeep learningConvolutional neural networksLearning Dense Contextual Features For Semantic SegmentationArticle62126342618-6462