Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Cross-Attentional Bracket-shaped Convolutional Network for semantic image segmentation

Authors
Hua, Cam-HaoThien Huynh-TheBae, Sung-HoLee, Sungyoung
Issue Date
Oct-2020
Publisher
ELSEVIER SCIENCE INC
Keywords
Semantic image segmentation; Bracket-shaped Convolutional Neural; Network; Cross-Attentional mechanism
Citation
INFORMATION SCIENCES, v.539, pp 277 - 294
Pages
18
Journal Title
INFORMATION SCIENCES
Volume
539
Start Page
277
End Page
294
URI
https://scholarworks.bwise.kr/kumoh/handle/2020.sw.kumoh/28328
DOI
10.1016/j.ins.2020.06.023
ISSN
0020-0255
1872-6291
Abstract
As perception-related applications are of great importance in industrial production and daily life nowadays, solutions for understanding given images semantically receive numerous attention from the literature. To this end, significant accomplishments have been reached for such pixel-wise segmentation problem thanks to novel manipulations of integrating global context into local details in convolutional neural networks. However, this strategy in the existing work did not exhaustively exploit middle-level features, which carry reasonable balance between fine-grained and semantic information. Therefore, this paper introduces a Cross-Attentional Bracket-shaped Convolutional Network (CAB-Net) to leverage their contribution to the tournament of constructing pixel-wise labeled map. In concrete, fine-to-coarse feature maps of interest from the backbone network are densely combined by an efficient fusion of channel-wisely and spatially attentional schemes in crossing manner, namely Cross-Attentional Fusion, to embed semantically rich features into finer patterns. Continuously, these newly decoded outputs repeat the same procedure round-by-round until shaping a final feature map having finest resolution for complete scene understanding. Consequently, the proposed CAB-Net achieves competitive mean Intersection of Union performance on PASCAL VOC 2012 (83.6% without MS-COCO pretraining), CamVid (76.4%) and Cityscapes (78.3%) datasets. (C) 2020 Elsevier Inc. All rights reserved.
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE