Hierarchical Multi-Scale Attention for Semantic Segmentation

Tao, Andrew; Sapra, Karan; Catanzaro, Bryan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2005.10821 (cs)

[Submitted on 21 May 2020]

Title:Hierarchical Multi-Scale Attention for Semantic Segmentation

Authors:Andrew Tao, Karan Sapra, Bryan Catanzaro

View PDF

Abstract:Multi-scale inference is commonly used to improve the results of semantic segmentation. Multiple images scales are passed through a network and then the results are combined with averaging or max pooling. In this work, we present an attention-based approach to combining multi-scale predictions. We show that predictions at certain scales are better at resolving particular failures modes, and that the network learns to favor those scales for such cases in order to generate better predictions. Our attention mechanism is hierarchical, which enables it to be roughly 4x more memory efficient to train than other recent approaches. In addition to enabling faster training, this allows us to train with larger crop sizes which leads to greater model accuracy. We demonstrate the result of our method on two datasets: Cityscapes and Mapillary Vistas. For Cityscapes, which has a large number of weakly labelled images, we also leverage auto-labelling to improve generalization. Using our approach we achieve a new state-of-the-art results in both Mapillary (61.1 IOU val) and Cityscapes (85.1 IOU test).

Comments:	11 pages, 5 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2005.10821 [cs.CV]
	(or arXiv:2005.10821v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2005.10821

Submission history

From: Andrew Tao [view email]
[v1] Thu, 21 May 2020 17:55:59 UTC (6,748 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2020-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Andrew Tao
Karan Sapra
Bryan Catanzaro

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Hierarchical Multi-Scale Attention for Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Hierarchical Multi-Scale Attention for Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators