| [1] |
Mo Y, Wu Y, Yang X, et al. Review the state-of-the-art technologies of semantic segmentation based on deep learning[J]. Neurocomputing, 2022, 49(3):626-646.
|
| [2] |
于润润, 姜晓燕, 朱凯赢, 等. 基于上下文注意力机制的实时语义分割[J]. 电子科技, 2022, 35(12):57-63.
|
|
Yu Runrun, Jiang Xiaoyan, Zhu Kaiying, et al. Real-time image semantic segmentation based on contextual attention mechanism[J]. Electronic Science and Technology, 2022, 35(12):57-63.
|
| [3] |
Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation[C]. Boston: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015:3431-3440.
|
| [4] |
Badrinarayanan V, Kendall A, Cipolla R. Segnet:A deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12):2481-2495.
doi: 10.1109/TPAMI.2016.2644615
pmid: 28060704
|
| [5] |
Zhao H S, Shi J P, Qi X J, et al. Pyramid scene parsing network[C]. Honolulu: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017:2881-2890.
|
| [6] |
He K M, Zhang X Y, Ren S Q, et al. Deep residual learning for image recognition[C]. Las Vegas: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016:770-778.
|
| [7] |
Chen L C, Papandreou G, Kokkinos I, et al. Deeplab: Semantic image segmentation with deep convolutional nets,atrous convolution, and fully connected CRFS[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 40(4):834-848.
|
| [8] |
Chen L C, Zhu Y, Papandreou G, et al. Encoder-decoder with atrous separable convolution for semantic image se-gmentation[C]. Munich: Proceedings of the European Conference on Computer Vision, 2018:801-818.
|
| [9] |
Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]. Salt Lake City: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018:7132-7141.
|
| [10] |
Fu J, Liu J, Tian H, et al. Dual attention network for scene segmentation[C]. Long Beach: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019:3146-3154.
|
| [11] |
Huang Z, Wang X, Huang L, et al. Ccnet:Criss-cross attention for semantic segmentation[C]. Long Beach: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019:603-612.
|
| [12] |
Li X, Zhong Z, Wu J, et al. Expectation-maximization attention networks for semantic segmentation[C]. Long Beach: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019:9167-9176.
|
| [13] |
Guo M H, Liu Z N, Mu T J, et al. Beyond self-attention: External attention using two linear layers for visual tasks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 45(5):5436-5447.
|
| [14] |
Woo S, Park J, Lee J Y, et al. Cbam:Convolutional block attention module[C]. Munich: Proceedings of the European Conference on Computer Vision, 2018:3-19.
|
| [15] |
Hou Q, Zhou D, Feng J. Coordinate attention for efficient mobile network design[C]. Paris: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021:13713-13722.
|
| [16] |
Maaz M, Shaker A, Cholakkal H, et al. Edgenext:Efficiently amalgamated CNN-transformer architecture for mobile vision applications[C]. Cham: European Conference on Computer Vision, 2022:3-20.
|
| [17] |
Zhao H, Qi X, Shen X, et al. ICNet for real-time semantic segmentation on high-resolution images[C]. Munich: Proceedings of the European Conference on Computer Vision, 2018:405-420.
|
| [18] |
Sandler M, Howard A, Zhu M, et al. MobileNetV2:Inverted residuals and linear bottlenecks[C]. Salt Lake City: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018:4510-4520.
|
| [19] |
Howard A, Sandler M, Chu G, et al. Searching for MobileNetV3[C]. Long Beach: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019:1314-1324.
|
| [20] |
Chollet F. Xception:Deep learning with depthwise separable convolutions[C]. Honolulu: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017:1251-1258.
|
| [21] |
Yu C, Wang J, Peng C, et al. Bisenet: Bilateral segmentation network for real-time semantic segmentation[C]. Munich: Proceedings of the European Conference on Computer Vision, 2018:325-341.
|
| [22] |
Fan M, Lai S, Huang J, et al. Rethinking bisenet for real-time semantic segmentation[C]. Paris: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021:9716-9725.
|
| [23] |
Peng J, Liu Y, Tang S, et al. PP-LiteSeg:A superior real-time semantic segmentation model[EB/OL].(2022-04-06)[2022-11-12]. https://arxiv.org/pdf/2204.02681.pdf.
|
| [24] |
Everingham M, Van Gool L, Williams C K I, et al. The pascal visual object classes challenge[J]. International Journal of Computer Vision, 2010, 88(1):303-338.
|