Journal of Xidian University ›› 2019, Vol. 46 ›› Issue (2): 17-22.doi: 10.19665/j.issn1001-2400.2019.02.004

Previous Articles     Next Articles

Improved method for image caption with global attention mechanism

MA Shulei1,2,ZHANG Guobin2,JIAO Yang1(),SHI Guangming1   

  1. 1. School of Artificial Intelligence, Xidian Univ., Xi’an 710071, China
    2. The 27th Research Institute of China Electronic Technology Group Corporation, Zhengzhou 450047, China
  • Received:2018-09-28 Online:2019-04-20 Published:2019-04-20
  • Contact: Yang JIAO E-mail:yangjiao@stu.xidian.edu.cn

Abstract:

Aiming at the lack of global information in existing attention based image caption methods, we propose an improved image caption method with global attention mechanism. The proposed method mimics the entire human perception process via designing a global feature extraction network to enhance the global context based on visual attention mechanism. This paper compares the proposed method with the existing attention based image caption technique under the same dataset and hyper parameters, and analyzes the influence of introducing the global feature. The results show that our method outperforms the existing technique in objective evaluations with the challenging Chinese caption dataset. In the subjective evaluation, in the meanwhile, the captions generated by the proposed method describes the image more accurately, vividly and diversely, and they are more close to the natural language.

Key words: image caption, attention mechanism, global feature, convolutional neural network, recurrent neural network

CLC Number: 

  • TP37