›› 2016, Vol. 29 ›› Issue (4): 49-.

• Articles • Previous Articles     Next Articles

An Algorithm for Vectoring SMS Based on Word2vec

WANG Guixin,ZHENG Xiaozong,ZHANG Haoran,ZHANG Xiaochuan   

  1. (1.School of Software Engineering,Chongqing Institute of Engineering,Chongqing 402260,China;
    2.School of Computer Science,Chongqing University of Technology,Chongqing,400054,China)
  • Online:2016-04-15 Published:2016-04-26

Abstract:

This paper proposes a new method of feature extraction of SMS for better spam message filtering.The method uses the latest results and tools of Word2vec based on deep learning theory.With the content and structure characteristics of Chinese short messages in mind,an algorithm of Vectoring SMS is designed based on this tool.The algorithm can effectively match each text message with a vector.The classification's experiments on the spam messages are carried out using the proposed algorithm on the deep belief networks.The results show that the performance of the proposed algorithm is improved by 5% compared with the previously reported results.

Key words: deep belief nets;deep learning;short messages;vectoring

CLC Number: 

  • TP29