Electronic Science and Technology ›› 2021, Vol. 34 ›› Issue (7): 50-55.doi: 10.16180/j.cnki.issn1007-7820.2021.07.009

Previous Articles     Next Articles

Chinese Microblog Polarity Classification Based on Hownet and PMI

HAO Miao,CHEN Linqiang   

  1. Computer and Software School,Hangzhou Dianzi University,Hangzhou 310018,China
  • Received:2020-03-10 Online:2021-07-15 Published:2021-07-05
  • Supported by:
    National University Student Innovation and Entrepreneurship Training Project(201610336013)

Abstract:

To solve the problem of Chinese microblog sentiment classification, a sentiment classification method combining PMI and Hownet based on microblog data is proposed. Through the research on the short and novel features of microblog data, a method of dictionary merging is proposed.The existing dictionaries are merged according to Hownet word similarity, and PMI is used to perform sentiment classification of online words. The network sentiment words are added to construct sentiment dictionary that adapts to the features of microblog text, and sentiment classification models are trained based on the new dictionary combined with supervised learning methods. The experimental results show that using this method for sentiment analysis can effectively identify the impact of new internet words on sentiment analysis, with an accuracy rate of 78.3%. In the sentiment analysis of microblog containing new words on the Internet, the accuracy rate is higher than that of only using dictionaries or supervised learning.

Key words: sentiment dictionary, microblog text classification, supervised learning, sentiment analysis, Hownet similarity, PMI, opinion mining, benchmark words

CLC Number: 

  • TP391