Junhao Zhou, Yue Lu, Hong-Ning Dai, Hao Wang, Hong Xiao
IEEE Access, vol. 7, pp. 38856-38866, 2019
Publication year: 2019

Abstract

Sentiment analysis on Chinese microblogs has received extensive attention recently. Most previous studies focus on identifying sentiment orientation by encoding as many word properties as possible while they fail to consider contextual features (e.g., the long-range dependencies of words), which are, however, essentially important in the sentiment analysis. In this paper, we propose a Chinese sentiment analysis method by incorporating a word2vec model and a stacked bidirectional long short-term memory (Stacked Bi-LSTM) model. We first employ the word2vec model to capture semantic features of words and transfer words into high-dimensional word vectors. We evaluate the performance of two typical word2vec models: continuous bag-of-words (CBOW) and skip-gram. We then use the Stacked Bi-LSTM model to conduct the feature extraction of sequential word vectors. We next apply a binary softmax classifier to predict the sentiment orientation by using semantic and contextual features. Moreover, we also conduct extensive experiments on the real dataset collected from Weibo (i.e., one of the most popular Chinese microblogs). The experimental results show that our proposed approach achieves better performance than other machine-learning models.

Keywords

  • Long short-term memory (LSTM)
  • Stacked bi-directional LSTM
  • Sentiment analysis
  • Continuous bag-of-words
  • Chinese microblog
  • Contextual features.

Bibtex

@ARTICLE{YLu:IEEEAccess19, 
	author={Junhao Zhou and Yue Lu and Hong-Ning Dai and Hao Wang and Hong Xiao}, 
	journal={IEEE Access}, 
	title={Sentiment Analysis of Chinese Microblog Based on Stacked Bidirectional LSTM}, 
	year={2019}, 
	volume={7}, 
	number={}, 
	pages={38856-38866}, 
	doi={10.1109/ACCESS.2019.2905048}, 
}

Note: We have released the dataset, the program codes as well as trained models in GitHub. Please refer to this link to download them.

Leave a Reply

Your email address will not be published. Required fields are marked *