Journal of Data and Information Science ›› 2019, Vol. 4 ›› Issue (1): 89-113.doi: 10.2478/jdis-2019-0005

• Research Paper • Previous Articles    

Sentiment Analysis of Japanese Tourism Online Reviews

Chuanming Yu1,Xingyu Zhu1,Bolin Feng1,Lin Cai1,Lu An2†()   

  1. 1School of Information and Safety Engineering, Zhongnan University of Economics and Law, Wuhan 430073, China
    2School of Information Management, Wuhan University, Wuhan 430072, China
  • Received:2018-10-15 Revised:2018-12-25 Online:2019-01-31 Published:2019-01-31
  • Contact: Lu An


Purpose: Online reviews on tourism attractions provide important references for potential tourists to choose tourism spots. The main goal of this study is conducting sentiment analysis to facilitate users comprehending the large scale of the reviews, based on the comments about Chinese attractions from Japanese tourism website 4Travel.

Design/methodology/approach: Different statistics- and rule-based methods are used to analyze the sentiment of the reviews. Three groups of novel statistics-based methods combining feature selection functions and the traditional term frequency-inverse document frequency (TF-IDF) method are proposed. We also make seven groups of different rules-based methods. The macro-average and micro-average values for the best classification results of the methods are calculated respectively and the performance of the methods are shown.

Findings: We compare the statistics-based and rule-based methods separately and compare the overall performance of the two method. According to the results, it is concluded that the combination of feature selection functions and weightings can strongly improve the overall performance. The emotional vocabulary in the field of tourism (EVT), kaomojis, negative and transitional words can notably improve the performance in all of three categories. The rule-based methods outperform the statistics-based ones with a narrow advantage.

Research limitation: Two limitations can be addressed: 1) the empirical studies to verify the validity of the proposed methods are only conducted on Japanese languages; and 2) the deep learning technology is not been incorporated in the methods.

Practical implications: The results help to elucidate the intrinsic characteristics of the Japanese language and the influence on sentiment analysis. These findings also provide practical usage guidelines within the field of sentiment analysis of Japanese online tourism reviews.Originality/value: Our research is of practicability. Currently, there are no studies that focus on the sentiment analysis of Japanese reviews about Chinese attractions.

Key words: Sentiment analysis, Japanese reviews, Rule-based methods, Statistics-based methods, Tourism reviews