›› 2018, Vol. 30 ›› Issue (8): 126-137.

Previous Articles     Next Articles

Forecasting Tourist Volume Based on Clustering Method with Screening Keywords of Search Engine Data

Zhang Lingling1,2, Zhang Xiao1,3, Cui Yiwen1,2   

  1. 1. School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190;
    2. Key Laboratory of Big Data Mining and Knowledge Management, Chinese Academy of Sciences, Beijing 100190;
    3. Sino-Danish Center for Education and Research, University of Chinese Academy of Sciences, Beijing 100190
  • Received:2016-01-13 Online:2018-08-28 Published:2018-08-31

Abstract:

As one of the main sources of non-trade foreign exchange earnings, tourism industry has been developing rapidly, and its traffic forecasting is an important part of marketing and operations. However, the general prediction methods based on the Bureau of Statistics data, which are released with a lag, cannot reflect the latest trends of the tourism market. Therefore, the forecasting model based on search engine data and historical traffic statistics data is proposed in this paper, to explore the relationship between search engine data and tourism market forecast passenger traffic. Keywords related to the fluctuation of predictor variables are selected through clustering method to synthesize keywords indexes, so that the effective information between search data and tourism market trends can be further complementary. Then, historical statistics data and synthetic keyword indexes are used to establish autoregressive lag model. This model proves to be more accurate in comparison with the methods based merely on either search indexes or historical data. This paper provides a new method of forecasting tourist volume for tourism business management.

Key words: search engine data, keyword index, clustering method, forecasting tourist volume