Enhancing inflation nowcasting with online search data: a random forest application for Colombia

Borradores de Economia
Number: 
1318
Published: 
Authors:
Felipe Roldán-Ferrína,
Classification JEL: 
C14, C53, E17, E31, E37
Keywords: 
Inflation, nowcasting, Forecasting, Random Forest, Google Trends, Machine learning

The most recent

Jaime Alfredo Bonet-Moron, Jaime Andrés Collazos-Rodríguez, Karen Astrid Rubio-Ramírez, Adolfo Ramírez-Moreno, Andrés Felipe Parra-Solano
Julián Alonso Cárdenas-Cárdenas, Deicy Johana Cristiano-Botia, Eliana Rocío González-Molano, Carlos Alfonso Huertas-Campos
Luis E. Arango, Juan José Ospina-Tejeiro, Fernando Arias-Rodríguez, Oscar Iván Ávila-Montealegre, Jaime Andrés Collazos-Rodríguez, Diana M. Cortázar Gómez, Juan Pablo Cote-Barón, Julio Escobar-Potes, Aarón Levi Garavito-Acosta, Franky Juliano Galeano-Ramírez, Eliana Rocío González-Molano, Maria Camila Gomez Cardona, Anderson Grajales, David Camilo López-Valenzuela, Wilmer Martinez-Rivera, Nicolás Martínez-Cortés, Rocío Clara Alexandra Mora-Quiñones, Sara Naranjo-Saldarriaga, Antonio Orozco, Daniel Parra-Amado, Julián Pérez-Amaya, José Pulido, Karen L. Pulido-Mahecha, Carolina Ramírez-Rodríguez, Sergio Restrepo Ángel, José Vicente Romero-Chamorro, Nicol Valeria Rodríguez-Rodríguez, Norberto Rodríguez-Niño, Diego Hernán Rodríguez-Hernández, Carlos D. Rojas-Martínez, Johana Andrea Sanabria-Domínguez, Diego Vásquez-Escobar

This paper evaluates the predictive capacity of a machine learning model based on Random Forests (RF), combined with Google Trends (GT) data, for nowcasting monthly inflation in Colombia. The proposed RF-GT model is trained using historical inflation data, macroeconomic indicators, and internet search activity. After optimizing the model’s hyperparameters through time series cross-validation, we assess its out-of-sample performance over the period 2023–2024. The results are benchmarked against traditional approaches, including SARIMA, Ridge, and Lasso regressions, as well as professional forecasts from the Banco de la República’s monthly survey of financial analysts (MES). In terms of forecast accuracy, the RF-GT model consistently outperforms the statistical models and performs comparably to the analysts’ median forecast, while offering the additional advantage of producing predictions approximately one and a half weeks earlier. These findings highlight the practical value of integrating alternative data sources and machine learning techniques into the inflation monitoring toolkit of emerging economies.

The results highlight the usefulness of combining machine learning techniques with alternative sources of information to generate timely forecasts that are comparable to those of the market.