Keywords: machine learning, ensemble models, tikhonov regularization, least squares method, model accuracy, model stability, railway transport
Model and method for evaluating the weighting coefficients of an ensemble machine learning model in the task of forecasting railway freight rates
UDC 004.852
DOI: 10.26102/2310-6018/2026.52.1.005
This article examines the effectiveness of a developed ensemble machine learning model for forecasting rail freight rates. Russian Railways data for a three-year period, comprising approximately 50 million freight shipment records, serves as the empirical base. This dataset ensures a representative sample and accounts for industry-specific data diversity. An ensemble model is developed using the Random Forest, XGBoost, LightGBM, and CatBoost algorithms, with a meta-level implemented as a multivariate linear regression. The ordinary least squares method and Tikhonov regularization are used to calculate the weighting coefficients. This approach stabilizes the solution and reduces the impact of correlated outputs from the base models. Results of computational experiments have shown that combining heterogeneous models into an ensemble improves forecasting accuracy compared to individual algorithms. The average absolute error decreased by 7–13 %, and the average absolute percentage error by 6–12 %, while the determination coefficient increased to 0.942. Additionally, the ensemble's stability was assessed using a sliding window method, which allowed us to determine forecasting horizons that maintain stable results. An extended analysis of the ensemble's behavior with varying input features showed that the model is robust to moderate data distortions and maintains high calculation reproducibility. The obtained results confirm the practical significance of the proposed approach for transport analytics, transportation planning, and the development of economically sound pricing policies.
1. Zagidullin R., Khaybullin A. Traffic Congestion Forecasting Using Machine Learning Methods. Transportation and Information Technologies in Russia. 2025;15(2):202–216. (In Russ.). https://doi.org/10.12731/2227-930X-2025-15-2-347
2. Pobirchenko V.R., Pobirchenko V.V. Teoreticheskie aspekty modelirovaniya sotsial'no-ekonomicheskogo razvitiya regiona. Uchenye zapiski Krymskogo federal'nogo universiteta imeni V.I. Vernadskogo. Ekonomika i upravlenie. 2025;11(2):170–182. (In Russ.).
3. Vegera Z.G. The Use of Generative Artificial Intelligence (AI) to Analyze Educational Data and Predict Student Academic Performance. Education Management Review. 2024;14(8-1):116–125. (In Russ.). https://doi.org/10.25726/j2473-1350-7803-t
4. Zhou Zh.-H. Ensemble Methods: Foundations and Algorithms. New York: Chapman & Hall/CRC; 2012. 236 p. https://doi.org/10.1201/b12207
5. Hyndman R.J., Athanasopoulos G. Forecasting: Principles and Practice. Melbourne: OTexts; 2021. 442 p.
6. Hastie T., Tibshirani R., Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. New York: Springer; 2009. 745 p. https://doi.org/10.1007/978-0-387-84858-7
7. Zhang Yu., Ma J., Liang Sh., Li X., Liu J. A Stacking Ensemble Algorithm for Improving the Biases of Forest Aboveground Biomass Estimations from Multiple Remotely Sensed Datasets. GIScience & Remote Sensing. 2022;59(1):234–249. https://doi.org/10.1080/15481603.2021.2023842
8. Rukomin M.A. Review of Ensemble Models in Predictive Analytics and Their Comparison with Traditional Machine Learning Approaches. Science Bulletin. 2025;1(8):368–373. (In Russ.).
9. Tikhonov A.N., Arsenin V.Ya. Metody resheniya nekorrektnykh zadach. Moscow: Nauka; 1979. 288 p. (In Russ.).
10. Kharitonova A.E. Forecasting the Tax Burden of Agricultural Enterprises by Machine Learning Methods. Taxes and Taxation. 2023;(4):28–38. (In Russ.). https://doi.org/10.7256/2454-065X.2023.4.43917
Keywords: machine learning, ensemble models, tikhonov regularization, least squares method, model accuracy, model stability, railway transport
For citation: Bukharova K.A. Model and method for evaluating the weighting coefficients of an ensemble machine learning model in the task of forecasting railway freight rates. Modeling, Optimization and Information Technology. 2026;14(1). URL: https://moitvivt.ru/ru/journal/pdf?id=2123 DOI: 10.26102/2310-6018/2026.52.1.005 (In Russ).
Received 06.11.2025
Revised 29.12.2025
Accepted 13.01.2026