Модель и метод оценки весовых коэффициентов ансамбля машинного обучения в задаче прогнозирования провозной платы железнодорожных грузовых перевозок
Работая с сайтом, я даю свое согласие на использование файлов cookie. Это необходимо для нормального функционирования сайта, показа целевой рекламы и анализа трафика. Статистика использования сайта обрабатывается системой Яндекс.Метрика
Научный журнал Моделирование, оптимизация и информационные технологииThe scientific journal Modeling, Optimization and Information Technology
Online media
issn 2310-6018

Model and method for evaluating the weighting coefficients of an ensemble machine learning model in the task of forecasting railway freight rates

Bukharova K.A. 

UDC 004.852
DOI: 10.26102/2310-6018/2026.52.1.005

  • Abstract
  • List of references
  • About authors

This article examines the effectiveness of a developed ensemble machine learning model for forecasting rail freight rates. Russian Railways data for a three-year period, comprising approximately 50 million freight shipment records, serves as the empirical base. This dataset ensures a representative sample and accounts for industry-specific data diversity. An ensemble model is developed using the Random Forest, XGBoost, LightGBM, and CatBoost algorithms, with a meta-level implemented as a multivariate linear regression. The ordinary least squares method and Tikhonov regularization are used to calculate the weighting coefficients. This approach stabilizes the solution and reduces the impact of correlated outputs from the base models. Results of computational experiments have shown that combining heterogeneous models into an ensemble improves forecasting accuracy compared to individual algorithms. The average absolute error decreased by 7–13 %, and the average absolute percentage error by 6–12 %, while the determination coefficient increased to 0.942. Additionally, the ensemble's stability was assessed using a sliding window method, which allowed us to determine forecasting horizons that maintain stable results. An extended analysis of the ensemble's behavior with varying input features showed that the model is robust to moderate data distortions and maintains high calculation reproducibility. The obtained results confirm the practical significance of the proposed approach for transport analytics, transportation planning, and the development of economically sound pricing policies.

1. Zagidullin R., Khaybullin A. Traffic Congestion Forecasting Using Machine Learning Methods. Transportation and Information Technologies in Russia. 2025;15(2):202–216. (In Russ.). https://doi.org/10.12731/2227-930X-2025-15-2-347

2. Pobirchenko V.R., Pobirchenko V.V. Teoreticheskie aspekty modelirovaniya sotsial'no-ekonomicheskogo razvitiya regiona. Uchenye zapiski Krymskogo federal'nogo universiteta imeni V.I. Vernadskogo. Ekonomika i upravlenie. 2025;11(2):170–182. (In Russ.).

3. Vegera Z.G. The Use of Generative Artificial Intelligence (AI) to Analyze Educational Data and Predict Student Academic Performance. Education Management Review. 2024;14(8-1):116–125. (In Russ.). https://doi.org/10.25726/j2473-1350-7803-t

4. Zhou Zh.-H. Ensemble Methods: Foundations and Algorithms. New York: Chapman & Hall/CRC; 2012. 236 p. https://doi.org/10.1201/b12207

5. Hyndman R.J., Athanasopoulos G. Forecasting: Principles and Practice. Melbourne: OTexts; 2021. 442 p.

6. Hastie T., Tibshirani R., Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. New York: Springer; 2009. 745 p. https://doi.org/10.1007/978-0-387-84858-7

7. Zhang Yu., Ma J., Liang Sh., Li X., Liu J. A Stacking Ensemble Algorithm for Improving the Biases of Forest Aboveground Biomass Estimations from Multiple Remotely Sensed Datasets. GIScience & Remote Sensing. 2022;59(1):234–249. https://doi.org/10.1080/15481603.2021.2023842

8. Rukomin M.A. Review of Ensemble Models in Predictive Analytics and Their Comparison with Traditional Machine Learning Approaches. Science Bulletin. 2025;1(8):368–373. (In Russ.).

9. Tikhonov A.N., Arsenin V.Ya. Metody resheniya nekorrektnykh zadach. Moscow: Nauka; 1979. 288 p. (In Russ.).

10. Kharitonova A.E. Forecasting the Tax Burden of Agricultural Enterprises by Machine Learning Methods. Taxes and Taxation. 2023;(4):28–38. (In Russ.). https://doi.org/10.7256/2454-065X.2023.4.43917

Bukharova Ksenya Alekseevna

Emperor Alexander I St. Petersburg State Transport University

St. Petersburg, Russian Federation

Keywords: machine learning, ensemble models, tikhonov regularization, least squares method, model accuracy, model stability, railway transport

For citation: Bukharova K.A. Model and method for evaluating the weighting coefficients of an ensemble machine learning model in the task of forecasting railway freight rates. Modeling, Optimization and Information Technology. 2026;14(1). URL: https://moitvivt.ru/ru/journal/pdf?id=2123 DOI: 10.26102/2310-6018/2026.52.1.005 (In Russ).

5

Full text in PDF

Received 06.11.2025

Revised 29.12.2025

Accepted 13.01.2026