Keywords: phishing, cybersecurity, machine learning, multilayer perceptron, random forest, URL classification, phishing detection, data protection
The development of a phishing attack protection system using software-hardware implementation of machine learning methods
UDC 004.056.53
DOI: 10.26102/2310-6018/2024.47.4.033
Due to the constant evolution of phishing attacks, traditional protection methods, such as URL filtering and user training, have become insufficiently effective. The article examines modern methods of detecting phishing attacks using machine learning algorithms aimed at improving the accuracy and efficiency of URL classification. The developed system employs a multilayer perceptron for automatic URL analysis and classification of links as either phishing or legitimate. Creating a high-quality, representative dataset containing both phishing and legitimate links is one of the key stages in model development. The focus is on analyzing URL addresses based on 30 key features, including URL length, SSL certificate presence, and IP address usage. The model testing results demonstrated high accuracy, significantly surpassing the results of traditional filtering methods. The developed software, implemented in Python with TensorFlow and Scikit-Learn libraries, proved highly effective in real-world conditions, ensuring high accuracy, recall, and F1 score. The results confirm that machine learning enhances the efficiency and accuracy of phishing detection compared to traditional methods.
1. Karpova N.E., Voskanyan I.I. Threat of social engineering and phishing in modern information security. Digital Technology Security. 2024;(2):69–78. (In Russ.). https://doi.org/10.17212/2782-2230-2024-2-69-78
2. Duo W., Zhou M., Abusorrah A. A Survey of Cyber Attacks on Cyber Physical Systems: Recent Advances and Challenges. IEEE/CAA Journal of Automatica Sinica. 2022;9(5):784–800. https://doi.org/10.1109/JAS.2022.105548
3. Lukmanova K.A., Kartak V.M. Recognition of phishing links using machine learning methods. Digital Technology Security. 2024;(3):9–20. (In Russ.). https://doi.org/10.17212/2782-2230-2024-3-9-20
4. Hussein S.K., Wahaballah A., Alosaimi A. Detecting Phishing Websites Using Natural Language Processing. International Journal of Computer Engineering in Research Trends. 2021;8(12):220–227.
5. Kutlyev D.Z., Shmanina A.V. Ispol'zovanie algoritmov mashinnogo obucheniya dlya zashchity ot URL-fishinga. In: Mavlyutovskie chteniya: Materialy XV Vserossiiskoi molodezhnoi nauchnoi konferentsii: in 7 volumes: Volume 4, 26–28 October 2021, Ufa, Russia. Ufa: Ufa State Aviation Technical University; 2021. pp. 430–435. (In Russ.).
6. Bahnsen A.C., Bohorquez E.C., Villegas S., Vargas J., González F.A. Classifying phishing URLs using recurrent neural networks. In: 2017 APWG Symposium on Electronic Crime Research (eCrime), 25–27 April 2017, Scottsdale, USA. IEEE; 2017. pp. 1–8. https://doi.org/10.1109/ECRIME.2017.7945048
7. Sahingoz O.K., Buber E., Demir O., Diri B. Machine learning based phishing detection from URLs. Expert Systems with Applications. 2019;117:345–357. https://doi.org/10.1016/j.eswa.2018.09.029
8. Artyushkina E.S., Andiryakova O.O., Tyurina D.A. Using machine learning methods in analyzing network traffic and malicious software. Industrial Economics. 2023;(4):12–15. (In Russ.).
9. Ma J., Saul L.K., Savage S., Voelker G.M. Beyond blacklists: learning to detect malicious web sites from suspicious URLs. In: KDD '09: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 28 June 2009 – 1 July 2009, Paris, France. New York: Association for Computing Machinery; 2009. pp. 1245–1254. https://doi.org/10.1145/1557019.1557153
10. Dutta A.K. Detecting phishing websites using machine learning technique. PLoS ONE. 2021;16(10). https://doi.org/10.1371/journal.pone.0258361
11. Saheed Y.K., Arowolo M.O. Efficient Cyber Attack Detection on the Internet of Medical Things-Smart Environment Based on Deep Recurrent Neural Network and Machine Learning Algorithms. IEEE Access. 2021;9:161546–161554. https://doi.org/10.1109/ACCESS.2021.3128837
Keywords: phishing, cybersecurity, machine learning, multilayer perceptron, random forest, URL classification, phishing detection, data protection
For citation: Lukmanova K.A., Kartak V.M. The development of a phishing attack protection system using software-hardware implementation of machine learning methods. Modeling, Optimization and Information Technology. 2024;12(4). URL: https://moitvivt.ru/ru/journal/pdf?id=1738 DOI: 10.26102/2310-6018/2024.47.4.033 (In Russ).
Received 09.11.2024
Revised 13.12.2024
Accepted 18.12.2024