Keywords: unmanned aerial vehicle, acoustic signals, acoustic features, spectral analysis, machine learning
Selection of acoustic features in unmanned aerial vehicle detection tasks
UDC 004.62:623.76
DOI: 10.26102/2310-6018/2025.50.3.007
With the increasing number of incidents involving the unauthorized use of unmanned aerial vehicles (UAVs), the development of effective methods for their automatic detection has become increasingly relevant. This article provides a concise overview of current approaches to UAV detection, with particular emphasis on acoustic monitoring methods, which offer several advantages over radio-frequency and visual systems. The main acoustic features used for recognizing drone sound signals are examined, along with techniques for extracting these features using open-source libraries such as Librosa and Essentia. To evaluate the effectiveness of various features, a balanced dataset was compiled and utilized, containing audio recordings of drones and background noise. A multi-stage feature selection methodology was tested using the Feature-engine library, including the removal of constant and duplicate features, correlation analysis, and feature importance assessment. As a result, a subset of 53 acoustic features was obtained, providing a balance between UAV detection accuracy and computational cost. The mathematical foundations of spectral feature extraction are described, including different types of spectrograms (mel-, bark-, and gammatone-spectrograms), as well as vector and scalar acoustic features. The results presented can be used to develop automatic UAV acoustic detection systems based on machine learning methods.
1. Seidaliyeva U., Ilipbayeva L., Taissariyeva K., Smailov N., Matson E.T. Advances and Challenges in Drone Detection and Classification Techniques: A State-of-the-Art Review. Sensors. 2023;24(1). https://doi.org/10.3390/s24010125
2. Lee H., Han S., Byeon J.-I., et al. CNN-Based UAV Detection and Classification Using Sensor Fusion. IEEE Access. 2023;11:68791–68808. https://doi.org/10.1109/ACCESS.2023.3293124
3. Tejera-Berengue D., Zhu-Zhou F., Utrilla-Manso M., Gil-Pita R., Rosa-Zurera M. Analysis of Distance and Environmental Impact on UAV Acoustic Detection. Electronics. 2024;13(3). https://doi.org/10.3390/electronics13030643
4. Patel K., Ramirez L., Canales D., Rojas E. Unmanned Aerial Vehicles Detection Using Acoustics and Quantum Signal Processing. In: 2024 AIAA Science and Technology Forum and Exposition, 08–12 January 2024, Orlando, FL, USA. American Institute of Aeronautics and Astronautics; 2024. https://doi.org/10.2514/6.2024-1740
5. Taha B., Shoufan A. Machine Learning-Based Drone Detection and Classification: State-of-the-Art in Research. IEEE Access. 2019;7:138669–138682. https://doi.org/10.1109/ACCESS.2019.2942944
6. Najafi Ja., Mirzakuchaki S., Shamaghdari S. Autonomous Drone Detection and Classification Using Computer Vision and Prony Algorithm-Based Frequency Feature Extraction. Journal of Intelligent & Robotic Systems. 2025;111(1). https://doi.org/10.1007/s10846-024-02216-x
7. Zhang Yi.D., Xiang X., Li Yi, Chen G. Enhanced Micro-Doppler Feature Analysis for Drone Detection. In: 2021 IEEE Radar Conference (RadarConf21), 07–14 May 2021, Atlanta, GA, USA. IEEE; 2021. P. 1–4. https://doi.org/10.1109/RadarConf2147009.2021.9455228
8. Souli N., Theodorou I., Kolios P., Ellinas G. Detection and Tracking of Rogue UASs Using a Novel Real-Time Passive Radar System. In: 2022 International Conference on Unmanned Aircraft Systems (ICUAS), 21–24 June 2022, Dubrovnik, Croatia. IEEE; 2022. P. 576–582. https://doi.org/10.1109/ICUAS54217.2022.9836054
9. McCoy J., Rawat D.B. Optimized Machine Learning Based Multimodal UAV Detection Using Ensemble Stacking. In: 2024 IEEE 6th International Conference on Cognitive Machine Intelligence (CogMI), 28–31 October 2024, Washington, DC, USA. IEEE; 2024. P. 40–49. https://doi.org/10.1109/CogMI62246.2024.00016
10. Zahid Rao A., Shahid Siddique S., Danish Mujib M., Abul Hasan M., Alokaily A.O., Tahira T. Sensor Fusion and Machine Learning for Seated Movement Detection with Trunk Orthosis. IEEE Access. 2024;12:41676–41687. https://doi.org/10.1109/ACCESS.2024.3377111
11. Wang Ye, Chen Yu., Choi J., Kuo C.-C.J. Towards Visible and Thermal Drone Monitoring with Convolutional Neural Networks. APSIPA Transactions on Signal and Information Processing. 2019;8(1). https://doi.org/10.1017/ATSIP.2018.30
12. Guo Ju., Ahmad I., Chang K. Classification, Positioning, and Tracking of Drones by HMM Using Acoustic Circular Microphone Array Beamforming. EURASIP Journal on Wireless Communications and Networking. 2020;2020(1). https://doi.org/10.1186/s13638-019-1632-9
13. Diao Yu., Zhang Yi., Zhao G., Khamis M. Drone Authentication via Acoustic Fingerprint. In: ACSAC '22: Proceedings of the 38th Annual Computer Security Applications Conference, 05–09 December 2022, Austin, TX, USA. New York: Association for Computing Machinery; 2022. P. 658–668. https://doi.org/10.1145/3564625.3564653
14. Deleforge A., Carlo D.D., Strauss M., Serizel R., Marcenaro L. Audio-Based Search and Rescue with a Drone: Highlights from the IEEE Signal Processing Cup 2019 Student Competition. IEEE Signal Processing Magazine. 2019;36(5):138–144. https://doi.org/10.1109/MSP.2019.2924687
15. Marple S.L., Jr. Digital Spectral Analysis. Mineola, New York: Dover Publications; 2019. 432 p.
16. Haykin S., Liu K.J.R. Handbook on Array Processing and Sensor Networks. Hoboken: John Wiley & Sons; 2009. 924 p.
17. Flanagan J.L. Speech Analysis Synthesis and Perception. Berlin, Heidelberg: Springer; 1972. 446 p. https://doi.org/10.1007/978-3-662-01562-9
18. O’Shaughnessy D. Speech Communication: Human and Machine. Reading: Addison-Wesley; 1990. 548 p.
19. Traunmüller H. Analytical Expressions for the Tonotopic Sensory Scale. The Journal of the Acoustical Society of America. 1990;88(1):97–100.
20. Van Gisbergen J.A.M., Grashuis J.L., Johannesma P.I.M., Vendrik A.J.H. Neurons in the Cochlear Nucleus Investigated with Tone and Noise Stimuli. Experimental Brain Research. 1975;23(4):387–406. https://doi.org/10.1007/BF00238022
21. Davis S., Mermelstein P. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences. IEEE Transactions on Acoustics, Speech, and Signal Processing. 1980;28(4):357–366. https://doi.org/10.1109/TASSP.1980.1163420
22. Xu M., Duan L.-Yu, Cai J., Chia L.-T., Xu Ch., Tian Q. HMM-Based Audio Keyword Generation. In: Advances in Multimedia Information Processing – PCM 2004: 5th Pacific Rim Conference on Multimedia: Proceedings: Part III, 30 November – 03 December 2004, Tokyo, Japan. Berlin, Heidelberg: Springer; 2004. P. 566–574. https://doi.org/10.1007/978-3-540-30543-9_71
23. Qi J., Wang D., Xu J., Tejedor J. Bottleneck Features Based on Gammatone Frequency Cepstral Coefficients. In: INTERSPEECH 2013: 14th Annual Conference of the International Speech Communication Association, 25–29 August 2013, Lyon, France. ISCA; 2013. P. 1751–1755. https://doi.org/10.21437/Interspeech.2013-435
24. Bartsch M.A., Wakefield G.H. Audio Thumbnailing of Popular Music Using Chroma-Based Representations. IEEE Transactions on Multimedia. 2005;7(1):96–104. https://doi.org/10.1109/TMM.2004.840597
25. Müller M., Kurth F., Clausen M. Audio Matching via Chroma-Based Statistical Features. In: ISMIR 2005: 6th International Conference on Music Information Retrieval: Proceedings, 11–15 September 2005, London, UK. 2005. P. 288–295. https://doi.org/10.5281/zenodo.1416799
26. Jiang D.-N., Lu L., Zhang H.-J., Tao J.-H., Cai L.-H. Music Type Classification by Spectral Contrast Feature. In: IEEE International Conference on Multimedia and Expo: Proceedings, 26–29 August 2002, Lausanne, Switzerland. IEEE; 2002. P. 113–116. https://doi.org/10.1109/ICME.2002.1035731
27. De Cheveigné A., Kawahara H. YIN, a Fundamental Frequency Estimator for Speech and Music. The Journal of the Acoustical Society of America. 2002;111(4):1917–1930.
28. Klapuri A. Qualitative and Quantitative Aspects in the Design of Periodicity Estimation Algorithms. In: 2000 10th European Signal Processing Conference, 04–08 September 2000, Tampere, Finland. IEEE; 2000. P. 1–4.
Keywords: unmanned aerial vehicle, acoustic signals, acoustic features, spectral analysis, machine learning
For citation: Prozorov D.E., Byzov V.A., Myshkin R.E. Selection of acoustic features in unmanned aerial vehicle detection tasks. Modeling, Optimization and Information Technology. 2025;13(3). URL: https://moitvivt.ru/ru/journal/pdf?id=1955 DOI: 10.26102/2310-6018/2025.50.3.007 (In Russ).
Received 13.05.2025
Revised 17.06.2025
Accepted 23.06.2025