YOLOXpress: A lightweight real-time unmanned aerial vehicle detection algorithm

The widespread use of drones has made drone detection a critical factor in various fields, particularly in security and defense. However, this task presents unique challenges due to the high speed, small size, and ability of drones to blend into their surroundings, which can hinder detection effectiveness. This paper introduces enhancements to the You Only Look Once (YOLO)-v8 model to improve real-time drone detection capabilities, especially when deployed on resource-constrained devices. We propose an improved model called YOLOXpress, which optimizes both processing speed and model size while maintaining an acceptable level of accuracy. By replacing the Cross-Stage Feature Fusion modules in the Backbone and Neck with Re-parameterization Convolution and RepC3 modules, we significantly reduced the number of computations, achieving a 12.25% increase in processing speed (frames per second) and a 69.96% reduction in model size. Although there was a 6% decrease in average accuracy compared to the original YOLO-v8 model, YOLOXpress remained effective for real-time drone detection. Experiments conducted on the TIB-Net dataset confirmed that this model is highly suitable for deployment on resource-limited devices, such as compact embedded systems.
- Al-lQubaydhi, N., Alenezi, A., Alanazi, T., Senyor, A., Alanezi, N., Alotaibi, B., Alotaibi, M., Razaque, A., & Hariri, S. (2024). Deep Learning for Unmanned Aerial Vehicles Detection: A Review. *Computer Science Review*, 51, 100614.
- Alsanad, H.R., Sadik, A.Z., Ucan, O.N., Ilyas, M., & Bayat, O. (2022). YOLO-V3 Based Real-Time Drone Detection Algorithm. *Multimedia Tools and Applications*, 81(18), 26185–26198.
- Bochkovskiy, A., Wang, C.Y., & Liao, H.Y.M. (2020). Yolov4: Optimal Speed and Accuracy of Object Detection. [arXiv Preprint] arXiv:2004.10934.
- Dosovitskiy, A. (2020). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. [arXiv Preprint] arXiv:2010.11929.
- Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., & Adam, H. (2017). MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications.[arXiv Preprint] arXiv:1704.04861, p. 126.
- Kim, J.H., Kim, N., & Won, C.S. (2023). High-Speed Drone Detection Based on Yolo-v8. In: *ICASSP 2023–2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)*.
- Lee, Y., Hwang, J.W., Lee, S., Bae, Y., & Park, J. (2019). An Energy and GPU-Computation Efficient Backbone Network for Real-Time Object Detection. In: *2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)*.
- Li, C., Li, L., Jiang, H., Weng, K., Geng, Y., Li, L., Ke, Z., Li, Q., Cheng, M., Nie, W., Li, Y., Zhang, B., Liang, Y., Zhou, L., Xu, X., Chu, X., Wei, X., & Wei, X. (2022). YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications. [arXiv Preprint] arXiv:2205.03199.
- Liu, Y., Liu, D., Wang, B., & Chen, B. (2022). Mob-YOLO: A Lightweight UAV Object Detection Method. In: *2022 International Conference on Sensing, Measurement and Data Analytics in the Era of Artificial Intelligence (ICSMD)*.
- Long, X., Deng, K., Wang, G., Zhang, Y., Dang, Q., Gao, Y., Shen, H., Ren, J., Han, S., & Ding, E. (2020). PP-YOLO: An Effective and Efficient Implementation of Object Detector. [arXiv Preprint] arXiv:2007.12099.
- Lv, Y., Ai, Z., Chen, M., Gong, X., Wang, Y., & Lu, Z. (2022). High-Resolution Drone Detection Based on Background Difference and SAG-Yolov5s. *Sensors (Basel)*, 22(15), 5825. https://doi.org/10.3390/s22155825
- Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., & Chen, L.C. (2018). MobileNetV2: Inverted Residuals and Linear Bottlenecks. In: *Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition*.
- Sun, H., Yang, J., Shen, J., Liang, D., Ning-Zhong, L., & Zhou, H. (2020). TIB-Net: Drone Detection Network with Tiny Iterative Backbone. *IEEE Access*, 8, 130697–130707. https://doi.org/10.1109/ACCESS.2020.3009518
- Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., & Rabinovich, A. (2015). Going Deeper with Convolutions. In: *Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition*.
- Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., & Wojna, Z. (2016). Rethinking the Inception Architecture for Computer Vision. In: *Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
- Terven, J., Córdova-Esparza, D.M., & Romero-González, J.A. (2023). A comprehensive review of yolo architectures in computer vision: From yolov1 to yolov8 and yolo-nas. Machine Learning and Knowledge Extraction, 5(4), 1680–1716.
- Wei, T., Tian, Y., & Chen, C.W. (2021). Rethinking Convolution: Towards an Optimal Efficiency. In: *Under Review as a Conference Paper at ICLR*.
- Wu, T., & Dong, Y. (2023). YOLO-SE: Improved YOLOv8 for Remote Sensing Object Detection and Recognition. *Applied Sciences*, 13(24), 12977.
- Yılmaz, H.B., & Oruç, F. (2024). Drone Detection Performance Evaluation via Real Experiments with Additional Synthetic Darkness. *Gazi University Journal of Science Part A: Engineering and Innovation*, 11(3), 546–562. https://doi.org/10.54287/gujsa.1526979
- Zamri, F.N.M., Gunawan, T.S., Yusoff, S.H., Alzahrani, A.A., Bramantoro, A., & Kartiwi, M. (2024). Enhanced Small Drone Detection Using Optimized YOLOv8 with Attention Mechanisms. *IEEE Access*, 12, 90629–90643. https://doi.org/10.1109/ACCESS.2024.3420730
- Zhai, X., Huang, Z., Li, T., Liu, H., & Wang, S. (2023). YOLO-Drone: An Optimized YOLOv8 Network for Tiny UAV Object Detection. *Electronics*, 12(17), 3664. https://doi.org/10.3390/electronics12173664
- Zhu, X., Lyu, S., Wang, X., & Zhao, Q. (2021). TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-Captured Scenarios. In: *2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).