Deep Learning-based Obstacle Detection for Human Interaction Robots: A Review

Farhana Ahmed; Nor Hidayati  Abdul Aziz; Rosli  Besar; Saad Salam; Md. Abdullah  Man

doi:10.33093/ijoras.2025.7.3.10

PDF

Published: Dec 10, 2025

DOI: https://doi.org/10.33093/ijoras.2025.7.3.10

Keywords:

Obstacle Detection, Deep Learning, Power Consumption, Computational Efficiency, CNN-Based Method, Deployment Feasibility, Lightweight Model

Farhana Ahmed

Faculty of Engineering and Technology, Multimedia University (Malaysia)

Nor Hidayati Abdul Aziz

Faculty of Engineering and Technology, Multimedia University (Malaysia)

https://orcid.org/0000-0001-7995-4912

Rosli Besar

Faculty of Engineering and Technology, Multimedia University (Malaysia)

Saad Salam

Mechanical Engineering Department, Chittagong University of Engineering and Technology (Bangladesh)

Md. Abdullah Man

TM R&D (Malaysia)

Abstract

Obstacle detection is the foundation of autonomous robotics, enabling robots to perceive and understand the world around them to move safely. Deep learning has emerged as one of the driving forces in today’s research, with various algorithms employed for learning and making effective decisions based on vast and complex datasets. In recent years, numerous deep learning methods have been developed and studied to detect obstacles. This paper provides an end-to-end overview of over 40 state-of-the-art deep learning models (from 50 papers) for obstacle detection in human-interacting robots, with a focus on deployment viability, real-time running, and energy efficiency. We also delve into the architecture of deep learning, highlight key challenges in real-world deployment, offer a comparative analysis of basic and advanced deep learning approaches, and examine the trade-offs between accuracy, speed, and power consumption, providing insights into practical considerations. This review categorizes obstacle detection techniques into two groups: Core CNN-based methods and Advanced Deep Learning Methods. Comparisons were made between these two groups, concentrating on computational requirements, deployment feasibility, and hardware configuration. Several key findings emerged. It was determined that models with high accuracy were computationally expensive and unsuitable for embedded deployment. While some models experience accuracy-speed trade-offs, others are limited by hardware constraints and power limitations. Finally, this review concludes with a structured discussion of real-world deployment considerations, prioritizing model efficiency, scalability, and potential future research directions in deep learning-based obstacle detection.

Manuscript received: 30 Jun 2025 | Revised: 28 Jul 2025 | Accepted: 11 Aug 2025 | Published: 30 Nov 2025

How to Cite

Ahmed, F., Abdul Aziz, N. H. ., Besar, R. ., Salam, S., & Man, M. A. . (2025). Deep Learning-based Obstacle Detection for Human Interaction Robots: A Review. International Journal on Robotics, Automation and Sciences, 7(3), 75–86. https://doi.org/10.33093/ijoras.2025.7.3.10

Issue

Vol. 7 No. 3 (2025): International Journal on Robotics, Automation and Sciences

Section

NexSymp2025 (Science & Technology)

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

References

E.M.G.N.V. Cruz, S. Oliveira and A. Correia, “Robotics Applications in the Hospital Domain: A Literature Review,” Applied System Innovation 2024, Vol. 7, Page 125, vol. 7, no. 6, p. 125, 2024.

DOI: https://doi.org/10.3390/asi7060125

H.A. Berkers, S. Rispens and P.M. Le Blanc, “The role of robotization in work design: a comparative case study among logistic warehouses,” International Journal of Human Resource Management, vol. 34, no. 9, pp. 1852–1875, 2023.

DOI: https://doi.org/10.1080/09585192.2022.2043925

D. Ristic-Durrant, M. Franke and K. Michels, “A Review of Vision-Based On-Board Obstacle Detection and Distance Estimation in Railways,” Sensors 2021, Vol. 21, Page 3452, vol. 21, no. 10, p. 3452, 2021.

DOI: https://doi.org/10.3390/s21103452

N.S. Ahmad, N.L. Boon and P. Goh, “Multi-sensor obstacle detection system via model-based state-feedback control in smart cane design for the visually challenged,” IEEE Access, vol. 6, pp. 64182–64192, 2018.

DOI: https://doi.org/10.1109/ACCESS.2018.2878423

A.B. Atitallah, Y. Said, M.A.B. Atitallah, M. Albekairi, K. Kaaniche and S. Boubaker, “An effective obstacle detection system using deep learning advantages to aid blind and visually impaired navigation,” Ain Shams Engineering Journal, vol. 15, no. 2, 2024.

DOI: https://doi.org/10.1016/j.asej.2023.102387

L. Liu et al., “Deep Learning for Generic Object Detection: A Survey,” International Journal of Computer Vision, vol. 128, no. 2, pp. 261–318, 2020.

DOI: https://doi.org/10.1007/s11263-019-01247-4

F. Shao et al., “Deep Learning for Weakly-Supervised Object Detection and Localization: A Survey,” Neurocomputing, vol. 496, pp. 192–207, 2022.

DOI: https://doi.org/10.1016/J.NEUCOM.2022.01.095

G. Li et al., “Implicit Feature Contrastive Learning for Few-Shot Object Detection,” Computers, Materials and Continua, vol. 84, no. 1, pp. 1615–1632, 2025.

DOI: https://doi.org/10.32604/CMC.2025.063109

L. Cao and X. Zhu, “An autonomous service mobile robot for indoor environments,” Proceedings of 2020 Asia-Pacific Conference on Image Processing, Electronics and Computers(IPEC) 2020, pp. 8–15, 2020.

DOI: https://doi.org/10.1109/IPEC49694.2020.9115180

A. Younis, L. Shixin, J.N. Shelembi and Z. Hai, “Real-time object detection using pre-trained deep learning models mobilenet- SSD,” ACM International Conference Proceeding Series, pp. 44–48, 2020.

DOI: https://doi.org/10.1145/3379247.3379264

M. Afif, R. Ayachi, Y. Said and M. Atri, “Deep embedded lightweight CNN network for indoor objects detection on FPGA,” Journal of Parallel and Distributed Computing, vol. 201, p. 105085, 2025.

DOI: https://doi.org/10.1016/J.JPDC.2025.105085

C. Lin, Y. Cheng, X. Wang, J. Yuan and G. Wang, “Transformer-Based Dual-Channel Self-Attention for UUV Autonomous Collision Avoidance,” IEEE Transactions on Intelligent Vehicles, vol. 8, no. 3, pp. 2319–2331, 2023.

DOI: https://doi.org/10.1109/TIV.2023.3245615

U. Aulia, I. Hasanuddin, M. Dirhamsyah, and N. Nasaruddin, “A new CNN-BASED object detection system for autonomous mobile robots based on real-world vehicle datasets,” Heliyon, vol. 10, no. 15, 2024.

DOI: https://doi.org/10.1016/j.heliyon.2024.e35247

P. Agrawal et al., “YOLO Algorithm Implementation for Real Time Object Detection and Tracking,” 2022 IEEE Students Conference on Engineering and Systems, 2022.

DOI: https://doi.org/10.1109/SCES55490.2022.9887678

J.H. Kim, N. Kim, Y.W. Park and C.S. Won, “Object Detection and Classification Based on YOLO-V5 with Improved Maritime Dataset,” Journal of Marine Science and Engineering 2022, Vol. 10, Page 377, vol. 10, no. 3, p. 377, 2022.

DOI: https://doi.org/10.3390/JMSE10030377

Z. Zhao, J. Kang, Z. Sun, T. Ye and B. Wu, “A real-time and high-accuracy railway obstacle detection method using lightweight CNN and improved transformer,” Measurement (Lond), vol. 238, 2024.

DOI:https://doi.org/10.1016/j.measurement.2024.115380

A. Jenefa et al., “Real-Time Rail Safety: A Deep Convolutional Neural Network Approach for Obstacle Detection on Tracks,” ICSPC 2023 - 4th International Conference on Signal Processing and Communication, pp. 101–105, 2023.

DOI:https://doi.org/10.1109/ICSPC57692.2023.10125284

M. Afif, Y. Said, R. Ayachi, and M. Hleili, “An End-to-End Object Detection System in Indoor Environments Using Lightweight Neural Network,” Traitement du Signal, vol. 41, no. 5, pp. 2711–2719, 2024.

DOI: https://doi.org/10.18280/ts.410544

Y. Lv, Y. Fang, W. Chi, G. Chen and L. Sun, “Object Detection for Sweeping Robots in Home Scenes (ODSR-IHS): A Novel Benchmark Dataset,” IEEE Access, vol. 9, pp. 17820–17828, 2021.

DOI: https://doi.org/10.1109/ACCESS.2021.3053546

L. Guan, L. Jia, Z. Xie and C. Yin, “A Lightweight Framework for Obstacle Detection in the Railway Image Based on Fast Region Proposal and Improved YOLO-Tiny Network,” IEEE Transactions on Instrumentation and Measurement, vol. 71, 2022.

DOI: https://doi.org/10.1109/TIM.2022.3150584

T. Gong and Y. Ma, “PSO-based lightweight neural architecture search for object detection,” Swarm and Evolutionary Computation, vol. 90, 2024.

DOI: https://doi.org/10.1016/j.swevo.2024.101684

H. Lokhande and S. R. Ganorkar, “Object detection in video surveillance using MobileNetV2 on resource-constrained low-power edge devices,” Bulletin of Electrical Engineering and Informatics, vol. 14, no. 1, pp. 357–365, 2025.

DOI: https://doi.org/10.11591/eei.v14i1.8131

J. Du, S. Zhao, C. Shang, and Y. Chen, “Applying Image Analysis to Build a Lightweight System for Blind Obstacles Detecting of Intelligent Wheelchairs,” Electronics (Switzerland), vol. 12, no. 21, 2023.

DOI: https://doi.org/10.3390/electronics12214472

A. Rana and K.K. Kim, “NAS-OD: Neural Architecture Search for Object Detection,” 2024 International Conference on Electronics, Information, and Communication, ICEIC 2024, 2024.

DOI: https://doi.org/10.1109/ICEIC61013.2024.10457265

L. Chen, Q. Ding, Q. Zou, Z. Chen and L. Li, “DenseLightNet: A Light-Weight Vehicle Detection Network for Autonomous Driving,” IEEE Transactions on Industrial Electronics, vol. 67, no. 12, pp. 10600–10609, 2020.

DOI: https://doi.org/10.1109/TIE.2019.2962413

H. Zhang, C. Lu, and E. Chen, “Obstacle detection: improved YOLOX-S based on swin transformer-tiny,” Optoelectronics Letters, vol. 19, no. 11, pp. 698–704, 2023.

DOI: https://doi.org/10.1007/s11801-023-3018-9

M. Kang, W. Lee, K. Hwang and Y. Yoon, “Vision Transformer for Detecting Critical Situations and Extracting Functional Scenario for Automated Vehicle Safety Assessment,” Sustainability (Switzerland), vol. 14, no. 15, 2022.

DOI: https://doi.org/10.3390/su14159680

Y. Qin, D. He, Z. Jin, Y. Chen and S. Shan, “An Improved Deep Learning Algorithm for Obstacle Detection in Complex Rail Transit Environments,” IEEE Sensors Journal, vol. 24, no. 3, pp. 4011–4022, 2024.

DOI: https://doi.org/10.1109/JSEN.2023.3340688

M.R. Abdurrahman, H. Al-Aziz, F.A. Zayn, M.A. Purnomo, and H.A. Santoso, “Development of Robot Feature for Stunting Analysis Using Long-Short Term Memory (LSTM) Algorithm,” Journal of Informatics and Web Engineering, vol. 3, no. 3, pp. 164–175, 2024,

DOI: https://doi.org/10.33093/jiwe.2024.3.3.10

R. Fang and C. Cai, “Computer vision based obstacle detection and target tracking for autonomous vehicles,” MATEC Web of Conferences, vol. 336, p. 07004, 2021.

DOI: https://doi.org/10.1051/matecconf/202133607004

A. Masoumian, D.G.F. Marei, S. Abdulwahab, J. Cristiano, D. Puig and H.A. Rashwan, “Absolute Distance Prediction Based on Deep Learning Object Detection and Monocular Depth Estimation Models,” Frontiers in Artificial Intelligence and Applications, pp. 325–334, 2021.

DOI: https://doi.org/10.3233/FAIA210151

X. Chen, Y. Liu, and K. Achuthan, “WODIS: Water Obstacle Detection Network Based on Image Segmentation for Autonomous Surface Vehicles in Maritime Environments,” IEEE Transactions on Instrumentation and Measurement, vol. 70, 2021.

DOI: https://doi.org/10.1109/TIM.2021.3092070

H. Kim et al., “Vision-Based Real-Time Obstacle Segmentation Algorithm for Autonomous Surface Vehicle,” IEEE Access, vol. 7, pp. 179420–179428, 2019.

DOI: https://doi.org/10.1109/ACCESS.2019.2959312

P.S. Perumal et al., “LaneScanNET: A deep-learning approach for simultaneous detection of obstacle-lane states for autonomous driving systems,” Expert Systems with Applications, vol. 233, 2023.

DOI: https://doi.org/10.1016/j.eswa.2023.120970

Y. Xie et al., “SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection,” arXiv, 2023.

DOI: https://doi.org/10.48550/arXiv.2304.14340

T. Liu, S. Du, C. Liang, B. Zhang, and R. Feng, “A Novel Multi-Sensor Fusion Based Object Detection and Recognition Algorithm for Intelligent Assisted Driving,” IEEE Access, vol. 9, pp. 81564–81574, 2021.

DOI: https://doi.org/10.1109/ACCESS.2021.3083503

C. Zhang et al., “Robust-FusionNet: Deep Multimodal Sensor Fusion for 3-D Object Detection Under Severe Weather Conditions,” IEEE Transactions on Instrumentation and Measurement, vol. 71, 2022.

DOI: https://doi.org/10.1109/TIM.2022.3191724

H. Xiang, R. Xu and J. Ma, “HM-ViT: Hetero-modal Vehicle-to-Vehicle Cooperative perception with vision transformer,” Proceedings of the IEEE International Conference on Computer Vision, pp. 284–295, 2023.

DOI: https://doi.org/10.1109/ICCV51070.2023.00033

Y. He et al., “BiViT: Extremely Compressed Binary Vision Transformer,” Proceedings of the IEEE International Conference on Computer Vision, pp. 5628–5640, Nov. 2022.

DOI: https://doi.org/10.1109/ICCV51070.2023.00520.

R. Xu, C.J. Chen, Z. Tu, and M.H. Yang, “V2X-ViTv2: Improved Vision Transformers for Vehicle-to-Everything Cooperative Perception,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 47, no. 1, pp. 650–662, 2025.

DOI: https://doi.org/10.1109/TPAMI.2024.3479222

D. Minott, S. Siddiqui and R.J. Haddad, “Benchmarking Edge AI Platforms: Performance Analysis of NVIDIA Jetson and Raspberry Pi 5 with Coral TPU,” Conference Proceedings - IEEE SOUTHEASTCON, pp. 1384–1389, 2025. DOI:https://doi.org/10.1109/SoutheastCon56624.2025.10971592

S. Tennekoon, N. Wedasingha, A. Welhenge, N. Abhayasinghe and I. Murray Am, “Advancing Object Detection: A Narrative Review of Evolving Techniques and Their Navigation Applications,” IEEE Access, vol. 13, pp. 50534–50555, 2025.

DOI: https://doi.org/10.1109/ACCESS.2025.3551686

I. Atik, “Deep Learning in Military Object Detection: An Example of the Yolo-Nas Model,” 8th International Symposium on Innovative Approaches in Smart Technologies, ISAS 2024 - Proceedings, 2024.

DOI: https://doi.org/10.1109/ISAS64331.2024.10845459

R. Varghese and M. Sambath, “A Comprehensive Review On Two-Stage Object Detection Algorithms,” iQ-CCHESS 2023 - 2023 IEEE International Conference on Quantum Technologies, Communications, Computing, Hardware and Embedded Systems Security, 2023. DOI:https://doi.org/10.1109/IQCCHESS56596.2023.10391506

J.G. Min, D. Kam, Y. Byun, G. Park and Y. Lee, “Energy-Efficient RISC-V-Based Vector Processor for Cache-Aware Structurally-Pruned Transformers,” Proceedings of the International Symposium on Low Power Electronics and Design, vol. 2023-August, 2023. DOI:https://doi.org/10.1109/ISLPED58423.2023.10244508

Z. Ouardirhi, S.A. Mahmoudi and M. Zbakh, “Enhancing Object Detection in Smart Video Surveillance: A Survey of Occlusion-Handling Approaches,” Electronics 2024, Vol. 13, Page 541, vol. 13, no. 3, p. 541, 2024.

DOI: https://doi.org/10.3390/ELECTRONICS13030541

Y. Chen and W. Zhou, “Hybrid-Attention Network for RGB-D Salient Object Detection,” Applied Sciences 2020, Vol. 10, Applied Sciences, vol. 10, no. 17, p. 5806, 2020.

DOI: https://doi.org/10.3390/APP10175806

Z. Chen, Z. Ding, X. Zhang, X. Zhang and T. Qin, “Improving Out-of-Distribution Generalization in SAR Image Scene Classification with Limited Training Samples,” Remote Sensing 2023, vol. 15, no. 24, p. 5761, 2023.

DOI: https://doi.org/10.3390/RS15245761

H. Mousazadeh et al., “Ships and Offshore Structures Dynamic and static object detection and tracking in an autonomous surface vehicle Dynamic and static object detection and tracking in an autonomous surface vehicle,” Ocean Engineering, vol. 159, pp. 56-65, 2018.

DOI: https://doi.org/10.1016/j.oceaneng.2018.04.018

S. Feng, B. Sebastian and P. Ben-Tzvi, “A collision avoidance method based on deep reinforcement learning,” Robotics, vol. 10, no. 2, 2021.

DOI: https://doi.org/10.3390/robotics10020073

Article Sidebar

Main Article Content

Abstract

Article Details

References