Progress in Object Detection: An In-Depth Analysis of Methods and Use Cases
##plugins.themes.bootstrap3.article.main##
Object detection, a fundamental task in computer vision, involves identifying and localizing objects within images or videos. This paper provides a comprehensive review of traditional and deep learning-based object detection techniques and their applications, challenges, and future directions. We first discuss traditional object detection methods, which rely on handcrafted features and classical machine learning algorithms. We then explore the advancements brought by deep learning, including convolutional neural networks (CNNs) and transformer-based architectures, which have significantly improved the accuracy and efficiency of object detection tasks. A thorough comparison and evaluation of different object detection techniques are presented, considering performance metrics, speed, and robustness to object size, orientation, and occlusion variations. We also examine the diverse applications of object detection across various domains, such as robotics, autonomous vehicles, surveillance, medical imaging, and augmented reality. We outline open challenges and future research directions, emphasizing the need to combine object detection with other tasks, develop few-shot and zero-shot learning approaches, and address issues related to fairness, accountability, and transparency. This paper aims to comprehensively review the most prominent object detection techniques, their evolution, and their applications in diverse domains. We discussed traditional methods and recent deep learning-based approaches, emphasizing their strengths and limitations.
Downloads
Download data is not yet available.
References
-
Wen L, Cheng Y, Fang Y, Li X. A comprehensive survey of oriented object detection in remote sensing images. Expert Syst Appl, 2023; 224: 119960.
Google Scholar
1
-
Hu D, Li S, Wang M. Object detection in hospital facilities: A comprehensive dataset and performance evaluation. Eng Appl Artif Intell, 2023; 123: 106223. doi: 10.1016/J.ENGAPPAI.2023.106223.
Google Scholar
2
-
Zhang J, Xia K, Huang Z, Wang S, Akindele RG. ETAM: Ensemble transformer with attention modules for detection of small objects. Expert Syst Appl, 2023; 224: 119997.
Google Scholar
3
-
Sai Prasanna GV, Pavani K, Singh MK. Spliced images detection by using Viola-Jones algorithms method. Mater Today Proc, 2022; 51: 924?927.
Google Scholar
4
-
Kordelas G, Daras P. Viewpoint independent object recognition in cluttered scenes exploiting ray-triangle intersection and SIFT algorithms. Pattern Recognit, 2010; 43(11): 3833?3845.
Google Scholar
5
-
Lowe DG. Object recognition from local scale-invariant features. In Proceedings of 7th IEEE international conference on computer vision, 1999; 2: 1150-1157.
Google Scholar
6
-
Feng C, Cao Z, Xiao Y, Fang Z, Zhou JT. Multi-spectral template matching based object detection in a few-shot learning manner. Inf Sci (N Y), 2023; 624: 20?36.
Google Scholar
7
-
Zhao X, Cheah CC. BIM-based indoor mobile robot initialization for construction automation using object detection. Autom Constr, 2023; 146: 104647.
Google Scholar
8
-
Choi JD, Kim MY. A sensor fusion system with thermal infrared camera and LiDAR for autonomous vehicles and deep learning based object detection. ICT Express, 2022.
Google Scholar
9
-
Srinivas K, Singh L, Chavva SR, Dappuri B, Chandrasekaran S, Qamar S. Multi-modal cyber security based object detection by classification using deep learning and background suppression techniques. Computers and Electrical Engineering, 2022; 103: 108333.
Google Scholar
10
-
Salman ME, ?akirsoy ?akar G, Azimjonov J, K?sem M, Cedi?mo?lu ?H. Automated prostate cancer grading and diagnosis system using deep learning-based Yolo object detection algorithm. Expert Syst Appl, 2022; 201: 117148.
Google Scholar
11
-
Napier T, Lee I. Using mobile-based augmented reality and object detection for real-time Abalone growth monitoring. Comput Electron Agric, 2023; 207: 107744.
Google Scholar
12
-
Roy AM, Bhaduri J, Kumar T, Raj K. WilDect-YOLO: An efficient and robust computer vision-based accurate object localization model for automated endangered wildlife detection. Ecol Inform, 2023; 75: 101919.
Google Scholar
13
-
Santra B, Mukherjee DP. A comprehensive survey on computer vision based approaches for automatic identification of products in retail store. Image Vis Comput, 2019; 86: 45?63.
Google Scholar
14
-
Wang H, Peng Z, Liu J, Zheng YP, Liao B, Wang Y. Feature points detection and tracking based on SIFT combining with KLT method. International Conference on Optical Instruments and Technology; 2009.
Google Scholar
15
-
Micheal A. Comparative analysis of SIFT and SURF on KLT tracker for UAV applications. [Internet]. 2017. Retrieved from: ieeexplore.ieee.org.
Google Scholar
16
-
Song P, Li P, Dai L, Wang T, Chen Z. Boosting R-CNN: Reweighting R-CNN samples by RPN?s error for underwater object detection. Neurocomputing, 2023; 530: 150?164.
Google Scholar
17
-
Wang Z, Ling Y, Wang X, Meng D, Nie L, An G, Wang X. An improved Faster R-CNN model for multi-object tomato maturity detection in complex scenarios. Ecological Informatics, 2022; 72: 101886. https://doi.org/10.1016/j.ecoinf.2022.101886.
Google Scholar
18
-
Mani VRS, Saravanaselvan A, Arumugam N. Performance comparison of CNN, QNN and BNN deep neural networks for real-time object detection using ZYNQ FPGA node. Microelectronics J, 2022; 119: 105319.
Google Scholar
19
-
Yi D, Su J, Chen WH. Probabilistic faster R-CNN with stochastic region proposing: Towards object detection and recognition in remote sensing imagery. Neurocomputing, 2021; 459: 290?301.
Google Scholar
20
-
Ren S, He K, Girshick R, Sun J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. arXiv (Cornell University), 2015. https://doi.org/10.48550/arxiv.1506.01497.
Google Scholar
21
-
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C, Berg AC. SSD: Single Shot MultiBox Detector. In Lecture Notes in Computer Science, 2016: 21?37. https://doi.org/10.1007/978-3-319-46448-0_2.
Google Scholar
22
-
Gu G, Gan S, Deng J, Du Y, Qiu Z, Liu J, Liu C, Zhao J. Automated diatom detection in forensic drowning diagnosis using a single shot multibox detector with plump receptive field. Applied Soft Computing, 2022; 122: 108885. https://doi.org/10.1016/j.asoc.2022.108885.
Google Scholar
23
-
Redmon J, Divvala S, Girshick R, Farhadi A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2016: 779-788.
Google Scholar
24
-
Lin T-Y, Goyal P, Girshick R, He K, Doll?r P. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision, 2017: 2980-2988.
Google Scholar
25
-
Chen S, Zhao J, Zhou Y, Wang H, Yao R, Li-Xu Z, Xue Y. Info-FPN: An Informative Feature Pyramid Network for object detection in remote sensing images. Expert Systems With Applications, 2023; 214: 119132. https://doi.org/10.1016/j.eswa.2022.119132.
Google Scholar
26
-
Li S, Zeng J, Liu Y, Cui X, Liu J, Huang T, Xu J. CLS-DETR: A DETR-series object detection network using classification information to accelerate convergence. Pattern Recognition Letters, 2023; 165: 168?175. https://doi.org/10.1016/j.patrec.2022.12.016.
Google Scholar
27
-
Zhao H, Wang J, Dai D, Lin S, Chen Z. D-NMS: A dynamic NMS network for general object detection. Neurocomputing, 2022; 512: 225?234. https://doi.org/10.1016/j.neucom.2022.09.080.
Google Scholar
28