Call for Paper - January 2024 Edition
IJCA solicits original research papers for the January 2024 Edition. Last date of manuscript submission is December 20, 2023. Read More

Object Detection in Video Frames using Deep Learning

International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Year of Publication: 2022
Krishna Kumar, Krishan Kumar, C.L.P. Gupta

Krishna Kumar, Krishan Kumar and C L P Gupta. Object Detection in Video Frames using Deep Learning. International Journal of Computer Applications 183(51):33-39, February 2022. BibTeX

	author = {Krishna Kumar and Krishan Kumar and C.L.P. Gupta},
	title = {Object Detection in Video Frames using Deep Learning},
	journal = {International Journal of Computer Applications},
	issue_date = {February 2022},
	volume = {183},
	number = {51},
	month = {Feb},
	year = {2022},
	issn = {0975-8887},
	pages = {33-39},
	numpages = {7},
	url = {},
	doi = {10.5120/ijca2022921930},
	publisher = {Foundation of Computer Science (FCS), NY, USA},
	address = {New York, USA}


The object detection based on deep learning is an important application like scene understanding, video surveillance, robotics, self-driving systems etc. in deep learning method which is eminent by its strong capability of feature learning and feature representation compared with the traditional object detection methods. With the rapid development in deep learning, more powerful tools, able to learn semantic, high-level, deeper features, are introduced to address the problems existing in traditional architectures. These models perform differently in network architecture, training strategy and optimization function. This paper introduces the classical methods for object detection and illustrates the relation and difference between the classical methods and the deep learning methods for object detection. Moreover, it introduces the appearance of the object detection based on deep learning elaborates the most typical methods nowadays via deep learning. The paper focuses on the framework design and the working principle of the models and examines the model performance in the real-time environment and hence for the accuracy of object detection. Furthermore, a survey of several specific tasks including salient object detection features, face detection and pedestrian detection has also been briefly discussed. Finally, the main challenges in object detection using deep learning and some solutions for reference has been discussed.


  1. Shine L, Jiji CV (2020) Automated detection of helmet on motorcyclists from traffic surveillance videos: a comparative analysis using hand-crafted features and CNN. Multimed Tools Appl. https :// 2-020-08627 -w
  2. Liu J, Yang Y, Lv S, Wang J, Chen H et al (2019) Attention-based BiGRU-CNN for Chinese question classification. J Ambient Intell Humaniz Comput. https :// 2-019-01344 -9
  3. Cao D, Zhu M, Gao L et al (2019) An image caption method based on object detection. Multimed Tools Appl 78(24):35329–35350.
  4. Patel, D., & Gautam, P. K. (2015). A Review Paper on Object Detection for Improve the Classification Accuracy and Robustness using different Techniques. International Journal of Computer Applications, vol. 112, no 11, pp- 975, 8887.
  5. Papageorgiou, C. P., Oren, M., &Poggio, T. (1998). A general framework for object detection. In Sixth International Conference on Computer Vision (IEEE Cat. No. 98CH36271), IEEE, vol 5, no.2, pp. 555-562).
  6. Schapire, R. E. (2003). The boosting approach to machine learning: An overview. In Nonlinear estimation and classification, Springer, New York, NY, vol 2, no.3, pp. 149-171.
  7. Parekh, H. S., Thakore, D. G., &Jaliya, U. K. (2014). A survey on object detection and tracking methods. International Journal of Innovative Research in Computer and Communication Engineering, vol2, no. 2, pp. 2970-2979.
  8. Awan, S. (2014). Object Class Recognition Using Global Shape Descriptors in 3D (Doctoral dissertation).,vol 2, no.1, pp. 345- 389.
  9. Mashak, S. V., Hosseini, B., Mokji, M., & Abu-Bakar, S. A. R. (2010). Background subtraction for object detection under varying environments. In 2010 International Conference of Soft Computing and Pattern Recognition IEEE, vol 3, no.6, pp. 123- 126.
  10. Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, vol2, no. 4, pp. 580-587.
  11. Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, vol 3, no.2, pp. 580-587.
  12. Ce Li, Yachao Zhang and YanyunQu, “Object Detection Based on Deep Learning of Small Samples,”
  13. International Conference, pp.1-6, March 2018.
  14. Cong Tang, YunsongFeng, Xing Yang, Chao Zheng and Yuanpu Zhou, “The Object Detection Based on Deep Learning,” International Conference, pp.1-6, 2017.
  15. Christian Szegedy, Alexander Toshev and DumitruErhan, “Deep Neural Networks for Object Detection,” IEEE, pp.1-9, 2007.
  16. Xiaogang Wang, “Deep Learning in Object Recognition, Detection, and Segmentation,” IEEE, pp.1-40, Apr. 2014.
  17. Shuai Zhang, Chong Wang and Shing-Chow Chan, “New Object Detection, Tracking, and Recognition Approaches for Video Surveillance Over Camera Network,” IEEE SENSORS JOURNAL, vol. 15, no.69, pp. 1-13, May 2015.
  18. Xiao Ma, Ke Zhou and JiangfengZheng, “Photo- Realistic Face Age Progression/Regression Using a
  19. Single Generative Adversarial Network,” Neurocomputing, Elsevier B.V., pp.1-16, July 2019.
  20. R. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. CVPR, 2014. 1, 2
  21. Zhong-Qiu Zhao, PengZheng, Shou-taoXu and Xindong Wu, “Object Detection with Deep Learning: A Review,” IEEE, pp.1-21, 2019.
  22. Sandeep Kumar, AmanBalyan and ManviChawla, “Object Detection and Recognition in Images,” IJEDR, pp.1-6, 201


Object detection, deep learning, framework design, model performance.