Deep learning based object recognition in video sequences

A, Ashwith; Nasreen, Dr. Azra; Sethuram, Anurag; G, Dr. Shobha; S. Iyengar, Dr. S.

doi:https://dx.doi.org/10.12785/ijcds/11014

Journals About us Ethics and Policies Objectives Values Contact us

UOB Journals
→
02. International Journal of Computing and Digital Systems
→
Volume 11
→
Issue 01
→
View Item

dc.contributor.author	A, Ashwith
dc.contributor.author	Nasreen, Dr. Azra
dc.contributor.author	Sethuram, Anurag
dc.contributor.author	G, Dr. Shobha
dc.contributor.author	S. Iyengar, Dr. S.
dc.date.accessioned	2021-08-18T22:15:47Z
dc.date.available	2021-08-18T22:15:47Z
dc.date.issued	2021-08-19
dc.identifier.issn	2210-142X
dc.identifier.uri	https://journal.uob.edu.bh:443/handle/123456789/4448
dc.description.abstract	Object recognition and tracking in videos is a field of research with extremely high potential and utility. Application of Machine Learning in this field is relatively recent with many algorithms being proposed to do so. Detecting and tracking objects in videos using TensorFlow (CNN) is a relatively recent approach. This paper proposes a methodology to detect and track a specific class of object in a given video using Convolutional Neural Networks (CNNs). The CNNs used for this project are the TensorFlow SSD Model and the TensorFlow Inception Model. The use case of airplane detection was incorporated as a test subject in this project although the concept is widely extensible to any class of object per the application. Object recognition has important applications in physical security systems including intrusion detection. For the SSD Model, images of planes were downloaded and were annotated using bounding boxes to identify regions of interest. Images were split into training and test sets, after which TensorFlow specific records were generated for training and test sets. For the Inception model, the last layer of the Neural Network was trained with multiple images of Planes and Random images to essentially obtain a classifier for classifying Planes and No-Planes. The model was tested and compared with the SSD model on multiple criteria. The TensorFlow SSD model was accurate, generating crisp bounding boxes with relatively high accuracy. The number of false positives and false negatives were very low. The TensorFlow Inception Model had a higher accuracy than the TensorFlow SSD model in terms of false positives and false negatives. However, the model does not display bounding boxes since the model isn’t meant to find the region of interest. Both are good offline models for a specific class of object recognition. The GPU clearly outperformed CPU in Training and Testing by a wide margin. The TensorFlow Inception model is suitable to extract frames in which specific object is present if the position of the object is not of importance. The SSD model is suitable if the specific object needs to be detected with its position in a video frame	en_US
dc.language.iso	en	en_US
dc.publisher	University of Bahrain	en_US
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	TensorFlow	en_US
dc.subject	Object Recognition	en_US
dc.subject	Bounding Boxes	en_US
dc.subject	CNN	en_US
dc.subject	Neural Network	en_US
dc.subject	SSD	en_US
dc.subject	Inception	en_US
dc.subject	Plane	en_US
dc.subject	CPU	en_US
dc.subject	GPU	en_US
dc.title	Deep learning based object recognition in video sequences	en_US
dc.identifier.doi	https://dx.doi.org/10.12785/ijcds/11014
dc.contributor.authorcountry	India	en_US
dc.contributor.authorcountry	India	en_US
dc.contributor.authorcountry	India	en_US
dc.contributor.authorcountry	India	en_US
dc.contributor.authorcountry	Miami	en_US
dc.contributor.authoraffiliation	Department of Computer Science and Engineering, RV College of Engineeing, Bangalore	en_US
dc.contributor.authoraffiliation	Department of Computer Science and Engineering, RV College of Engineeing, Bangalore	en_US
dc.contributor.authoraffiliation	Department of Computer Science and Engineering, RV College of Engineeing, Bangalore	en_US
dc.contributor.authoraffiliation	Department of Computer Science and Engineering, RV College of Engineeing, Bangalore	en_US
dc.contributor.authoraffiliation	Ryder Professor of Computer Science and Director of the School of Computing and Information Sciences, Florida International University (FIU)	en_US
dc.source.title	International Journal Of Computing and Digital System	en_US
dc.abbreviatedsourcetitle	IJCDS	en_US