University of Bahrain
Scientific Journals

Deep learning based object recognition in video sequences

Show simple item record

dc.contributor.author A, Ashwith
dc.contributor.author Nasreen, Dr. Azra
dc.contributor.author Sethuram, Anurag
dc.contributor.author G, Dr. Shobha
dc.contributor.author S. Iyengar, Dr. S.
dc.date.accessioned 2021-08-18T22:15:47Z
dc.date.available 2021-08-18T22:15:47Z
dc.date.issued 2021-08-19
dc.identifier.issn 2210-142X
dc.identifier.uri https://journal.uob.edu.bh:443/handle/123456789/4448
dc.description.abstract Object recognition and tracking in videos is a field of research with extremely high potential and utility. Application of Machine Learning in this field is relatively recent with many algorithms being proposed to do so. Detecting and tracking objects in videos using TensorFlow (CNN) is a relatively recent approach. This paper proposes a methodology to detect and track a specific class of object in a given video using Convolutional Neural Networks (CNNs). The CNNs used for this project are the TensorFlow SSD Model and the TensorFlow Inception Model. The use case of airplane detection was incorporated as a test subject in this project although the concept is widely extensible to any class of object per the application. Object recognition has important applications in physical security systems including intrusion detection. For the SSD Model, images of planes were downloaded and were annotated using bounding boxes to identify regions of interest. Images were split into training and test sets, after which TensorFlow specific records were generated for training and test sets. For the Inception model, the last layer of the Neural Network was trained with multiple images of Planes and Random images to essentially obtain a classifier for classifying Planes and No-Planes. The model was tested and compared with the SSD model on multiple criteria. The TensorFlow SSD model was accurate, generating crisp bounding boxes with relatively high accuracy. The number of false positives and false negatives were very low. The TensorFlow Inception Model had a higher accuracy than the TensorFlow SSD model in terms of false positives and false negatives. However, the model does not display bounding boxes since the model isn’t meant to find the region of interest. Both are good offline models for a specific class of object recognition. The GPU clearly outperformed CPU in Training and Testing by a wide margin. The TensorFlow Inception model is suitable to extract frames in which specific object is present if the position of the object is not of importance. The SSD model is suitable if the specific object needs to be detected with its position in a video frame en_US
dc.language.iso en en_US
dc.publisher University of Bahrain en_US
dc.rights Attribution-NonCommercial-NoDerivatives 4.0 International *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/4.0/ *
dc.subject TensorFlow en_US
dc.subject Object Recognition en_US
dc.subject Bounding Boxes en_US
dc.subject CNN en_US
dc.subject Neural Network en_US
dc.subject SSD en_US
dc.subject Inception en_US
dc.subject Plane en_US
dc.subject CPU en_US
dc.subject GPU en_US
dc.title Deep learning based object recognition in video sequences en_US
dc.identifier.doi https://dx.doi.org/10.12785/ijcds/11014
dc.contributor.authorcountry India en_US
dc.contributor.authorcountry India en_US
dc.contributor.authorcountry India en_US
dc.contributor.authorcountry India en_US
dc.contributor.authorcountry Miami en_US
dc.contributor.authoraffiliation Department of Computer Science and Engineering, RV College of Engineeing, Bangalore en_US
dc.contributor.authoraffiliation Department of Computer Science and Engineering, RV College of Engineeing, Bangalore en_US
dc.contributor.authoraffiliation Department of Computer Science and Engineering, RV College of Engineeing, Bangalore en_US
dc.contributor.authoraffiliation Department of Computer Science and Engineering, RV College of Engineeing, Bangalore en_US
dc.contributor.authoraffiliation Ryder Professor of Computer Science and Director of the School of Computing and Information Sciences, Florida International University (FIU) en_US
dc.source.title International Journal Of Computing and Digital System en_US
dc.abbreviatedsourcetitle IJCDS en_US


Files in this item

The following license files are associated with this item:

This item appears in the following Issue(s)

Show simple item record

Attribution-NonCommercial-NoDerivatives 4.0 International Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivatives 4.0 International

All Journals


Advanced Search

Browse

Administrator Account