Real-World Anomaly Detection in Video Using Spatio-Temporal Features Analysis for Weakly Labelled Data with Auto Label Generation

Rikin Nayak; Jitendra P. Chaudhari

doi:10.32985/ijeces.14.5.8

Authors

Rikin J. Nayak V T Patel Dept of E & C Engg, Chandubhai S Patel Institute of Technology, Charotar University of Science and Technology, Changa, Ta-Petlad, Anand, Gujarat 388421, India https://orcid.org/0000-0001-9869-6197
Jitendra P. Chaudhari Charusat Space Research and Technology Center, V T Patel Dept of E & C Engg, Chandubhai S Patel Institute of Technology, Charotar University of Science and Technology, Changa, Ta-Petlad, Anand, Gujarat 388421, India

DOI:

https://doi.org/10.32985/ijeces.14.5.8

Keywords:

anomaly detection, spatio-temporal analysis, 3d convolutional neural network, multi-instance learning

Abstract

Detecting anomalies in videos is a complex task due to diverse content, noisy labeling, and a lack of frame-level labeling. To address these challenges in weakly labeled datasets, we propose a novel custom loss function in conjunction with the multi-instance learning (MIL) algorithm. Our approach utilizes the UCF Crime and ShanghaiTech datasets for anomaly detection. The UCF Crime dataset includes labeled videos depicting a range of incidents such as explosions, assaults, and burglaries, while the ShanghaiTech dataset is one of the largest anomaly datasets, with over 400 video clips featuring three different scenes and 130 abnormal events. We generated pseudo labels for videos using the MIL technique to detect frame-level anomalies from video-level annotations, and to train the network to distinguish between normal and abnormal classes. We conducted extensive experiments on the UCF Crime dataset using C3D and I3D features to test our model's performance. For the ShanghaiTech dataset, we used I3D features for training and testing. Our results show that with I3D features, we achieve an 84.6% frame-level AUC score for the UCF Crime dataset and a 92.27% frame-level AUC score for the ShanghaiTech dataset, which are comparable to other methods used for similar datasets.

Real-World Anomaly Detection in Video Using Spatio-Temporal Features Analysis for Weakly Labelled Data with Auto Label Generation

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Information

Make a Submission

JCR Impact factor for 2024

0.9