Shoplifting Detection System

Deep learning-based video classification system for detecting shoplifting behavior in surveillance footage using 3D Convolutional Neural Networks (R3D-18). Analyzes 16-frame video clips at 224x224 resolution for binary classification of shoplifting vs normal activity. Trained on 183 videos (90 normal + 93 shoplifting).

The system processes surveillance video in real-time, extracting temporal features across consecutive frames to understand human actions and identify suspicious shoplifting behavior. The R3D-18 architecture, pretrained on the large-scale Kinetics-400 dataset, provides robust spatiotemporal feature extraction that generalizes well to retail surveillance scenarios.

Key Features

R3D-18 (3D ResNet-18) pretrained on Kinetics-400
16-frame video clip analysis at 224x224
Binary classification (shoplifting vs non-shoplifting)
183-video training dataset (80/20 split)
CUDA GPU acceleration support
Annotated output video with predictions and confidence scores