SSIVD-NET：一种新型的显着超级图像分类和武器暴力检测技术

论文标题

SSIVD-NET：一种新型的显着超级图像分类和武器暴力检测技术

SSIVD-Net: A Novel Salient Super Image Classification & Detection Technique for Weaponized Violence

论文作者

Aremu, Toluwani, Zhiyuan, Li, Alameeri, Reem, Khan, Mustaqeem, Saddik, Abdulmotaleb El

论文摘要

在闭路电视（CCTV）镜头中发现暴力和武器暴力行为需要全面的方法。在这项工作中，我们介绍了\ emph {智能城市CCTV暴力检测（SCVD）}数据集，该数据集是专门为促进监视视频中武器分配的学习而设计的。为了解决分析3D监视视频的暴力识别任务的复杂性，我们提出了一种新型技术，称为\ emph {ssivd-net}（\ textbf {s} alient- \ textbf {s textbf {s} s} s}我们的方法降低了3D视频数据的复杂性，维度和信息丢失，同时通过突出的苏图像表示提高了推论，性能和解释性。考虑到未来派智能城市的可伸缩性和可持续性要求，作者介绍了\ emph {Squient-Clalerifier}，这是一种新颖的体系结构，将内核方法与残留学习策略相结合。我们评估了SSIVD-NET和显着分类器在我们的SCVD数据集上的变化，并针对暴力检测中常见的最新模型（SOTA）模型评估了基准。我们的方法在检测武器化和非武器的暴力实例方面表现出重大改进。通过在暴力检测中推进SOTA，我们的工作提供了适用于现实世界应用的实用和可扩展解决方案。提出的方法不仅解决了CCTV录像中暴力发现的挑战，而且还有助于理解智能监视中的武器分配。最终，我们的研究结果应使更聪明，更安全的城市以及提高公共安全措施。

Detection of violence and weaponized violence in closed-circuit television (CCTV) footage requires a comprehensive approach. In this work, we introduce the \emph{Smart-City CCTV Violence Detection (SCVD)} dataset, specifically designed to facilitate the learning of weapon distribution in surveillance videos. To tackle the complexities of analyzing 3D surveillance video for violence recognition tasks, we propose a novel technique called \emph{SSIVD-Net} (\textbf{S}alient-\textbf{S}uper-\textbf{I}mage for \textbf{V}iolence \textbf{D}etection). Our method reduces 3D video data complexity, dimensionality, and information loss while improving inference, performance, and explainability through salient-super-Image representations. Considering the scalability and sustainability requirements of futuristic smart cities, the authors introduce the \emph{Salient-Classifier}, a novel architecture combining a kernelized approach with a residual learning strategy. We evaluate variations of SSIVD-Net and Salient Classifier on our SCVD dataset and benchmark against state-of-the-art (SOTA) models commonly employed in violence detection. Our approach exhibits significant improvements in detecting both weaponized and non-weaponized violence instances. By advancing the SOTA in violence detection, our work offers a practical and scalable solution suitable for real-world applications. The proposed methodology not only addresses the challenges of violence detection in CCTV footage but also contributes to the understanding of weapon distribution in smart surveillance. Ultimately, our research findings should enable smarter and more secure cities, as well as enhance public safety measures.

下载PDF全文

下载文献需遵守相关版权规定

论文标题