Simultaneous and Spatiotemporal Detection of Different Levels of Activity in Multidimensional Data

Received May 28, 2020, accepted June 22, 2020, date of publication June 29, 2020, date of current version July 7, 2020. Digital Object Identifier 10.1109/ACCESS.2020.3005633

Simultaneous and Spatiotemporal Detection of Different Levels of Activity in Multidimensional Data NATALIE M. SOMMER 1 , (Member, IEEE), SENEM VELIPASALAR LEANNE HIRSHFIELD2 , YANTAO LU 1 , (Student Member, IEEE), AND BURAK KAKILLIOGLU 1 , (Student Member, IEEE)

1,

(Senior Member, IEEE),

1 Department 2 Institute

of Electrical Engineering and Computer Science, Syracuse University, Syracuse, NY 13244, USA of Cognitive Science, University of Colorado at Boulder, Boulder, CO 80309, USA

Corresponding author: Natalie M. Sommer (nmsommer@syr.edu) This work was supported in part by the National Science Foundation (NSF) under Grant 1739748 and Grant 1816732, and in part by the Advanced Research Projects Agency-Energy (ARPA-E), U.S. Department of Energy, under Award DE-AR0000940.

ABSTRACT In this work, we present a novel and promising approach to autonomously detect different levels of simultaneous and spatiotemporal activity in multidimensional data. We introduce a new multilabeling technique, which assigns different labels to different regions of interest in the data, and thus, incorporates the spatial aspect. Each label is built to describe the level of activity/motion to be monitored in the spatial location that it represents, in contrast to existing approaches providing only a binary result as the presence or absence of activity. This novel Spatially and Motion-Level Descriptive (SMLD) labeling schema is combined with a Convolutional Long Short Term Memory-based network for classification to capture different levels of activity both spatially and temporally without the use of any foreground or object detection. The proposed approach can be applied to various types of spatiotemporal data captured for completely different application domains. In this paper, it was evaluated on video data as well as respiratory sound data. Metrics commonly associated with multilabeling, namely Hamming Loss and Subset Accuracy, as well as confusion matrixbased measurements are used to evaluate performance. Promising testing results are achieved with an overall Hamming Loss for video datasets close to 0.05, Subset Accuracy close to 80% and confusion matrix-based metrics above 0.9. In addition, our proposed approachâ&#x20AC;&#x2122;s ability in detecting frequent motion patterns based on predicted spatiotemporal activity levels is discussed. Encouraging results have been obtained on the respiratory sound dataset as well, while detecting abnormalities in different parts of the lungs. The experimental results demonstrate that the proposed approach can be applied to various types of spatiotemporal data captured for different application domains. INDEX TERMS LSTM, convolutional LSTM, machine learning, multilabeling, activity detection, spatiotemporal, label imbalance, subset accuracy, Hamming Loss, precision, recall, F-score.

I. INTRODUCTION

Establishing a way of detecting simultaneous activity in different regions of multidimensional data with spatiotemporal characteristics can be beneficial for a wide range of application domains. For example, it can provide improvements in autonomous surveillance systems. Letâ&#x20AC;&#x2122;s consider maritime surveillance for national security [32], [33]. Being able to detect a higher level of activity in regions of interest in The associate editor coordinating the review of this manuscript and approving it for publication was Yongqiang Zhao VOLUME 8, 2020

.

SAR (Synthetic Aperture Radar) imagery may trigger other parts of the surveillance system or prompt an alert to human supervision. In general, integrating an algorithm, which can automate the process of surveillance by simultaneously detecting anomalies in different regions of interest captured through video can enhance security and provide peace of mind. In a survey of recent research related to intelligent surveillance monitoring techniques, Sreenu and Durai [34] review different methods to discern unusual motion in crowd analysis. Jiang et al. [35] focus primarily on detecting anomalies in vehicular traffic at an intersection and consider both

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/

118205

Turn static files into dynamic content formats.

Create a flipbook