Frota rewbenio paa 2012 a unifying methodology

Pattern Anal Applic DOI 10.1007/s10044-011-0265-3

THEORETICAL ADVANCES

A unifying methodology for the evaluation of neural network models on novelty detection tasks Guilherme A. Barreto • Rewbenio A. Frota

Received: 27 April 2010 / Accepted: 31 December 2011 Ó Springer-Verlag London Limited 2012

Abstract An important issue in data analysis and pattern classification is the detection of anomalous observations and its influence on the classifier’s performance. In this paper, we introduce a novel methodology to systematically compare the performance of neural network (NN) methods applied to novelty detection problems. Initially, we describe the most common NN-based novelty detection techniques. Then we generalize to the supervised case, a recently proposed unsupervised novelty detection method for computing reliable decision thresholds. We illustrate how to use the proposed methodology to evaluate the performances of supervised and unsupervised NN-based novelty detectors on a real-world benchmarking data set, assessing their sensitivity to training parameters, such as data scaling, number of neurons, training epochs and size of the training set. Keywords Novelty detection Self-organizing maps Multilayer neural networks Bootstrap Decision intervals

As such, it has been the focus of increasing attention in many pattern recognition applications whose success depends on building a reliable model for the data, such as machine monitoring [23, 31, 44], image processing [34], remote sensing [54], medical diagnosis [30, 42], mobile robotics [36, 47], multimedia applications [9, 40], computer network security [13, 48, 53], homeland security [4], telecommunications [7, 15], time series data analysis [6, 37, 51], among others. This interest is in part due to the fact that, for a wide range of real-world problems it is crucial to be able to detect patterns that do not match well with the stored data representation. Several neural, system-theoretic, statistical and hybrid approaches to novelty detection have been proposed over the years, but it is becoming usual that the formulation of novelty detection tasks as one of the following pattern classification problems: –

1 Introduction Novelty detection is the problem of reporting the occurrence of novel events or data. Due to the wide range of applicability across disciplines in engineering and science, novelty detection can also be called anomaly detection, intruder detection, fault detection or even outlier detection. G. A. Barreto (&) R. A. Frota Department of Teleinformatics Engineering, Federal University of Ceara´, Fortaleza, CE, Brazil e-mail: guilherme@deti.ufc.br R. A. Frota e-mail: rewbenio@uol.com.br

–

Single-class: The data available for learning a representation of the expected behavior of the system of interest comprised only one class of data vectors, usually representing normal activity of the system. This type of data is also referred to as positive examples. The goal is to indicate if a given input vector corresponds to normal or abnormal behavior. Multi-class: The training set contains data vectors of different classes. The data should be representative of positive (normal) and negative (abnormal) behavior, to build an overall representation of the known system behavior, even (and specially) in faulty operation [3]. The goal is to classify the input vector into one or none of the existing classes.

Thus, the design of novelty detectors can be generally stated as the task in which a description of what is already known about the system is learned by fitting a set of normal

123

Turn static files into dynamic content formats.

Create a flipbook