Font Size: a A A

A Family of Joint Sparse PCA Algorithms for Anomaly Localization in Network Data Streams

Posted on:2013-01-17Degree:M.SType:Thesis
University:University of KansasCandidate:Jiang, RuoyiFull Text:PDF
GTID:2458390008982244Subject:Engineering
Abstract/Summary:
Determining anomalies in data streams that are collected and transformed from various types of networks has recently attracted significant research interest. Principal Component Analysis (PCA) is arguably the most widely applied unsupervised anomaly detection technique for networked data streams due to its simplicity and efficiency. However, none of existing PCA based approaches addresses the problem of identifying the sources that contribute most to the observed anomaly, or anomaly localization. In this paper, we first proposed a novel joint sparse PCA method to perform anomaly detection and localization for network data streams. Our key observation is that we can detect anomalies and localize anomalous sources by identifying a low dimensional abnormal subspace that captures the abnormal behavior of data. To better capture the sources of anomalies, we incorporated the structure of the network stream data in our anomaly localization framework. Also, an extended version of PCA, multidimensional KLE, was introduced to stabilize the localization performance. We performed comprehensive experimental studies on four real-world data sets from different application domains and compared our proposed techniques with several state-of-the-arts. Our experimental studies demonstrate the utility of the proposed methods.
Keywords/Search Tags:Data, PCA, Anomaly localization, Network
Related items