Predictive data grouping using successor prediction

Posted on:2003-01-01

Degree:Ph.D

Type:Dissertation

University:University of California, Santa Cruz

Candidate:Amer, Ahmed M

Full Text:PDF

GTID:1468390011486237

Subject:Computer Science

Abstract/Summary:

PDF Full Text Request

Latency is an ever-increasing component of data access costs, which in turn are often the bottleneck for modern high performance systems. The ability to predict future data accesses is essential to any attempt at addressing this problem, and we present a novel model for gathering and utilizing data access predictions. Prior attempts to utilize access predictions have taken the form of a single predictive engine attempting to preemptively fetch data. We offer a more powerful model that separates the process of access prediction from the data retrieval mechanism. Predictions are made on a per-file basis and used to provide a minimal amount of additional metadata, which in turn is used by a grouping mechanism to automatically associate related items. This approach allows truly opportunistic utilization of predictive information, with little of the timing restrictions of prior approaches. Our research covers access prediction, grouping based on predictions, and a discussion of predictability and its meaning in the context of I/O behavior.; We present two predictors: Noah, named for its prediction of pairs, and Recent Popularity, a majority voting mechanism. We distinguish the goal of predicting the most events accurately (general accuracy) from the goal of offering the most accurate predictions (specific accuracy). Both predictors can trade the number of events predicted for accuracy. Trace-based evaluation demonstrates that their error rates can be adjusted to less than 2% for more than 60% of all access requests. Predictions are used to provide a minimal amount of per-file additional metadata, which is then used separately by our grouping mechanism.; To demonstrate the usefulness of grouping, we present the aggregating cache which manages distributed file system caches based upon groups built from our successor predictions. We present trace-driven results demonstrating that grouping can reduce LRU demand fetches by 50% to 60%. If we consider the effects of intervening caches we observe dramatic gains for our predictive cache. Our treatment includes information theoretic results that justify our approach, a graphical explanation of the effects of caches on workload predictability (cache-frequency plots), as well as relative predictor performance (rank-difference plots).

Keywords/Search Tags:

Data, Grouping, Access, Predictive, Prediction

PDF Full Text Request

Related items

1	Research On Customer Management Based On User Grouping And Churn Prediction Technology
2	Research Of User Access Prediction Based On Web Log Mining
3	Research On Software Evolvability Prediction Based On Semi-supervised Data Grouping
4	Non-orthogonal Multiple Access Based On User Grouping And Multiple Layer Modulation For 5G
5	Research On Power Allocation And User Grouping Algorithms For Non-orthogonal Multiple Access System
6	Research On User Grouping And Power Allocation Scheme For Non-Orthogonal Multiple Access Systems
7	Research-oriented Data Manage System In Numerical Prediction
8	Research And Design On Grouping Network Users Based On 802.1X
9	Research On Optimization Strategy Of Distributed Model Predictive Control System Based On GA-PSO
10	Research Of Nonlinear Predictive Control Algorithm Based On Neural Networks And PSO