Constraint-based clustering procedure for data envelopment analysis

Posted on:2006-10-31

Degree:Ph.D

Type:Dissertation

University:North Dakota State University

Candidate:Majadat, Hassan Mohammad

Full Text:PDF

GTID:1458390008470582

Subject:Computer Science

Abstract/Summary:

This dissertation integrates two important fields of information technology, data mining and data envelopment analysis (DEA), to provide a new tool for measuring the performance of decision making units (DMU). The DEA is a powerful performance measurement methodology for assessing the relative efficiency of DMUs. This methodology determines the efficient and inefficient DMUs in order to gain valuable information for making further improvements such as identifying the savings in expenditures and the best suitable way to distribute services which will eventually improve the productivity of the entire system. There are two typical assumptions in the DEA: (1) the DEA assumes that all DMUs are homogenous and identical in their operations, and (2) the DEA is deterministic and that leads to inaccurate efficiency assessment in the presence of outliers or unusual observations.; Many investigations have dealt with the DEA models, but few have focused on heterogonous DMUs, outlier detection, and scalability over large datasets. In this dissertation, a comprehensive model is presented. We introduce a new constraint-based clustering method for early detection of outliers to evaluate the performance scores of non-homogenous DMUs. In this method, DMUs dissimilar to the DMU under evaluation are labeled as outliers and are excluded from the analysis. This work removes the extra effort needed to predefine the dissimilarity parameters or the number of DMUs to be excluded.; Experimental results of our approach show big improvements in assessing the transportation system funding for school districts in the state of North Dakota. An extensive analysis is provided to show the characteristics of our method and how it compares with different models in terms of the quality of results. The performance of these school districts is measured several times using different economical models to get the most suitable view of the situation.; The dissertation starts with the investigation of the parametric and non-parametric performance measurements along with advantages and shortcomings of these metrics. Then, a detailed analysis of outlier detection algorithms in data mining is provided. Finally, a method called the clustering-based DEA is developed.

Keywords/Search Tags:

DEA, Data, Method

Related items

1	Data-driven Based Controller Parameters Tuning For Wire Bonders
2	H-KTT Clustering Method And Its Applications In Analyzing Large-Scale AMI Data
3	Data-Driven Based Controller Parameters Tuning For Wire Bonders
4	Research And Application Of Imbalanced Data Processing Algorithm
5	Research On Data Quick Unloading Method Based On IRIG106 Chapter 10 Standard
6	Application Of Geometric Iteration Method In Fitting Discrete Data Points
7	Synthesis Of Data-driven Controller Based On Multirate Sampled-data System
8	The Compile Method Of XML And The Research Of Developing Toolsthe Method To Deal With The Separation Of The Data Content And The Data Presentation By Using XML In Wed-based Applications
9	The Class-Mean Method And Its Extensions To Handling Incomplete Data In Data Mining
10	Research On Classification Algorithm Of Medical Diagnostic Data Based On Kernel Method