Font Size: a A A

A parameter selection framework for semi-supervised clustering algorithms

Posted on:2014-12-16Degree:M.SType:Thesis
University:University of Alberta (Canada)Candidate:Pourrajabi, MojganFull Text:PDF
GTID:2458390005992374Subject:Computer Science
Abstract/Summary:
Many clustering techniques require parameter settings and depending on an algorithms sensitivity to the parameter, the choice of the parameter value can be very important. Several approaches have been proposed to find the "best" value of the clustering parameter for the different unsupervised clustering methods.;We introduce a general method, denoted as "Cross-validation framework for finding clustering parameters" (CVCP). Given a data set, CVCP selects the "best" parameter value for a semi-supervised clustering method based on available constraints or labels that are given as input to a semi-supervised clustering method. CVCP is evaluated based on selecting the "best" value of k for a semi-supervised Kmeans-based clustering algorithm and the "best" value of MinPts for a semi-supervised density-based clustering algorithm. Our experimental results show that using the framework to select parameters can significantly improve the expected performance of a semi-supervised clustering method when appropriate parameter values often have to be "guessed".
Keywords/Search Tags:Clustering, Parameter, Value, Framework
Related items