Multi-Objective Clustering Ensemble Based On Fast Nondominated Sorting Genetic Algorithm

Posted on:2023-05-28

Degree:Master

Type:Thesis

Country:China

Candidate:X Li

Full Text:PDF

GTID:2558307073983229

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Mining valuable information from vast amounts of data with rich structures and characteristics generated in the Internet era has become a hot and difficult area of research in machine learning today.During this period,clustering ensemble techniques have become the focus of research and are used in various industries due to their superior performance and unsupervised nature.Many clustering ensemble techniques are generally oriented towards the design of integration strategies based on the base clustering results,which makes it difficult to obtain better results when the base clustering results are generally poor.This thesis applies the idea of using both sample data and base clustering results in the clustering ensemble process to the proposed model and solves it by a genetic algorithm.The Fast Nondominated Sorting Genetic Algorithm for Multi-Objective Clustering Ensemble(NSGAMCE)is first proposed in this thesis.The model is designed to produce a multi-objective formulation set of consistent objective functions for the sample data and the base clustering results respectively,aiming to produce consensus guidance on both levels of the optimisation objective during the integrated optimisation solution.The model first transforms and defines the clustering ensemble task in the genetic algorithm,then proposes a reduction coding strategy and adaptive variation probability to solve the dimensional catastrophe and local search problems encountered by the genetic algorithm in the clustering ensemble solution process,and finally uses the genetic algorithm to solve the objective formula set.In this thesis,we also propose a Pairwise Constraints Guide Fast Nondominated Sorting Genetic Algorithm for Multi-Objective Clustering Ensemble Algorithm(pc NSGAMCE),which adheres to the idea of fusing sample data with base clustering results.Finally,the pairwise constraint information is incorporated into the clustering ensemble multi-objective formula set to guide the genetic algorithm to iteratively solve the problem.Finally,the two algorithms proposed in the thesis are experimented on public datasets.During the experiments,the classical and frontier clustering ensemble algorithms are selected for comparison,and the results are evaluated using accuracy,purity and normalized mutual information as evaluation metrics.The experimental results show that the two algorithms proposed in this thesis outperform other algorithms.

Keywords/Search Tags:

multi-objective clustering ensemble, genetic algorithm, pairwise constraints, label reduction coding, adaptive probabilistic, nondominated sorting

PDF Full Text Request

Related items

1	Elitist Nondominated Sorting Genetic Algorithm And Its Application
2	Application Of BP Neural Network Based Genetic Algorithm In Multi-Objective Optimizing The Drugs Component
3	Research On Semi-supervised Selective Clustering Ensemble
4	Semi-Supervised Dimensionality Reduction And Ensemble Learning For Multi-label Classification
5	Multi-objective Optimization Anycast Routing Algorithm
6	A Study On Semi-supervised Leaning Based On Genetic Algorithm
7	Research And Application Of Hardwaresoftware Partitioning Algorithm For Hybrid NSGA-Ⅱ And DE
8	The Research And Application Of Improved Adaptive Non-dominated Sorting Genetic Algorithm In Multi Objective Of Job Shop
9	Research And Application Of New Non-Dominated Individual Sorting In Multi-objective Evolutionary Algorithms
10	Research On Multi-Objective Clustering Ensemble And Its Application