Research And Parallelization Of Community Detection Algorithm Based On Label Propagation Algorithm

Posted on:2019-04-12

Degree:Master

Type:Thesis

Country:China

Candidate:M L Yue

Full Text:PDF

GTID:2370330545470256

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

With the continuous development of social networks,community detection has become an important research hotspots in the complex network field.A complete network consists several communities.The connection between nodes is relatively close within the community and the connection between nodes in different communities is relatively loose.Label propagation algorithm LPA is an excellent algorithm in community detection.Its linear time complexity is a great advantage.Although LPA has a lot of advantages,but the shortcomings are also very obvious.Because of the random selection of labels,LPA cannot guarantee the consistency of every result.In addition,after repeated iterations,there may be a phenomenon of large communities swallowing small communities.In combination with the above problems,two algorithms are improved on the basis of LPA,and the specific research results are as follows:(1)Optimization and improvement of LPALPA does not contain any parameters,we mainly optimize the label propagation and label update.In PSLPA(Probability and Similarity based Label Propagation Algorithm),we combine the probability of label propagation and similarity between nodes,more,an adaptive label selection is utilized to update node labels in the process of label propagation.In WRWLPA(Weight and Random Walk based Label Propagation Algorithm),we propose a new similarity calculation method by combining the weight and random walk,weight and similarity are used to update labels in the stage of label propagation.These two algorithms have excellent performance in accuracy and stability.(2)ParallelizationFor the two algorithms mentioned above,we all realized parallelization.The GraphX module is used under the Spark platform.The algorithm process is transformed into the iterative computation process of network graph which is transformed through the existing API interface.For the label propagation process,a custom function is implemented to complete the parallelization of the algorithm.The parallel algorithm shows high accuracy and stability on different scale datasets.

Keywords/Search Tags:

Community Detection, Propagation Probability, Similarity, Parallelization

PDF Full Text Request

Related items

1	Research On Community Detection Algorithm Based On Node Similarity
2	Research On Community Detection In Complex Network Based On Label Propagation
3	Community Detection Algorithm Based On Seed Expansion And Its Parallelization
4	Research On Overlapping Community Detection Algorithms Based On Influence Propagation
5	Similarity-based Complex Network Community Detection Research
6	Research On Overlapping Community Detection And Community Evolution Analysis Method Based On Dynamic Social Network
7	Research On Community Detection Algorithm Based On Node Importance And Similarity
8	Research On Overlapping Community Detection Algorithm Based On SLPA
9	Research On Community Detection Algorithms In Complex Networks
10	Research On Complex Network Community Detection Algorithm Based On Affinity Propagation