Research On The Key Technology Of Heterogeneous Network Alignment

Posted on:2022-07-05

Degree:Master

Type:Thesis

Country:China

Candidate:Z X Ren

Full Text:PDF

GTID:2518306542483624

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

With the rapid development of science and technology,the Internet has gradually become an indispensable tool in people's daily lives.It is a powerful graph structure that can naturally capture the relationship between objects in massive data.In order to obtain more hidden information about the object,many graph mining tasks need to combine multiple network data for analysis.However,the primary problem faced by data aggregation in heterogeneous networks is "how to align objects of different platforms".Many downstream tasks are constructed after objects are aligned.For example,pattern matching of protein networks,user identification in social networks,cross-domain personalized recommendation and social link prediction,etc.Therefore,network alignment is of great significance for a wide range of applications.This paper has done the following two tasks for the alignment of heterogeneous networks:(1)At present,the solutions to most heterogeneous network alignment problems can be summarized into the following three steps: feature extraction,model construction and node matching.The first step is to extract the available information from the node's attribute information and network structure as the features of the node,which identifies the identity of the node.In the second step,statistical or vectorized representation of the features extracted in the previous step is used as input for the model construction stage.Then,a supervised or unsupervised model is trained according to the availability of anchor node pairs(ie,pre-known alignment node pairs).Finally,the trained model is used to predict whether the identities of the two nodes match.Currently,most network alignment methods only focus on the local structure or global structure of nodes in the feature extraction stage.This feature extraction method causes the lack of node information to a certain extent.Therefore,this paper proposes a multi-scale modeling mechanism,which not only builds the features of nodes on the global structure,but also strengthens the expressive ability of node information from the local structure,and enhances the richness of node features.At the same time,in the model training stage,we often face problems of high computational complexity.In order to reduce the computational complexity of the model training process,this paper introduces an implicit matrix decomposition method based on QR decomposition,which uses a small number of nodes to achieve the ability to express all nodes,which greatly reduces the computational complexity.(2)The implicit matrix factorization method often needs to filter out a small number of nodes,and use these nodes to establish an association relationship with all nodes.These selected small numbers of nodes are called landmark nodes.At present,models that use implicit matrix factorization to solve network alignment problems often use random sampling technology to select landmark nodes.This screening method treats all nodes equally,ignoring the particularity and importance of landmark nodes.Therefore,this paper proposes a sampling strategy based on node importance.According to the importance of nodes,this strategy selects the most representative nodes from all nodes as landmark nodes,and then builds their association with global nodes.The heterogeneous network alignment algorithm proposed in this paper can be widely used in different fields,such as social network user matching,link prediction,user portrait,friend recommendation,protein analysis,pattern recognition and so on in practical problems.In order to verify the effectiveness of the two algorithms proposed in this article,a large number of experiments were done on three different types,different scales,and different sparsity data sets,and three different evaluation indicators were used to compare the model from the perspective of prediction and ranking to evaluate.This kind of evaluation index evaluates the model from the perspective of prediction and ranking.Experimental results show that the two network alignment algorithms proposed in this paper have better performance.

Keywords/Search Tags:

node embedding, heterogeneous network alignment, matrix decomposition, unsupervised learning

PDF Full Text Request

Related items

1	The Study Of Heterogeneous Social Network Alignment Based On Network Representation Learning
2	Node Representation Learning Based Cross Network Node Correlation
3	Research On User Alignment Technology Based On Joint Embedding Of Heterogeneous Networks
4	Unsupervised Relation Extraction Based On Matrix Factorization
5	Research And Application Of Matrix Factorization Algorithms For Multi-source Heterogeneous Data
6	Research On Recommendation Technology Based On Heterogeneous Information Network Embedding
7	Personalized Research Paper Recommendation Based On Heterogeneous Network Embedding
8	Salient Object Detection Based On Unsupervised Learning
9	Research On Node User Alignment Model And Application Of E-commerce System On Cross Bookstore Platform
10	Research On Attributed Network Embedding Methods Based On Matrix Factorization