Modular Feature Matching Method Of High Dimensional Omics Data Based On Network Analysis

Posted on:2022-06-25

Degree:Master

Type:Thesis

Country:China

Candidate:J Huang

Full Text:PDF

GTID:2504306569980799

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Accurately identifying the interactions between genomic factors and the response of cancer drugs plays important roles in drug discovery,drug repositioning and cancer treatment.A number of studies revealed that interactions between genes and drugs were ‘many-genes-tomany drugs’ interactions,i.e.common modules,opposed to ‘one-gene-to-one-drug’interactions.Such modules fully explain the interactions between complex biological regulatory mechanisms and drugs.Therefore,this thesis makes a review of the existing research algorithms and proposes a new modular feature matching method of high dimensional omics data based on network analysis.First,this article introduces the significance of research and application value of common module identification.After that,from the perspective of machine learning,it provides a detailed evaluation of three types state-of art common module identification methods,including methods based on non-negative matrix factorization,partial least squares and network analysis.Subsequently,in view of the shortcomings of these methods,this paper proposes a new high-dimensional omics data modular feature matching model based on network analysis and gives a solution to the model.The model uses high-order similarity tensor,hypergraph prior knowledge network constraints and sparse constraints to jointly optimize obtaining common modules.Finally,the experimental results and analysis are carried out using experiments on simulated data and real world data.Through the comparison of two sets of experiments on noise contamination and outlier interference,it is proved that the model proposed in this paper has good properties in both scenarios.The real experiment use 2091 gene expression data and 101 drug response data on 392 cell lines.After comparing with a number of cutting-edge methods for biological validation,it is proved that the method proposed in this article has good properties and the output results have biological significance.The main contributions of the model proposed in this paper are: 1)Using high-order similarity tensor,which reflecting the many-to-many relationship weakens the interference of noise and outliers in the input data.2)The acquisition of the common module is integrated into the iterative optimization of the objective function,which solves the shortcomings of the decoupling strategy of the current methods.3)Use hypergraph to fuse multiple prior knowledge network,which improves the effect of prior knowledge constraint compared with a single prior knowledge network.

Keywords/Search Tags:

Gene-drug interactions, Common module, Non-negative matrix factorization, Partial least squares, Network analysis

PDF Full Text Request

Related items

1	Two Types Of Gene-drug Co-module Identify Algorithms
2	Feature Extraction Of Cancer Gene Expression Data Based On Non-negative Matrix Factorization
3	Tumor Dna Microarray Data Classification Based On Non-negative Matrix Factorization
4	Research On Prediction Methods Of Disease-related MiRNAs Based On Non-negative Matrix Factorization
5	Algorithm For Layer-specific Module Detection In Multi-layer Cancer Networks
6	Collaborative Matrix Factorization For Predicting Drug-Target Interactions
7	Study On The Brain Network Of Epilepsy Based On Non-Negative Matrix Factorization And Non-Negative Tensor Decomposition
8	Non-negative Matrix Factorization Algorithm To Deal With The Cancer Gene Expression Data
9	Multimodal Medical Image Fusion Based On Non-negative Matrix Factorization
10	Lung Data Processing Based On Non-negative Matrix Factorization