| The incidence and mortality of cancer are increasing and the key to its prevention and treatment is early diagnosis and early treatment.However,due to the complexity of cancer,the reveal of its mechanism has become a difficult worldwide problem.Currently,cancer research based on big data is one of the important methods to reveal cancer mechanisms.In this study,we applied data from TCGA included 11 kinds of cancer,which are breast invasive carcinoma,colon adenocarcinoma,colorectal adenocarcinoma,head and neck squamous cell carcinoma,kidney renal clear cell carcinoma,kidney renal papillary cell carcinoma,liver hepatocellular carcinoma,lung adenocarcinoma,lung squamous cell carcinoma,stomach adenocarcinoma,thyroid carcinoma.And the genome mutation data,transcriptome data and the integrated data of cancer patients were analyzed and studied respectively.The results are as follows: We first analyzed the cancer genome and found that the reason for the high heterogeneity of advanced cancer cell population may not be the same as that of early cancer.Then we studied the changes of gene co-expression between normal tissues and tumor tissue and found that the co-expression correlation between transcription factors and target genes was generally decreased in cancer tissues,and we also screened out universal and specific transcription factor-target gene pairs that may have important effects on cancer.Some of these gene pairs,such as XRCC5-XRCC6,have been reported to be associated with the progression of multiple cancers,while others,such as IRF1-PSMB9,may be involved in the development of lung cancer and renal cancer-related cancers and CENPA-NEK2 may be associated with the development of renal,gastric and thymic cancer,but these gene pairs have not been reported in the literature.Therefore,these genes have great potential to become new cancer markers or targets for cancer therapy.By integrating the mutation data of genome and differentially expressed data of transcriptome,we screened out the cancer-driving genes.For example,ASB10 may be the driving gene for breast invasive carcinoma,while CD180 may be the driving gene for colorectal cancer.And finally,we concluded the possible mechanism of gene co-expression changes in cancer tissues: the reason for the general decline of gene co-expression in cancer tissues may be that some gene mutations destroy the original gene network.In this study,we explored the occurrence and development of pan-cancer and provided new and potential candidate markers for pan-cancer diagnosis and treatment,which would provide an important theoretical basis for the prevention and treatment of cancer in the future. |