Font Size: a A A

The Maximal Frequent Pattern Mining Of Dna Sequence

Posted on:2011-03-03Degree:MasterType:Thesis
Country:ChinaCandidate:S BaiFull Text:PDF
GTID:2198330332473851Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Bioinformatics is mathematics, statistics, life sciences, computer science and information science and other disciplines that constitute the compound subject, is currently a hot research subjects. Data mining is realized through computer statistics and artificial intelligence techniques in some of the algorithms, mainly for the data from the mass of various laws to find out the hidden, computer data analysis is currently the most effective technology. Biological data of biological sequence data as one of the most important data, the data mining in biological sequence data analysis and processing is currently the researchers are most concerned about the areas of research. Functional identification of sequence elements and found that the relationship between the sequence of biological sequence data mining is the most important task. Biological sequence pattern mining as a biological sequence data mining of the most important research, interpretation of the discovery of functional elements sequence plays a vital role in its function. The DNA sequence data of biological sequence data is the most important one. Data mining technology in DNA sequence data analysis of DNA sequence data processing is a major attempt.At present, how to develop the design of effective DNA sequence analysis of the data mining algorithm is the most important DNA sequences, we can consider the following two aspects:first, the combination of biology background and knowledge of related areas designed for mining algorithm, so dig out the biological interpretation of the results of a more sufficient to meet the needs of the practical application of biology; Second, to fully consider the sequence of DNA sequence data different from the general characteristics of the data, on this basis, DNA sequences designed for data mining algorithms to improve the efficiency of algorithms.To improve the biological efficiency of sequential pattern mining algorithms and performance analysis of this paper, the existing sequential pattern mining algorithm is practical and efficiency of the algorithm. Some problems on the algorithm, combined with the characteristics of DNA sequences, a new DNA sequence pattern mining algorithm-JMPS, experiments show that the algorithm is feasible and effective.
Keywords/Search Tags:bioinformatics, data mining, biological sequence data mining, sequential pattern, DNA sequence
PDF Full Text Request
Related items