Font Size: a A A

Study On And Establishment Of Protein Sequence GO Annotation Database

Posted on:2009-01-22Degree:MasterType:Thesis
Country:ChinaCandidate:W YangFull Text:PDF
GTID:2178360278964185Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Bioinformatics database which storing source data of the bioinformatics and various types of annotation results data, available to help biologists analysis and explore the biological meaning of the data, has very important significance.But establishment of database application system, according to bioinformatics analysis and transaction Processing function and characteristic etc, has no attention in the computer industry. In the reports of internationally renowned biological Agencies, most have only emphasized that its data collection methods and bioinformatics meaning. Small amount of documentation involved in a database system for storage model, updated data on changes in the adaptability.In order to better carry out bioinformatics database system research and development, information for a specific large-scale analysis of biological processes, analysis and abstract bioinformatics database system to the characteristics and functional requirements, then develop the strategies and related technology, is a better way to establish Biological information database application system.Accordingly, based on summing up the characteristics of bioinformatics database establish the protein sequence GO annotation database system. After analysis annotation system processing, the original data and results, acquiring the database requirement analysis, including the corresponding transaction processing requirements. Establishment storage model of a annotation tool as the core , based on demand and the analysis of the business model calls for the storage functions of the validation and testing. Sequence separated storage, XML expression tree data structure, and the MySQL data storage skills such measures as a strategy of optimizing database, improving the database for the import and retrieval efficiency. The establishment of a database system data automatically update mechanism, making data can be shared with the international public data synchronization.Because of the similarity of bioinformatics databases, the protein structure of the Notes database is also for the construction of more large-scale genome databases and lay a solid foundation...
Keywords/Search Tags:bioinformatics database, requirement analysis, tree data structure, database version updating
PDF Full Text Request
Related items