Font Size: a A A

XML-Based Storage And Query Optimization Of Biologic Information

Posted on:2006-05-02Degree:MasterType:Thesis
Country:ChinaCandidate:S Y CengFull Text:PDF
GTID:2178360185463487Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
Bioinformatics is an emerging research field, which is developing rapidly. Biological data produced by experiments is very huge. It becomes an important subject how to collect, store, analyze and publish biological data. Today, biological data is generally expressed by XML, and improving the performance of XML query on biological data is especially important.We have developed a biological data management platform MyWorkSpace, which deals with the gathering, storing, management, analysis and release of the biological data in the HLPP project. The XML representation of original biological data is implementated in the data transformation module of MyWorkSpace. MyWorkSpace log management system was used to manage the biological data's gathering and storing, and the privilege management ensures the security of the log information. The web release of the biological data offers convenient access and quick queries by the users, and its privilege management ensures the security of the biological experimental data. The web query of the biological experimental data offers a query method that locates the batch data for the users. In order to improve the query performance on the biological data, this thesis analyzes the characteristics of the biological data, and proposes a path index method based on "Lucene", and presents the path expression optimization method based on schema and an optimization method on the query algebra of XQuery. We tested the aforementioned query methods respectively, and did performance evaluation.
Keywords/Search Tags:XML, Biological Data, XQuery, Index, Path Expression
PDF Full Text Request
Related items