Font Size: a A A

Research And Implementation Of Partial Indexing Mechanism In Personal Dataspace Management System

Posted on:2013-12-10Degree:MasterType:Thesis
Country:ChinaCandidate:X J WangFull Text:PDF
GTID:2248330371977824Subject:Computer technology
Abstract/Summary:PDF Full Text Request
With the development of internet technology, the data faced now is no longer pure structured data, data appear new features:content, diversity and sharing. The traditional data management technology can not deal with these new features very well, as a new data management technology, Dataspace can satisfy the increasingly complex data management needs because of its loose data model and pay-as-you-go integration method which will gradually form the mode of Dataspace.This paper studies the partial index mechanism in personal Dataspace management system, our main contributions can be summarized as follows:(1) This paper puts forward partial index based on vector space model in Dataspace and apply automatic abstracting in partial index mechanism.We use partial index rather than full text indexing, so we can reduce the cost of establishing and maintaining the index.(2) In the process of extracting abstract, this paper transform text into vector space model and use tf*idf and improved method of reducing feature set to cluster. This paper transforms complex problems into vector calculation and obtains the better response time.This paper compare the recall ratio and precision in the index based on semantic and the index based on VSM(vector space model)to prove feasibility and validity in the index based on VSM (vector space model)...
Keywords/Search Tags:Dataspace, partial index, vector space model, automatic abstracting
PDF Full Text Request
Related items