Font Size: a A A

Research On Named Entity Intergration In Dataspace

Posted on:2013-08-26Degree:MasterType:Thesis
Country:ChinaCandidate:J H WangFull Text:PDF
GTID:2248330392450545Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
The appeareance of the computer and the development of network technologymake people can share information conveniently, and the rapid development ofmodern information technology makes it become more frequently. What peopleface is no longer a stable of information, but massive data with rapidly growing.Due to derive from different sources, the data, with different forms, couldn’t beused in an efficiently way. In order to make full use of the existing data resourcesand reduce the Repetition work of data collection, pepole need a way to intergratedata from multiple distributed autonomous sources with heterogeneous data. Thisneeds us to establish a universal and feasible distributed integrated method forheterogeneous data sources.From the file system to the database technology, andto the development of the integration technology, they still can not satisfypeople’s requirements for the big data management. A new big data managementconcept, called Dataspace, is now coming up.As a kind of heterogeneous data management Technology, dataspace managesdata in a Pay-as-you-go way. Essentially, great deals of data are the descriptiveinformations of objects in the real world. That is to say, all the data are exactlythe descriptive informations to the named entities in data sources. The purpose of user accessing these sources is mainly to querying the entities informations. Thus,with the extraction and integration of named entity and its descriptioninformations in dataspace, users can manage and get access to their data moreeffective.According to the characteristics of the data, this paper propses a method fornamed entity integration in dataspace. And through the extraction of data space ofdata source contains named entity and its integrated model, provides the functionbased on entity data inquires. The main contributions of my research are in thefollowing aspects:1) Put forward an integration model for named entity and its description of theinformation in dataspace;2) Propose the mapping and intergration methods of heterogeneous data sourcesand named entity in dataspace system;3) Propose a method of entity resolution for dataspaces.4) Based on the achievement of our research team’s research, this articleimplements a dataspace prototype system.
Keywords/Search Tags:Dataspace, Named Entity, Integration
PDF Full Text Request
Related items