Font Size: a A A

An algebraic foundation for automatic semantic data integration on the hidden Web

Posted on:2010-02-19Degree:Ph.DType:Thesis
University:Wayne State UniversityCandidate:Hosain, Md. ShazzadFull Text:PDF
GTID:2448390002972785Subject:Computer Science
Abstract/Summary:
Semantic integration of the hidden Web is an emerging area of research where traditional assumptions about schema do not always hold and semantic heterogeneity poses serious challenge. Constant changes, conflicts and sheer size in the world of hidden Web demand integration techniques that rely on autonomous detection and resolution heterogeneity, correspondence establishment and information extraction strategies. First it needs to automate those techniques and then to integrate those techniques or sub-systems automatically into a single system. Though many such sub-systems have been automated, to our knowledge, there is no integrated framework for combining those technologies automatically. Our idea is to exploit the flexibility and strengths of a declarative language and the first step of such a language is to give an algebraic foundation that takes various integration techniques into consideration. In this thesis, we present an algebraic language, called Integra, as a foundation for an SQL like query language such as BioFlow for the integration of Life Sciences data on the hidden Web. The algebra presented here assumes that all web pages can be thought of as traditional relations and the integration techniques can be considered as user defined functions. These assumptions make it possible for us to extend the traditional relational algebra to include integration primitives such that a database with traditional relations reduces to a special case in our model. The algebra relies on a schema matching function mu, a key discovery function k, a wrapper or extraction function eta and two new operators link and combine that embody the well known concepts of horizontal and vertical integration.
Keywords/Search Tags:Integration, Hidden web, Algebraic, Foundation, Traditional
Related items