An efficient algorithm for joining large XML documents

Posted on:2004-07-01

Degree:M.Sc

Type:Thesis

University:University of Guelph (Canada)

Candidate:Zhou, Wei

Full Text:PDF

GTID:2468390011962904

Subject:Computer Science

Abstract/Summary:

XML is becoming the major markup language in developing heterogeneous distributed databases. Data from different sources can be encoded as XML documents and processed together. Join is one of the most important database operations for processing data together. XML documents have special features that make them different from relational data. Most join techniques developed for relational databases cannot be directly adopted for processing XML data. Efficient join algorithms are needed for building high performance XML databases. This thesis describes an efficient algorithm for joining large XML documents. This algorithm scans the data only one or two times. It creates a set of supporting structures then performs join in main memory or by direct disk access. It does not require any existing index structures, and is not dependent on the support from database software (e.g. an RDBMS).

Keywords/Search Tags:

XML, Data, Join, Efficient, Algorithm

Related items

1	An efficient algorithm for joining large XML documents
2	Efficient Similarity Join Over Probabilistic Data Streams Based On Earth Mover's Distance
3	Efficient join processing in spatial database systems
4	Design And Optimize Big-Data Join Algorithms Using MapReduce
5	Energy-Efficient Join Algorithms In Multi-Memory Hierarchies
6	Complete overflow management algorithm to join large relational tables
7	Research And Implementation Of Efficient Log Information Extraction Platform Based On Flink
8	Efficient Star Join For Column-Oriented Data Store In The MAP Reduce Environment
9	Research And Implementation Of High Efficiency Set Similarity Join Algorithm Based On Overlap Similarity
10	Implementation And Evaluation Of Big Data Parallel Join Algorithms