Based On The Attribute Table Of Rdf Data Storage System Research

Posted on:2014-02-15

Degree:Master

Type:Thesis

Country:China

Candidate:C K Tao

Full Text:PDF

GTID:2248330395495514

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Semantic web is an extension to current World Wide Web. By including semantic content in web pages, machines can understand and process the knowledge inside. RDF is a knowledge representation model used in semantic web. It is widely adopted as semantic web develops. With the increasing amount of RDF data, efficient storage of large scale RDF data becomes challaging. This paper investigates the storage technologies of RDF property table, especially property selection algorithms and dynamic adjustment algorithms.Previous researches indicate that different RDF datasets and workloads require different storage strategies. No existing RDF storage method can perform well in all scenarios. Property table are considered promising because they can change property table schemas according to workloads. A property selection algorithm must be assigned when using property table. Most existing methods are applying algorithms designed for other fields, such as Apriori algorithm used in data mining, vertical partitioning algorithm used in distributed database systems, etc. This paper proposes a new property selection algorithm designated for RDF data. The new algorithm can not only select property based on workloads, but also reduces the join operations.On the basis of property selection algorithm, this paper designs property table adjustment strategies that can make use of the latest query information while the system is running. Because adjustment operations of property tables often have high costs, most existing algorithms create the property tables offline. This paper proposes an algorithm that can judge the system’s load level. The algorithm uses the idea of PID controller. By measuring the requests and responses, it can qualitatively tell whether the system is idle or not. Moreover, this paper proposes an incremental schema adjustment algorithm. The change is made at the attribute level and only when the system is idle. In this way the impact of property table dynamic adjustment can be reduced.Finally, this paper adds the property table stores functionality to Jena, an open source semantic toolkit. The query processing module in Jena is modified to redirect possible data accesses to property tables. To evaluate the performance of property table stores in real application environments, this paper also adds user number simulation to SP2Bench SPARQL benchmark tool. Experiments on the modified Jena and SP2Bench show that the new property table property selection algorithm and adjustment timing algorithm can significantly improve query performance.

Keywords/Search Tags:

RDF data, property table, storage system, PID controller, Jena

PDF Full Text Request

Related items

1	The Design Of Portable Digital Storage Oscilloscope Table Based On ARM
2	Research On Multi-Tenant Data Storage Mechanism Based On Universal Table In SaaS
3	Research On Multi-tenant Data Storage Mode Of SaaS Application Based On Universal Table
4	Research On Multi-tenant Data Storage Mechanism Based On Universal Table
5	Study On Multi-Tenant Data Storage And Data Migration On Basic-Table Combined With Extension-Table Schema
6	Extraction From The Web Table Of Contents Based On Ontology And Implementation
7	Research On Storage Optimization For OpenFlow Flow Table Of Data Plane In Software Defined Networking
8	Research And Design Of Secure Storage And Retrieval System For Medical Big Data
9	Table-mapping cone-beam X-ray tomography algorithm and two-photon data storage system
10	Research And Design Network Data Transfer Controller Based On SOC