Streaming knowledge bases

Posted on:2008-04-26

Degree:M.S

Type:Thesis

University:University of Maryland, Baltimore County

Candidate:Walavalkar, Onkar B

Full Text:PDF

GTID:2448390005952470

Subject:Computer Science

Abstract/Summary:

A knowledge base can be thought of as a special kind of database for knowledge management. It provides a way of collection and efficient retrieval of knowledge. Due to growth in deployment of sensors, we encounter many scenarios where data is constantly flowing between sensors and applications. The volume of data produced is large, so is the rate of the data-flow. In such scenarios, knowledge extraction boils down to finding useful information i.e. detecting events of interest. Typical use cases where event detection is of paramount importance are surveillance, tracking, telecommunications data management, disease outburst detection and environmental monitoring. There are many streaming database applications built to deal with these dynamic environments. Some examples of query processors based on adaptive data-flow are TelegraphCQ and the Aurora project.;With the emergence of Semantic Web, we now have a universal medium for data, information and knowledge exchange. RDF graphs are used to denote relation and interaction between different entities or resources. Some popular and uniform data interchange formats are developed to support RDF graphs. Knowledge extraction in Semantic Web is carrying out inferencing on such RDF graphs. Existing tools like Jena, Sesame are used for this task.;Semantic Web can be effectively used to convert raw data generated in a pervasive environment into knowledge using Semantic markup. We can use formats recommended by Semantic Web, such as RDF/XML or n-triples, to express streaming facts. In a pervasive environment, performing reasoning on such streaming facts becomes a challenging task. Existing reasoners use techniques that load the whole RDF graph in main memory and carry out queries on it. This approach is of little use in real-time reasoning for streaming scenarios and takes considerable amount of time.;We combine the continuous query processors with Semantic Web techniques to build an "rdfs:subClassOf" reasoner that can deal with streaming data. Given an ontology, we pre-compute the transitive closure of all classes on "rdfs:subClassOf" relationship and store the class-subclass relationships in a database table. At run-time we just need to query the database to identify subclass events of the event of concern. There are already many applications which describe data in RDF compatible formats. We feed streams of such RDF data to our query processor and carry out real-time "rdfs:subClassOf" reasoning on them.

Keywords/Search Tags:

Data, RDF, Streaming, Semantic web, Query

Related items

1	Top-k Query Processing Techniques Over Streaming Data
2	The Research On Structured Query Generation Framework Based On Semantic Query Graph
3	Research On Ontology-based Semantic Query Techniques
4	Linked Data Based Semantic Query System Of Traditional Chinese Medicine
5	Research On Streaming Data Event Acquisition Method Based On Multi-granular Top-k Query
6	Study On Methods To Real-time Query For Streaming Data
7	Research On Distributed RDF Query Processing
8	Semantic Query Processing Over Linked Data Knowledge Bases
9	Semantic EMR Data SPARQL Query Optimization Mechanisms
10	Design And Implementation Of Semantic Query For Structure Engineering Experiment Linked Data