Font Size: a A A

Digital Rights Repository Construction And Application Of Research Based On Lucene

Posted on:2014-02-28Degree:MasterType:Thesis
Country:ChinaCandidate:Y H HanFull Text:PDF
GTID:2248330395998608Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the rapid development of the Internet, information storage and mode of transmission has undergone tremendous changes. Because digital resources can be spread easily, digital resources bring great convenience, also has brought unprecedented challenges to a digital rights management work. First of all, digital publishing resources copyright relying on traditional storage methods cannot resolve the storage problem of the growing digital resources. Secondly, for the vast amounts of digital works, how to get quick search for useful information is becoming more and more difficult. Therefore, establishing a mass storage resource repository to implement large amounts of digital works’storage and quick search service is necessary.For the problem of massive digital works’storage, In this paper, using the framework of the Hadoop HDFS distributed file system for storage; In the Quick Search of resource information, firstly based on Lucene full-text search feature information to construct the index, and search the index file, and then use Elasticsearch distributed technology to sub-fragments store Lucene index file distributed search; Finally, the search engine is to provide users with a friendly user interface:digital works information management, copyright information management, create an index, fast search.The main difficulties and innovations of this paper are as follows:Analysis the characteristics of digital copyright resources, design a cloud storage scheme that can be expanded easily, highly fault-tolerant, support massive data sets; Study Lucene technology, design a full-text search solution of digital rights resource metadata; Use Elasticsearch to shard the index file of resources repository, achieve digital copyright repository’s distributed indexing and distributed searching.The main results of this paper is with Hadoop HDFS distributed file system implements the massive works retrieval and uses Lucene full-text index and Elasticsearch slicing technique to finish efficient index and the quick retrieval, and ultimately build an efficient, distributed digital rights repository, which ensure the safely and reliable storage of vast amounts of digital works to promote the integration of resources of the digital copyright industries, reduce the cost of the spread of digital works and provides the underlying supporting for the registration information of the digital copyright in the work, the record, Search and detection.
Keywords/Search Tags:Digital Rights repository, mass storage, inverted index, search engines, distributed search
PDF Full Text Request
Related items