Font Size: a A A

Research Of Digital Library Architecture Based On Hadoop

Posted on:2013-06-18Degree:MasterType:Thesis
Country:ChinaCandidate:X S LiuFull Text:PDF
GTID:2268330398498874Subject:Information Science
Abstract/Summary:PDF Full Text Request
The appearance of digital library brings people with great improvement, and it stores books information in the computer digitally and communicates through the computer network, making books information resource sharing, which has played a very important role in each field of people’s life. In the past decade, the construction work of the digital library has gotten some achievements, and to a certain extent it has meet people’s needs for personalized and more knowledgeable literature information, and it makes information storage space considerably reduced, information retrieval more convenient, the remote transmission of information reached and the purpose of information sharing achieved. With the rapid development of the computer network, Internet, information digitization and information storage technology, information resource grows rapidly, and more and more information is stored. However, along with the increase of digital books information, digital library turns out a series of problems in terms of storage, retrieval, security, system maintenance and so on, which lead to the bottleneck in the development of digital library.This paper analyzes the problems of the digital library, and the cloud computing system framework is discussed deeply, and the construction thoughts of the digital library based on hadoop is put forward. Hadoop is the open source implement frame of cloud computing, and Google proposed the programming ideas of GFS and mapreduce, which greatly improved the process of mass data information. Aiming at Google’s GFS and mapreduce, Apache Open Source organization developed Hadoop, a distributed computing open source framework which is Java implementation of mapreduce in essence, and it allows program automatic distribution to an oversized cluster composed with ordinary machines and implement concurrently. This paper makes a deep research to Hadoop architecture, and analyzes the implementation mechanism of Hadoop. On that basis, the digital library system based on Hadoop is designed, and some of the main funptional modules are realized. Finally, this paper introduces the construction of Hadoop’s experimental environment in detail, and analyzes mass data processing with Hadoop.
Keywords/Search Tags:Cloud Computing, Digital Library, Hadoop, Distributed File System, Distributed Computing
PDF Full Text Request
Related items