Font Size: a A A

Research And Implement Of Document Management System Based On Hadoop Storage

Posted on:2014-04-09Degree:MasterType:Thesis
Country:ChinaCandidate:D Y MaFull Text:PDF
GTID:2268330422963245Subject:Communication and Information System
Abstract/Summary:PDF Full Text Request
With the rapid development of Internet technology and cloud storage technology, themethods of file management have undergone significant change, which has promoted therapid development of the network drives. Network drives have changed people’s way offile management, whereas the development lag of small and medium-sized enterprises’mass data document management system highlights the importance of suitable documentmanagement system.Based on the analysis of document management system function and performancerequirements, this thesis has designed the overall framework of the system, an openpackage structure solution, a scheme of file upload, and users’ information managementoptimization scheme, a realization process of which have all conducted thereafter. In theoverall scheme, has designed the overall function structure, and the server deploymentaccording to the function structure. In the open package structure, the bottom interface hasbecome kinds of shared resources, achieved remote call. Moreover, this solution haspackaged the files and improved the convenience of system loading. As for file uploadingsolution, this thesis has formulated a method of file dynamic segmentation andimplemented multi-thread upload. Furthermore, the incremental algorithm has beenapplied to realize continuous transmission on the breakpoint, thus, improving theefficiency of the file upload. Combined with the message queue management service, theimplementation of file offline transmission to the Hadoop distributed storage server hasconducted. At the same time, this thesis has established the two stage index between userand file information, converted structured data into a semi-structured tree directory data tostorage, and realized the maintenance and analysis of semi-structured data XML.The present study has designed a Hadoop-based document management system,which combined the advantages of open structure, efficient file uploading, and increasedquery speed of users’ information. With the help of functional and performance test, thedocument management system has realized the goal of small and medium-sizedenterprises’mass data document management.
Keywords/Search Tags:Document management, Hadoop storage, Segmentation upload, XML datastorage
PDF Full Text Request
Related items