Font Size: a A A

On The Standardized Meta-data Conversion For Chinese Ancient Books: Research And Applications

Posted on:2012-02-29Degree:MasterType:Thesis
Country:ChinaCandidate:J X RaoFull Text:PDF
GTID:2218330362956289Subject:Communication and Information System
Abstract/Summary:PDF Full Text Request
The traditional way of ancient book survey is spending a lot human and material resource, and mainly using traditional carriers such as CNMARC(China Machine-Readable Catalogue), ancient book census form, declaration of national precious ancient books and the access database to record information. Due to differences of local conditions and uneven, even the same data is not the same. Coupled with the coarse-grained of the traditional carriers themselves, the communication of different format is difficult, resulting in the very inconvenient of the spread, view, statistics and retrieval of ancient books.This paper is aimed to study the conversion of Chinese ancient book standardized metadata, with problem analysis. After design and implementation of the standardized metadata convertion software, we take detailed testing on the software and make prospects for the development of digitization of Chinese ancient book.In the background of The Digital Service Platform of Ancient Book project, the previous offline work shifts to online through Web technology. By modeling the ancient book metadata, adopting object-oriented design and using relational database to store data, the ancient book data becomes a fine-grained metadata. By parsing 380,000 existing CNMARC format data of the National Library, the CNMARC data is converted into digital ancient book metadata. By using POI technology to parse 2,500 existing ancient book census forms and 100 declarations of national precious ancient books, they are also converted into digital ancient book metadata. By connecting to the access database via JDBC(Java Data Base Connectivity), 1,800 access data is converted to digital ancient book metadata too. In addition, the average retrieval time is less than 1 second using the Lucene technology indexing 300,000 ancient book data for data retrieval and through computer statistic and schedule automatic generates PDF ancient book category, which are based on the ancient book metadata model.By the series of research, the ancient book data is converted into a uniform format of fine-grained metadata, which is easier to show the form diversify and very convenient for statistics. The application of Web technology reduces the cost of a large number of resources, and the Chinese ancient book metadata is very convenient of disseminations and viewing. As a result of using a mature indexing technology, the searching speed is fast and meets the need of pratical applications.
Keywords/Search Tags:Digitization of Ancient Book, Ancient Book Meta-data, Meta-data Conversion, CNMARC(China Machine-Readable Catalogue)
PDF Full Text Request
Related items