Font Size: a A A

The Design And Implementation Of Chinese Chronicle Literature System In CADAL

Posted on:2012-06-03Degree:MasterType:Thesis
Country:ChinaCandidate:Z C YeFull Text:PDF
GTID:2178330332976265Subject:Computer applications
Abstract/Summary:PDF Full Text Request
China has a long history of civilization for thousands of years, which has generated great number of ancient, modern and contemporary literature. But, with the time passing by, a lot of famous works are lost in the history. In addition, handling literature in such a great number is really a difficult and complex problem, either in further protection or in repair and research. Thus makes study literary with the help of information technique even more urgent.CADAL Chinese Chronicle Literature System aims at constructing a research information system in Chinese literature history, which deems finding potential value of Chinese literature, history, geographical information and other related digital resource in CADAL as its basic foundation. Compared with other traditional literary and historical system, it does not just simply digitize the resources. Instead, it creates the relation between these resources and increases the utilization rate, with the techniques of database, geographic information, multimedia processing, semantic web, mass storage.We mainly do work in three aspects in building CADAL Chinese Chronicle literature system. Firstly, we divided the system into three application platforms:the basic literary and historical information retrieval platform, the historical and geographical information platform and,multimedia application platform by analyzing the actual needs and resources in CADAL, then do the design work for every module. Secondly, we finish key technical design in detail, which includes completing data extraction and reconstruction from OCR database by the means of web crawler and regular express, building full text search engine with Lucene framework, constructing geographical information platform with CAMAME and GIS data in shapefile format and multimedia application platform with CCA and other multimedia feature retrieval technique. In this paper, we give detailed description in these three aspects.
Keywords/Search Tags:Digital Library, Information Extraction, Machine Learning, GIS
PDF Full Text Request
Related items