Study On Two Key Techniques In Archive Digitalization

Posted on:2008-05-25

Degree:Master

Type:Thesis

Country:China

Candidate:F Zhang

Full Text:PDF

GTID:2178360272967820

Subject:Systems Engineering

Abstract/Summary:

PDF Full Text Request

In recent years, archive processing technology has been developing unimaginably towards the direction of digital, informational and networking at very fast speed. Traditional paper-based archival processing methods to some extent limit the sharing of files and information inquiries,large qualities of archive brings about new challenge to the trend. Focused on two key techniques in archive information process: symbol recognition and search mapping, a comprehensive and in-depth discuss was made in this thesis from three aspects as theoretical foundation, application methods, and analysis simulation.Symbol recognition is the base and core of the entire process. In traditional barcode-based information recognition application, massive archive files burden archive workers as well as barcode attaching is a rather complex work and error prone. Meanwhile, barcode undermine the original appearance of the files. Symbol recognition technique made full use of pattern classification and neural network as core technique, file scanning image processing technique as the basic principle, symbol as separator between two files, manual preprocessing to guarantee correspondence, which successfully replaced original barcode, lowed down redundancy in archive database, improved efficiency of the inquiries, and brought out considerable convenience to the following step: search mapping.Search mapping is the goal and end-result of the whole process. Traditional paper-based archival retrieval method is no doubt of low efficiency. When facing large amounts of unrelated data, like Internet Web information retrieval, archive information retrieval is also more and more challenged. Apply modern internet retrieval technique into archive information retrieval; make full use of text mining as basis, brought forward the concept of correlation degree between archives, which makes automatic clustering between archives possible to follow. Meanwhile, I use PageRank algorithm of search engine Google for reference and provide different ranks of priority in face of the users, which thereby means"full, accurate, fast"searching goal is coming true, and it's also a successful application of network searching technique into archive information retrieving.Via modeling and simulation to the application technique, real archive data was made used of as training samples, testing result was exported, integrated evaluation indicators of this system was also established, which facilitated optimizing the system. In the end, a full summarization and conclusion of the key techniques was made, mentioned where should be ameliorated, and a solid foundation for the next step: establishing distributed sharing archive information platform was in the meantime constituted.

Keywords/Search Tags:

symbol recognition, search mapping, pattern classifying, text mining

PDF Full Text Request

Related items

1	Web Concept Mining Based On Text Layer Model
2	Classifying maritime near-miss and injury report using text mining
3	Research And Realization Of Topic Extraction Based On Text Mining
4	Research Of Text Mining And Application In Topic Search
5	Mining Users' Interests Based On Search Logs
6	Research On The Evolution Process Of Additive Manufacturing Technology Based On Text Mining And Pattern Recognition
7	Classifying and searching hidden-web text databases
8	System Design Of Real-time RGB Data Comparison TV Symbol Recognition Based On Matlab
9	The Research Of The Gradual Chinese Text Classifying Technology
10	Web Text Mining Research Based On Subject-oriented Search Engine