Font Size: a A A

Key Techniques Of Digital Content Across The Terminal Publishing

Posted on:2014-01-13Degree:MasterType:Thesis
Country:ChinaCandidate:L ChangFull Text:PDF
GTID:2268330401489031Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
In today’s publishing industry, digital publishing develops the most rapidly and somecross-terminal publishing have become newly digital publishing formats, such as mobilephone publishing, moving publishing and so on. By comparison with traditional publishing,digital publishing has excellent quick query, mass storage, cheap cost, easy editing, andmore environmental protection, etc. As a result, more and more presses focus on thetechnology research and market development of digital publishing.PDF (Portable Document Format) is able to reproduce characters, colors and images ofthe manuscript’s and adopt industry standard compression algorithms which are easy tostore and transmit. Besides, PDF also includes the location information of importantstructures. For the above features, PDF has become one of the mainstream formats forsaving the electronic document in most presses at home and abroad. However, the PDF filelays emphasis on the description of the print format of files and do not describe the datastructure of the document content. So PDF cannot rebuild the document contentdynamically according to the size of the screen of the terminal equipment. Nowadays, akind of reading ways depending on the tablet PCs and smart phones has become themainstream one, which increases the technical demand on publishing the digital contentacross the terminal. Therefore, the digital content managing way based on PDF becomesthe bottleneck of publishing digital content across the terminal.XML,a data exchange standard, which is recommended by W3C, is a cross-platform,content-dependent technology in the Internet environment and also a content-oriented fileformat. So it is able to compensate for the deficiencies of the PDF file format in thesemantic description. Using XML as a document save format for the press makes digitalcontent publish across the terminal possible. XML describes the electronic documentinformation by structured way and can calculate the layout according to the giveninformation of the format when outputs. What’s more, XML can dynamically generate thelayout which fits the size of the current terminal screen. So XML is a tool which is suitableto describe file structure and content. In order to make use of documents across-terminaldigital content publish more effectively, it is necessary to convert PDF document into XMLdocument. Around the issues encountered when the press publishes digital contents across theterminal, this thesis focuses on the research of PDF document information extraction andcross-terminal adaptive restructuring techniques. In this thesis, an information extractionmethod for the test, pictures and vectors in the PDF document and layout structure analysisapproach are proposed. It publishes the digital content through terminal adaptivereorganization algorithm. Finally, on the basis of the algorithm, we design a system topublish digital contents across the terminal and apply it to the practical work in the press.
Keywords/Search Tags:PDF, XML, cross terminal adaptive recombination, layout informationextraction
PDF Full Text Request
Related items