Font Size: a A A

Extraction And Addition Of Text Information Of Print Data Based On The Print Command

Posted on:2016-11-08Degree:MasterType:Thesis
Country:ChinaCandidate:P R LiFull Text:PDF
GTID:2308330473958512Subject:Computer technology
Abstract/Summary:PDF Full Text Request
With the popularity of the Internet, big data concepts came into being, at the same time, a variety of data collection methods have also emerged. Except for the common web crawlers, filtration cards and other data collection methods, printers data collection requirements are also getting more and more. There is no doubt that data collection is the basic work of big data analysis.The earliest data collection occurs in the field of automatic control and environmental monitoring of the industrial age, and later development into the field of electronic evidence, and now as a major basic work of data analysis, data collection is very pivotal position in the Internet field. The data of Internet come from many sources, including the client record, system log, network traffic monitoring, e-mail messages, hard disk files, browser cache data, chat records and so on.Demand for printers data collection appears relatively later, it is along with the emergence of Internet payment, especially after the popularity of the concept of 020, many stores have begun to try to use the online payment business strategy. Especially for those stores which are consumed first and paid last, it is very important to have access to the consumer’s consumption information. If you want to increase payment capabilities to existing customer management system, just can only start to the link from printing bill, because the implementation of information is too complicated, so customer consumption information only from the analysis of the printing customer bill(print buffer files) to obtain, because in terms of number species compared to the various management system, the variety of printer is much less. Therefore, from the perspective of common software, consumer information may be collected from the printer to start, and for a variety of print instruction translation will become increasingly urgent.From the printing mechanism of Windows operating system to start, this paper respectively with Windows standard spool file(EMF) and the associated printer spool file(print instruction to Postscript as an example) as the research object, on the basis of the current command parsing related study, the analytical algorithm of EMF file based on DRAW 16 is proposed, as well as standard text or pictures to print instruction about PostScript conversion algorithm.In this dissertation, aiming at various problems encountered in the EMF vector character recognition are put forward specific solutions, including the selection of training set and test set, vector feature extraction and the methods of multi-character record identification, etc. In standard text or picture is transformed into a print command issues respectively covers many aspects of problems, such as Postscript coordinate transformation, resolution setting, character creation, etc. To effectively solve this type of printer instruction translation and additional have a promoting role.
Keywords/Search Tags:Postscript, EMF, Spooling, Print Order
PDF Full Text Request
Related items