Font Size: a A A

The Design And Implementation Of Plugin For Web Simplified Andtraditional Forms Of Chinese Characters Conversion

Posted on:2015-01-13Degree:MasterType:Thesis
Country:ChinaCandidate:R J YanFull Text:PDF
GTID:2268330428982825Subject:Computer technology
Abstract/Summary:PDF Full Text Request
Chinese language has thousands of years of history, it is not only the Chinese cultural heritage, it is the communication tools between the Chinese and the world. While Hong Kong and Taiwan and mainland Chinese still use different systems due to these historical reasons, which caused a huge obstacle for both sides of the three places on communication and exchange of information. With the development of network technology, the Internet browser must be become a software in order to build a better platform for the exchange of information.This paper achieved Simplified and Traditional Chinese conversion in the form of web browser plug-in. Since the IE browser has a high market share, so we choice IE10which is now the most popular.Common web Chinese encoding include GB2312, GBK, UTF-8, BIG-5, mostly in mainland China is based on GB2312, GBK, UTF-8, while Taiwan, Hong Kong and some overseas Chinese communities is more use BIG-5. Which, GB2312character set only contains simplified, GBK and UTF-8can be displayed simultaneously simplified and traditional Chinese characters, BIG-5character set only contains traditional Chinese characters. Based on the research and analysis of various types of encoding, web page Chinese Simplified and Traditional conversion concreted into two categories:Simplified and Traditional Chinese within the same kind of coding and conversion between different encoding.Encoding between different character are not identical, the two-way translation index must be established to achieve data conversion between different character. Have to use some existing transcoding and Simplified and Traditional conversion tools to query and batch conversion.Taking into account of the conversion efficiency problem, this paper use the separate way for storing simplified coding and Traditional coding, and the use of efficient hashing algorithm to find a replacement. After the plug-in registered, the conversion way is chosen when users browse the Web pages to be displayed, the system will automatically crawl the web content of the document, identifying the page encoding, self-judgment Simplified and Traditional conversion program to convert, and then return the translated pages finally.
Keywords/Search Tags:Web page, Simplified and Traditional, Coding, Plug-in
PDF Full Text Request
Related items