Font Size: a A A

Design And Implementation Of Document Analysis And Translation Engine Of WEB Page

Posted on:2010-12-21Degree:MasterType:Thesis
Country:ChinaCandidate:Z H ZhaoFull Text:PDF
GTID:2248330395462532Subject:Software engineering
Abstract/Summary:PDF Full Text Request
With the background study of Network Machine Translation System and HTML as study object, this paper has carried out the thorough research and discussion for the Network Machine Translation oriented document analysis processing technique. This paper makes analysis for HTML form and introduces the mechanism of knowledge base into the Network Machine Translation, Proposes an analysis method based on the Self learning and rules. Grounded on this new idea, HTML Machine Translation System is designed and implemented.HTML Machine Translation System include three mode:HTML Analysis Module, Translation Engine Module, HTML Revert Module。HTML Analysis Module gain text and tags through analysis HTML document, and hand down text to Translation Engine Module which convert text into translation, Translation HTML is generate by combine translation and tags in HTML Revert Module.Translation Engine is basis of HTML Translation System, the quality of translation is depends on translation analysis is right or not. According to one existing typical Translation Engine, This paper is describing a process of Translation Engine.This system is base on knowledge base which is made up of HTML tags, not only can modularize this system and improve system’s portability, but also will enhance performance of the system through manage this base. Practice shows that, the introduction of Knowledge base increase applicability and expansion for the Machine Translation.Via the constantly test and improve to lots of HTML document, this system achieve the anticipate results, and proved that the WEB document analysis method is correct, practical and feasible.
Keywords/Search Tags:WEB Page Analysis, Network Machine Translation, Knowledge Base
PDF Full Text Request
Related items