Font Size: a A A

Research On Detection Algrithm Of Mathematicia Formula For MathML

Posted on:2017-03-14Degree:MasterType:Thesis
Country:ChinaCandidate:J N GuoFull Text:PDF
GTID:2308330485456851Subject:Operational Research and Cybernetics
Abstract/Summary:PDF Full Text Request
With the rapid development of Internet, the exchange of academic research results tend to be information, network, resource sharing is better and better. But the following is the phenomenon of academic plagiarism is increasing. Therefore, academic research focus on academic paper detection technology. At present, great progress has been made in the detection of text content, and it has been widely used. But the research on the detection technology of the mathematical formula is still in the exploratory stage. Because of the academic papers, especially the science and engineering academic papers, the key idea is expressed by the mathematical formula. Therefore, in order to prevent academic plagiarism, the research on detection technology of mathematical formula is great significance.This paper proposes a mathematical formula detection algorithm for MathML. Firstly, finding mathematical formula in the document according to the coding characteristics of MathML, and generating its tree structure representation according to the mathematical formula encoding, normalizing the tree structure by rule base and AC algorithm. secondly, preorder traversal the tree to normalize the variable names, level traversal the tree to get the structure code. Searching the table that named by the structure code which is in formula database, if the table exist in the database, then traversal tree and get the preorder traversal sequence and the postorder traversal sequence. Final, indexing the records that traversal sequence are the same as data table. If the records exist, displaying the document information.The experimental results show that the algorithm realizes detection of mathematical formula for Math ML. It is not only suitable for the same structure mathematical formula detection, but also for the same semantic mathematical formula detection, it is a more practical algorithm. It have fast detection speed and strong utility.
Keywords/Search Tags:MathML, mathematical formula, duplicate checking detection structure code, tree, XML
PDF Full Text Request
Related items