Font Size: a A A

Research And Implementation Technology Based On Document Copy Detection

Posted on:2011-06-24Degree:MasterType:Thesis
Country:ChinaCandidate:W SunFull Text:PDF
GTID:2178330332962390Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Document copy detection detected whether a document is copied from one or some other documents."Copy"here includes not only fully copying but also partical copying, exchange copying, similar description and so on. Analyzing the ways how plagiarism occurred and the known algorithms, this paper provides Paper Similar Evaluation Frame (PSEF) and Paper Similar Evaluation Frame (PSEF) , providing"Parallel Similar Model(PSM)"algorithm which is an implementation of these frames to resolve moving copy which is difficult to detect by basic cosine algorithm; using"word segmentation"to split paper to resolve boundary problem which occurs in COPS; using comparison instead of number to resolve manual threshold problem. Using"word segmentation"to clear the sentence, make the detection more accurately. Using the known"LCS"algorithm we implement the frames by java programming language. The related experiments'results show these methods can produce the expected results.
Keywords/Search Tags:Copy Detection, PSEF, SSEF, LCS, JAVA
PDF Full Text Request
Related items