Font Size: a A A

Research And Implement Of The Computer-Aided Copy Detection System For Document

Posted on:2009-11-23Degree:MasterType:Thesis
Country:ChinaCandidate:X L YanFull Text:PDF
GTID:2178360245965496Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
As the digital resources of network is becoming richer and the change of way people store and get information caused by network, the digital documents get more and more easy to obtain and the duplication of documents becomes more and more easy. In recent years, the plagiarism of academic paper has been found in the press repeatedly; the redundant homepage in Internet has increased day by day which cut retrieval efficiency and brought difficulty to the user.The technology of documents copy detection has been put forward to prevent illegal copy and spread of digital documents, which is used in intellectual property protection and information retrieval. It can prevent plagiarism incidents and improve the Internet retrieval efficiency which was a hot point in data security research in recent years.Copy detection for documents is to judge whether the given document plagiarizes contents of other documents in the database, which plagiarism occurs in some ways, such as by duplicating partial or total document contents and using different words or sentences to express the same meaning of the texts of pervious documents in the database.Firstly, this paper introduces background, basic concept, domestic and foreign research situation, application domain and scientific significance of the technology of documents copy detection. Then it analyses the functions and characteristics of the existing system, and explores the technologies and the characteristics of the technologies to build a system, such as XML, ASP.NET, ADO.NET and SQL Server. The idea is proposed to establish computer-aided detection system for documents copy based on the B/S three tiers of the concept.Secondly, this paper has designed the copy computer-aided detection system's architecture and the database, the user registration the module, the documents upload module, the documents detection module, the system administration module.This system uses SQL Server 2005 as the database server, use XML to express the document file, use the ADO.NET module of ASP.NET to visit database, use Internet Information Server 5.1 as the Web server, uses C# to compile the procedure of web server, the client side visits this system with the web browser.Again, this paper introduces each concrete functional module realization of this system in detail, including the user registration, uploading for the documents, detection for the documents, documents management for the users, system's basic establishment management, the user management, the system's documents management.Above all, the computer-aided copy detection system for Chinese and English documents based on sentence is designed and implemented, the online documents copy computer-aided detection service is provided for the user, a lot of experiments are taken and it testifies that the system is so practical that it will greatly favor the users in the future.
Keywords/Search Tags:copy detection, text blocks, similarity, ASP.NET
PDF Full Text Request
Related items