Font Size: a A A

Rules And Relevance Based Twitter Comment Spam Detection System And Implementation

Posted on:2015-04-08Degree:MasterType:Thesis
Country:ChinaCandidate:X Y LiuFull Text:PDF
GTID:2308330482456949Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Micro-blog use has become a very important source of information or work or entertainment or promotion or looking for in life, with the use micro-blog expansion micro-blog comments more exciting, even its brilliant reply as a bright spot. Read the comments to find bright spot has become a new interest in modern times. Rubbish micro-blog comments make comments on the image greatly. The computer is so advanced today this problem is of course not beat the computer up to the people. The corresponding computer program to solve such problems also appeared. In summary comment spam rules and then the correlation judgment trend by micro-blog Comments classification system of B/S structure of the development of the web also produced. Sina micro-blog is especially development platform in micro-blog several big platform, API interface technology is mature to become a lot of people love most.The system consists of sina micro-blog data platform API interface to download the micro-blog comment data as basic data as the experimental samples. Garbage micro-blog review classification system developed in the import. Set the initial sample set parameters. The data as a classification system through the sample database filtration formation classification, at the same time classification review enhances the training database of maturity. The use of the neural network and the theory of data mining in the filtering process. Through long-term summary of the rules to judge the comment classification. The characteristic of this system in the system to produce the comment spam also mining the formation of new information to build this system through correlation and data sample library. This process is known as the training sample library. When the sample library approach maturity classification results with tightening required classification results. The program also involves can be switched manually selected features because the system is advanced and the need for human intervention, is the so-called artificial intelligence and artificial are inseparable, the system log function for the normal operation of the system to escort.The system development language Java, realize the webpage interface using JSP technology. Java is the biggest advantage of cross platform ability system is stable. In micro-blog review classification system for large data processing in a stable code platform is rigid premise. Using JS and CSS to beautify the page effect. SQL Server 2000 for data storage database, SQL Server 2000 database technology is mature, patch perfect. The system uses the BS structure is rapidly developing today through a WEB browser can use system reduces the installation steps and can be used in the network whenever and wherever possible.
Keywords/Search Tags:micro-blog, micro-blog review, classification, rules, related degree
PDF Full Text Request
Related items