Font Size: a A A

Study On Information Extraction Of AQSIQ Internet Public Opinion SU Pervisory System

Posted on:2012-10-04Degree:MasterType:Thesis
Country:ChinaCandidate:H N TianFull Text:PDF
GTID:2178330335960427Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
The thesis designed and realized the information extraction of the Internet Public Opinion Supervisory System of General Administration of Quality Supervision, Inspection and Quarantine the People's Republic of China (AQSIQ).The system can fully meet the users'need and demand on the public opinion information comprehensive monitoring and provide intelligent, personalized, diversified public opinion monitoring services for all users. The main completed work is as follows:(1)The thesis completed the system's analysis of the requirements and the system design. Designed the information extraction level, the user level, the database level, the system administration level.The information extraction level is responsible to process the homepage information; the user level provides user system service. The database level is responsible to save and maintain the data which the system service needs. The system administration level is responsible for managing the key word and the website and so on.(2)The thesis use the homepage information pretreatment and the homepage information analysis techniques to achieve the homepage processing. Use the HtmlParser tool and regular expression technology to remove the redundant information in the homepage. Used the main text extracion algorithm which is based on the text block to analysis the homepage document and extract the title, the abstract, the content of the homepage. According to the content which has been extracted and the data saved in the database, used regular expression match method to extract the key word and website. Using JSP, Javascript, Ajax to realize the system management module.(3) The Thesis designed test cases for the system and completed the test. The testing results show that the system functions basically meet the needs of users and the system is stable and useful.The thesis deeply studied information extraction and system management of the Internet Public Opinion Supervisory System; realized extract the main text, the abstract, the keyword, the website of the homepage, system management and provided users with more complete information extracting service. The results of the research have a certain theoretical significance and application value.
Keywords/Search Tags:public opinion supervisory, information processing, information extraction, system management
PDF Full Text Request
Related items