Font Size: a A A

Design And Implementation On The Quality Assessment System Of Uncertain Data

Posted on:2015-10-31Degree:MasterType:Thesis
Country:ChinaCandidate:C Q ZhaoFull Text:PDF
GTID:2308330482452687Subject:Computer technology
Abstract/Summary:PDF Full Text Request
Uncertain data is widely used in RFID network, market analysis and other applications. Although the data is very rich, the information is scarce. The reason is the lack of effective data analysis and data of poor quality, such as data duplication, deletion of data, inconsistent of data, incomplete data and so on, which causes the problem, data can not be effectively utilized. The data quality assessment is a basic condition for solving data quality problems. At present, although the data quality assessment has gaind widespread attention and research of scholars, how to assess the quality of uncertain data effectively needs further researchThis thesis describes a practical application of uncertain data analysis-the quality assessment system of uncertain data. The system uses JAVA language for development and uses MyEclipse as the development tool; development of database uses standard SQL language and MySQL as the database management system. System includes user management, data document management, log management, data quality assessment and interface module. The system realizes the effective management of user, data document and log, realizes the assessment of uncertain density, answer decisiveness, duplication and completement dimension of uncertain data, and forms the log of data quality assessment. The storage of the data involved in this system uses the Hibernate framework of MyEclipse.The system uses the hql statement to operate the databases, In the process of duplication assessment, the block technique is used to improve the efficiency of dumplication assessment.In this thesis, we first introduce the status and related content of uncertain data research. Then, in order to clear the function and feasibility of the system, we analyze the system demands from three aspects:demand, feasibility and data sources. Then, we carry out the overall design and module design of the system. Because the design of some modules in the system is complex and there are many operations, we carry out a detail design and implementation for the system. According to the test, we assess fuctions and performances of the system. Finally, we summarizes the characteristics and shortcomings, and pointed out the direction of future work...
Keywords/Search Tags:uncertain data, answer decisiveness, integrity, duplicate, uncertain density
PDF Full Text Request
Related items