Font Size: a A A

The Research And Realization Of Data Quality Control In Citation Search

Posted on:2013-06-30Degree:MasterType:Thesis
Country:ChinaCandidate:D HaoFull Text:PDF
GTID:2268330398998793Subject:Management Science and Engineering
Abstract/Summary:PDF Full Text Request
As a proof of representing actual strength of experts or team provided by authority organization, citation reports is important for scientific researchers and scientific research teams in title evaluation, award declaration and applying funds. Its accuracy is not only the most direct supportive for experts or teams, but for also related to the justice and judgment of auditing process in project application, award declaration and title evaluation etc. However, in the citation searching, the quality of retrieval data waiting for being searched directly determines the accuracy and reliability of search results, the quality of which has an effect on authorization of citation reports:that is nonstandard data providing and result processing not only leading to searching without result or citation records with incomplete, but also costs a great lot of manpower, materials&financials to reducing the omission and correcting the error. Therefore, data quality control in citation search has especially important meaning and the research of data quality control in citation search is essential.On the analysis of artificial searching flowchart, many data quality problems in automatic process of citation searching were discussed and critical data were mainly studied, including retrieval data supported by users and waiting for being searched、 the results of citation searching and data quality control problems. Finally, our data quality control methods were validated in theory and practice through by setting up data quality control measures which adapted to automatic process of citation searching. The main contents included are as follows:According to definition of data quality, the control standard in citation searching was defined based on the existed content of data quality.According to the automation of citation searching, an automatic searching flowchart suitable for citation searching was designed, which reference for artificial searching flowchart and data quality control standard. There are three parts in this automatic searching flowchart, that are automatic in embody searching, automatic in citation searching and citation information catching, automatic in report generation.According to data quality control in citation searching, through analyzing various data quality problems and their effects that may confronted in automatic searching flowchart of citation reports, lots of control Measures were adopted, such as, which guarantee the high quality of source data used in producing citation reports.According to validity problem of data quality control measures, automation software of citation searching was designed and implemented,meantime,four aspects tests were done including integration testing、merging module testing、errorcited confirmed module testing and self-citation/other-citation module testing, which prove the values of data quality control methods in this paper in theory and practice.According to prospects of data quality control, some ideas of perfecting and expanding a step further were put forward in the end of this paper, which we hopes may bring some new insight in other novelty search process.
Keywords/Search Tags:Data Quality, Data Quality Control, Citation Searching
PDF Full Text Request
Related items