Font Size: a A A

Research On The Factors Affecting The Quality Of Network Survey Data In Big Data Environment

Posted on:2018-01-11Degree:MasterType:Thesis
Country:ChinaCandidate:C ZhangFull Text:PDF
GTID:2358330515478775Subject:Library and Information Science
Abstract/Summary:PDF Full Text Request
Network survey is an emerging approach in the survey industry and is a means of investigating specific network users.These groups often have a high level of information collection and payment.They are a combination of traditional survey techniques and modern network technologies,product.With the development of the Internet,the popularity of the network continues to improve,but also in the update and development at the same time,the use of network surveys to collect data is also more and more widely,survey applications are also innovative.However,due to the openness of the network itself,non-security,there are many new problems such as network coverage and network control.These problems make it impossible for investigators to guarantee the quality of network survey data and become an obstacle to control the quality of network survey data.Based on the research results of quality management theory and data error theory,this paper combines the technical and non-technical factors to carry out comprehensive quantitative research.According to the research status quo at home and abroad,combined with the characteristics of network survey and large data,the factors influencing the quality of network survey data are preliminarily analyzed.From the technical and non-technical aspects,the paper analyzes the situation of the network investigation work.The problem is analyzed in depth,and the concrete factors such as accuracy,timeliness,linkability,comprehensibility,acquisition and validity are obtained,and 13 specific influencing factors are obtained.Combined with the data quality management theory,from the accuracy,timeliness,applicability of the three dimensions of indicators to reflect the data quality standards.The explanatory variables and explanatory variables of the data model of the network survey are determined,and the index system of the whole model is constructed.Through a variety of technical means,access to a large number of network survey data,the use of large data analysis methods,the data for sorting,processing,classification,quantification,and then,according to the established model of the relevant factors and variables to standardize Processing,and established a multiple linear regression model.Using R language programming and calculation,the data for statistical analysis.The results show that the four factors influencing the quality of the network survey data are the key factors in the quality of the network survey data,and then the results show that the sample size,geographical distribution dispersion,the difficulty degree of the questionnaire and whether or not to obtain the complete investigation data are the key factors influencing the quality of the network survey data.Combined with these four factors,in the actual investigation work,to carry out the investigation of the investigators,not only to strengthen the training of the investigation,improve their work experience and knowledge reserves,but also should continue to update the survey The means and methods,and constantly in the investigation work to innovate,can guarantee that in the continuous development of science and technology,data quality can continue to improve.
Keywords/Search Tags:network survey, data quality, influencing factors, multiple linear regression
PDF Full Text Request
Related items