| Environmental big data management is an important management measure for environmental protection organizations to do a good job in business supervision,which is of great significance to environmental protection business growth and environmental pollution prevention and control.As the central content of data management,environmental protection big data platform will continue to provide data driving force for environmental protection business.Some enterprises with pollution sources are involved in more pollution emissions,and the emission situation is urgently concerned by the society and the state.In the key prevention and control of pollution sources,a large number of old monitoring data and real-time monitoring data are accumulated.These data are integrated in the later stage.How to effectively process,statistically analyze and mine the data rules,summarize the data situation for the pollution source supervisors to check and learn from,and show the pollution situation and the improvement effect of the environmental protection measures on the pollution situation,Ultimately,it is of great significance to the country,enterprises and the public.This paper describes the definition of big data,data processing definition,data maturity model definition and data quality definition,as well as the data of environmental protection business,including some pollution source emission enterprises,pollution source treatment enterprises,pollution source emission data,environmental monitoring data,environmental laws and regulations,environmental standard documents,etc.In recent years,domestic air pollution has attracted much attention.People pay more attention to the weather conditions such as haze.Haze affects people’s travel,work and daily activities.Therefore,this paper studies more on air data quality.Secondly,the environmental protection big data quality evaluation system is also introduced,including data maturity model management,data quality management,data evaluation system,data effectiveness and other key points.For the abnormal data,incomplete data and duplicate data that often appear in data quality,this paper puts forward the solutions such as average method and minimum limit substitution method.For the quality of monitoring data required by monitoring in different cycles,it is necessary to judge the effectiveness,and the data within the overall time period can be used only when 75%of the data is valid. |