Improving The Accuracy Of Ozone Prediction Based On Machine Learning In China

Posted on:2024-04-28

Degree:Master

Type:Thesis

Country:China

Candidate:K L Xiong

Full Text:PDF

GTID:2531307106475394

Subject:Resources and environment

Abstract/Summary:

PDF Full Text Request

Severe near-surface ozone（O₃）pollution poses a significant threat to residential health,ecosystems,climate change,vegetation and buildings.Accurate O₃ predictions can better assess its impact on public health and help develop effective prevention and control measures.Data from ground-based stations are the most accurate,but the number of ground-based stations is small and unevenly distributed.The simulations from the air quality model provide complete spatial and temporal coverage,but there are large bias between the simulations and the observations.Firstly,the bias and influencing factors of the Community Air Quality Model（CMAQ）simulations were analysed and a bias correction model was constructed based on the Random Forest（RF）algorithm.The RF model successfully captured the non-linear relationship between O₃ and its influencing factors.The standard mean bias（NMB）of hourly O₃concentration（O₃-1h）,the daily maximum 8h O₃（O₃-Max8h）and the daily maximum 1h O₃（O₃-Max1h）decreased from 15.8%,20%and 17%to-0.5%,0.8%and 0.1%,respectively,and the correlation coefficient（R）improved from 0.78,0.90 and 0.89 to 0.94,0.95 and 0.94,respectively.The causes of the bias in CMAQ simulated O₃ were also explored.For O₃-1h,the bias of nitrogen dioxide（NO₂）may be the main cause.For O₃-Max8h and O₃-Max1h,the observations are the main cause of the bias.Two multi-source data prediction models based on the Light GBM algorithm were then constructed to improve the accuracy of the CMAQ model for O₃-Max8h.The first model uses pollutant concentrations simulated by CMAQ,meteorological data simulated by the Weather Research and Forecasting Mode（WRF）and latitude and longitude data as input variables（named LGBR）,while the other model uses the same setup but uses the O₃-Max8h provided by the China High Air Pollutants（CHAP）dataset as an additional input variable（named LGBR＿CHAP）.The results showed that the root mean square error（RMSE）and mean bias（MB）of the LGBR model（LGBR＿CHAP model）were reduced by 3.15μg/m³ and 2.07μg/m³ at the daily scale（5.61μg/m³ and 4.18μg/m³）,respectively,compared to the original CMAQ model.At the monthly scale,the R of the CMAQ model was improved from 0.2 to 0.91 to 0.4 to 0.92（0.5 to 0.94）after optimization of the LGBR（LGBR＿CHAP）model.Spatially,the O₃-Max8h simulated by the CMAQ model performed better in East China but worse in West China.After optimisation of the LGBR and LGBR＿CHAP models,the CMAQ model national station-averaged R improves from 0.77 to0.83 and 0.88,respectively.the LGBR and LGBR＿CHAP models have successfully captured the spatial and temporal patterns of O₃-Max8h.Overall,both the LGBR and LGBR＿CHAP models perform better than the original CAMQ model,but the LGBR＿CHAP model has better predictive power than the LGBR model.Therefore,the LGBR＿CHAP model was used to predict O₃-Max8h for the whole country.the LGBR＿CHAP model successfully predicted O₃-Max8h data with high resolution（10km×10km）and full coverage（100%）.

Keywords/Search Tags:

Machine learning algorithms, Air quality models, Ozone, Spatiotemporal distribution

PDF Full Text Request

Related items

1	Application Of SVR Nonlinear Integrated Models Based On Different Intelligent Optimization Algorithms For Ozone Prediction
2	Study On Distribution Of Ground-level Ozone In China Based On Machine Learning Approaches
3	PM_2.5 Spatiotemporal Distribution Using Improved LUR Model And Its Relationship Between Land Use
4	Study Of Statistical Methods Based On Machine Learning Algorithms For Air Quality Forecasting In Lanzhou
5	Remote Sensing Estimation And Spatiotemporal Characteristics Analysis Of Near-surface Ozone Concentration In China Based On Ensemble Learning
6	Spatiotemporal Evolution And Health Risk Assessment Of Ozone Pollution In China
7	Research On Machine Learning Algorithms For Tea Blending Process
8	Research On The Spatiotemporal Distribution Characteristics And Influencing Factors Of Ozone Pollution In China
9	Comparison Of Inversion Results On Water Color Remote Sensing Based On Traditional Empirical Models And Machine Learning Models
10	Chemical Process Monitoring Method Based On Spatiotemporal Sequence Predictive Neural Networks