Study On TOC Prediction Method Of Shale Based On Machine Learning

Posted on:2024-05-02

Degree:Master

Type:Thesis

Country:China

Candidate:J T Sun

Full Text:PDF

GTID:2530306920462514

Subject:Geological engineering

Abstract/Summary:

Total organic carbon(TOC)content is a key parameter for screening potential source rocks and sweet spots of shale oil and gas.The traditional methods of measuring and predicting TOC content in shale,such as carbon and sulfur analysis experiment,have some problems,such as high prediction cost,long time consuming and discontinuous results,while the empirical mathematical model prediction rule has some problems,such as general prediction accuracy,poor generalization and weak applicability.How to predict TOC content with low cost,high efficiency and high precision has become an important task in shale oil and gas exploration and development.As a key means of big data analysis and mining,machine learning has been applied in many fields,such as biology,chemical industry,medicine,transportation,finance,industrial manufacturing and so on,and has achieved good application effects.However,in the field of oil and gas exploration and development,the application of machine learning is still very short,and the application effect is not clear.In this paper,multiple types of logging data and known TOC content data are taken as the breakthrough point,Random forest(RF),support vector regression(SVR)and XGBoost machine learning algorithm are used to establish TOC content prediction model,to realize the continuity and high precision prediction of TOC content in shale,and the prediction performance is compared systematically.First,a decision tree algorithm is used to determine the optimal set of logging parameters from a total of 15 commonly used logging features.Three machine learning algorithms,including Random Forest(RF),support vector regression(SVR)and XGBoost,were used for hyperparameter optimization,and then trained and tested to build a predictive TOC mode A total of 816 data points of well logs and TOC content from five different shale formations were then used to train and test the three models.Finally,these three models are used to predict TOC content data in Shahejie shale that models have not seen before.The results showed that RF had the best prediction effect on TOC content,R~2=0.9141,RMSE=0.329,MAE=0.252,followed by XGBoost,and SVR had the lowest prediction accuracy.Nevertheless,these three models outperform the traditional Schmoker gamma ray logarithm method,multiple linear regression method andΔlgR method,which verify the reliability of the above three machine learning methods.

Keywords/Search Tags:

TOC, Random forest, Support vector machine, XGBoost, Organic-rich shale

Related items

1	Research On Prediction Method Of Total Organic Carbon In Shale Based On Machine Learning
2	Qualitative Logging Identification Of Water-flooded Zones In F Block Based On Random Forest
3	Stock Price Prediction Based On Investor Sentimen
4	Application Of Artificial Intelligence In Navigation Positioning
5	Evaluation Model Of Random Forest And Support Vector Machine For Landslide Prone Along Mountain Road
6	Eukaryotic Gene Promoter Recognition Based On Optimized Support Vector Machine
7	Prediction Research Of Protein-Protein Interaction Based On Ensemble Of Support Vector Machine And Random Forest
8	Research On Multi-Factor Quantitative Stock Selection Model Based On Random Forest-Support Vector Machine Algorithm
9	Personal Credit Evaluation Model Based On SVM And XGBoost
10	The Application Of Machine Learning Models To A Share