Leveraging Machine Learning For Lithology Discrimination
Leveraging Machine Learning For Lithology Discrimination
5
Lift Curve (Siltstone)
Leveraging Machine
In the world of rocks and
1
Learning for
Lift
earth sciences, being 0.5
Lift
1.5
1
1.2
0.8
0% 20% 40% 60% 80% 100%
Percentage of Sample
Leveraging Machine Learning for Lithology Discrimination
Abstract:
This report presents a comprehensive study that employs various machine-learning techniques to develop a discriminative
model for lithology classification. The study utilized a substantial dataset containing key petrophysical attributes, including
PE, DT, GR, NPHI, RHOB, RT10_l10, RT20_l10, RT30_l10, RT60_l10, and RT90_l10, to distinguish between five distinct
Introduction
The classification of lithologies in the field of geology and petrophysics is a fundamental task that has traditionally relied on
expert interpretation. However, the application of machine learning and artificial intelligence techniques has shown
promising results in automating and enhancing this process. In this study, we explore the application of various machine-
1. Logistic Regression
2. Random Forests
3. K Nearest Neighbors
5. Discriminant Analysis
values (RT10_l10, RT20_l10, RT30_l10, RT60_l10, RT90_l10). The data was randomly split into two samples: 80% of the
observations were used for model training, and the remaining 20% were reserved for model validation.
Results:
Our machine learning techniques were applied to the lithological dataset, and the results demonstrate significant accuracy
and reliability in distinguishing the various lithologies. Clear patterns emerged, indicating the potential benefits of artificial
Performance metrics Logistic regression Random forests K Nearest Neighbors SVM Discriminant Analysis
Discussion:
The results indicate that the Random Forests model outperforms the other techniques, achieving a perfect accuracy and
the lowest misclassification rate. This suggests that Random Forests are well-suited for lithology discrimination based on
Forests, in particular, demonstrates the capability to automate the classification of lithologies with a high degree of accuracy.
As the field of petrophysics and geology continues to evolve, these techniques offer a valuable tool for efficient lithological
analysis.
Some obtained results
Dataset for training
Dataset for testing
Modeling Results
Variable Observations Obs. with missing data Obs. without missing data Minimum Maximum Mean Std. deviation
Performance metrics Logistic regression Random forests K Nearest Neighbors SVM Discriminant Analysis
Accuracy 0.995 1.000 0.977 0.690 0.372
Precision 0.954 1.000 0.784 0.471 0.198
Recall 0.956 1.000 0.700 0.440 0.197
Correct classification 3583.000 3601.000 3519.000 2486.000 1341.000
Misclassification 18.000 0.000 82.000 1115.000 2260.000
F-score 0.955 1.000 0.739 0.455 0.198
Summary table: The best model, according to the statistic Misclassification computed on the validation sample, is colored
in green in the table above.
Logistic regression
Classification for the training sample (Variable Lithology):
Classification for the validation sample (Variable Lithology):
Random forests
Misclassification rate:
hesitate to contact us via email ([email protected]). Additionally, if you would like to receive the Excel
sheet containing the results we obtained, kindly provide us with your email address, and we will promptly send it to you