Applsci 12 11704 v2
Applsci 12 11704 v2
sciences
Article
Drilling Parameters Optimization for Horizontal Wells Based
on a Multiobjective Genetic Algorithm to Improve the Rate of
Penetration and Reduce Drill String Drag
Chuanzhen Zang 1 , Zongyu Lu 1 , Shanlin Ye 2 , Xinniu Xu 1 , Chuanming Xi 1 , Xianzhi Song 2, *, Yong Guo 1
and Tao Pan 2
1 Research Institute of Engineering Technology, PetroChina Xinjiang Oilfield Company, Karamay 834002, China
2 School of Petroleum Engineering, China University of Petroleum (Beijing), Beijing 102249, China
* Correspondence: [email protected]; Tel.: +86-010-89732176
Abstract: With the development of China’s oil and gas exploration and development to complex oil
and gas fields, the drilling efficiency and safety of complex formations with large hardness and strong
abrasiveness have become increasingly significant. Optimizing drilling parameters is an effective
means to increase the rate of penetration (ROP) and improve drilling efficiency. However, traditional
drilling parameter optimization methods with only a single objective of increasing the ROP lack
consideration of the drill string’s drag which may also be increased when drilling parameters change.
When drilling a horizontal well, increased drag can reduce drilling efficiency. Aiming at this problem,
this paper uses the logging data of the oil field as the data source, establishes an intelligent ROP
prediction model through the random forest algorithm, and calculates the string drag using the
“hard-string” model. Finally, the nondominant sorting genetic algorithm-II (NSGA-II), which is a
domination-based multiobjective optimization algorithm, is used to optimize the drilling parameters
Citation: Zang, C.; Lu, Z.; Ye, S.; Xu, to increase the ROP and reduce the drag at the same time. The optimized drilling parameters guide
X.; Xi, C.; Song, X.; Guo, Y.; Pan, T. the drilling operations. We used the proposed method to optimize the parameters during the drilling
Drilling Parameters Optimization for of a new horizontal well. The results show that the ROP of the horizontal section of the new well
Horizontal Wells Based on a increases by 10.3%, and the drag reduces by 4.5% on average compared with the adjacent well.
Multiobjective Genetic Algorithm to
Improve the Rate of Penetration and Keywords: drilling parameters optimization; big data; ROP; string drag
Reduce Drill String Drag. Appl. Sci.
2022, 12, 11704. https://doi.org/
10.3390/app122211704
optimization objectives at the same time for the synergistic optimization of drilling pa-
rameters, as shown in Table 1. These works achieved better results than single objective
optimization. It can be seen that the method of multiobjective optimization of drilling
parameters considering the restriction effect between objectives is more scientific.
Work 1: Merge and clean data from different sources, including mud logging, well log-
ging, and bit parameters data. Clean data can improve the prediction accuracy of models;
Work 2: Establish the ROP prediction models based on SVM, random forest, and BP
neural network algorithms, and compare the prediction effect of the models;
Work 3: Establish drag calculation model by “hard-string” drag calculation model;
Work 4: The ROP prediction model and drag calculation model are used as the
objective functions, and the weight on bit (WOB), revolutions per minute (RPM), and pump
flow rate (GPM) are optimized by NSGA-II multiobjective optimization algorithm.
2.1. Data
ROP is the result of engineering factors and formation factors. To make our intelligent
ROP prediction model take into account engineering and formation factors, we collected
bit data, mud logging data, and well logging data from six wells in the oil field and then
merged the data existing in different documents into a data table by depth. The table
contains a total of 22,317 pieces of data, and each piece contains 15 features, as shown
in Table 2.
With the digital transformation of the oil field, MWD and other measurement tools
are widely used. The oil field recorded and stored a large amount of drilling and logging
data. Nowadays, many data-driven ROP models have been proposed. Researchers used
the random forest regression method (RF) [20,21], gradient boosting tree [22], BP neural
network [23–25], Support Vector Machine Regression (SVM) [26,27], and other algorithms
to establish the ROP prediction models. These ROP prediction models have higher accuracy
and timeliness than traditional ROP models.
Appl. Sci. 2022, 12, 11704 4 of 14
In this work, we establish three drill-rate prediction models based on SVM, RF, and BP
neural network algorithms and compare their prediction accuracy within the same dataset.
The principles of the three algorithms are below.
(1) SVM algorithm
Support vector machine (SVM) is a machine learning method based on statistical
learning theory. For regression problems, conventional regression models usually calculate
the loss directly on the basis of the difference between the predicted values of the model
and the true values, and the loss is 0 if and only if the predicted value of the model is
exactly equal to the ground truth. The loss is 0 only if the predicted value of the model is
exactly equal to the true values. However, the SVM regression model assumes that an error
ε between the predicted value and the true value can be tolerated. The loss is calculated
only if the absolute value of the difference between the predicted value and the true value
Appl. Sci. 2022, 12, x FOR PEER REVIEW 5 of 15
is greater than 2ε, as shown in Figure 2. The training problem of the SVR model is an
optimization problem under a given objective function and constraint conditions.
The constraint
The constraint is as follows:
is as follows:
i i i
|| f ( xi||) f||(xiy)||− y = ε + ξi (2) (2)
where w is weight,
where b is the
w is weight, b isbias, C is the
the bias, C ispenalty factor,
the penalty isε the
factor, error
is the tolerance
error inter-
tolerance interval, and
val, and i slack variable.
ξ i is the is the slack variable.
(2) (2) Random
Random forest forest algorithm
algorithm
Random forestforest
Random is a classical machine
is a classical learninglearning
machine algorithm that has been
algorithm widely
that has beenused in used in
widely
the industry since it
the industry wasitfirst
since wasproposed by Breiman
first proposed in 2001 [28].
by Breiman In this
in 2001 algorithm,
[28]. decision decision
In this algorithm,
treestrees
are selected as weak
are selected as learners. Decision
weak learners. trees cantrees
Decision be classified into classification
can be classified trees
into classification trees
and regression trees. In this paper, we established the ROP regression prediction model,
and regression trees. In this paper, we established the ROP regression prediction model, so
so the regression decision tree is selected as the weak learner. The schematic diagram of
the regression decision tree is selected as the weak learner. The schematic diagram of the
the random forest algorithm is shown in Figure 3.
random forest algorithm is shown in Figure 3.
(2) Random forest algorithm
Random forest is a classical machine learning algorithm that has been widely used in
the industry since it was first proposed by Breiman in 2001 [28]. In this algorithm, decision
trees are selected as weak learners. Decision trees can be classified into classification trees
and regression trees. In this paper, we established the ROP regression prediction model,
Appl. Sci. 2022, 12, 11704 5 of 14
so the regression decision tree is selected as the weak learner. The schematic diagram of
the random forest algorithm is shown in Figure 3.
The
Figure3.3.The
Figure random
random forest
forest algorithm.
algorithm.
When training the binary decision tree model, we need to consider how to select
the cut variables and cut points and how to measure the goodness of a cut variable and
cut point. The impurity of the cut nodes is generally used to measure the goodness. In
Breiman’s work (Equations (3) and (4)) [28], the impurity is calculated as follows:
nl nr
G ( xi , vi ) = H ( Xl ) + H ( Xr ) (3)
Ns Ns
where xi is a cut variable, vi is a cut point, nl , nr , and NS are the number of samples of the
left child node, the number of samples of the right child node, and the number of samples
of the parent node, Xl , Xr is the set of samples of the left and right child nodes, and H(X) is
the impurity function. The regression task generally uses the squared mean error (MSE)
as the impurity function. In this study, the random forest algorithm was used to establish
the ROP regression model, so MSE was chosen as the impurity function, and the impurity
formula was calculated as follows:
1
H ( Xm ) =
Nm ∑i∈ Nm (y − y)2 (4)
d(− F ) dk
= − EIk b b − qcosα ∓ µ1 ·µt (5)
ds ds
dM Do
= µ1 · µ t · (6)
ds 2
where in the “∓”, the “−” stands for pulling up the string, the ‘’+” stands for pressing
down the string; F is the axial pressure on the drill string N, s is the depth of the well, m; q
is the gravity of the drill string per unit length, N/m; α is the inclination angle of the well,
rad; EI is the bending stiffness of the drill string, N m2 ; nt is the contact drag force between
the drill string and the well wall, N/m; µ1 is the axial drag coefficient of the drill string; k b
is the wellbore axis curvature, m−1 .
The borehole curvature k b is calculated as:
s
2 2
dα dϕ
Kb = + sin2 α (7)
ds ds
d2 k b q dα
A = EI 2
+ k b F − k n (−k b MT + EI ·k b k n ) + sinα (9)
ds k b ds
d dk q dϕ 2
B= (−k b MT + EI ·k b k n ) + EI ·k n b − sin α (10)
ds ds k b ds
!
sinα dα d2 ϕ dϕ d2 α 1 dα 2
dϕ
kn = 2 2
− 2
+ cosα· 2 +1 (11)
kb ds ds ds ds k b ds ds
We use the finite difference method to solve for the forces on the entire drill string.
The drill string can be divided into several small drill string sections, as shown in Figure 5.
The forces on the small drill string sections can be obtained by the following equations,
with the pressure as positive:
1 2 2
Fi = Fi+1 + EIi (Kbi
2
− Kbi +1 ) + (− qi cosαi ∓ µ1 · µti ) ∆si (12)
sorting. The crowding distance is assigned to the individuals belonging to the same
nondominant layer. According to the crowding distance value between individuals, N
individuals are selected to form a new population. The next generation population M is
obtained by genetic operation. The above process is repeated until the termination9 condition
Appl. Sci. 2022, 12, x FOR PEER REVIEW of 15
is reached. The program flow chart of the NSGA-II algorithm is shown in Figure 6.
The machines that record data in the field take one piece of data every second. The
ground engineering parameters remain in a relatively stable range in a short period of
Figure
Figure 7.7.ROP
ROP data collected
data collectedfromfrom
the oil
thefield.
oilitfield.
time (5 s, 10 s). In such a short time interval, can be considered that the formation has not
changed. When drilling into a uniform formation, the drilling parameters and formation
shouldThe machines
not change muchthat record
in a short data
time, and in
the the
ROPfield
shouldtake one pieceinof
be maintained datarange.
a small every second. T
If abnormal data points occur in a small time window, the data variance will
ground engineering parameters remain in a relatively stable range in a short period increase.
time (5 s, 10 s). In such a short time interval, it can be considered that the formation has n
Appl. Sci. 2022, 12, 11704 9 of 14
The machines that record data in the field take one piece of data every second. The
Appl. Sci. 2022, 12, x FOR PEER REVIEW
ground engineering parameters remain in a relatively stable range in a short period of
time (5 s, 10 s). In such a short time interval, it can be considered that the formation
has not changed. When drilling into a uniform formation, the drilling parameters and
formation should not change much in a short time, and the ROP should be maintained in
a smallBased
range. on
If abnormal data pointswe
this regulation, occur in a small
select a datatime window, the
window withdata variance
a fixed time ste
will increase.
the variance
Based on thisofregulation,
the engineering
we select a data in thewith
data window window. When
a fixed time step new data appear
and calculate
if the
the variance
variance of the data
of the engineering datain
in the window
the window. Whenchanges
new datain a range
appear in the exceeding
window, th
if the variance of the data in the window changes in a range exceeding
new data are judged as abnormal data and vice versa as normal data. Thi the set value, the
new data are judged as abnormal data and vice versa as normal data. This outlier detection
tion process
process was programmed
was programmed in python, andin good
python, andresults
detection good were
detection
achievedresults
on the were a
collected
collected data,
data, as shown
as shown in Figurein8.Figure 8.
Figure 9.
Figure 9. The
ThePearson
Pearsoncorrelation
correlationcoefficient
coefficient between
between each
each parameter.
parameter.
100% n ŷi − yi
n ∑ i =1
MAPE = (15)
yi
Model Prediction: The test set was imported into the trained model, and the predic-
tion results were output.Model Prediction: The test set was imported into the trained model, and the prediction
The prediction results of the test set are shown in Figure 10.
results were output. The prediction results of the test set are shown in Figure 10.
R PEER REVIEW 12 of 15
(a) (b)
(c) (d)
Figure 10. The prediction effect
Figure andprediction
10. The MAPE comparison. (a) comparison.
effect and MAPE The prediction results
(a) The of SVR
prediction model;
results (b)model;
of SVR
The prediction results (b)
of RF
Themodel; (c)results
prediction The ofprediction results
RF model; (c) of BP network
The prediction model;
results of BP (d)
network Comparison
model; (d) Comparison
of models accuracy. of models accuracy.
The random forest ROP model performs best in both test and training sets. Therefore,
The random forest ROP model
we choose performs
to use the random best
forestin both
ROP test andmodel
prediction training sets.
as one Therefore,
of the multiobjective
we choose to use theoptimization
random forest ROP prediction model as one of the multiobjective
functions.
optimization functions.
3.4. Drag Calculation Model
The above Equations (5)–(13) can be used to calculate the drag of the drill string, which
3.4. Drag Calculation isModel
used as another optimization function of the multiobjective optimization algorithm. The
The above Equations
equations(5)–(13) can beusing
were calculated used
theto calculate
finite differencethe dragand
method ofprogrammed
the drill string,
through the
which is used as another optimization function of the multiobjective optimization on
python computer language. Figure 11 shows the calculated drag distribution the drill
algo-
string of well A:
rithm. The equations were calculated using the finite difference method and programmed
through the python computer language. Figure 11 shows the calculated drag distribution
on the drill string of well A:
we choose to use the random forest ROP prediction model as one of the multiobjective
optimization functions.
Figure 11. The calculated drag distribution on the drill string of well A.
Figure 11. The calculated drag distribution on the drill string of well A.
3.5. Multiobjective Collaborative Optimization Model
3.5. Multiobjective Collaborative Optimization Model
The random forest rate of the penetration prediction model and drag calculation
modelThearerandom forest
used as the rate of the
optimization penetration
functions prediction
of NSGA-II, model and
and the maximum ROPdrag
andcalculation model
are used as
minimum the
drag areoptimization functions
used as the optimization of NSGA-II,
objectives. and
Giving the the maximum
optimization ROP and minimum
boundary
of drilling
drag parameters
are used as theand setting the hyperparameters
optimization of NSGA-II,
objectives. Giving the maximum ROP
the optimization boundary of drilling
parameters and setting the hyperparameters of NSGA-II, the maximum ROP and minimum
drag within the parameter boundary can be obtained using NSGA-II, as well as their
corresponding parameter combinations (WOB, RPM, and GPM). The hyperparameters are
selected in Table 5.
We used the model to provide drilling parameters for another neighboring well under
drilling. Table 6 shows the drilling parameters before and after the optimization of the
well’s five test points, as well as the resulting ROP increase and drag reduction. Compared
with the neighboring well, the average ROP in the horizontal section of the new well is
increased by 10.3% on average, and the average drag of the drill string is reduced by 4.5%.
The final test results show that the optimized parameters achieve the purpose of increasing
ROP and reducing drag.
Table 6. The ROP increase and drag reduction effect at test points.
4. Conclusions
In this paper, the random forest algorithm is used to establish a data-driven ROP
prediction model of horizontal wells. The MSE of the training set is 5%, and the test set is
9%, which can accurately predict the ROP of horizontal wells.
The random forest ROP model and the “hard-string” drag calculation model are used
as the optimization function, and the maximum ROP and minimum drag are used as the
optimization objectives. In the drilling process, the NSGA-II multiobjective optimization
model is used to optimize the drilling parameters of the horizontal well. The oil field
experiment results show that the multiobjective optimization model for drilling parameters
is reliable. Compared with the neighboring well, the ROP of the test well is increased
by 10.3%, and the drag is reduced by 4.5% on average. The drilling time is effectively
shortened.
However, the training of the ROP prediction models is purely data driven. Similar to
other machine learning models, the interpretability of the model is low, and the reasons
why the optimized parameters can improve ROP and reduce drill string drag could not
be well explained. Improving the interpretability of data-driven models is an important
direction for future research.
Author Contributions: Conceptualization, C.Z.; methodology, C.Z. and Z.L.; investigation, C.X.;
data curation, S.Y. and X.X.; writing—original draft preparation, T.P.; writing—review and editing,
Y.G.; supervision, X.S. All authors have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: The data are not publicly available because they involve information
on Chinese oil fields and need to be kept confidential.
Acknowledgments: The author would like to thank the academic salon of the High-Pressure Water Jet
Drilling and Completion Laboratory of China University of Petroleum (Beijing) and Zhaopeng Zhu.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Mo, H.; Shi, Z.; Hao, S.; Li, Q. Analysis and Application of Treatment Techniques in Horizontal Directional Drilling Borehole
Accident. Procedia Earth Planet 2011, 3, 273–279. [CrossRef]
2. Sha, L. Drilling Parameters Optimization Technology Status and Development Trend. China Pet. Mach. 2016, 44, 29–33. [CrossRef]
3. Chen, X.; Gao, D.; Guo, B.; Feng, Y. Real-time optimization of drilling parameters based on mechanical specific energy for rotating
drilling with positive displacement motor in the hard formation. J. Nat. Gas Sci. Eng. 2016, 35, 686–694. [CrossRef]
4. Bahari, A.; Seyed, A.B. Drilling cost optimization in a hydrocarbon field by combination of comparative and mathematical
methods. Pet. Sci. 2009, 6, 451–463. [CrossRef]
5. Hankins, D.; Salehi, S.; Saleh, F.K. An integrated approach for drilling optimization using advanced drilling optimizer. J. Pet. Eng.
2015, 2015, 281276. [CrossRef]
6. Hegde, C.; Daigle, H.; Gray, K.E. Performance comparison of algorithms for real-time rate-of-penetration optimization in drilling
using data-driven models. SPE J. 2018, 23, 1706–1722. [CrossRef]
7. Gray, K.E.; Hegde, C. Use of machine learning and data analytic to increase drilling efficiency for nearby wells. J. Nat. Gas Sci. Eng.
2017, 40, 327–335. [CrossRef]
8. Momeni, M.; Hosseini, S.J.; Ridha, S.; Laruccia, M.B. An optimum drill bit selection technique using artificial neural networks and
genetic algorithms to increase the rate of penetration. J. Eng. Sci. 2018, 13, 361–372.
9. Abughaban, M.; Alshaarawi, A.; Meng, C. Optimization of drilling performance based on an intelligent drilling advisory system.
In Proceedings of the International Petroleum Technology Conference, Beijing, China, 26–28 March 2019. [CrossRef]
10. Payette, G.S.; Spivey, B.J.; Wang, L.; Bailey, J.R. Real-time well-site based surveillance and optimization platform fordrilling:
Technology, basic workflows and field results. In Proceedings of the SPE/IADC Drilling Conference and Exhibition, The Hague,
The Netherlands, 14–16 March 2017. [CrossRef]
11. Gidh, Y.; Purwanto, A.; Bits, S. Artificial Neural Network Drilling Parameter Optimization System Improves ROP by Predict-
ing/Managing Bit Wear. In Proceedings of the SPE Intelligent Energy International, Utrecht, The Netherlands, 27–29 March 2012.
[CrossRef]
Appl. Sci. 2022, 12, 11704 14 of 14
12. Guria, C.; Goli, K.K.; Pathak, A.K. Multi-objective optimization of oil well drilling using elitist non-dominated sorting genetic
algorithm. Pet. Sci. 2014, 11, 97–110. [CrossRef]
13. Ammar, A.; Mahmoud, A.; Beshir, M. Hybrid data driven drilling and rate of penetration optimization. J. Pet. Sci. Eng. 2021, 200, 108075.
[CrossRef]
14. Omojuwa, E.; Osianya, S.; Ahmed, R. Influence of Dynamic Drilling Parameters on Axial Load and Torque Transfer in Extended-
Reach Horizontal Wells. In Proceedings of the SPE Annual Technical Conference and Exhibition, Amsterdam, The Netherlands,
27–29 October 2014. [CrossRef]
15. Wang, Z.; Wang, X.; Chen, L. Status and Prospect of Technologies to Reduce Cost and Increase Efficiency for Drilling in Bohai
Oilfield. Xinjiang Oil Gas 2022, 18, 66–72.
16. Bingham, M.G. A New Approach to Interpreting Rock Drillability; Petroleum Publishing Company: Tulsa, OK, USA, 1965.
17. Bourgoyne, A.; Young, F.S. A multiple regression approach to optimal drilling and abnormal pressure detection. Soc. Pet. Eng. J.
1974, 14, 371–384. [CrossRef]
18. Hareland, G.; Rampersad, P. Drag-bit model including wear. In Proceedings of the SPE Latin America/Caribbean Petroleum
Engineering Conference, Buenos Aires, Argentina, 27–29 April 1994. [CrossRef]
19. Al-abduljabbar, A. A robust rate of penetration model for carbonate formation. J. Energy Resour. Technol. 2019, 141, 042903.
[CrossRef]
20. Hegde, C.; Wallace, S.; Gray, K. Using trees, bagging, and random forests to predict rate of penetration. In Proceedings of the SPE
Middle East Intelligent Oil and Gas Conference and Exhibition, Abu Dhabi, UAE, 15–16 September 2015. [CrossRef]
21. Chiranth, H.; Hugh, D.; Harry, M. Analysis of rate of penetration (ROP) prediction in drilling using physics-based and data-driven
models. J. Pet. Sci. Eng. 2017, 159, 295–306. [CrossRef]
22. Su, X.; Sun, J.; Gao, X.; Wang, M. Prediction method of Drilling rate penetration based on GBDT algorithm. Comput. Appl. Softw.
2019, 36, 87–92.
23. Ashrafi, S.B.; Anemangely, M.; Sabah, M.; Ameri, M.J. Application of hybrid artificial neural networks for predicting rate of
penetration (ROP): A case study from Marun oil field. J. Petrol. Sci. Eng. 2019, 175, 604–623. [CrossRef]
24. Soares, C.; Gray, K. Real-time predictive capabilities of analytical and machine learning rate of penetration (ROP) models.
J. Pet. Sci. Eng. 2019, 172, 934–959. [CrossRef]
25. Hassan, A.; Elkatatny, S.; Al-Majed, A. Coupling rate of penetration and mechanical specific energy to Improve the efficiency of
drilling gas wells. J. Nat. Gas Sci. Eng. 2020, 83, 103558. [CrossRef]
26. Song, X.; Pei, Z.; Wang, P.; Zhang, G. Intelligent Prediction for Rate of Penetration Based on Support Vector Machine Regression.
Xinjiang Oil Gas 2022, 18, 14–20.
27. Ahmed, O.S.; Adeniran, A.A.; Samsuri, A. Computational intelligence based prediction of drilling rate of penetration: A
comparative study. J. Pet. Sci. Eng. 2019, 172, 1–12. [CrossRef]
28. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [CrossRef]
29. Zhu, S.; Song, X.; Li, G. Intelligent real-time drag and torque analysis and sticking trend prediction of drill string. Oil Drill. Prod. Technol.
2021, 43, 428–435. [CrossRef]
30. Ho, H.-S. An Improved Modeling Program for Computing the Torque and Drag in Directional and Deep Wells. In Proceedings of
the SPE Annual Technical Conference and Exhibition, Houston, TX, USA, 2–5 October 1988. [CrossRef]
31. Deb, K.; Pratap, A.; Agarwal, S.; Meyarivan, T.A.M.T. A fast and elitist multi-objective genetic algorithm: NSGA-II. IEEE Trans.
Evol. Comput. 2002, 6, 182–197. [CrossRef]
32. Boslaugh, S. Statistics in a Nutshell: A Desktop Quick Reference; O’Reilly Media Inc.: Boston, MA, USA, 2012.