Artificial Intelligence in Chemical Engineering Solving Problems
Artificial Intelligence in Chemical Engineering Solving Problems
Big data analytics is the journey to turn data into insights for more informed business and
operational decisions. As the chemical engineering community is collecting more data
(volume) from different sources (variety), this journey becomes more challenging in terms of
using the right data and the right tools (analytics) to make the right decisions in real time
(velocity). This article highlights recent big data advancements in five industries, including
chemicals, energy, semiconductors, pharmaceuticals, and food, and then discusses
technical, platform, and culture challenges. To reach the next milestone in multiplying
successes to the enterprise level, government, academia, and industry need to
collaboratively focus on workforce development and innovation.
- ADVANCES IN APPLICATIONS:
Chemical Process Industry The chemical process industry spans a large scale from
commodity chemicals, petrochemicals, refinery products, specialty chemicals, and life
sciences to consumer products. The scale of manufacturing increases from small-scale,
specialized products, such as life sciences or consumer products, to large-scale chemical
and petrochemical production facilities. Modern petrochemical and chemical complexes are
tightly integrated, with many manufacturing units geographically concentrated in one
location. As a result, incremental improvements in energy efficiency, reliability, and safety
would be amplified owing to the economy of scale. A recent survey by
PricewaterhouseCoopers reveals that 88% of chemical industry executives acknowledge
that data analytics will be crucial for retaining the competitive advantage in five years (14).
As a result, big data analytics is expected to be a key growth area in the industry.
Historically, the chemical process industry is one of the earliest adopters of computer-based
control. Safe and efficient plant operation requires constant monitoring of thousands of
process variables. Collection of process data for monitoring and control became routine and
provided a platform for exploration and development of data-driven methods and
applications. Database servers designed specifically for storing process data, such as
OSISoft PITM and AspenTech IP21TM, enabled easy and reliable access to process data by
engineers and researchers. The chemical process industry has been a pioneer in adopting
data-driven tools owing to this advantage (7). This section first highlights the ongoing
data-driven analytics efforts that have already seen success in the industry and then
discusses the emerging technologies that could potentially revolutionize analytics-driven
decision making.
Machine learning (ML) has the potential to overcome the limitations of mechanistic modeling
as ML methods can learn complex behaviors, the model development is cheaper, and it can
be advantageous for optimization. Chemical engineering (CE) already experienced two big
waves of ML applications between the 1980s and 2008, i.e., expert systems and (shallow)
artificial neural networks (ANNs). These waves had limited impact due to several reasons: i)
lack of data, of data accessibility, of computation power, and of programming
environments/paradigms, ii) competing successful emerging technologies for CE, in
particular, mechanistic modeling, optimization, and model predictive control.
Fig.1
Within PSE, various data-driven models have been used for regression and subsequent
process optimization. As illustrated in Fig. 2, the complexity of applied data-driven models
ranges from linear approximations to deep ANNs. For a long time, the literature focused on
linear models to approximate simulation and experimental data . Since the 1990s, shallow
ANNs have been used extensively. Shallow ANNs can theoretically approximate any
nonlinear, smooth function given a sufficient number of neurons to any given positive
accuracy on a training data set. In many PSE applications, ANNs are fitted to complete
processes (black-box approach, e.g.,), or ANNs are combined with mechanistic model
equations (hybrid modeling approach, e.g.,). Subsequently, the obtained process models
can be optimized, e.g., to identify process design. Today, the re-emergence of ML is mostly
driven by deep ANNs and big data. The deep ANNs are believed to become more important
in PSE because abundant data becomes available, e.g., through smart manufacturing, high
throughput experiments, and simulation studies. However, applications of deep ANNs are
still limited in process design in PSE.
Fig. 2
Description:
Overview of data-driven models embedded in optimization problems in CE: linear, convex
region linear surrogate model , nonlinear basis functions, e.g., ALAMO , piecewise
polynomial function, spline function, Gaussian process, support vector machine, ensemble
tree model (e.g., random forest, gradient boosted trees), ANNs with ReLU activation , and
other ANNs with more complex activation functions. Note that the models are ordered by
their estimated ability to learn complex dependencies.
While optimization of problems with linear models can be solved globally on a large scale,
e.g., for structural optimization, linear models cannot learn high-dimensional nonlinear
problems accurately. On the other hand, the consideration of more complex data-driven
models like ANNs and GPs, which could reflect high dimensional nonlinear problems better,
has long been limited to local or stochastic solution approaches. Although some tailor-made
data-driven models can be solved using state-of-the-art global solvers, these are also limited
to low-dimensional problems. A few researchers in CE and ML have developed tailored
optimization approaches for problems with ML models embedded. Mistry et al. proposed a
tailored algorithm for problems with gradient boosted trees embedded. Grimstad and
coworkers proposed an algorithm for the optimization of piecewise polynomial functions and
spline functions . Some previous works also used general-purpose global solvers to solve
optimization problems with complex surrogate models embedded but observed
computational burdens. Recently, Schweidtmann and Mitsos proposed an efficient
reduced-space optimization formulation for global optimization of deep ANNs. Notably, ANNs
with rectified linear unit (ReLU) activations have recently been reformulated as mixed-integer
linear programs (MILPs). In the MILP formulations, binary variables are introduced to divide
the domain of the piecewise linear ReLU activation functions into two linear sub-domains.
Similarly, tree models can be reformulated as MILPs. However, the number of integer
variables and constraints grows linearly with the model complexity (e.g., number of nodes in
the ANN) Overall, optimal decision-making based on data has already seen some work on
optimal process synthesis and optimal process operation. However, there are still severe
limitations in integrating learning and optimization frameworks that exhibit complex ML
models.