Deep learning-RNN
Deep learning-RNN
A novel model for water quality prediction caused by non-point sources pollution
based on deep learning and feature extraction methods,
Journal of Hydrology,
Volume 612, Part A,
2022,
128081,
ISSN 0022-1694,
https://doi.org/10.1016/j.jhydrol.2022.128081.
(https://www.sciencedirect.com/science/article/pii/S0022169422006564)
Abstract: Non-point source (NPS) pollution is an important factor affecting the
quality of water environment. In recent years, a large number of online water
quality monitoring stations have been used to obtain continuous time series water
quality monitoring data. These data provide the necessary basis for the application
of deep learning methods in water quality prediction. However, the prediction
accuracy of traditional deep learning methods is low, especially in predicting the
water quality with NPS pollution. Aiming to address this limitation, a novel deep
learning model named SOD-VGG-LSTM with the simulation-observation difference (SOD)
modular based on physical process, the visual geometry (VGG) modular reflecting
spatial characteristics, and the long short-term memory (LSTM) modular based on
deep learning method was developed to improve the accuracy of the water quality
prediction with NPS pollution. The established model can overcome the problem that
mechanism models can not predict the changes of water quality on the hourly or
minute time scale. The model was applied in Lijiang River watershed. Experimental
results indicated that the proposed model had the highest accuracy in the extreme
value prediction compared with the mechanism model and LSTM model. The maximum
relative errors between the predicted and observed results for DO, CODMn, NH3-N,
and TP were 8.47%, 19.76%, 24.1%, and 35.4%, respectively. The model evaluation
demonstrated thatthe established SOD-VGG-LSTM model achieved superior computational
performance compared to Auto Regression Integreate Moving Average model (ARIMA),
Support Vector Regression model (SVR), and Recurrent Neural Network model (RNN).
The evaluation results showed that SOD-VGG-LSTM achieved 3.2–39.3% higher R2 than
ARIMA, SVR and RNN. The proposed model can provide a new method for water quality
prediction with NPS pollution.
Keywords: Non-point source pollution; Spatial characteristics; LSTM; VGG
Min Zhao, Weizheng Yan, Na Luo, Dongmei Zhi, Zening Fu, Yuhui Du, Shan Yu, Tianzi
Jiang, Vince D. Calhoun, Jing Sui,
An attention-based hybrid deep learning framework integrating brain connectivity
and activity of resting-state functional MRI data,
Medical Image Analysis,
Volume 78,
2022,
102413,
ISSN 1361-8415,
https://doi.org/10.1016/j.media.2022.102413.
(https://www.sciencedirect.com/science/article/pii/S1361841522000652)
Abstract: Functional magnetic resonance imaging (fMRI) as a promising tool to
investigate psychotic disorders can be decomposed into useful imaging features such
as time courses (TCs) of independent components (ICs) and functional network
connectivity (FNC) calculated by TC cross-correlation. TCs reflect the temporal
dynamics of brain activity and the FNC characterizes temporal coherence across
intrinsic brain networks. Both features have been used as input to deep learning
approaches with decent results. However, few studies have tried to leverage their
complementary information to learn optimal representations at multiple facets.
Motivated by this, we proposed a Hybrid Deep Learning Framework integrating brain
Connectivity and Activity (HDLFCA) together by combining convolutional recurrent
neural network (C-RNN) and deep neural network (DNN), aiming to improve
classification accuracy and interpretability simultaneously. Specifically, C-RNNAM
was proposed to extract temporal dynamic dependencies with an attention module (AM)
to automatically learn discriminative knowledge from TC nodes, while DNN was
applied to identify the most group-discriminative FNC patterns with layer-wise
relevance propagation (LRP). Then, both prediction outputs were concatenated to
build a new feature matrix, generating the final decision by logistic regression.
The effectiveness of HDLFCA was validated on both multi-site schizophrenia (SZ, n ∼
1100) and public autism datasets (ABIDE, n ∼ 1522) by outperforming 12 alternative
models at 2.8-8.9% accuracy, including 8 models using either static FNC or TCs and
4 models using dynamic FNC. Appreciable classification accuracy was achieved for HC
vs. SZ (85.3%) and HC vs. Autism (72.4%) respectively. More importantly, the most
group-discriminative brain regions can be easily attributed and visualized,
providing meaningful biological interpretability and highlighting the great
potential of the proposed HDLFCA model in the identification of valid neuroimaging
biomarkers.
Keywords: Attention mechanism; Deep learning; Brain connectivity and activity; fMRI
Qingtai Wang, Hongquan Huang, Xingke Ma, Zhiwen Shen, Chenglin Zhong, Weicheng
Ding, Wei Zhou, Jianbin Zhou,
Trapezoidal pile-up nuclear pulse parameter identification method based on deep
learning transformer model,
Applied Radiation and Isotopes,
Volume 190,
2022,
110515,
ISSN 0969-8043,
https://doi.org/10.1016/j.apradiso.2022.110515.
(https://www.sciencedirect.com/science/article/pii/S0969804322004006)
Abstract: Pile-up between adjacent nuclear pulses is unavoidable in the actual
detection process. Some scholars have tried to apply deep learning techniques to
identify pile-up nuclear pulse parameters. However, traditional deep learning
recurrent neural networks (RNNs) suffer from inefficient pulse recognition and poor
recognition of pile-up nuclear pulses with short intervals between adjacent pulses.
In this paper, a Transformer model with an attention mechanism as the core to
recognize pile-up nuclear pulses is innovatively applied, aiming to provide a more
accurate and efficient method for pile-up nuclear pulse recognition. Thus, it gives
a better help for the spectrum correction with a high count rate.
Keywords: Nuclear pulse; Attention mechanism; Transformer
Masayuki Nigo, Hong Thoai Nga Tran, Ziqian Xie, Han Feng, Bingyu Mao, Laila Rasmy,
Hongyu Miao, Degui Zhi,
PK-RNN-V E: A deep learning model approach to vancomycin therapeutic drug
monitoring using electronic health record data,
Journal of Biomedical Informatics,
Volume 133,
2022,
104166,
ISSN 1532-0464,
https://doi.org/10.1016/j.jbi.2022.104166.
(https://www.sciencedirect.com/science/article/pii/S1532046422001782)
Abstract: Vancomycin is a commonly used antimicrobial in hospitals, and therapeutic
drug monitoring (TDM) is required to optimize its efficacy and avoid toxicities.
Bayesian models are currently recommended to predict the antibiotic levels. These
models, however, although using carefully designed lab observations, were often
developed in limited patient populations. The increasing availability of electronic
health record (EHR) data offers an opportunity to develop TDM models for real-world
patient populations. Here, we present a deep learning-based pharmacokinetic
prediction model for vancomycin (PK-RNN-V E) using a large EHR dataset of 5,483
patients with 55,336 vancomycin administrations. PK-RNN-V E takes the patient’s
real-time sparse and irregular observations and offers dynamic predictions. Our
results show that RNN-PK-V E offers a root mean squared error (RMSE) of 5.39 and
outperforms the traditional Bayesian model (VTDM model) with an RMSE of 6.29. We
believe that PK-RNN-V E can provide a pharmacokinetic model for vancomycin and
other antimicrobials that require TDM.
Keywords: Vancomycin; Recurrent neural network; Pharmacokinetics; Deep learning;
Bayesian model
S. Ali Naqvi, Meagan T.M. King, Trevor J. DeVries, Herman W. Barkema, Rob Deardon,
Data considerations for developing deep learning models for dairy applications: A
simulation study on mastitis detection,
Computers and Electronics in Agriculture,
Volume 196,
2022,
106895,
ISSN 0168-1699,
https://doi.org/10.1016/j.compag.2022.106895.
(https://www.sciencedirect.com/science/article/pii/S0168169922002125)
Abstract: With growing adoption of precision dairy technologies, the use of big
data is becoming increasingly common in the dairy industry. The speed at which data
are generated has led to increased interest in developing detection and predictive
models for animal health and disease events using real time records. When combining
data from multiple sources, statistical methods exist to account for the underlying
heterogeneity in data collected from commercial farms, although its impact on
predictive models is not known. We investigated how 4 different issues commonly
seen in these large datasets impact the performance of deep recurrent neural
networks (RNNs) trained to detect the onset of clinical mastitis (CM) in dairy
cows. Data were simulated by first sampling from real-world data and adding noise,
then defining the association between predictor variables and CM while
incorporating parameters to reflect underlying heterogeneity: 1) random effects to
reflect unmeasured variability at the farm level (3 levels – none, moderate, high);
2) random effects to reflect unmeasured variability at the cow level (3 levels –
none, moderate, high); 3) missed recording of CM cases (3 false-negative rates –
0.10, 0.25, 0.50); and 4) incomplete observations due to certain farms not having a
somatic cell count sensor (SCC data missing vs SCC data included). At baseline
(moderate farm and cow random effects; moderate misclassification; 42% herds with
SCC sensor) the model achieved a sensitivity and specificity of 86% and 90%
respectively. Higher levels of unmeasured variability at the farm and cow levels
resulted in reduced model performance (sensitivity and specificity of 76% and 85%
at the highest levels), indicating that data collection and feature selection
should be informed by previous knowledge of the associations between the outcome
and predictors when possible, and that model performance may be limited when
predictors are selected only from routinely collected data. However, even when 50%
of CM cases were incorrectly recorded as CM-negative, model performance did not
decrease, demonstrating that deep RNNs are robust to the level of misclassification
that would be typically encountered in dairy datasets. RNNs were also able to
accurately detect CM onset even when a highly predictive variable, somatic cell
count, was excluded from training and test data, but the models took longer to
train. The effect of unmeasured variability on model performance demonstrates how
predictors should be selected for RNNs, whereas RNNs appear to be very robust to
misclassification in training data as well as missing variables. Researchers
developing studies using deep learning should therefore focus their attention more
on predictor selection than on reducing or standardizing outcome recording, since
RNNs appear to be robust to the latter, while being more strongly impacted by the
former.
Keywords: Mastitis; Detection; Robotic milking systems; Simulation; Dairy
Run-Ze Xu, Jia-Shun Cao, Jing-Yang Luo, Qian Feng, Bing-Jie Ni, Fang Fang,
Integrating mechanistic and deep learning models for accurately predicting the
enrichment of polyhydroxyalkanoates accumulating bacteria in mixed microbial
cultures,
Bioresource Technology,
Volume 344, Part B,
2022,
126276,
ISSN 0960-8524,
https://doi.org/10.1016/j.biortech.2021.126276.
(https://www.sciencedirect.com/science/article/pii/S0960852421016187)
Abstract: The enrichment of polyhydroxyalkanoates (PHA) accumulating bacteria (PAB)
in mixed microbial cultures (MMC) is extremely difficult to be predicted and
optimized. Here we demonstrate that mechanistic and deep learning models can be
integrated innovatively to accurately predict the dynamic enrichment of PAB. Well-
calibrated activated sludge models (ASM) of the PAB enrichment process provide
time-dependent data under different operating conditions. Recurrent neural network
(RNN) models are trained and tested based on the time-dependent dataset generated
by ASM. The accurate prediction performance is achieved (R2 > 0.991) for three
different PAB enrichment datasets by the optimized RNN model. The optimized RNN
model can also predict the equilibrium concentration of PAB (R2 = 0.944) and
corresponding time, which represents the end of the PAB enrichment process. This
study demonstrates the strength of integrating mechanistic and deep learning models
to predict long-term variations of specific microbes, helping to optimize their
selection process for PHA production.
Keywords: Recurrent neural networks (RNN); Long short-term memory (LSTM); Activated
sludge model (ASM); Polyhydroxyalkanoates (PHA); Microbial enrichment
Xinxing Zhao, Joel Weijia Lai, Andrew Fu Wah Ho, Nan Liu, Marcus Eng Hock Ong, Kang
Hao Cheong,
Predicting hospital emergency department visits with deep learning approaches,
Biocybernetics and Biomedical Engineering,
Volume 42, Issue 3,
2022,
Pages 1051-1065,
ISSN 0208-5216,
https://doi.org/10.1016/j.bbe.2022.07.008.
(https://www.sciencedirect.com/science/article/pii/S0208521622000729)
Abstract: Overcrowding in emergency department (ED) causes lengthy waiting times,
reduces adequate emergency care and increases rate of mortality. Accurate
prediction of daily ED visits and allocating resources in advance is one of the
solutions to ED overcrowding problem. In this paper, a deep stacked architecture is
being proposed and applied to the daily ED visits prediction problem with deep
components such as Long Short Term Memory (LSTM), Gated Recurrent Units (GRU) and
simple Recurrent Neural Network (RNN). The proposed architecture achieves very high
mean accuracy level (94.28–94.59%) in daily ED visits predictions. We have also
compared the performance of this architecture with non-stacked deep models and
traditional prediction models. The results indicate that deep stacked models
outperform (4–7%) the traditional prediction models and other non-stacked deep
learning models (1–2%) in our prediction tasks. The application of deep neural
network in ED visits prediction is novel as this is one of the first studies to
apply a deep stacked architecture in this field. Importantly, our models have
achieved better prediction accuracy (in one case comparable) than the state-of-the-
art in the literature.
Keywords: Deep learning; Deep stacked architecture; ED Prediction; LSTM; RNN; GRU
Phuong Thao Thi Ngo, Mahdi Panahi, Khabat Khosravi, Omid Ghorbanzadeh, Narges
Kariminejad, Artemi Cerda, Saro Lee,
Evaluation of deep learning algorithms for national scale landslide susceptibility
mapping of Iran,
Geoscience Frontiers,
Volume 12, Issue 2,
2021,
Pages 505-519,
ISSN 1674-9871,
https://doi.org/10.1016/j.gsf.2020.06.013.
(https://www.sciencedirect.com/science/article/pii/S1674987120301687)
Abstract: The identification of landslide-prone areas is an essential step in
landslide hazard assessment and mitigation of landslide-related losses. In this
study, we applied two novel deep learning algorithms, the recurrent neural network
(RNN) and convolutional neural network (CNN), for national-scale landslide
susceptibility mapping of Iran. We prepared a dataset comprising 4069 historical
landslide locations and 11 conditioning factors (altitude, slope degree, profile
curvature, distance to river, aspect, plan curvature, distance to road, distance to
fault, rainfall, geology and land-sue) to construct a geospatial database and
divided the data into the training and the testing dataset. We then developed RNN
and CNN algorithms to generate landslide susceptibility maps of Iran using the
training dataset. We calculated the receiver operating characteristic (ROC) curve
and used the area under the curve (AUC) for the quantitative evaluation of the
landslide susceptibility maps using the testing dataset. Better performance in both
the training and testing phases was provided by the RNN algorithm (AUC = 0.88) than
by the CNN algorithm (AUC = 0.85). Finally, we calculated areas of susceptibility
for each province and found that 6% and 14% of the land area of Iran is very highly
and highly susceptible to future landslide events, respectively, with the highest
susceptibility in Chaharmahal and Bakhtiari Province (33.8%). About 31% of cities
of Iran are located in areas with high and very high landslide susceptibility. The
results of the present study will be useful for the development of landslide hazard
mitigation strategies.
Keywords: CNN; RNN; Deep learning; Landslide; Iran
Lulu Qi, Jialuo Du, Yue Sun, Yongzhao Xiong, Xinyao Zhao, Daodong Pan, Yueru Zhi,
Yali Dang, Xinchang Gao,
Umami-MRNN: Deep learning-based prediction of umami peptide using RNN and MLP,
Food Chemistry,
Volume 405, Part A,
2023,
134935,
ISSN 0308-8146,
https://doi.org/10.1016/j.foodchem.2022.134935.
(https://www.sciencedirect.com/science/article/pii/S0308814622028977)
Abstract: Umami components are an important part of food condiments, and the use of
umami peptides in the condiment industry has received great attention. However,
traditional methods for umami peptide identification are time-consuming, labor-
intensive, and difficult to achieve high throughput. Therefore, it is essential to
develop an effective algorithm to identify potential umami peptides. In this study,
we proposed a prediction method for umami peptides called Umami-MRNN. We
constructed a merged model for the Multi-layer Perceptron and Recurrent Neural
Network. We then developed predictors with six feature vectors as the input. We
trained the neural networks using the training dataset and selected hyperparameters
of machine learning models via a 10-fold cross-validation. The independent tests
showed that Umami-MRNN achieved an accuracy of 90.5% and a Matthews correlation
coefficient value of 0.811. To assist the scientific community, we also developed a
publicly accessible web server at https://umami-mrnn.herokuapp.com/.
Keywords: Umami peptides; Deep learning; Multi-layer perceptron; Recurrent neural
network; Predictor
Jingyang Wang, Xiaolei Li, Jiazheng Li, Qiuhong Sun, Haiyao Wang,
NGCU: A New RNN Model for Time-Series Data Prediction,
Big Data Research,
Volume 27,
2022,
100296,
ISSN 2214-5796,
https://doi.org/10.1016/j.bdr.2021.100296.
(https://www.sciencedirect.com/science/article/pii/S2214579621001131)
Abstract: With the rapid development of machine learning, a possibility is provided
for high-precision prediction of time-series. This paper proposes a new unit which
is called New Gate Control Unit (NGCU) based on Recurrent Neural Networks (RNN).
The proposal of NGCU is mainly used for prediction of time-series data. NGCU
alleviates the problems of gradient disappearance and explosion of traditional RNN.
Compared with Long Short-Term Memory (LSTM) and Gate Recurrent Unit (GRU), NGCU
improves not only the computational complexity of gating unit but also the
sensitivity of model learning. To verify the accuracy, efficiency and feasibility
of NGCU, this paper uses RNN, LSTM and GRU to conduct comparative experiments, and
uses three different data of air quality, Hang Seng Index, and gold future price to
prove the generalization of NGCU. Mean Absolute Error (MAE), Mean Squared Error
(MSE), Explained Variance Score (EVS), R2 and training time are used to evaluate
experimental results. Among the three different data prediction results, the R2 of
NGCU is 0.9736, 0.9872, and 0.9231, respectively. And NGCU's MAE, MSE, EVS are also
the best. Compared with LSTM and GRU, NGCU has the least training time, which is
323.5261s, 53.3257s, and 43.4814s respectively.
Keywords: NGCU; RNN; Time-series; Prediction
Mohammad Eslami, Julia A. Kim, Miao Zhang, Michael V. Boland, Mengyu Wang, Dolly S.
Chang, Tobias Elze,
Visual Field Prediction: Evaluating the Clinical Relevance of Deep Learning Models,
Ophthalmology Science,
Volume 3, Issue 1,
2023,
100222,
ISSN 2666-9145,
https://doi.org/10.1016/j.xops.2022.100222.
(https://www.sciencedirect.com/science/article/pii/S2666914522001117)
Abstract: Purpose
Two novel deep learning methods using a convolutional neural network (CNN) and a
recurrent neural network (RNN) have recently been developed to forecast future
visual fields (VFs). Although the original evaluations of these models focused on
overall accuracy, it was not assessed whether they can accurately identify patients
with progressive glaucomatous vision loss to aid clinicians in preventing further
decline. We evaluated these 2 prediction models for potential biases in
overestimating or underestimating VF changes over time.
Design
Retrospective observational cohort study.
Participants
All available and reliable Swedish Interactive Thresholding Algorithm Standard 24-2
VFs from Massachusetts Eye and Ear Glaucoma Service collected between 1999 and 2020
were extracted. Because of the methods’ respective needs, the CNN data set included
54 373 samples from 7472 patients, and the RNN data set included 24 430 samples from
1809 patients.
Methods
The CNN and RNN methods were reimplemented. A fivefold cross-validation procedure
was performed on each model, and pointwise mean absolute error (PMAE) was used to
measure prediction accuracy. Test data were stratified into categories based on the
severity of VF progression to investigate the models’ performances on predicting
worsening cases. The models were additionally compared with a no-change model that
uses the baseline VF (for the CNN) and the last-observed VF (for the RNN) for its
prediction.
Main Outcome Measures
PMAE in predictions.
Results
The overall PMAE 95% confidence intervals were 2.21 to 2.24 decibels (dB) for the
CNN and 2.56 to 2.61 dB for the RNN, which were close to the original studies’
reported values. However, both models exhibited large errors in identifying
patients with worsening VFs and often failed to outperform the no-change model.
Pointwise mean absolute error values were higher in patients with greater changes
in mean sensitivity (for the CNN) and mean total deviation (for the RNN) between
baseline and follow-up VFs.
Conclusions
Although our evaluation confirms the low overall PMAEs reported in the original
studies, our findings also reveal that both models severely underpredict worsening
of VF loss. Because the accurate detection and projection of glaucomatous VF
decline is crucial in ophthalmic clinical practice, we recommend that this
consideration is explicitly taken into account when developing and evaluating
future deep learning models.
Keywords: Deep learning; Artificial intelligence; Glaucoma; Visual fields;
Prediction
Mumtaz Ali, Ramendra Prasad, Yong Xiang, Adarsh Sankaran, Ravinesh C. Deo, Fuyuan
Xiao, Shuyu Zhu,
Advanced extreme learning machines vs. deep learning models for peak wave energy
period forecasting: A case study in Queensland, Australia,
Renewable Energy,
Volume 177,
2021,
Pages 1031-1044,
ISSN 0960-1481,
https://doi.org/10.1016/j.renene.2021.06.052.
(https://www.sciencedirect.com/science/article/pii/S0960148121009186)
Abstract: The peak period of an energy-generating wave is one of the most important
parameters that describe the spectral shape of the oceanic wave, as this indicates
the duration for which the waves prevail with respect to their maximum extractable
energy. In this paper, a half-hourly peak wave energy period (TP) forecast model is
constructed using a suite of statistically significant lagged inputs based on the
partial auto-correlation function with an extreme learning machine model developed
and its predictive utility is benchmarked against deep learning models, i.e.,
convolutional neural network (CNN/CovNet) and recurrent neural network (RNN) models
and other traditional M5tree, Conditional Maximization based Multiple Linear
Regression (MLR-ECM) and MLR models. The objective model (ELM) vs. the comparison
models (CNN, RNN, M5tree, MLR-ECM, and MLR) were trained and validated
independently on the test dataset obtained from coastal zones of eastern Australia
that have a high potential for implementation of wave energy generation systems.
The outcomes ascertain that the ELM model can generate significantly accurate
predictions of the half-hourly peak wave energy period, providing a good level of
accuracy relative to deep learning models in selected coastal study zones. The
study establishes the practical usefulness of the ELM model as being a noteworthy
methodology for the applications in renewable and sustainable energy resource
management systems.
Keywords: Deep learning; RNN; CNN; ELM; Peak wave energy period; Coastal waves
Mohammed Farsi,
Application of ensemble RNN deep neural network to the fall detection through IoT
environment,
Alexandria Engineering Journal,
Volume 60, Issue 1,
2021,
Pages 199-211,
ISSN 1110-0168,
https://doi.org/10.1016/j.aej.2020.06.056.
(https://www.sciencedirect.com/science/article/pii/S1110016820303239)
Abstract: The emerging of new models in machine learning enhances the performance
of algorithms proposed to address several challenging tasks such as object
recognition, classification and identification purpose. Nowadays. the deep learning
algorithms are playing a massive role in accurately addressing complex problems due
to their capability of learning various complex features from data. One limitation
of deep learning is a lack of sufficient data for training. In this study, we
proposed variants of Long Short Term Memory (LSTM) model and ensemble learning
methods such as XGBoost, AdaBoost, Bagging, Stacking and Random forest. The
experimentation is carried out on Time series data generated from the Internet of
Things (IoT) devices. To validate the proposed method, we have used a freely
available dataset on the web namely Smart-Fall datasets. To measure the performance
of the proposed method, we have used standard performance measures namely,
accuracy, precision, recall, f-score, specificity, geometric mean and confusion
matrix. A set of experimental details are carried out on the SmartFall dataset and
the experimental results exhibit that the Random forest algorithm performs better
when compared with a single deep LSTM model and different ensemble techniques.
Keywords: Ensemble methods; Deep learning; Recurrent neural network; Fall
detection; Time series; IoT
Abhijit Singh, Ananya Nath, Shibendu Shekhar Roy, Aditya Kumar Lohar,
Modeling of laser aided direct metal deposition of stainless steel using supervised
deep learning algorithms,
Materials Today: Proceedings,
Volume 62, Part 1,
2022,
Pages 360-366,
ISSN 2214-7853,
https://doi.org/10.1016/j.matpr.2022.03.468.
(https://www.sciencedirect.com/science/article/pii/S2214785322017291)
Abstract: Laser Direct Metal Deposition (LDMD) is a very promising additive
manufacturing methodology which provides metal cladding layer over the substrate
with good corrosion resistance, desired surface finish and very accurate geometry.
In this work, an attempt has been made to develop a forward and reverse model of
laser aided direct metal deposition utilizing deep learning algorithms. Forward
model aims at predicting the deposited bead height and width from the known set of
LDMD process parameters such as laser power, scanning velocity, and powder flow
rate. Reverse modelling shows the effectiveness of each input parameter over the
chosen response factors. Therefore, in the reverse modelling, the optimized input
parameters values have been tried to predict to get the desired deposition
characteristics during experimentation. Three different neural networks based deep
learning architectures, namely, Multi-Layer Feed Forward Neural Network (MLFFNN),
Recurrent Neural Networks (RNN) and Radial Basis Function Neural Network (RBFNN),
have been put forward to develop the said predictive models. The supervised
learning algorithms are used to train the neural networks. The performances of
MLFFNN, RNN and RBFNN models are compared among themselves. The results show that
all the models are capable of making better predictions and the models can be
effectively used in shop floor in optimal selection of most influential parameters
for the desired outputs.
Keywords: Direct metal deposition; Forward modeling; Reverse modeling; Additive
Manufacturing; Supervised learning; Deep learning algorithms
Jihong Deng, Xujie Zhang, Meichen Li, Hui Jiang, Quansheng Chen,
Feasibility study on Raman spectra-based deep learning models for monitoring the
contamination degree and level of aflatoxin B1 in edible oil,
Microchemical Journal,
Volume 180,
2022,
107613,
ISSN 0026-265X,
https://doi.org/10.1016/j.microc.2022.107613.
(https://www.sciencedirect.com/science/article/pii/S0026265X22004416)
Abstract: Edible oil is one of the three major nutritional sources of human body,
and its quality is directly related to human health. This study proposes two novel
deep learning architectures to achieve qualitative and quantitative monitoring of
the degree and level of aflatoxin B1 (AFB1) contamination synthetically of edible
oil in laboratory level. Based on Raman spectra acquired, the structures of a
convolutional neural network (CNN) and a recurrent neural network (RNN) were
designed, respectively; and deep learning models were established to achieve
qualitative identification of the AFB1 contamination degree and quantitative
detection of the AFB1 contamination level of edible oil samples. The results showed
that both the CNN model and the RNN model showed perfect recognition performance
when identifying the contamination degree of edible oil samples by the AFB1, and
their recognition accuracy reached 100% when predicting the independent samples in
the prediction set. The RNN model has better detection performance than the CNN
model when detecting the AFB1 contamination level of edible oil samples. In the
prediction set, the coefficient of determination (RP2) and ration of prediction to
deviation (RPD) of the RNN model were 0.95 and 4.86, respectively. The feasibility
study results demonstrate that deep learning combined with Raman spectroscopy can
achieve high-precision monitoring of edible oil mycotoxins. In addition, deep
learning has a good promising tool in the field of spectral chemometrics analysis.
Keywords: Edible oil; Aflatoxin B1; Raman spectroscopy; Convolutional neural
network; Recurrent neural network
K.E. ArunKumar, Dinesh V. Kalaga, Ch. Mohan Sai Kumar, Masahiro Kawaji, Timothy M
Brenza,
Forecasting of COVID-19 using deep layer Recurrent Neural Networks (RNNs) with
Gated Recurrent Units (GRUs) and Long Short-Term Memory (LSTM) cells,
Chaos, Solitons & Fractals,
Volume 146,
2021,
110861,
ISSN 0960-0779,
https://doi.org/10.1016/j.chaos.2021.110861.
(https://www.sciencedirect.com/science/article/pii/S0960077921002149)
Abstract: In December 2019, first case of the COVID-19 was reported in Wuhan, Hubei
province in China. Soon world health organization has declared contagious
coronavirus disease (a.k.a. COVID-19) as a global pandemic in the month of March
2020. Over the span of eleven months, it has rapidly spread out all over the world
with total confirmed cases of ~ 41.39 M and causing a total fatality of ~1.13 M. At
present, the entire mankind is facing serious threat and it is believed that COVID-
19 may have been around for quite some time. Therefore, it has become imperative to
forecast the global impact of COVID-19 in the near future. The present work
proposes state-of-art deep learning Recurrent Neural Networks (RNN) models to
predict the country-wise cumulative confirmed cases, cumulative recovered cases and
the cumulative fatalities. The Gated Recurrent Units (GRUs) and Long Short-Term
Memory (LSTM) cells along with Recurrent Neural Networks (RNN) were developed to
predict the future trends of the COVID-19. We have used publicly available data
from John Hopkins University's COVID-19 database. In this work, we emphasize the
importance of various factors such as age, preventive measures, and healthcare
facilities, population density, etc. that play vital role in rapid spread of COVID-
19 pandemic. Therefore, our forecasted results are very helpful for countries to
better prepare themselves to control the pandemic.
Keywords: Forecasting COVID-19 pandemic; Time series analysis; Gated Recurrent
Units (GRUs); Long Short-Term Memory (LSTM); Recurrent Neural Networks (RNNs)
Jingqiu Guo, Yangzexi Liu, Qingyan (Ken) Yang, Yibing Wang, Shouen Fang,
GPS-based citywide traffic congestion forecasting using CNN-RNN and C3D hybrid
model,
Transportmetrica A Transport Science,
Volume 17, Issue 2,
2021,
Pages 190-211,
ISSN 2324-9935,
https://doi.org/10.1080/23249935.2020.1745927.
(https://www.sciencedirect.com/science/article/pii/S2324993522001531)
Abstract: ABSTRACT
Traffic congestion prediction in citywide road networks is a challenging research
field in metropolitan transportation operation and management. Recent advances in
GPS technology offer great opportunities to improve upon the limitations on the
availability and quality of traffic data. Motivated by the success of deep neural
networks and considering the spatial dependencies and temporal evolutions of
network traffic, we propose an innovative deep learning-based mapping to cube
architecture for network-wide urban traffic forecasting. Experiments using real
Taxi GPS vehicle trajectory data confirm the accuracy and effectiveness of the
proposed approach combining 3-Dimensional Convolutional Networks (C3D) with
Convolutional Neuron Networks (CNNs) and Recurrent Neuron Networks (RNNs), called
CRC3D as a hybrid method integrating CNN-RNNs and C3Ds. We also compared a variety
of recurrent neural network architectures. Results show that CRC3D succeeds in
inheriting the advantages of C3D and CNN-RNN, and show its consistent and
satisfactory results in urban complex system.
Keywords: Traffic congestion; deep learning; C3D
Mahdi Panahi, Abolfazl Jaafari, Ataollah Shirzadi, Himan Shahabi, Omid Rahmati,
Ebrahim Omidvar, Saro Lee, Dieu Tien Bui,
Deep learning neural networks for spatially explicit prediction of flash flood
probability,
Geoscience Frontiers,
Volume 12, Issue 3,
2021,
101076,
ISSN 1674-9871,
https://doi.org/10.1016/j.gsf.2020.09.007.
(https://www.sciencedirect.com/science/article/pii/S1674987120302085)
Abstract: Flood probability maps are essential for a range of applications,
including land use planning and developing mitigation strategies and early warning
systems. This study describes the potential application of two architectures of
deep learning neural networks, namely convolutional neural networks (CNN) and
recurrent neural networks (RNN), for spatially explicit prediction and mapping of
flash flood probability. To develop and validate the predictive models, a
geospatial database that contained records for the historical flood events and geo-
environmental characteristics of the Golestan Province in northern Iran was
constructed. The step-wise weight assessment ratio analysis (SWARA) was employed to
investigate the spatial interplay between floods and different influencing factors.
The CNN and RNN models were trained using the SWARA weights and validated using the
receiver operating characteristics technique. The results showed that the CNN model
(AUC = 0.832, RMSE = 0.144) performed slightly better than the RNN model
(AUC = 0.814, RMSE = 0.181) in predicting future floods. Further, these models
demonstrated an improved prediction of floods compared to previous studies that
used different models in the same study area. This study showed that the spatially
explicit deep learning neural network models are successful in capturing the
heterogeneity of spatial patterns of flood probability in the Golestan Province,
and the resulting probability maps can be used for the development of mitigation
plans in response to the future floods. The general policy implication of our study
suggests that design, implementation, and verification of flood early warning
systems should be directed to approximately 40% of the land area characterized by
high and very susceptibility to flooding.
Keywords: Spatial modeling; Machine learning; Convolutional neural networks;
Recurrent neural networks; GIS; Iran
Mohammad Masum, M.A. Masud, Muhaiminul Islam Adnan, Hossain Shahriar, Sangil Kim,
Comparative study of a mathematical epidemic model, statistical modeling, and deep
learning for COVID-19 forecasting and management,
Socio-Economic Planning Sciences,
Volume 80,
2022,
101249,
ISSN 0038-0121,
https://doi.org/10.1016/j.seps.2022.101249.
(https://www.sciencedirect.com/science/article/pii/S0038012122000271)
Abstract: The COVID-19 pandemic has caused a global crisis with 47,209,305
confirmed cases and 1,209,505 confirmed deaths worldwide as of November 2, 2020.
Forecasting confirmed cases and understanding the virus dynamics is necessary to
provide valuable insights into the growth of the outbreak and facilitate policy-
making regarding virus containment and utilization of medical resources. In this
study, we applied a mathematical epidemic model (MEM), statistical model, and
recurrent neural network (RNN) variants to forecast the cumulative confirmed cases.
We proposed a reproducible framework for RNN variants that addressed the stochastic
nature of RNN variants leveraging z-score outlier detection. We incorporated
heterogeneity in susceptibility into the MEM considering lockdowns and the dynamic
dependency of the transmission and identification rates which were estimated using
Poisson likelihood fitting. While the experimental results demonstrated the
superiority of RNN variants in forecasting accuracy, the MEM presented
comprehensive insights into the virus spread and potential control strategies.
Keywords: COVID-19 forecasting; Management; Mathematical epidemic model;
Statistical modeling; Deep learning
Muhammad Naveed Akhter, Saad Mekhilef, Hazlie Mokhlis, Raza Ali, Muhammad Usama,
Munir Azam Muhammad, Anis Salwa Mohd Khairuddin,
A hybrid deep learning method for an hour ahead power output forecasting of three
different photovoltaic systems,
Applied Energy,
Volume 307,
2022,
118185,
ISSN 0306-2619,
https://doi.org/10.1016/j.apenergy.2021.118185.
(https://www.sciencedirect.com/science/article/pii/S0306261921014562)
Abstract: The integration of photovoltaic energy into a grid demands accurate power
output forecasting. In this research, an hour ahead prediction of power output is
performed on an annual basis over real data period (2016–2019) for three different
PV systems based on polycrystalline, monocrystalline, and thin-film technologies.
The solar radiation, ambient temperature, module temperature and wind speed are the
considered input parameters, while the power output of each PV system is the output
parameter. A hybrid deep learning (DL) method (SSA-RNN-LSTM) is proposed for an
hour ahead prediction of output power for each PV system. The proposed technique is
compared with GA-RNN-LSTM, PSO-RNN-LSTM and RNN-LSTM. The considered forecasting
accuracy measurement parameters are RMSE, MSE, MAE and coefficient of determination
(R2). The findings elaborate that SSA-RNN-LSTM has shown better forecasting
accuracy with the lowest (RMSE and MSE), highest (R2) and highest convergence speed
compared to other methods. The proposed model has shown testing (RMSE and MAE) of
(19.14% and 21.57%), (15.4% and 10.81%) and (22.9% and 25.2%) lower than RNN-LSTM
for polycrystalline, monocrystalline and thin-film PV systems respectively.
Furthermore, the proposed model is found more robust in predicting the power output
for three different PV systems over four years data period.
Keywords: An hour ahead power output forecasting; Hybrid deep learning; PV systems;
SSA-RNN-LSTM; PSO-RNN-LSTM; GA-RNN-LSTM
Yue Ma, Yingjie Hu, Glenn R. Moncrieff, Jasper A. Slingsby, Adam M. Wilson, Brian
Maitner, Ryan Zhenqi Zhou,
Forecasting vegetation dynamics in an open ecosystem by integrating deep learning
and environmental variables,
International Journal of Applied Earth Observation and Geoinformation,
Volume 114,
2022,
103060,
ISSN 1569-8432,
https://doi.org/10.1016/j.jag.2022.103060.
(https://www.sciencedirect.com/science/article/pii/S1569843222002485)
Abstract: Open (i.e., non-forest) ecosystems, such as savannas, shrublands, and
grasslands, contain over 40 % of the global total ecosystem organic carbon and
harbor a substantial portion of the world’s biodiversity. Accurately forecasting
vegetation dynamics is critical for managing biodiversity, fire, water, and carbon
in these open ecosystems. Unlike forests or other relatively stable ecosystems,
open ecosystems can have dramatically changing vegetation states since they are
prone to natural disturbances, long-term trends, and short-term events.
Consequently, it is challenging to accurately predict vegetation state in this type
of ecosystems. This paper investigates the use of deep learning based approaches
for forecasting vegetation dynamics in an open ecosystem, the fynbos shrublands of
the Cape Floristic Region of South Africa, a global biodiversity hotspot. We
experiment with different deep learning models and examine the ability of thirteen
environmental variables, such as precipitation, fire history, and temperature, to
enhance the forecasting. We find that the ConvLSTM model can forecast vegetation
state more accurately than four other compared baseline approaches. The
environmental variable mean precipitation in July (winter) provides the most
prominent enhancement for forecasting among the tested variables. Finally, we
discuss the pros and cons of using a deep learning based approach for vegetation
forecasting in open ecosystems from a conservation management perspective.
Keywords: Open ecosystems; NDVI; RNN; ConvLSTM; Cape Floristic Region; GeoAI
Amer M. Johri, Krishna V. Singh, Laura E. Mantella, Luca Saba, Aditya Sharma, John
R. Laird, Kumar Utkarsh, Inder M. Singh, Suneet Gupta, Manudeep S. Kalra, Jasjit S.
Suri,
Deep learning artificial intelligence framework for multiclass coronary artery
disease prediction using combination of conventional risk factors, carotid
ultrasound, and intraplaque neovascularization,
Computers in Biology and Medicine,
Volume 150,
2022,
106018,
ISSN 0010-4825,
https://doi.org/10.1016/j.compbiomed.2022.106018.
(https://www.sciencedirect.com/science/article/pii/S0010482522007405)
Abstract: Objective
Cardiovascular disease (CVD) is a major healthcare challenge and therefore early
risk assessment is vital. Previous assessment techniques use either “conventional
CVD risk calculators (CCVRC)” or machine learning (ML) paradigms. These techniques
are ad-hoc, unreliable, not fully automated, and have variabilities. We, therefore,
introduce AtheroEdge-MCDLAI (AE3.0DL) windows-based platform using multiclass Deep
Learning (DL) system.
Methods
Data was collected on 500 patients having both carotid ultrasound and corresponding
coronary angiography scores (CAS), measured as stenosis in coronary arteries and
considered as the gold standard. A total of 39 covariates were used, clubbed into
three clusters, namely (i) Office-based: age, gender, body mass index, smoker,
hypertension, systolic blood pressure, and diastolic blood pressure; (ii)
Laboratory-based: Hyperlipidemia, hemoglobin A1c, and estimated glomerular
filtration rate; and (iii) Carotid ultrasound image phenotypes: maximum plaque
height, total plaque area, and intra-plaque neovascularization. Baseline
characteristics for four classes (target labels) having significant (p < 0.0001)
values were calculated using Chi-square and ANOVA. For handling the cohort's
imbalance in the risk classes, AE3.0DL used the synthetic minority over-sampling
technique (SMOTE). AE3.0DL used Recurrent Neural Network (RNN) and Long Short-Term
Memory (LSTM) DL models and the performance (accuracy and area-under-the-curve) was
computed using 10-fold cross-validation (90% training, 10% testing) frameworks.
AE3.0DL was validated and benchmarked.
Results
The AE3.0DL using RNN and LSTM showed an accuracy and AUC (p < 0.0001) pairs as
(95.00% and 0.98), and (95.34% and 0.99), respectively, and showed an improvement
of 32.93% and 9.94% against CCVRC and ML, respectively. AE3.0DL runs in <1 s.
Conclusion
DL algorithms are a powerful paradigm for coronary artery disease (CAD) risk
prediction and CVD risk stratification.
Keywords: Coronary artery disease prediction; Carotid ultrasound; Artificial
intelligence; Machine learning; Deep learning; Performance evaluation
Mohammad Ehsan Basiri, Shahla Nemati, Moloud Abdar, Erik Cambria, U. Rajendra
Acharya,
ABCDM: An Attention-based Bidirectional CNN-RNN Deep Model for sentiment analysis,
Future Generation Computer Systems,
Volume 115,
2021,
Pages 279-294,
ISSN 0167-739X,
https://doi.org/10.1016/j.future.2020.08.005.
(https://www.sciencedirect.com/science/article/pii/S0167739X20309195)
Abstract: Sentiment analysis has been a hot research topic in natural language
processing and data mining fields in the last decade. Recently, deep neural network
(DNN) models are being applied to sentiment analysis tasks to obtain promising
results. Among various neural architectures applied for sentiment analysis, long
short-term memory (LSTM) models and its variants such as gated recurrent unit (GRU)
have attracted increasing attention. Although these models are capable of
processing sequences of arbitrary length, using them in the feature extraction
layer of a DNN makes the feature space high dimensional. Another drawback of such
models is that they consider different features equally important. To address these
problems, we propose an Attention-based Bidirectional CNN-RNN Deep Model (ABCDM).
By utilizing two independent bidirectional LSTM and GRU layers, ABCDM will extract
both past and future contexts by considering temporal information flow in both
directions. Also, the attention mechanism is applied on the outputs of
bidirectional layers of ABCDM to put more or less emphasis on different words. To
reduce the dimensionality of features and extract position-invariant local
features, ABCDM utilizes convolution and pooling mechanisms. The effectiveness of
ABCDM is evaluated on sentiment polarity detection which is the most common and
essential task of sentiment analysis. Experiments were conducted on five review and
three Twitter datasets. The results of comparing ABCDM with six recently proposed
DNNs for sentiment analysis show that ABCDM achieves state-of-the-art results on
both long review and short tweet polarity classification.
Keywords: Sentiment analysis; Deep learning; Convolutional neural network; Long
short-term memory; Attention mechanism
Xijun Wu, Yudong Niu, Shibo Gao, Zhilei Zhao, Baoran Xu, Renqi Ma, Hailong Liu,
Yungang Zhang,
Identification of antioxidants in edible oil by two-dimensional correlation
spectroscopy combined with deep learning,
LWT,
Volume 162,
2022,
113436,
ISSN 0023-6438,
https://doi.org/10.1016/j.lwt.2022.113436.
(https://www.sciencedirect.com/science/article/pii/S0023643822003711)
Abstract: The quality of edible oil is an essential part of food safety which is
highly concerned by people. In this study, perturbation Raman spectroscopy combined
with deep learning was used to identify antioxidants in edible oils. Convolutional
neural network (CNN) and recurrent neural network (RNN) are two classical network
structures in deep learning. First of all, we explored the identification effect of
antioxidants in edible oils using one-dimensional Raman data combined with one-
dimensional CNN and RNN. At the same time, we also compared the identification
effect of the data set under a single heating time disturbance. Then two-
dimensional correlation spectroscopy combined with a two-dimensional CNN model was
used to identify the types of antioxidants. It was found that the final
classification accuracy reached 97%, which was nearly 10% higher than the one-
dimensional CNN model. This showed that the two-dimensional correlation spectral
analysis based on external disturbance can “amplify” the subtle differences in the
spectral data. In addition, the traditional chemometric method, partial least
squares discriminant analysis (PLS-DA), was used as a control experiment. According
to this study, it can be seen that the perturbation spectrum combined with deep
learning was feasible in the detection of trace substances in edible oils.
Keywords: Edible oil; Antioxidant; Perturbation spectrum; Two-dimensional
correlation spectra; Deep learning
Huai Su, Enrico Zio, Jinjun Zhang, Mingjing Xu, Xueyi Li, Zongjie Zhang,
A hybrid hourly natural gas demand forecasting method based on the integration of
wavelet transform and enhanced Deep-RNN model,
Energy,
Volume 178,
2019,
Pages 585-597,
ISSN 0360-5442,
https://doi.org/10.1016/j.energy.2019.04.167.
(https://www.sciencedirect.com/science/article/pii/S036054421930800X)
Abstract: The rapid development of big data and smart technology in the natural gas
industry requires timely and accurate forecasting of natural gas consumption on
different time horizons. In this work, we propose a robust hybrid hours-ahead gas
consumption method by integrating Wavelet Transform, RNN-structured deep learning
and Genetic Algorithm. The Wavelet Transform is used to reduce the complexity of
the forecasting tasks by decomposing the original series of gas loads into several
sub-components. The RNN-structured deep learning method is built up via combining a
multi-layer Bi-LSTM model and a LSTM model. The multi-layer Bi-LSTM model can
comprehensively capture the features in the sub-components and the LSTM model is
used to forecast the future gas consumption based on these abstracted features. To
enhance the performance of the RNN-structured deep learning model, Genetic
Algorithm is employed to optimize the structure of each layer in the model.
Besides, the dropout technology is applied in this work to overcome the potential
problem of overfitting. In this case study, the effectiveness of the developed
method is verified from multiple perspective, including graphical examination,
mathematical errors analysis and model comparison, on different data sets.
Keywords: Natural gas demand forecasting; Deep learning; Recurrent neural network;
Genetic algorithm; Long short time memory model
Shahid Farah, Wood David A, Nisar Humaira, Zameer Aneela, Eger Steffen,
Short-term multi-hour ahead country-wide wind power prediction for Germany using
gated recurrent unit deep learning,
Renewable and Sustainable Energy Reviews,
Volume 167,
2022,
112700,
ISSN 1364-0321,
https://doi.org/10.1016/j.rser.2022.112700.
(https://www.sciencedirect.com/science/article/pii/S1364032122005895)
Abstract: In recent years, wind power has emerged as an important source of
renewable energy. When onshore and offshore wind farm regions are connected to the
grid for power generation, consistent multi-location short-term wind power
predictions are extremely valuable in terms of assuring the power system's safety,
sustainability, and economic operation. An abrupt variation in wind power
generation influences the efficiency of the regional power grid. This makes
accurate short-term forecasting essential for high-level planning and scheduling of
power grids. To address the issue, this paper presents two variants of recurrent
neural networks (RNN): gated recurrent unit (GRU) and long short-term memory (LSTM)
models considering substantially better prediction accuracy to forecast a country-
wide (Germany) wind power data for daily (t + 1), and multi-step (t + 3, t + 5, and
t + 12) hours ahead. In addition, wind velocities [m/s] measured at heights of 2,
10, and 50-m (above ground level) are exploited as an essential characteristic
among the available input variables and evaluated each feature subset based on four
training divisions (80-20%, 70-30%, 60-40%, and 50-50%) and compared the results
with ARIMA and SVR approaches in the literature. The findings reveal that the RNN-
GRU model not only can achieve higher predicting accuracy but also has a faster
learning speed over long sequences.
Keywords: Wind power prediction; Feature selection; Gated recurrent unit; Long
short term memory; Lagged time-series analysis
Zengwei Yao, Zihao Wang, Weihuang Liu, Yaqian Liu, Jiahui Pan,
Speech emotion recognition using fusion of three multi-task learning-based
classifiers: HSF-DNN, MS-CNN and LLD-RNN,
Speech Communication,
Volume 120,
2020,
Pages 11-19,
ISSN 0167-6393,
https://doi.org/10.1016/j.specom.2020.03.005.
(https://www.sciencedirect.com/science/article/pii/S0167639319300329)
Abstract: Speech emotion recognition plays an increasingly important role in
emotional computing and is still a challenging task due to its complexity. In this
study, we developed a framework integrating three distinctive classifiers: a deep
neural network (DNN), a convolution neural network (CNN), and a recurrent neural
network (RNN). The framework was used for categorical recognition of four discrete
emotions (i.e., angry, happy, neutral and sad). Frame-level low-level descriptors
(LLDs), segment-level mel-spectrograms (MS), and utterance-level outputs of high-
level statistical functions (HSFs) on LLDs were passed to RNN, CNN, and DNN,
separately. Three individual models of LLD-RNN, MS-CNN, and HSF-DNN were obtained.
In the models of MS-CNN and LLD-RNN, the attention mechanism based weighted-pooling
method was utilized to aggregate the CNN and RNN outputs. To effectively utilize
the interdependencies between the two approaches of emotion description (discrete
emotion categories and continuous emotion attributes), a multi-task learning
strategy was implemented in these three models to acquire generalized features by
simultaneously operating classification of discrete categories and regression of
continuous attributes. Finally, a confidence-based fusion strategy was developed to
integrate the power of different classifiers in recognizing different emotional
states. Three experiments on emotion recognition based on the IEMOCAP corpus were
conducted. Our experimental results show that the weighted pooling method based on
attention mechanism endowed the neural networks with the capability to focus on
emotionally salient parts. The generalized features learned in the multi-task
learning helped the neural networks to achieve higher accuracies in the tasks of
emotion classification. Furthermore, our proposed fusion system achieved weighted
accuracy of 57.1% and unweighted accuracy of 58.3%, which were significantly higher
than those of each individual classifier. The effectiveness of the proposed
approach based on classifier fusion was thus validated.
Keywords: Speech emotion recognition; Attention mechanism; Multi-task learning;
Classifier fusion
Mengbo Liu, Shaoming Liao, Yifeng Yang, Yanqing Men, Junzuo He, Yongliang Huang,
Tunnel boring machine vibration-based deep learning for the ground identification
of working faces,
Journal of Rock Mechanics and Geotechnical Engineering,
Volume 13, Issue 6,
2021,
Pages 1340-1357,
ISSN 1674-7755,
https://doi.org/10.1016/j.jrmge.2021.09.004.
(https://www.sciencedirect.com/science/article/pii/S1674775521001323)
Abstract: Tunnel boring machine (TBM) vibration induced by cutting complex ground
contains essential information that can help engineers evaluate the interaction
between a cutterhead and the ground itself. In this study, deep recurrent neural
networks (RNNs) and convolutional neural networks (CNNs) were used for vibration-
based working face ground identification. First, field monitoring was conducted to
obtain the TBM vibration data when tunneling in changing geological conditions,
including mixed-face, homogeneous, and transmission ground. Next, RNNs and CNNs
were utilized to develop vibration-based prediction models, which were then
validated using the testing dataset. The accuracy of the long short-term memory
(LSTM) and bidirectional LSTM (Bi-LSTM) models was approximately 70% with raw data;
however, with instantaneous frequency transmission, the accuracy increased to
approximately 80%. Two types of deep CNNs, GoogLeNet and ResNet, were trained and
tested with time-frequency scalar diagrams from continuous wavelet transformation.
The CNN models, with an accuracy greater than 96%, performed significantly better
than the RNN models. The ResNet-18, with an accuracy of 98.28%, performed the best.
When the sample length was set as the cutterhead rotation period, the deep CNN and
RNN models achieved the highest accuracy while the proposed deep CNN model
simultaneously achieved high prediction accuracy and feedback efficiency. The
proposed model could promptly identify the ground conditions at the working face
without stopping the normal tunneling process, and the TBM working parameters could
be adjusted and optimized in a timely manner based on the predicted results.
Keywords: Deep learning; Transfer learning; Convolutional neural network (CNN);
Recurrent neural network (RNN); Ground detection; Tunnel boring machine (TBM)
vibration; Mixed-face ground
Yiqi Jiang, Chaolin Li, Lu Sun, Dong Guo, Yituo Zhang, Wenhui Wang,
A deep learning algorithm for multi-source data fusion to predict water quality of
urban sewer networks,
Journal of Cleaner Production,
Volume 318,
2021,
128533,
ISSN 0959-6526,
https://doi.org/10.1016/j.jclepro.2021.128533.
(https://www.sciencedirect.com/science/article/pii/S0959652621027426)
Abstract: Point source pollution in urban drainage networks, which is difficult to
monitor and control, has been regarded as an intractable problem. To solve the
problem, key water quality indicators must be tracked in the evaluation and
prediction of sewer water quality. However, some of these important chemical
indicators (e.g. biological oxygen demand (BOD5), chemical oxygen demand (COD),
ammonia nitrogen (NH4+-N), total nitrogen (TN), and total phosphorus (TP)) require
a great deal of time and effort to measure, which will adversely affect the
prediction in a sewage network. Existing statistical methods and machine learning
algorithms cannot effectively solve the detection time problem or provide limited
accuracy. Moreover, the lack of various factors taken into account in these methods
results in unsatisfactory predictive performance. Few studies consider the impact
of urban multi-source data on water quality prediction of sewer networks while
developing statistical methods or machine learning algorithms. To address this
problem, we propose a deep learning approach based on multi-source data fusion.
This approach takes into account the following indicators to comprehensively
analyze and predict drainage water quality: environmental indicators (such as area
and diameter); social indicators (such as population); water quantity indicators
(such as drinking water supply, sewage flow, water velocity, and liquid level); and
easily monitored water quality criteria indicators (such as pH, temperature, and
conductivity). To test the effectiveness of this method, we conducted a case study
in a city in southern China. By comparing this new method with the linear method
(multiple linear regression, MLR) and traditional learning algorithm (multilayer
perception, MLP), it is found that the deep learning algorithm—which includes
recurrent neural network (RNN), long-short term memory (LSTM), and gated recurrent
unit (GRU)—has good predictive performance, in which GRU shows superior ability in
predicting the chemical index of water quality and the learning curve is faster.
The results showed that the GRU achieved 0.82%–5.07% higher R2 than RNN and LSTM,
9.13%–15.03% higher R2 than traditional machine learning algorithms, and 37.26%–
43.38% higher R2 than linear methods.
Keywords: Urban sewer networks; Water quality prediction; Deep learning; Multi-
source data fusion
Liwen Xing, Ningbo Cui, Chunwei Liu, Lu Zhao, Li Guo, Taisheng Du, Cun Zhan,
Zongjun Wu, Shenglin Wen, Shouzheng Jiang,
Estimation of daily apple tree transpiration in the Loess Plateau region of China
using deep learning models,
Agricultural Water Management,
Volume 273,
2022,
107889,
ISSN 0378-3774,
https://doi.org/10.1016/j.agwat.2022.107889.
(https://www.sciencedirect.com/science/article/pii/S037837742200436X)
Abstract: Quantitatively characterizing and accurately predicting plant
transpiration are of great significance, but directly measuring transpiration is
impractical, time-consuming, and labor-intensive. This study compared the
transpiration estimation performance of multiple linear regression (MLR), modified
Jarvis–Stewart (MJS), and Shuttleworth–Wallace (S-W) with deep belief network
(DBN), long short-term memory recurrent neural network (LSTM-RNN), and LSTM-RNN
improved with multiple restricted Boltzmann machines (R-L-RNN) using 31 input
combinations comprising complete subsets of Vapor pressure deficit (VPD), Net solar
radiation (Rn), Average air temperature (Ta), Soil water content (SWC), and Leaf
area index (LAI) observations collected at Wuwei, Changwu and Taigu stations on the
Loess Plateau in China. The results showed that R-L-RNN obtained the most accurate
estimations in the partial canopy stage, dense canopy stage, and whole growth
stage, compared to MLR, MJS, S-W, DBN, and LSTM-RNN. The accuracy of the deep
learning models (DNN) increased exponentially as the number of input variables
increased, and the importance of the input variables followed the orders of: LAI
> VPD > Rn > Ta > SWC in the partial and whole canopy stage, and VPD > Rn > Ta
> LAI > SWC in the dense canopy stage. The apple tree transpiration models were
more accurate in the partial and dense canopy stages than the whole growth stage.
The coefficient of determination and Nash-Sutcliffe efficiency coefficient for the
R-L-RNN model increased by 8.1–13.1% and 11.2–25.4% in the partial canopy stage,
respectively, and by 2.6–6.9% and 14.7–20.1% in the dense canopy stage, whereas the
relative root mean square error decreased by 8.7–28.6% and 17.3–38.2%. Overall, R-
L-RNN is the most recommended model for estimating the apple tree transpiration,
because it is such a simple method that agricultural water managers can easily
determine the water consumption of apple trees using limited accessible
observational data.
Keywords: Hybrid deep learning model; Empirical model; Path analysis; Growth stage;
Input variable
Jiahuan Lu, Rui Xiong, Jinpeng Tian, Chenxu Wang, Chia-Wei Hsu, Nien-Ti Tsou,
Fengchun Sun, Ju Li,
Battery degradation prediction against uncertain future conditions with recurrent
neural network enabled deep learning,
Energy Storage Materials,
Volume 50,
2022,
Pages 139-151,
ISSN 2405-8297,
https://doi.org/10.1016/j.ensm.2022.05.007.
(https://www.sciencedirect.com/science/article/pii/S2405829722002446)
Abstract: Accurate degradation trajectory and future life are the key information
of a new generation of intelligent battery and electrochemical energy storage
systems. It is very challenging to obtain accurate predictions against uncertain
application conditions by using only a few known historical data. In this article,
we extend the widely studied remaining useful life (RUL) prediction to the
prediction of charge and discharge capacity trajectories under both fixed and
random future operating conditions. This is achieved by developing a general deep
learning framework cored by recurrent neural network (RNN) which integrates future
current plan and few early capacity-voltage data as inputs. As a case study, we
have experimented with 77 commercial batteries cycled under fixed and random
operating conditions. We demonstrate that the median root mean square error (RMSE)
of prediction can be within 2.4% for NMC/graphite batteries and 2.3% for
LFP/graphite batteries by using 3.8% of the whole life data only. Compared with the
existing methods, the proposed framework predicts more accurately and has a very
balanced performance for both fixed and random future conditions. This work
highlights the promise of actively forecasting the future of batteries based on
RNN.
Xiangyu Fu, Jiafei Fang, Shilin Xiao, Xiaoyu Liu, Lizhuo Zheng, Weisheng Hu,
Dual-RNN based polar decoder with tanh-modified LLR over FSO turbulence channel,
Optics Communications,
Volume 527,
2023,
128965,
ISSN 0030-4018,
https://doi.org/10.1016/j.optcom.2022.128965.
(https://www.sciencedirect.com/science/article/pii/S0030401822006125)
Abstract: In this paper, the deep learning-based Belief Propagation (BP) decoding
is first investigated in a polar-coded free space optical (FSO) communication
system. On the basis of neural Cyclic redundancy check-Polar Belief Propagation
(NCPBP) decoding scheme, we propose a dual-recurrent neural network (RNN) based
decoder for polar codes, which adopts a new training loss function, RNN structure
and tanh-modified input. The presented decoder exhibits better performance to the
NCPBP decoder under different intensities of turbulence. Furthermore, the decoder
trained under fixed turbulence intensity shows higher turbulence adaptability and
its advantage increases with the intensity of turbulence, reaching a decoding gain
of 1 dB under strong turbulence when Rytov variance is 3.5.
Keywords: Polar codes; Free-space optical communication; Deep learning; Belief
propagation; Recurrent neural network
C. Anuradha, M. Ponnavaikko,
A RNN based offloading scheme to reduce latency and preserve energy using RNNBOS,
Measurement: Sensors,
Volume 24,
2022,
100429,
ISSN 2665-9174,
https://doi.org/10.1016/j.measen.2022.100429.
(https://www.sciencedirect.com/science/article/pii/S2665917422000630)
Abstract: Mobile cloud computing is currently evolving quickly in today's trend and
it provides infinite number of applications to the people those who are using
regularly.MCC means the mobile gadgets are strongly tied up with cloud technology
to execute various application for attaining many tasks. Mobile devices contain
different application according to its own capacity to hold each application. In
which many applications are in need of connecting with cloud storage. A new
proposed technique named RNNBOS (Recurrent Neural Network Based Offloading scheme)
is used to compute calculations in terms of energy source of mobile device along
with active conditions of network, Load computations, delay possibility of request
from device and quantitative amount of data being transferred for this purpose. We
have simulated the above technique using python tool and observed RNN based
offloading scheme is good in execution of application using MCC.
Keywords: Computational offloading; Mobile edge computing; Deep neural network;
Energy consumption and mobile cloud computing
Rajib Ghosh,
A Recurrent Neural Network based deep learning model for offline signature
verification and recognition system,
Expert Systems with Applications,
Volume 168,
2021,
114249,
ISSN 0957-4174,
https://doi.org/10.1016/j.eswa.2020.114249.
(https://www.sciencedirect.com/science/article/pii/S0957417420309659)
Abstract: With the recent advancement in information technology field, the demand
to develop a person authentication system through verifying their offline
signatures is gradually increasing. This type of system may be used to verify
various official documents through verifying the signatures of the concerned
persons present in the documents. This article proposes a Recurrent Neural Network
(RNN), a deep learning network, based method to verify and recognize offline
signatures of different persons. Various structural and directional features have
been extracted locally from each signature sample and the generated feature vectors
have been studied using two different models of RNN—long-short term memory (LSTM)
and bidirectional long–short term memory (BLSTM). The performance of the proposed
system has been tested on six widely used public signature databases—GPDS
synthetic, GPDS-300, MCYT-75, CEDAR, BHSig260 Hindi, and BHSig260 Bengali.
Experiment has also been performed using Convolutional Neural Network (CNN) to have
a comparison with RNN based results. Experimental results demonstrate that the
proposed RNN based signature verification and recognition system is superior over
CNN and also outperforms the existing state-of-the-art results in this regard.
Keywords: Offline signature; Verification and recognition; Deep learning; RNN; LSTM
and BLSTM
Haowei Jiang, Feiwei Qin, Jin Cao, Yong Peng, Yanli Shao,
Recurrent neural network from adder’s perspective: Carry-lookahead RNN,
Neural Networks,
Volume 144,
2021,
Pages 297-306,
ISSN 0893-6080,
https://doi.org/10.1016/j.neunet.2021.08.032.
(https://www.sciencedirect.com/science/article/pii/S0893608021003452)
Abstract: The recurrent network architecture is a widely used model in sequence
modeling, but its serial dependency hinders the computation parallelization, which
makes the operation inefficient. The same problem was encountered in serial adder
at the early stage of digital electronics. In this paper, we discuss the
similarities between recurrent neural network (RNN) and serial adder. Inspired by
carry-lookahead adder, we introduce carry-lookahead module to RNN, which makes it
possible for RNN to run in parallel. Then, we design the method of parallel RNN
computation, and finally Carry-lookahead RNN (CL-RNN) is proposed. CL-RNN takes
advantages in parallelism and flexible receptive field. Through a comprehensive set
of tests, we verify that CL-RNN can perform better than existing typical RNNs in
sequence modeling tasks which are specially designed for RNNs. Code and models are
available at: https://github.com/WinnieJiangHW/Carry-lookahead_RNN.
Keywords: Deep learning; Carry-lookahead; Parallel computation; Sequence modeling
Wendong Ge, Jin-Won Huh, Yu Rang Park, Jae-Ho Lee, Young-Hak Kim, Guohai Zhou,
Alexander Turchin,
Using deep learning with attention mechanism for identification of novel temporal
data patterns for prediction of ICU mortality,
Informatics in Medicine Unlocked,
Volume 29,
2022,
100875,
ISSN 2352-9148,
https://doi.org/10.1016/j.imu.2022.100875.
(https://www.sciencedirect.com/science/article/pii/S2352914822000296)
Abstract: Background and objectives
Changes in a patient's condition over time are a backbone of clinical decision
making. However, most currently used methods for identification of patients in
intensive care units (ICUs) at high risk for death do not make effective use of the
temporal dimension of available data. We therefore conducted a study to determine
whether longitudinal data analysis using recurrent neural networks (RNN) with
attention mechanism can identify novel temporal data patterns predictive of adverse
outcomes.
Methods
We analyzed data on patients admitted to the Medical Intensive Care Unit (MICU) of
Asan Medical Center between 2010 and 2017. Static (demographics, diagnoses,
procedures, medications) and longitudinal (vitals, laboratory tests, Glasgow Coma
Scale) variables were included in the analysis. We used an RNN model with long
short-term memory (RNN-LSTM) with attention mechanism to identify and test novel
data patterns predictive of ICU death. We also compared accuracy of prediction of
ICU mortality between a logistic regression and RNN-LSTM models with and without
attention.
Results
Among 4896 patients admitted to the MICU, 548 (11.19%) died. RNN-LSTM model with
attention identified several high-risk longitudinal variable patterns that were
predictive of ICU mortality in a confirmatory analysis, including sustained low
blood oxygen content (OR 2.33; 95% CI 1.16 to 4.70) and high frequency of serum
sodium measurements (OR 1.27; 95% CI 1.04 to 1.56). RNN-LSTM models with and
without attention achieved numerically, but not statistically significantly higher
c-statistics for prediction of ICU mortality compared to logistic regression.
Conclusions
RNN-LSTM model with attention identified novel temporal data patterns predictive of
ICU mortality. These predictors were both statistically significant and clinically
plausible, likely representing progressive respiratory failure (sustained low
oxygen saturation) and close monitoring of a clinically deteriorating patient
(frequent sodium measurements).
Keywords: Predictive modeling; Temporal data; Deep learning; Recurrent neural
networks; Attention
Mouaad Boulakhbar, Markos Farag, Kawtar Benabdelaziz, Tarik Kousksou, Malika Zazi,
A deep learning approach for prediction of electrical vehicle charging stations
power demand in regulated electricity markets: The case of Morocco,
Cleaner Energy Systems,
Volume 3,
2022,
100039,
ISSN 2772-7831,
https://doi.org/10.1016/j.cles.2022.100039.
(https://www.sciencedirect.com/science/article/pii/S2772783122000371)
Abstract: The transport sector is a prominent source of increasing fuel consumption
and greenhouse gas (GHG) emissions. Electric vehicle (EV) is deemed an appealing
solution for those problems. However, due to the variation in charging demands, the
high penetration of electric vehicles may cause different problems to the
distribution network and its reliability. Therefore, several approaches are
employed to predict the EVs charging demand and avoid the corresponding challenges.
This paper compares the performance of four well-known deep learning models, namely
artificial neural networks (ANN), recurrent neural networks (RNNs), long short-term
memory (LSTM), and gated recurrent units (GRUs), in predicting charging demand for
EV users after a charging session begins. We use a dataset consisting of 2000
observations of charging events collected from two public charging stations in
Morocco. According to numerical data results, the first layer of the GRU regression
approach marginally beats the other three methods in estimating power charging
needs. Specifically, the GRU regression model has an RMSE and MAPE of 1.39% and
0.50% in the training stage and 2.90% and 0.76% in the testing stage, respectively.
These findings can assist the National Office of Electricity and Water in Morocco
in ensuring the reliability of grid utility in the short run and guiding them to
construct additional charging stations in the long run.
Keywords: Load forecasting; Deep learning; Electric vehicles; Regulated electricity
market; Morocco