Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (2,402)

Search Parameters:
Keywords = XGboost

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
12 pages, 2388 KiB  
Article
Analyzing the Relationship Between COVID-19 and Sociodemographic and Environmental Factors: A Case Study in Toronto
by Brian Anlan Yu and Qinmin Vivian Hu
Electronics 2024, 13(22), 4524; https://doi.org/10.3390/electronics13224524 (registering DOI) - 18 Nov 2024
Abstract
COVID-19 has disproportionately impacted communities based on sociodemographic and environmental factors. Previous studies have largely focused on traditional statistical models to investigate these disparities with limited attention to within-city variations. This research addresses this gap by employing advanced machine learning models to predict [...] Read more.
COVID-19 has disproportionately impacted communities based on sociodemographic and environmental factors. Previous studies have largely focused on traditional statistical models to investigate these disparities with limited attention to within-city variations. This research addresses this gap by employing advanced machine learning models to predict COVID-19 case counts at the neighborhood level within Toronto. Using algorithms such as Support Vector Regression, Random Forest, Gradient Boosting, and XGBoost, along with SHAP (SHapley Additive exPlanations) analysis, we identify key factors impacting COVID-19 transmission, including air pollution, socioeconomic status, and racialized group membership. Our results demonstrate that sociodemographic factors significantly influence sporadic cases, while environmental factors, particularly air pollutants, are critical in outbreak cases. This study highlights the value of machine learning in understanding complex interactions between risk factors with implications for targeted public health interventions to mitigate COVID-19 disparities. Full article
Show Figures

Figure 1

26 pages, 1044 KiB  
Article
PredXGBR: A Machine Learning Framework for Short-Term Electrical Load Prediction
by Rifat Zabin, Khandaker Foysal Haque and Ahmed Abdelgawad
Electronics 2024, 13(22), 4521; https://doi.org/10.3390/electronics13224521 (registering DOI) - 18 Nov 2024
Abstract
The growing demand for consumer-end electrical load is driving the need for smarter management of power sector utilities. In today’s technologically advanced society, efficient energy usage is critical, leaving no room for waste. To prevent both electricity shortage and wastage, electrical load forecasting [...] Read more.
The growing demand for consumer-end electrical load is driving the need for smarter management of power sector utilities. In today’s technologically advanced society, efficient energy usage is critical, leaving no room for waste. To prevent both electricity shortage and wastage, electrical load forecasting becomes the most convenient way out. However, the conventional and probabilistic methods are less adaptive to the acute, micro, and unusual changes in the demand trend. With the recent development of artificial intelligence (AI), machine learning (ML) has become the most popular choice due to its higher accuracy based on time-, demand-, and trend-based feature extractions. Thus, we propose an Extreme Gradient Boosting (XGBoost) regression-based model—PredXGBR-1, which employs short-term lag features to predict hourly load demand. The novelty of PredXGBR-1 lies in its focus on short-term lag autocorrelations to enhance adaptability to micro-trends and demand fluctuations. Validation across five datasets, representing electrical load in the eastern and western USA over a 20-year period, shows that PredXGBR-1 outperforms a long-term feature-based XGBoost model, PredXGBR-2, and state-of-the-art recurrent neural network (RNN) and long short-term memory (LSTM) models. Specifically, PredXGBR-1 achieves an mean absolute percentage error (MAPE) between 0.98 and 1.2% and an R2 value of 0.99, significantly surpassing PredXGBR-2’s R2 of 0.61 and delivering up to 86.8% improvement in MAPE compared to LSTM models. These results confirm the superior performance of PredXGBR-1 in accurately forecasting short-term load demand. Full article
Show Figures

Figure 1

25 pages, 14501 KiB  
Article
Root-Zone Salinity in Irrigated Arid Farmland: Revealing Driving Mechanisms of Dynamic Changes in China’s Manas River Basin over 20 Years
by Guang Yang, Xuejin Qiao, Qiang Zuo, Jianchu Shi, Xun Wu and Alon Ben-Gal
Remote Sens. 2024, 16(22), 4294; https://doi.org/10.3390/rs16224294 (registering DOI) - 18 Nov 2024
Viewed by 60
Abstract
The risk of soil salinization is prevalent in arid and semi-arid regions, posing a critical challenge to sustainable agriculture. This study addresses the need for accurate assessment of regional root-zone soil salt content (SSC) and understanding of underlying driving mechanisms, which [...] Read more.
The risk of soil salinization is prevalent in arid and semi-arid regions, posing a critical challenge to sustainable agriculture. This study addresses the need for accurate assessment of regional root-zone soil salt content (SSC) and understanding of underlying driving mechanisms, which are essential for developing effective salinization mitigation and water management strategies. A remote sensing inversion technique, initially proposed to estimate root-zone SSC in cotton fields, was adapted and validated more widely to non-cotton farmlands. Validation results (with a coefficient of determination R2 > 0.53) were obtained using data from a three-year (2020–2022) regional survey conducted in the arid Manas River Basin (MRB), Xinjiang, China. Based on this adapted technique, we analyzed the spatiotemporal distributions of root-zone SSC across all farmlands in MRB from 2001 to 2022. Findings showed that root-zone SSC decreased significantly from 5.47 to 3.77 g kg−1 over the past 20 years but experienced a slight increase of 0.15 g kg1 in recent five years (2017–2022), attributed to cultivated area expansion and reduced irrigation quotas due to local water shortages. The driving mechanisms behind root-zone SSC distributions were analyzed using an approach combined with two machine learning algorithms, eXtreme Gradient Boosting (XGBoost) and SHapley Additive exPlanation (SHAP), to identify influential factors and quantify their impacts. The approach demonstrated high predictive accuracy (R2 = 0.96 ± 0.01, root mean squared error RMSE = 0.19 ± 0.03 g kg1, maximum absolute error MAE = 0.14 ± 0.02 g kg1) in evaluating SSC drivers. Factors such as initial SSC, crop type distribution, duration of film mulched drip irrigation implementation, normalized difference vegetation index (NDVI), irrigation amount, and actual evapotranspiration (ETa), with mean (SHAP value) ≥ 0.02 g kg−1, were found to be more closely correlated with root-zone SSC variations than other factors. Decreased irrigation amount appeared as the primary driver for recent increased root-zone SSC, especially in the mid- and down-stream sections of MRB. Recommendations for secondary soil salinization risk reduction include regulation of the planting structure (crop choice and extent of planting area) and maintenance of a sufficient irrigation amount. Full article
Show Figures

Graphical abstract

31 pages, 6184 KiB  
Article
Optimizing Perceived Jogging Supportiveness for Enhanced Sustainable Greenway Design Based on Computer Vision: Implications of the Nonlinear Influence of Perceptual and Physical Characteristics
by Yuhan Liu, Nuo Xu, Chang Liu, Jiayi Zhao and Yongrong Zheng
Sustainability 2024, 16(22), 10038; https://doi.org/10.3390/su162210038 - 18 Nov 2024
Viewed by 124
Abstract
Active transportation and lifestyles are important components of a sustainable city. Greenways play a crucial role in providing conducive environments for jogging. To investigate the influence of micro-scale characteristics on perceived jogging supportiveness (PJS), 230 video clips of greenways within Fuzhou City were [...] Read more.
Active transportation and lifestyles are important components of a sustainable city. Greenways play a crucial role in providing conducive environments for jogging. To investigate the influence of micro-scale characteristics on perceived jogging supportiveness (PJS), 230 video clips of greenways within Fuzhou City were collected as samples. PJS was evaluated using a Likert scale, perceptual characteristics were assessed through a semantic difference scale, and physical characteristics were computed via semantic segmentation. By employing SHAP values and dependence plots within an XGBoost framework, the findings reveal the following: (1) Regarding perceptual characteristics, continuity, culture, and facility affordance exhibit the highest relative importance to PJS (|SHAP| ≥ 0.1). Continuity, naturalness, and vitality generally have positive impacts on PJS, while disturbance is negative. Facility affordance, scale, culture, openness, and brightness demonstrate more complex nonlinear influences that suggest optimal value ranges. (2) Concerning physical characteristics, fences, motor vehicles, and surface material are deemed most influential (|SHAP| ≥ 0.1). The presence of fences, walls, and construction generally negatively affect PJS, while excessive openness is also unfavorable. Comfortable road surfaces are associated with higher levels of PJS. Natural elements and the presence of people and vehicles have promoting effects up to certain thresholds, but beyond that point, they exert opposite influences. Finally, suggestions for designing greenways that encourage jogging are proposed. This study provides practical references for optimizing greenway design to promote active transportation and lifestyles, reinforcing the contribution of green infrastructure to public health in sustainable cities. Full article
Show Figures

Figure 1

14 pages, 237 KiB  
Article
Predictive Analytics for Thyroid Cancer Recurrence: A Machine Learning Approach
by Elizabeth Clark, Samantha Price, Theresa Lucena, Bailey Haberlein, Abdullah Wahbeh and Raed Seetan
Knowledge 2024, 4(4), 557-570; https://doi.org/10.3390/knowledge4040029 (registering DOI) - 18 Nov 2024
Viewed by 125
Abstract
Differentiated thyroid cancer (DTC), comprising papillary and follicular thyroid cancers, is the most prevalent type of thyroid malignancy. Accurate prediction of DTC is crucial for improving patient outcomes. Machine learning (ML) offers a promising approach to analyze risk factors and predict cancer recurrence. [...] Read more.
Differentiated thyroid cancer (DTC), comprising papillary and follicular thyroid cancers, is the most prevalent type of thyroid malignancy. Accurate prediction of DTC is crucial for improving patient outcomes. Machine learning (ML) offers a promising approach to analyze risk factors and predict cancer recurrence. In this study, we aimed to develop predictive models to identify patients at an elevated risk of DTC recurrence based on 16 risk factors. We developed six ML models and applied them to a DTC dataset. We evaluated the ML models using Synthetic Minority Over-Sampling Technique (SMOTE) and with hyperparameter tuning. We measured the models’ performance using precision, recall, F1 score, and accuracy. Results showed that Random Forest consistently outperformed the other investigated models (KNN, SVM, Decision Tree, AdaBoost, and XGBoost) across all scenarios, demonstrating high accuracy and balanced precision and recall. The application of SMOTE improved model performance, and hyperparameter tuning enhanced overall model effectiveness. Full article
19 pages, 7362 KiB  
Article
Highly Efficient JR Optimization Technique for Solving Prediction Problem of Soil Organic Carbon on Large Scale
by Harsh Vazirani, Xiaofeng Wu, Anurag Srivastava, Debajyoti Dhar and Divyansh Pathak
Sensors 2024, 24(22), 7317; https://doi.org/10.3390/s24227317 (registering DOI) - 15 Nov 2024
Viewed by 245
Abstract
We utilized remote sensing and ground cover data to predict soil organic carbon (SOC) content across a vast geographic region. Employing a combination of machine learning and deep learning techniques, we developed a novel data fusion approach that integrated Digital Elevation Model (DEM) [...] Read more.
We utilized remote sensing and ground cover data to predict soil organic carbon (SOC) content across a vast geographic region. Employing a combination of machine learning and deep learning techniques, we developed a novel data fusion approach that integrated Digital Elevation Model (DEM) data, MODIS satellite imagery, WOSIS soil profile data, and CHELSA environmental data. This combined dataset, named GeoBlendMDWC, was specifically designed for SOC prediction. The primary aim of this research is to develop and evaluate a novel optimization algorithm for accurate SOC prediction by leveraging multi-source environmental data. Specifically, this study aims to (1) create an integrated dataset combining remote sensing and ground data for comprehensive SOC analysis, (2) develop a new optimization technique that enhances both machine learning and deep learning model performance, and (3) evaluate the algorithm’s efficiency and accuracy against established optimization methods like Jaya and GridSearchCV. This study focused on India, Australia, and South Africa, countries known for their significant agricultural activities. We introduced a novel optimization technique for both machine learning and deep neural networks, comparing its performance to established methods like the Jaya optimization technique and GridSearchCV. The models evaluated included XGBoost Regression, LightGBM, Gradient Boosting Regression (GBR), Random Forest Regression, Decision Tree Regression, and a Multilayer Perceptron (MLP) model. Our research demonstrated that the proposed optimization algorithm consistently outperformed existing methods in terms of execution time and performance. It achieved results comparable to GridSearchCV, reaching an R2 of 90.16, which was a significant improvement over the base XGBoost model’s R2 of 79.08. In deep learning optimization, it significantly outperformed the Jaya algorithm, achieving an R2 of 61.34 compared to Jaya’s 30.04. Moreover, it was 20–30 times faster than GridSearchCV. Given its speed and accuracy, this algorithm can be applied to real-time data processing in remote sensing satellites. This advanced methodology will greatly benefit the agriculture and farming sectors by providing precise SOC predictions. Full article
(This article belongs to the Section Remote Sensors)
Show Figures

Figure 1

20 pages, 7030 KiB  
Article
Identification of Exploited Unreliable Account Passwords in the Information Infrastructure Using Machine Learning Methods
by Mikhail Rusanov, Mikhail Babenko, Maria Lapina and Mohammad Sajid
Big Data Cogn. Comput. 2024, 8(11), 159; https://doi.org/10.3390/bdcc8110159 - 15 Nov 2024
Viewed by 300
Abstract
Accounts are an integral part of most modern information systems and provide their owners with the ability to authenticate within the system. This paper presents an analysis of existing methods for detecting simple account passwords in automated systems. Their advantages and disadvantages are [...] Read more.
Accounts are an integral part of most modern information systems and provide their owners with the ability to authenticate within the system. This paper presents an analysis of existing methods for detecting simple account passwords in automated systems. Their advantages and disadvantages are listed. A method was developed to detect simple exploitable passwords that administrators can use to supplement other existing methods to increase the overall security of automated systems against threats from accounts potentially compromised by attackers. The method was based on the analysis of commands executed in automated or manual modes with the indication of credentials in plain text. Minimum password strength requirements are provided based on the security level. A special case was considered in which all passwords analyzed in this way were found explicitly in the system logs. We developed a unified definition of the classification of passwords into simple and strong, and also developed machine learning technology for their classification. The method offers a flexible adaptation to a specific system, taking into account the level of significance of the information being processed and the password policy adopted, expressed in the possibility of retraining the machine learning model. The experimental method using machine learning algorithms, namely the ensemble of decision trees, for classifying passwords into strong and potentially compromised by attackers based on flexible password strength criteria, showed high results. The performance of the method is also compared against other machine learning algorithms, specifically XGBoost, Random Forest, and Naive Bayes. The presented approach also solves the problem of detecting events related to the use and storage of credentials in plain text. We used the dataset of approximately 770,000 passwords, allowing the machine learning model to accurately classify 98% of the passwords by their significance levels. Full article
Show Figures

Figure 1

22 pages, 5382 KiB  
Article
Impact of Feature Selection Techniques on the Performance of Machine Learning Models for Depression Detection Using EEG Data
by Marwa Hassan and Naima Kaabouch
Appl. Sci. 2024, 14(22), 10532; https://doi.org/10.3390/app142210532 - 15 Nov 2024
Viewed by 309
Abstract
Major depressive disorder (MDD) poses a significant challenge in mental healthcare due to difficulties in accurate diagnosis and timely identification. This study explores the potential of machine learning models trained on EEG-based features for depression detection. Six models and six feature selection techniques [...] Read more.
Major depressive disorder (MDD) poses a significant challenge in mental healthcare due to difficulties in accurate diagnosis and timely identification. This study explores the potential of machine learning models trained on EEG-based features for depression detection. Six models and six feature selection techniques were compared, highlighting the crucial role of feature selection in enhancing classifier performance. This study investigates the six feature selection methods: Elastic Net, Mutual Information (MI), Chi-Square, Forward Feature Selection with Stochastic Gradient Descent (FFS-SGD), Support Vector Machine-based Recursive Feature Elimination (SVM-RFE), and Minimal-Redundancy-Maximal-Relevance (mRMR). These methods were combined with six diverse classifiers: Logistic Regression, Support Vector Machine (SVM), Random Forest, Extreme Gradient Boosting (XGBoost), Categorical Boosting (CatBoost), and Light Gradient Boosting Machine (LightGBM). The results demonstrate the substantial impact of feature selection on model performance. SVM-RFE with SVM achieved the highest accuracy (93.54%) and F1 score (95.29%), followed by Logistic Regression with an accuracy of 92.86% and F1 score of 94.84%. Elastic Net also delivered strong results, with SVM and Logistic Regression both achieving 90.47% accuracy. Other feature selection methods yielded lower performance, emphasizing the importance of selecting appropriate feature selection and machine learning algorithms. These findings suggest that careful selection and application of feature selection techniques can significantly enhance the accuracy of EEG-based depression detection. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

14 pages, 1452 KiB  
Article
Online Prediction and Correction of Static Voltage Stability Index Based on Extreme Gradient Boosting Algorithm
by Huiling Qin, Shuang Li, Juncheng Zhang, Zhi Rao, Chengyu He, Zhijun Chen and Bo Li
Energies 2024, 17(22), 5710; https://doi.org/10.3390/en17225710 - 15 Nov 2024
Viewed by 260
Abstract
With the increasing integration of renewable energy sources into the power grid and the continuous expansion of grid infrastructure, real-time preventive control becomes crucial. This article proposes a real-time prediction and correction method based on the extreme gradient boosting (XGBoost) algorithm. The XGBoost [...] Read more.
With the increasing integration of renewable energy sources into the power grid and the continuous expansion of grid infrastructure, real-time preventive control becomes crucial. This article proposes a real-time prediction and correction method based on the extreme gradient boosting (XGBoost) algorithm. The XGBoost algorithm is utilized to evaluate the real-time stability of grid static voltage, with the voltage stability L-index as the prediction target. A correction model is established with the objective of minimizing correction costs while considering the operational constraints of the grid. When the L-index exceeds the warning value, the XGBoost algorithm can obtain the importance of each feature of the system and calculate the sensitivity approximation of highly important characteristics. The model corrects these characteristics to maintain the system’s operation within a reasonably secure range. The methodology is demonstrated using the IEEE-14 and IEEE-118 systems. The results show that the XGBoost algorithm has higher prediction accuracy and computational efficiency in assessing the static voltage stability of the power grid. It is also shown that the proposed approach has the potential to greatly improve the operational dependability of the power grid. Full article
(This article belongs to the Section F1: Electrical Power System)
Show Figures

Figure 1

23 pages, 22588 KiB  
Article
Monitoring Dissolved Organic Carbon Concentration and Flux in the Qiantang Riverine System Using Sentinel-2 Satellite Images
by Yujia Yan, Xianqiang He, Yan Bai, Jinsong Liu, Palanisamy Shanmugame, Yaqi Zhao, Xuan Zhang, Zhihong Wang, Yifan Zhang and Fang Gong
Remote Sens. 2024, 16(22), 4254; https://doi.org/10.3390/rs16224254 - 15 Nov 2024
Viewed by 386
Abstract
Real-time monitoring of riverine-dissolved organic carbon (DOC) and its controlling factors is critical for formulating strategies regarding the river basin and marginal seas pollution prevention and control. In this study, we established a linear regression formulation that relates the permanganate index (CODMn [...] Read more.
Real-time monitoring of riverine-dissolved organic carbon (DOC) and its controlling factors is critical for formulating strategies regarding the river basin and marginal seas pollution prevention and control. In this study, we established a linear regression formulation that relates the permanganate index (CODMn) to the DOC concentration based on in situ measurements collected on five field surveys in 2023–2024. This regression formulation was used on a large number of data collected from automatic monitoring stations in the Qiantang River area to construct a daily quasi-in situ database of DOC concentration. By combining the quasi-in situ DOC data and Sentinel-2 measurements, an enhanced algorithm for empirical DOC estimation was developed (R2 = 0.66) using the extreme gradient boosting (XGBoost) method and its spatial and temporal variations in the Qiantang River were analyzed from 2016 to 2023. Spatially, the main stream of the Qiantang River exhibited an overall decreasing and increasing trend influenced by population density, economic development, and pollutant discharge in the basin area, and the temporal distribution of DOC was controlled by meteorological conditions. The DOC contents had the highest in summer, primarily due to high rainfall and leaching. The inter-annual variation in DOC concentration was influenced by the total annual runoff volumes, with a minimum level of 2.24 mg L−1 in 2023 and a maximum level of 2.45 mg L−1 in 2019. The monthly DOC fluxes ranged from 6.3 to 13.8 × 104 t, with the highest values coinciding with the maximum river discharge volumes in June and July. The DOC levels in the Qiantang River remained relatively high in recent years (2016–2023). This study enables the concerned stakeholders and researchers to better understand carbon transportation and its dynamics in the Qiantang River and its coastal areas. Full article
Show Figures

Figure 1

27 pages, 3743 KiB  
Article
Performance Analysis and Improvement of Machine Learning with Various Feature Selection Methods for EEG-Based Emotion Classification
by Sherzod Abdumalikov, Jingeun Kim and Yourim Yoon
Appl. Sci. 2024, 14(22), 10511; https://doi.org/10.3390/app142210511 - 14 Nov 2024
Viewed by 601
Abstract
Emotion classification is a challenge in affective computing, with applications ranging from human–computer interaction to mental health monitoring. In this study, the classification of emotional states using electroencephalography (EEG) data were investigated. Specifically, the efficacy of the combination of various feature selection methods [...] Read more.
Emotion classification is a challenge in affective computing, with applications ranging from human–computer interaction to mental health monitoring. In this study, the classification of emotional states using electroencephalography (EEG) data were investigated. Specifically, the efficacy of the combination of various feature selection methods and hyperparameter tuning of machine learning algorithms for accurate and robust emotion recognition was studied. The following feature selection methods were explored: filter (SelectKBest with analysis of variance (ANOVA) F-test), embedded (least absolute shrinkage and selection operator (LASSO) tuned using Bayesian optimization (BO)), and wrapper (genetic algorithm (GA)) methods. We also executed hyperparameter tuning of machine learning algorithms using BO. The performance of each method was assessed. Two different EEG datasets, EEG Emotion and DEAP Dataset, containing 2548 and 160 features, respectively, were evaluated using random forest (RF), logistic regression, XGBoost, and support vector machine (SVM). For both datasets, the experimented three feature selection methods consistently improved the accuracy of the models. For EEG Emotion dataset, RF with LASSO achieved the best result among all the experimented methods increasing the accuracy from 98.78% to 99.39%. In the DEAP dataset experiment, XGBoost with GA showed the best result, increasing the accuracy by 1.59% and 2.84% for valence and arousal. We also show that these results are superior to those by the previous other methods in the literature. Full article
(This article belongs to the Special Issue Advances in Biosignal Processing)
Show Figures

Figure 1

22 pages, 4646 KiB  
Article
Concrete Creep Prediction Based on Improved Machine Learning and Game Theory: Modeling and Analysis Methods
by Wenchao Li, Houmin Li, Cai Liu and Kai Min
Buildings 2024, 14(11), 3627; https://doi.org/10.3390/buildings14113627 - 14 Nov 2024
Viewed by 296
Abstract
Understanding the impact of creep on the long-term mechanical features of concrete is crucial, and constructing an accurate prediction model is the key to exploring the development of concrete creep under long-term loads. Therefore, in this study, three machine learning (ML) models, a [...] Read more.
Understanding the impact of creep on the long-term mechanical features of concrete is crucial, and constructing an accurate prediction model is the key to exploring the development of concrete creep under long-term loads. Therefore, in this study, three machine learning (ML) models, a Support Vector Machine (SVM), Random Forest (RF), and Extreme Gradient Boosting Machine (XGBoost), are constructed, and the Hybrid Snake Optimization Algorithm (HSOA) is proposed, which can reduce the risk of the ML model falling into the local optimum while improving its prediction performance. Simultaneously, the contributions of the input features are ranked, and the optimal model’s prediction outcomes are explained through SHapley Additive exPlanations (SHAP). The research results show that the optimized SVM, RF, and XGBoost models increase their accuracies on the test set by 9.927%, 9.58%, and 14.1%, respectively, and the XGBoost has the highest precision in forecasting the concrete creep. The verification results of four scenarios confirm that the optimized model can precisely capture the compliance changes in long-term creep, meeting the requirements for forecasting the nature of concrete creep. Full article
(This article belongs to the Section Building Materials, and Repair & Renovation)
Show Figures

Figure 1

11 pages, 1745 KiB  
Article
Improved Cd Detection in Rice Grain Using LIBS with Husk-Based XGBoost Transfer Learning
by Weiping Xie, Jiang Xu, Lin Huang, Yuan Xu, Qi Wan, Yangfan Chen and Mingyin Yao
Agriculture 2024, 14(11), 2053; https://doi.org/10.3390/agriculture14112053 - 14 Nov 2024
Viewed by 269
Abstract
Cadmium (Cd) is a highly toxic metal that is difficult to completely eliminate from soil, despite advancements in modern agricultural and environmental technologies that have successfully reduced Cd levels. However, rice remains a key source of Cd exposure for humans. Even small amounts [...] Read more.
Cadmium (Cd) is a highly toxic metal that is difficult to completely eliminate from soil, despite advancements in modern agricultural and environmental technologies that have successfully reduced Cd levels. However, rice remains a key source of Cd exposure for humans. Even small amounts of Cd absorbed by rice can pose a potential health risk to the human body. Laser-induced breakdown spectroscopy (LIBS) has the advantages of simple sample preparation and fast analysis, which, combined with the transfer learning method, is expected to realize the real-time and rapid detection of low-level heavy metals in rice. In this work, 21 groups of naturally matured rice samples from potentially Cd-contaminated environments were collected. These samples were processed into rice husk, brown rice, and polished rice groups, and the reference Cd content was measured by ICP-MS. The XGBoost algorithm, known for its excellent performance in handling high-dimensional data and nonlinear relationships, was applied to construct both the XGBoost base model and the XGBoost-based transfer learning model to predict Cd content in brown rice and polished rice. By pre-training on rice husk source data, the XGBoost-based transfer learning model can learn from the abundant information available in rice husk to improve Cd quantification in rice grain. For brown rice, the XGBoost base model achieved RC2 of 0.9852 and RP2 of 0.8778, which were improved to 0.9885 and 0.9743, respectively, with the XGBoost-based transfer learning model. In the case of polished rice, the base model achieved RC2 of 0.9838 and RP2 of 0.8683, while the transfer learning model enhanced these to 0.9883 and 0.9699, respectively. The results indicate that the transfer learning method not only improves the detection capability for low Cd content in rice but also provides new insights for food safety detection. Full article
(This article belongs to the Section Digital Agriculture)
Show Figures

Figure 1

24 pages, 4650 KiB  
Article
Passenger Flow Prediction for Rail Transit Stations Based on an Improved SSA-LSTM Model
by Xing Zhao, Chenxi Li, Xueting Zou, Xiwang Du and Ahmed Ismail
Mathematics 2024, 12(22), 3556; https://doi.org/10.3390/math12223556 - 14 Nov 2024
Viewed by 309
Abstract
Accurate and timely passenger flow prediction is important for the successful deployment of rail transit intelligent operation. The Sparrow Search Algorithm (SSA) has been applied to the parameter optimization of a Long-Short-Term Memory (LSTM) model. To solve the inherent weaknesses of SSA, this [...] Read more.
Accurate and timely passenger flow prediction is important for the successful deployment of rail transit intelligent operation. The Sparrow Search Algorithm (SSA) has been applied to the parameter optimization of a Long-Short-Term Memory (LSTM) model. To solve the inherent weaknesses of SSA, this paper proposes an improved SSA-LSTM model with optimization strategies including Tent Map and Levy Flight to practice the short-term prediction of boarding passenger flow at rail transit stations. Aimed at the passenger flow at four rail transit stations in Nanjing, China, it is found that the day of a week and rainfall are the influencing factors with the highest correlation. On this basis, we apply the proposed SSA-LSTM and four baseline models to realize the short-term prediction, and carry out the prediction experiments with different time granularities. According to the experimental results, the proposed SSA-LSTM model has a more effective performance than the Support Vector Regression (SVR) method, the eXtreme Gradient Boosting (XGBoost) model, the traditional LSTM model, and the improved LSTM model with the Whale Optimization Algorithm (WOA-LSTM) in the passenger flow prediction. In addition, for most stations, the prediction accuracy of the proposed SSA-LSTM model is greater at a larger time granularity, but there are still exceptions. Full article
Show Figures

Figure 1

46 pages, 4014 KiB  
Article
Robust Human Activity Recognition for Intelligent Transportation Systems Using Smartphone Sensors: A Position-Independent Approach
by John Benedict Lazaro Bernardo, Attaphongse Taparugssanagorn, Hiroyuki Miyazaki, Bipun Man Pati and Ukesh Thapa
Appl. Sci. 2024, 14(22), 10461; https://doi.org/10.3390/app142210461 - 13 Nov 2024
Viewed by 751
Abstract
This study explores Human Activity Recognition (HAR) using smartphone sensors to address the challenges posed by position-dependent datasets. We propose a position-independent system that leverages data from accelerometers, gyroscopes, linear accelerometers, and gravity sensors collected from smartphones placed either on the chest or [...] Read more.
This study explores Human Activity Recognition (HAR) using smartphone sensors to address the challenges posed by position-dependent datasets. We propose a position-independent system that leverages data from accelerometers, gyroscopes, linear accelerometers, and gravity sensors collected from smartphones placed either on the chest or in the left/right leg pocket. The performance of traditional machine learning algorithms (Decision Trees (DT), K-Nearest Neighbors (KNN), Random Forest (RF), Support Vector Classifier (SVC), and XGBoost) is compared against deep learning models (Gated Recurrent Unit (GRU), Long Short-Term Memory (LSTM), Temporal Convolutional Networks (TCN), and Transformer models) under two sensor configurations. Our findings highlight that the Temporal Convolutional Network (TCN) model consistently outperforms other models, particularly in the four-sensor non-overlapping configuration, achieving the highest accuracy of 97.70%. Deep learning models such as LSTM, GRU, and Transformer also demonstrate strong performance, showcasing their effectiveness in capturing temporal dependencies in HAR tasks. Traditional machine learning models, including RF and XGBoost, provide reasonable performance but do not match the accuracy of deep learning models. Additionally, incorporating data from linear accelerometers and gravity sensors led to slight improvements over using accelerometer and gyroscope data alone. This research enhances the recognition of passenger behaviors for intelligent transportation systems, contributing to more efficient congestion management and emergency response strategies. Full article
Show Figures

Figure 1

Back to TopTop