Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (3,958)

Search Parameters:
Keywords = ensemble method

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
23 pages, 11815 KiB  
Article
Landslide Displacement Prediction Stacking Deep Learning Algorithms: A Case Study of Shengjibao Landslide in the Three Gorges Reservoir Area of China
by Hongwei Jiang, Yunmin Wang, Zizheng Guo, Hao Zhou, Jiayi Wu and Xiaoshuang Li
Water 2024, 16(21), 3141; https://doi.org/10.3390/w16213141 (registering DOI) - 2 Nov 2024
Abstract
Computational models enable accurate, timely prediction of landslides based on the monitoring data on-site as the development of artificial intelligence technology. The most existing prediction methods focus on finding a single prediction algorithm with excellent performance or an integrated and efficient hyperparameter optimization [...] Read more.
Computational models enable accurate, timely prediction of landslides based on the monitoring data on-site as the development of artificial intelligence technology. The most existing prediction methods focus on finding a single prediction algorithm with excellent performance or an integrated and efficient hyperparameter optimization algorithm with a highly accurate regression prediction algorithm. In order to break through the limitation of generalization of prediction models, this paper proposes an ensemble model that combines deep learning algorithms, with a stacking framework optimized with the sliding window method. Multiple deep learning algorithms are set as the first layer of the stacking framework, which is optimized with the sliding window method to avoid confusion in the time order of datasets based on time series analysis. The Shengjibao landslide in the Three Gorges Reservoir is used as a case study. First, the cumulative displacement is decomposed into a trend and a periodic term using a moving average method. A single-factor and a multi-factor superposition model based on multiple deep learning algorithms are used to predict the trend and periodic term of the displacement, respectively. Finally, the predicted values of the trend and periodic terms are added to obtain the total predicted landslide displacement. For monitoring point ZK2-3, the values of RMSE and MAPE of the total displacement prediction with the stacking model are 15.93 mm and 0.54%, and the values of RMSE and MAPE of the best-performing individual deep learning model are 20.00 mm and 0.64%. The results show that the stacking model outperforms other models by combining the advantages of each individual deep learning algorithm. This study provides a framework for integrating landslide displacement prediction models. It can serve as a reference for the geological disaster prediction and the establishment of an early warning system in the Three Gorges Reservoir Area. Full article
(This article belongs to the Section Hydrogeology)
Show Figures

Figure 1

22 pages, 5390 KiB  
Article
Detecting Water Stress in Winter Wheat Based on Multifeature Fusion from UAV Remote Sensing and Stacking Ensemble Learning Method
by He Zhao, Jingjing Wang, Jiali Guo, Xin Hui, Yunling Wang, Dongyu Cai and Haijun Yan
Remote Sens. 2024, 16(21), 4100; https://doi.org/10.3390/rs16214100 (registering DOI) - 2 Nov 2024
Viewed by 68
Abstract
The integration of remote sensing technology and machine learning algorithms represents a new research direction for the rapid and large-scale detection of water stress in modern agricultural crops. However, in solving practical agricultural problems, single machine learning algorithms cannot fully explore the potential [...] Read more.
The integration of remote sensing technology and machine learning algorithms represents a new research direction for the rapid and large-scale detection of water stress in modern agricultural crops. However, in solving practical agricultural problems, single machine learning algorithms cannot fully explore the potential information within the data, lacking stability and accuracy. Stacking ensemble learning (SEL) can combine the advantages of multiple single machine learning algorithms to construct more stable predictive models. In this study, threshold values of stomatal conductance (gs) under different soil water stress indices (SWSIs) were proposed to assist managers in irrigation scheduling. In the present study, six irrigation treatments were established for winter wheat to simulate various soil moisture supply conditions. During the critical growth stages, gs was measured and the SWSI was calculated. A spectral camera mounted on an unmanned aerial vehicle (UAV) captured reflectance images in five bands, from which vegetation indices and texture information were extracted. The results indicated that gs at different growth stages of winter wheat was sensitive to soil moisture supply conditions. The correlation between the gs value and SWSI value was high (R2 > 0.79). Therefore, the gs value threshold can reflect the current soil water stress level. Compared with individual machine learning models, the SEL model exhibited higher prediction accuracy, with R2 increasing by 6.67–17.14%. Using a reserved test set, the SEL model demonstrated excellent performance in various evaluation metrics across different growth stages (R2: 0.69–0.87, RMSE: 0.04–0.08 mol m−2 s−1; NRMSE: 12.3–23.6%, MAE: 0.03–0.06 mol m−2 s−1) and exhibited excellent stability and accuracy. This research can play a significant role in achieving large-scale monitoring of crop growth status through UAV, enabling the real-time capture of winter wheat water deficit changes, and providing technical support for precision irrigation. Full article
20 pages, 10999 KiB  
Article
Stacking Ensemble Technique Using Optimized Machine Learning Models with Boruta–XGBoost Feature Selection for Landslide Susceptibility Mapping: A Case of Kermanshah Province, Iran
by Zeynab Yousefi, Ali Asghar Alesheikh, Ali Jafari, Sara Torktatari and Mohammad Sharif
Information 2024, 15(11), 689; https://doi.org/10.3390/info15110689 (registering DOI) - 2 Nov 2024
Viewed by 97
Abstract
Landslides cause significant human and financial losses in different regions of the world. A high-accuracy landslide susceptibility map (LSM) is required to reduce the adverse effects of landslides. Machine learning (ML) is a robust tool for LSM creation. ML models require large amounts [...] Read more.
Landslides cause significant human and financial losses in different regions of the world. A high-accuracy landslide susceptibility map (LSM) is required to reduce the adverse effects of landslides. Machine learning (ML) is a robust tool for LSM creation. ML models require large amounts of data to predict landslides accurately. This study has developed a stacking ensemble technique based on ML and optimization to enhance the accuracy of an LSM while considering small datasets. The Boruta–XGBoost feature selection was used to determine the optimal combination of features. Then, an intelligent and accurate analysis was performed to prepare the LSM using a dynamic and hybrid approach based on the Adaptive Fuzzy Inference System (ANFIS), Extreme Learning Machine (ELM), Support Vector Regression (SVR), and new optimization algorithms (Ladybug Beetle Optimization [LBO] and Electric Eel Foraging Optimization [EEFO]). After model optimization, a stacking ensemble learning technique was used to weight the models and combine the model outputs to increase the accuracy and reliability of the LSM. The weight combinations of the models were optimized using LBO and EEFO. The Root Mean Square Error (RMSE) and Area Under the Receiver Operating Characteristic Curve (AUC-ROC) parameters were used to assess the performance of these models. A landslide dataset from Kermanshah province, Iran, and 17 influencing factors were used to evaluate the proposed approach. Landslide inventory was 116 points, and the combined Voronoi and entropy method was applied for non-landslide point sampling. The results showed higher accuracy from the stacking ensemble technique with EEFO and LBO algorithms with AUC-ROC values of 94.81% and 94.84% and RMSE values of 0.3146 and 0.3142, respectively. The proposed approach can help managers and planners prepare accurate and reliable LSMs and, as a result, reduce the human and financial losses associated with landslide events. Full article
(This article belongs to the Special Issue Emerging Research in Optimization Algorithms in the Era of Big Data)
Show Figures

Figure 1

21 pages, 3521 KiB  
Article
Assessment of Line Outage Prediction Using Ensemble Learning and Gaussian Processes During Extreme Meteorological Events
by Altan Unlu and Malaquias Peña
Wind 2024, 4(4), 342-362; https://doi.org/10.3390/wind4040017 (registering DOI) - 1 Nov 2024
Viewed by 205
Abstract
Climate change is increasing the occurrence of extreme weather events, such as intense windstorms, with a trend expected to worsen due to global warming. The growing intensity and frequency of these events are causing a significant number of failures in power distribution grids. [...] Read more.
Climate change is increasing the occurrence of extreme weather events, such as intense windstorms, with a trend expected to worsen due to global warming. The growing intensity and frequency of these events are causing a significant number of failures in power distribution grids. However, understanding the nature of extreme wind events and predicting their impact on distribution grids can help and prevent these issues, potentially mitigating their adverse effects. This study analyzes a structured method to predict distribution grid disruptions caused by extreme wind events. The method utilizes Machine Learning (ML) models, including K-Nearest Neighbors (KNN), Random Forest (RF), Support Vector Machine (SVM), Decision Trees (DTs), Gradient Boosting Machine (GBM), Gaussian Process (GP), Deep Neural Network (DNN), and Ensemble Learning which combines RF, SVM and GP to analyze synthetic failure data and predict power grid outages. The study utilized meteorological information, physical fragility curves, and scenario generation for distribution systems. The approach is validated by using five-fold cross-validation on the dataset, demonstrating its effectiveness in enhancing predictive capabilities against extreme wind events. Experimental results showed that the Ensemble Learning, GP, and SVM models outperformed other predictive models in the binary classification task of identifying failures or non-failures, achieving the highest performance metrics. Full article
Show Figures

Figure 1

19 pages, 2535 KiB  
Article
Elegante: A Machine Learning-Based Threads Configuration Tool for SpMV Computations on Shared Memory Architecture
by Muhammad Ahmad, Usman Sardar, Ildar Batyrshin, Muhammad Hasnain, Khan Sajid and Grigori Sidorov
Information 2024, 15(11), 685; https://doi.org/10.3390/info15110685 - 1 Nov 2024
Viewed by 276
Abstract
The sparse matrix–vector product (SpMV) is a fundamental computational kernel utilized in a diverse range of scientific and engineering applications. It is commonly used to solve linear and partial differential equations. The parallel computation of the SpMV product is a challenging task. Existing [...] Read more.
The sparse matrix–vector product (SpMV) is a fundamental computational kernel utilized in a diverse range of scientific and engineering applications. It is commonly used to solve linear and partial differential equations. The parallel computation of the SpMV product is a challenging task. Existing solutions often employ a fixed number of threads assignment to rows based on empirical formulas, leading to sub-optimal configurations and significant performance losses. Elegante, our proposed machine learning-powered tool, utilizes a data-driven approach to identify the optimal thread configuration for SpMV computations within a shared memory architecture. It accomplishes this by predicting the best thread configuration based on the unique sparsity pattern of each sparse matrix. Our approach involves training and testing using various base and ensemble machine learning algorithms such as decision tree, random forest, gradient boosting, logistic regression, and support vector machine. We rigorously experimented with a dataset of nearly 1000+ real-world matrices. These matrices originated from 46 distinct application domains, spanning fields like robotics, power networks, 2D/3D meshing, and computational fluid dynamics. Our proposed methodology achieved 62% of the highest achievable performance and is 7.33 times faster, demonstrating a significant disparity from the default OpenMP configuration policy and traditional practice methods of manually or randomly selecting the number of threads. This work is the first attempt where the structure of the matrix is used to predict the optimal thread configuration for the optimization of parallel SpMV computation in a shared memory environment. Full article
Show Figures

Figure 1

24 pages, 5255 KiB  
Article
Deep Ensemble Remote Sensing Scene Classification via Category Distribution Association
by Zhenxin He, Guoxu Li, Zheng Wang, Guanxiong He, Hao Yan and Rong Wang
Remote Sens. 2024, 16(21), 4084; https://doi.org/10.3390/rs16214084 - 1 Nov 2024
Viewed by 315
Abstract
Recently, deep learning models have been successfully and widely applied in the field of remote sensing scene classification. But, the existing deep models largely overlook the distinct learning difficulties associated with discriminating different pairs of scenes. Consequently, leveraging the relationships within category distributions [...] Read more.
Recently, deep learning models have been successfully and widely applied in the field of remote sensing scene classification. But, the existing deep models largely overlook the distinct learning difficulties associated with discriminating different pairs of scenes. Consequently, leveraging the relationships within category distributions and employing ensemble learning algorithms hold considerable potential in addressing these issues. In this paper, we propose a category-distribution-associated deep ensemble learning model that pays more attention to instances that are difficult to identify between similar scenes. The core idea is to utilize the degree of difficulty between categories to guide model learning, which is primarily divided into two modules: category distribution information extraction and scene classification. This method employs an autoencoder to capture distinct scene distributions within the samples and constructs a similarity matrix based on the discrepancies between distributions. Subsequently, the scene classification module adopts a stacking ensemble framework, where the base layer utilizes various neural networks to capture sample representations from shallow to deep levels. The meta layer incorporates a novel multiclass boosting algorithm that integrates sample distribution and representations of information to discriminate scenes. Exhaustive empirical evaluations on remote sensing scene benchmarks demonstrate the effectiveness and superiority of our proposed method over the state-of-the-art approaches. Full article
Show Figures

Figure 1

16 pages, 4785 KiB  
Article
Hyperspectral Inversion of Soil Cu Content in Agricultural Land Based on Continuous Wavelet Transform and Stacking Ensemble Learning
by Kai Yang, Fan Wu, Hongxu Guo, Dongbin Chen, Yirong Deng, Zaoquan Huang, Cunliang Han, Zhiliang Chen, Rongbo Xiao and Pengcheng Chen
Viewed by 207
Abstract
Heavy metal pollution in agricultural land poses significant threats to both the ecological environment and human health. Therefore, the rapid and accurate prediction of heavy metal content in agricultural soil is crucial for environmental protection and soil remediation. Acknowledging the limitations of traditional [...] Read more.
Heavy metal pollution in agricultural land poses significant threats to both the ecological environment and human health. Therefore, the rapid and accurate prediction of heavy metal content in agricultural soil is crucial for environmental protection and soil remediation. Acknowledging the limitations of traditional single linear or nonlinear machine learning models in terms of prediction accuracy, this study developed an ensemble learning model that integrates multiple linear or nonlinear learning models with a random forest (RF) model to improve both the prediction accuracy and reliability. In this study, we selected a typical copper (Cu) polluted area in the Pearl River Delta of Guangdong Province as the research site and collected Cu content data and indoor soil reflectance spectral data from 269 surface soil samples. First, the soil spectral data were preprocessed using Savitzky–Golay (SG) smoothing, multiplicative scattering correction (MSC), and continuous wavelet transform (CWT) to reduce noise interference. Next, principal components analysis (PCA) was employed to reduce the dimensionality of the preprocessed spectral data, eliminating redundant features and lowering the computational complexity. Finally, based on the dimensionality-reduced data and Cu content, we established a stacked ensemble learning model, where the base models included SVR, PLSR, BPNN, and XGBoost, with RF serving as the meta-model to estimate the soil heavy metal content. To evaluate the performance of the stacking model, we compared its prediction accuracy with that of individual models. The results indicate that, compared to the traditional machine learning models, the prediction accuracy of the stacking model was superior (R2 = 0.77; RMSE = 7.65 mg/kg; RPD = 2.29). This suggests that the integrated algorithm demonstrates a greater robustness and generalization capability. This study presents a method to improve soil heavy metal content estimation using hyperspectral technology, ensuring a robust model that supports policymakers in making informed decisions about land use, agriculture, and environmental protection. Full article
Show Figures

Figure 1

32 pages, 2625 KiB  
Article
Ensemble-Based Machine Learning Algorithm for Loan Default Risk Prediction
by Abisola Akinjole, Olamilekan Shobayo, Jumoke Popoola, Obinna Okoyeigbo and Bayode Ogunleye
Mathematics 2024, 12(21), 3423; https://doi.org/10.3390/math12213423 - 31 Oct 2024
Viewed by 267
Abstract
Predicting credit default risk is important to financial institutions, as accurately predicting the likelihood of a borrower defaulting on their loans will help to reduce financial losses, thereby maintaining profitability and stability. Although machine learning models have been used in assessing large applications [...] Read more.
Predicting credit default risk is important to financial institutions, as accurately predicting the likelihood of a borrower defaulting on their loans will help to reduce financial losses, thereby maintaining profitability and stability. Although machine learning models have been used in assessing large applications with complex attributes for these predictions, there is still a need to identify the most effective techniques for the model development process, including the technique to address the issue of data imbalance. In this research, we conducted a comparative analysis of random forest, decision tree, SVMs (Support Vector Machines), XGBoost (Extreme Gradient Boosting), ADABoost (Adaptive Boosting) and the multi-layered perceptron, to predict credit defaults using loan data from LendingClub. Additionally, XGBoost was used as a framework for testing and evaluating various techniques. Moreover, we applied this XGBoost framework to handle the issue of class imbalance observed, by testing various resampling methods such as Random Over-Sampling (ROS), the Synthetic Minority Over-Sampling Technique (SMOTE), Adaptive Synthetic Sampling (ADASYN), Random Under-Sampling (RUS), and hybrid approaches like the SMOTE with Tomek Links and the SMOTE with Edited Nearest Neighbours (SMOTE + ENNs). The results showed that balanced datasets significantly outperformed the imbalanced dataset, with the SMOTE + ENNs delivering the best overall performance, achieving an accuracy of 90.49%, a precision of 94.61% and a recall of 92.02%. Furthermore, ensemble methods such as voting and stacking were employed to enhance performance further. Our proposed model achieved an accuracy of 93.7%, a precision of 95.6% and a recall of 95.5%, which shows the potential of ensemble methods in improving credit default predictions and can provide lending platforms with the tool to reduce default rates and financial losses. In conclusion, the findings from this study have broader implications for financial institutions, offering a robust approach to risk assessment beyond the LendingClub dataset. Full article
(This article belongs to the Special Issue Data-Driven Approaches in Revenue Management and Pricing Analytics)
21 pages, 1133 KiB  
Article
A Stacking Ensemble Based on Lexicon and Machine Learning Methods for the Sentiment Analysis of Tweets
by Sharaf J. Malebary and Anas W. Abulfaraj
Mathematics 2024, 12(21), 3405; https://doi.org/10.3390/math12213405 - 31 Oct 2024
Viewed by 318
Abstract
Sentiment is employed in various fields, such as collecting web-based opinions for the formulation of governmental policies, measuring employee and customer satisfaction levels in business organizations, and measuring the sentiment of the public in political and security matters. The field has recently faced [...] Read more.
Sentiment is employed in various fields, such as collecting web-based opinions for the formulation of governmental policies, measuring employee and customer satisfaction levels in business organizations, and measuring the sentiment of the public in political and security matters. The field has recently faced new challenges since algorithms must operate with highly unstructured sentiment data from social media. In this study, the authors present a new stacking ensemble method that combines the lexicon-based approach with machine learning algorithms to improve the sentiment analysis of tweets. Due to the complexity of the text with very ill-defined syntactic and grammatical patterns, using lexicon-based techniques to extract sentiment from the content is proposed. On the same note, the contextual and nuanced aspects of sentiment are inferred through machine learning algorithms. A sophisticated bat algorithm that uses an Elman network as a meta-classifier is then employed to classify the extracted features accurately. Substantial evidence from three datasets that are readily available for public analysis re-affirms the improvements this innovative approach brings to sentiment classification. Full article
(This article belongs to the Special Issue Application of Artificial Intelligence in Decision Making)
Show Figures

Figure 1

15 pages, 3375 KiB  
Article
Non-Destructive Prediction of Rosmarinic Acid Content in Basil Plants Using a Portable Hyperspectral Imaging System and Ensemble Learning Algorithms
by Hyo In Yoon, Dahye Ryu, Jai-Eok Park, Ho-Youn Kim, Soo Hyun Park and Jung-Seok Yang
Horticulturae 2024, 10(11), 1156; https://doi.org/10.3390/horticulturae10111156 - 31 Oct 2024
Viewed by 278
Abstract
Rosmarinic acid (RA) is a phenolic antioxidant naturally occurring in the plants of the Lamiaceae family, including basil (Ocimum basilicum L.). Existing analytical methods for determining the RA content in leaves are time-consuming and destructive, posing limitations on quality assessment and control [...] Read more.
Rosmarinic acid (RA) is a phenolic antioxidant naturally occurring in the plants of the Lamiaceae family, including basil (Ocimum basilicum L.). Existing analytical methods for determining the RA content in leaves are time-consuming and destructive, posing limitations on quality assessment and control during cultivation. In this study, we aimed to develop non-destructive prediction models for the RA content in basil plants using a portable hyperspectral imaging (HSI) system and machine learning algorithms. The basil plants were grown in a vertical farm module with controlled environments, and the HSI of the whole plant was captured using a portable HSI camera in the range of 400–850 nm. The average spectra were extracted from the segmented regions of the plants. We employed several spectral data pre-processing methods and ensemble learning algorithms, such as Random Forest, AdaBoost, XGBoost, and LightGBM, to develop the RA prediction model and feature selection based on feature importance. The best RA prediction model was the LightGBM model with feature selection by the AdaBoost algorithm and spectral pre-processing through logarithmic transformation and second derivative. This model performed satisfactorily for practical screening with R2P = 0.81 and RMSEP = 3.92. From in-field HSI data, the developed model successfully estimated and visualized the RA distribution in basil plants growing in the greenhouse. Our findings demonstrate the potential use of a portable HSI system for monitoring and controlling pharmaceutical quality in medicinal plants during cultivation. This non-destructive and rapid method can provide a valuable tool for assessing the quality of RA in basil plants, thereby enhancing the efficiency and accuracy of quality control during the cultivation stage. Full article
Show Figures

Figure 1

18 pages, 1099 KiB  
Article
Enhanced Prediction and Evaluation of Hydraulic Concrete Compressive Strength Using Multiple Soft Computing and Metaheuristic Optimization Algorithms
by Tianyu Li, Xiamin Hu, Tao Li, Jie Liao, Lidan Mei, Huiwen Tian and Jinlong Gu
Buildings 2024, 14(11), 3461; https://doi.org/10.3390/buildings14113461 - 30 Oct 2024
Viewed by 270
Abstract
Concrete is the material of choice for constructing hydraulic structures in water-related buildings, and its mechanical properties are crucial for evaluating the structural damage state. Machine learning models have proven effective in predicting these properties. However, a single machine learning model often suffers [...] Read more.
Concrete is the material of choice for constructing hydraulic structures in water-related buildings, and its mechanical properties are crucial for evaluating the structural damage state. Machine learning models have proven effective in predicting these properties. However, a single machine learning model often suffers from overfitting and low prediction accuracy. To address this issue, this study introduces a novel hybrid method for predicting concrete compressive strength by integrating multiple soft computing algorithms and the stacking ensemble learning strategy. In the initial stage, several classic machine learning models are selected as base models, and the optimal parameters of these models are obtained using the improved metaheuristic-based gray wolf algorithm. In the subsequent stage, the lightweight gradient boosting tree (LightGBM) model and the metaheuristic-based optimization algorithm are combined to integrate information from base models. This process identifies the primary factors affecting concrete compressive strength. The experimental results demonstrate that the hybrid ensemble learning and heuristic optimization algorithm achieve a regression coefficient of 0.9329, a mean absolute error (MAE) of 2.7695, and a mean square error (MSE) of 4.0891. These results indicate superior predictive performance compared to other advanced methods. The proposed method shows potential for application in predicting the service life and assessing the structural damage status of hydraulic concrete structures, suggesting broad prospects. Full article
(This article belongs to the Section Building Structures)
16 pages, 10190 KiB  
Article
Automated Recognition of Submerged Body-like Objects in Sonar Images Using Convolutional Neural Networks
by Yan Zun Nga, Zuhayr Rymansaib, Alfie Anthony Treloar and Alan Hunter
Remote Sens. 2024, 16(21), 4036; https://doi.org/10.3390/rs16214036 - 30 Oct 2024
Viewed by 305
Abstract
The Police Robot for Inspection and Mapping of Underwater Evidence (PRIME) is an uncrewed surface vehicle (USV) currently being developed for underwater search and recovery teams to assist in crime scene investigation. The USV maps underwater scenes using sidescan sonar (SSS). Test exercises [...] Read more.
The Police Robot for Inspection and Mapping of Underwater Evidence (PRIME) is an uncrewed surface vehicle (USV) currently being developed for underwater search and recovery teams to assist in crime scene investigation. The USV maps underwater scenes using sidescan sonar (SSS). Test exercises use a clothed mannequin lying on the seafloor as a target object to evaluate system performance. A robust, automated method for detecting human body-shaped objects is required to maximise operational functionality. The use of a convolutional neural network (CNN) for automatic target recognition (ATR) is proposed. SSS image data acquired from four different locations during previous missions were used to build a dataset consisting of two classes, i.e., a binary classification problem. The target object class consisted of 166 196 × 196 pixel image snippets of the underwater mannequin, whereas the non-target class consisted of 13,054 examples. Due to the large class imbalance in the dataset, CNN models were trained with six different imbalance ratios. Two different pre-trained models (ResNet-50 and Xception) were compared, and trained via transfer learning. This paper presents results from the CNNs and details the training methods used. Larger datasets are shown to improve CNN performance despite class imbalance, achieving average F1 scores of 97% in image classification. Average F1 scores for target vs background classification with unseen data are only 47% but the end result is enhanced by combining multiple weak classification results in an ensemble average. The combined output, represented as a georeferenced heatmap, accurately indicates the target object location with a high detection confidence and one false positive of low confidence. The CNN approach shows improved object detection performance when compared to the currently used ATR method. Full article
(This article belongs to the Special Issue AI-Driven Mapping Using Remote Sensing Data)
Show Figures

Figure 1

31 pages, 2978 KiB  
Article
QSAR Regression Models for Predicting HMG-CoA Reductase Inhibition
by Robert Ancuceanu, Patriciu Constantin Popovici, Doina Drăgănescu, Ștefan Busnatu, Beatrice Elena Lascu and Mihaela Dinu
Pharmaceuticals 2024, 17(11), 1448; https://doi.org/10.3390/ph17111448 - 30 Oct 2024
Viewed by 231
Abstract
Background/Objectives: HMG-CoA reductase is an enzyme that regulates the initial stage of cholesterol synthesis, and its inhibitors are widely used in the treatment of cardiovascular diseases. Methods: We have created a set of quantitative structure-activity relationship (QSAR) models for human HMG-CoA reductase inhibitors [...] Read more.
Background/Objectives: HMG-CoA reductase is an enzyme that regulates the initial stage of cholesterol synthesis, and its inhibitors are widely used in the treatment of cardiovascular diseases. Methods: We have created a set of quantitative structure-activity relationship (QSAR) models for human HMG-CoA reductase inhibitors using nested cross-validation as the primary validation method. To develop the QSAR models, we employed various machine learning regression algorithms, feature selection methods, and fingerprints or descriptor datasets. Results: We built and evaluated a total of 300 models, selecting 21 that demonstrated good performance (coefficient of determination, R2 ≥ 0.70 or concordance correlation coefficient, CCC ≥ 0.85). Six of these top-performing models met both performance criteria and were used to construct five ensemble models. We identified the descriptors most important in explaining HMG-CoA inhibition for each of the six best-performing models. We used the top models to search through over 220,000 chemical compounds from a large database (ZINC 15) for potential new inhibitors. Only a small fraction (237 out of approximately 220,000 compounds) had reliable predictions with mean pIC50 values ≥ 8 (IC50 values ≤ 10 nM). Our svm-based ensemble model predicted IC50 values < 10 nM for roughly 0.08% of the screened compounds. We have also illustrated the potential applications of these QSAR models in understanding the cholesterol-lowering activities of herbal extracts, such as those reported for an extract prepared from the Iris × germanica rhizome. Conclusions: Our QSAR models can accurately predict human HMG-CoA reductase inhibitors, having the potential to accelerate the discovery of novel cholesterol-lowering agents and may also be applied to understand the mechanisms underlying the reported cholesterol-lowering activities of herbal extracts. Full article
(This article belongs to the Section Pharmacology)
Show Figures

Figure 1

21 pages, 1556 KiB  
Article
Deep Learning for Opportunistic Rain Estimation via Satellite Microwave Links
by Giovanni Scognamiglio, Andrea Rucci, Attilio Vaccaro, Elisa Adirosi, Fabiola Sapienza, Filippo Giannetti, Giacomo Bacci, Sabina Angeloni, Luca Baldini, Giacomo Roversi, Alberto Ortolani, Andrea Antonini and Samantha Melani
Sensors 2024, 24(21), 6944; https://doi.org/10.3390/s24216944 - 29 Oct 2024
Viewed by 291
Abstract
Accurate precipitation measurement is critical for managing flood and drought risks. Traditional meteorological tools, such as rain gauges and remote sensors, have limitations in resolution, coverage, and cost-effectiveness. Recently, the opportunistic use of microwave communication signals has been explored to improve precipitation estimation. [...] Read more.
Accurate precipitation measurement is critical for managing flood and drought risks. Traditional meteorological tools, such as rain gauges and remote sensors, have limitations in resolution, coverage, and cost-effectiveness. Recently, the opportunistic use of microwave communication signals has been explored to improve precipitation estimation. While there is growing interest in using satellite-to-earth microwave links (SMLs) for machine learning-based precipitation estimation, direct rainfall estimation from raw signal-to-noise ratio (SNR) data via deep learning remains underexplored. This study investigates a range of machine learning (ML) approaches, including deep learning (DL) models and traditional methods like gradient boosting machine (GBM), for estimating rainfall rates from SNR data collected by interactive satellite receivers. We develop real-time models for rainfall detection and estimation using downlink SNR signals from satellites to user terminals. By leveraging a year-long dataset from multiple locations—including SNR measurements paired with disdrometer and rain-gauge data—we explore and evaluate various ML models. Our final models include ensemble approaches for both rainfall detection and cumulative rainfall estimation. The proposed models provide a reliable solution for estimating precipitation using Earth–satellite microwave links, potentially improving precipitation monitoring. Compared to the state-of-the-art power-law-based models applied to similar datasets reported in the literature, our ML models achieve a 46% reduction in the root mean squared error (RMSE) for event-based cumulative precipitation predictions. Full article
(This article belongs to the Special Issue Remote Sensing in Atmospheric Measurements)
Show Figures

Figure 1

18 pages, 3041 KiB  
Article
A Deep Learning PM2.5 Hybrid Prediction Model Based on Clustering–Secondary Decomposition Strategy
by Tao Zeng, Ruru Liu, Yahui Liu, Jinli Shi, Tao Luo, Yunyun Xi, Shuo Zhao, Chunpeng Chen, Guangrui Pan, Yuming Zhou and Liping Xu
Electronics 2024, 13(21), 4242; https://doi.org/10.3390/electronics13214242 - 29 Oct 2024
Viewed by 327
Abstract
Accurate prediction of PM2.5 concentration is important for pollution control, public health, and ecological protection. However, due to the nonlinear nature of PM2.5 data, the accuracy of existing methods suffers and performs poorly in both short-term and long-term predictions. In this [...] Read more.
Accurate prediction of PM2.5 concentration is important for pollution control, public health, and ecological protection. However, due to the nonlinear nature of PM2.5 data, the accuracy of existing methods suffers and performs poorly in both short-term and long-term predictions. In this study, a deep learning hybrid prediction model based on clustering and quadratic decomposition is proposed. The model utilizes the complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) to decompose the PM2.5 sequences into multiple intrinsic modal function components (IMFs), and clusters and re-fuses the subsequences with similar complexity by permutation entropy (PE) and K-means clustering. For the fused high-frequency sequences, a secondary decomposition is performed using the whale optimization algorithm (WOA) optimized variational modal decomposition (VMD). Finally, the nonlinear and temporal features are captured for prediction using the long- and short-term memory neural network (LSTM). Experiments show that this proposed model exhibits good stability and generalization ability. It does not only make accurate predictions in the short term, but also captures the trends in the long-term prediction. There is a significant performance improvement over the baseline models. Further comparisons with existing models outperform the current state-of-the-art models. Full article
Show Figures

Figure 1

Back to TopTop