Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (586)

Search Parameters:
Keywords = local descriptors

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
27 pages, 10246 KiB  
Article
A Novel HPNVD Descriptor for 3D Local Surface Description
by Jiming Sa, Xuecheng Zhang, Yuan Yuan, Yuyan Song, Liwei Ding and Yechen Huang
Mathematics 2025, 13(1), 92; https://doi.org/10.3390/math13010092 - 29 Dec 2024
Viewed by 229
Abstract
Existing methods for 3D local feature description often struggle to achieve a good balance between distinctiveness, robustness, and computational efficiency. To address this challenge, a novel 3D local feature descriptor named Histograms of Projected Normal Vector Distribution (HPNVD) is proposed. The HPNVD descriptor [...] Read more.
Existing methods for 3D local feature description often struggle to achieve a good balance between distinctiveness, robustness, and computational efficiency. To address this challenge, a novel 3D local feature descriptor named Histograms of Projected Normal Vector Distribution (HPNVD) is proposed. The HPNVD descriptor consists of two main components. First, a local reference frame (LRF) is constructed based on the covariance matrix and neighborhood projection to achieve invariance to rigid transformations. Then, the local surface normals are projected onto three coordinate planes within the LRF, which allows for effective encoding of the local shape information. The projection planes are further divided into multiple regions, and a histogram is computed for each plane to generate the final HPNVD descriptor. Experimental results demonstrate that the proposed HPNVD descriptor outperforms state-of-the-art methods in terms of descriptiveness and robustness, while maintaining compact storage and computational efficiency. Moreover, the HPNVD-based point cloud registration algorithm shows excellent performance, further validating the effectiveness of the descriptor. Full article
Show Figures

Figure 1

18 pages, 2966 KiB  
Article
A Survey on the Evaluation of Monosodium Glutamate (MSG) Taste in Austria
by Emilia Iannilli, Emilise Lucerne Pötz and Thomas Hummel
Viewed by 240
Abstract
The umami taste is well validated in Asian culture but remains less recognized and accepted in European cultures despite its presence in natural local products. This study explored the sensory and emotional perceptions of umami in 233 Austrian participants who had lived in [...] Read more.
The umami taste is well validated in Asian culture but remains less recognized and accepted in European cultures despite its presence in natural local products. This study explored the sensory and emotional perceptions of umami in 233 Austrian participants who had lived in Austria for most of their lives. Using blind tasting, participants evaluated monosodium glutamate (MSG) dissolved in water, providing open-ended verbal descriptions, pleasantness ratings, and comparisons to a sodium chloride (NaCl) solution. Discrimination tests excluded MSG ageusia, and basic demographic data were collected. A text semantic-based analysis (TSA) was employed to analyze the emotional valence and descriptive content of participants’ responses. The results showed that MSG was predominantly associated with neutral sentiments across the group, including both female and male subgroups. “Sour” was the most frequent taste descriptor, while “unfamiliar” characterized the perceptual experience. Pleasantness ratings for MSG and NaCl were positively correlated, suggesting that overcoming the unfamiliarity of umami could enhance its acceptance and align it with the pleasantness of salt. These findings advance the understanding of umami sensory perception and its emotional and cultural acceptance in European contexts, offering valuable insights for integrating umami into Western dietary and sensory frameworks. Full article
(This article belongs to the Section Sensory and Consumer Sciences)
Show Figures

Figure 1

25 pages, 9865 KiB  
Article
Dual-Branch Feature Generalization Method for AUV Near-Field Exploration of Hydrothermal Areas
by Yihui Liu, Guofang Chen, Yufei Xu, Lei Wan and Ziyang Zhang
J. Mar. Sci. Eng. 2024, 12(12), 2359; https://doi.org/10.3390/jmse12122359 - 22 Dec 2024
Viewed by 291
Abstract
The simultaneous localization and mapping (SLAM) technique provides long-term near-seafloor navigation for autonomous underwater vehicles (AUVs). However, the stability of descriptors generated by interest point detectors remains a challenge in the hydrothermal environment. This paper proposes a dual-branch feature generalization method, incorporating volumetric [...] Read more.
The simultaneous localization and mapping (SLAM) technique provides long-term near-seafloor navigation for autonomous underwater vehicles (AUVs). However, the stability of descriptors generated by interest point detectors remains a challenge in the hydrothermal environment. This paper proposes a dual-branch feature generalization method, incorporating volumetric density and color distribution for enhanced robustness. The method utilizes shared descriptors and a feature confidence mechanism, combining neural radiance fields with Gaussian splatting models, ensuring fast and accurate feature generalization. The proposed approach improves recall while maintaining matching accuracy, ensuring stability and robustness in feature matching. This method achieves stable and reliable feature matching in a simulated hydrothermal environment. Full article
(This article belongs to the Section Ocean Engineering)
Show Figures

Figure 1

20 pages, 6119 KiB  
Article
A Laser-Based SLAM Algorithm of the Unmanned Surface Vehicle for Accurate Localization and Mapping in an Inland Waterway Scenario
by Yang Wang, Chao Liu, Jiahe Liu, Jinzhe Wang, Jianbin Liu, Kai Zheng and Rencheng Zheng
J. Mar. Sci. Eng. 2024, 12(12), 2311; https://doi.org/10.3390/jmse12122311 - 16 Dec 2024
Viewed by 490
Abstract
It is important to improve the localization accuracy of the unmanned surface vehicle (USV) for ensuring safe navigation in an inland waterway scenario. However, the localization accuracy of the USV is affected by the limited availability of global navigation satellite system signals, the [...] Read more.
It is important to improve the localization accuracy of the unmanned surface vehicle (USV) for ensuring safe navigation in an inland waterway scenario. However, the localization accuracy of the USV is affected by the limited availability of global navigation satellite system signals, the sparsity of feature points, and the high scene similarity in inland waterway scenarios. Therefore, this paper proposes a laser-based simultaneous localization and mapping (SLAM) algorithm for accurate localization and mapping in inland waterway scenarios. Inertial measurement unit (IMU) data are integrated with lidar data to address motion distortion caused by the frequent motion of the USV. Subsequently, a generalized iterative closest point (GICP) algorithm incorporating rejection sampling is integrated to enhance the accuracy of point cloud matching, involving a two-phase filtering process to select key feature points for matching. Additionally, a mixed global descriptor is constructed by combining point cloud intensity and distance information to improve the accuracy of loop closure detection. Experiments are conducted on the USV-Inland datasets to evaluate the performance of the proposed algorithm. The experimental results show that the proposed algorithm generates accurate mapping and significantly improves localization accuracy by 25.6%, 18.5%, and 23.6% compared to A-LOAM, LeGO-LOAM, and ISC-LOAM, respectively. These results demonstrate that the proposed algorithm achieves accurate localization and mapping in an inland waterway scenario. Full article
(This article belongs to the Section Ocean Engineering)
Show Figures

Figure 1

21 pages, 17557 KiB  
Article
Lidar Simultaneous Localization and Mapping Algorithm for Dynamic Scenes
by Peng Ji, Qingsong Xu and Yifan Zhao
World Electr. Veh. J. 2024, 15(12), 567; https://doi.org/10.3390/wevj15120567 - 7 Dec 2024
Viewed by 855
Abstract
To address the issue of significant point cloud ghosting in the construction of high-precision point cloud maps by low-speed intelligent mobile vehicles due to the presence of numerous dynamic obstacles in the environment, which affects the accuracy of map construction, this paper proposes [...] Read more.
To address the issue of significant point cloud ghosting in the construction of high-precision point cloud maps by low-speed intelligent mobile vehicles due to the presence of numerous dynamic obstacles in the environment, which affects the accuracy of map construction, this paper proposes a LiDAR-based Simultaneous Localization and Mapping (SLAM) algorithm tailored for dynamic scenes. The algorithm employs a tightly coupled SLAM framework integrating LiDAR and inertial measurement unit (IMU). In the process of dynamic obstacle removal, the point cloud data is first gridded. To more comprehensively represent the point cloud information, the point cloud within the perception area is linearly discretized by height to obtain the distribution of the point cloud at different height layers, which is then encoded to construct a linear discretized height descriptor for dynamic region extraction. To preserve more static feature points without altering the original point cloud, the Random Sample Consensus (RANSAC) ground fitting algorithm is employed to fit and segment the ground point cloud within the dynamic regions, followed by the removal of dynamic obstacles. Finally, accurate point cloud poses are obtained through static feature matching. The proposed algorithm has been validated using open-source datasets and self-collected campus datasets. The results demonstrate that the algorithm improves dynamic point cloud removal accuracy by 12.3% compared to the ERASOR algorithm and enhances overall mapping and localization accuracy by 8.3% compared to the LIO-SAM algorithm, thereby providing a reliable environmental description for intelligent mobile vehicles. Full article
Show Figures

Figure 1

21 pages, 14612 KiB  
Article
Corrupted Point Cloud Classification Through Deep Learning with Local Feature Descriptor
by Xian Wu, Xueyi Guo, Hang Peng, Bin Su, Sabbir Ahamod and Fenglin Han
Sensors 2024, 24(23), 7749; https://doi.org/10.3390/s24237749 - 4 Dec 2024
Viewed by 471
Abstract
Three-dimensional point cloud recognition is a very fundamental work in fields such as autonomous driving and face recognition. However, in real industrial scenarios, input point cloud data are often accompanied by factors such as occlusion, rotation, and noise. These factors make it challenging [...] Read more.
Three-dimensional point cloud recognition is a very fundamental work in fields such as autonomous driving and face recognition. However, in real industrial scenarios, input point cloud data are often accompanied by factors such as occlusion, rotation, and noise. These factors make it challenging to apply existing point cloud classification algorithms in real industrial scenarios. Currently, most studies enhance model robustness from the perspective of neural network structure. However, researchers have found that simply adjusting the neural network structure has proven insufficient in addressing the decline in accuracy caused by data corruption. In this article, we use local feature descriptors as a preprocessing method to extract features from point cloud data and propose a new neural network architecture aligned with these local features, effectively enhancing performance even in extreme cases of data corruption. In addition, we conducted data augmentation to the 10 intentionally selected categories in ModelNet40. Finally, we conducted multiple experiments, including testing the robustness of the model to occlusion and coordinate transformation and then comparing the model with existing SOTA models. Furthermore, in actual scene experiments, we used depth cameras to capture objects and input the obtained data into the established model. The experimental results show that our model outperforms existing popular algorithms when dealing with corrupted point cloud data. Even when the input point cloud data are affected by occlusion or coordinate transformation, our proposed model can maintain high accuracy. This suggests that our method can alleviate the problem of decreased model accuracy caused by the aforementioned factors. Full article
Show Figures

Figure 1

17 pages, 4865 KiB  
Article
Morpho-Phenological, Chemical, and Genetic Characterization of Italian Maize Landraces from the Lazio Region
by Rita Redaelli, Laura Bassolino, Carlotta Balconi, Irma Terracciano, Alessio Torri, Federica Nicoletti, Gianluca Benedetti, Valentina Iacoponi, Roberto Rea and Paola Taviani
Plants 2024, 13(22), 3249; https://doi.org/10.3390/plants13223249 - 20 Nov 2024
Viewed by 713
Abstract
In the framework of a Collaboration Agreement between CREA and ARSIAL, a morpho-phenological, chemical, and genetic characterization of maize populations native to the Lazio region was carried out. During 2022 and 2023, a set of 50 accessions, belonging both to ARSIAL and CREA [...] Read more.
In the framework of a Collaboration Agreement between CREA and ARSIAL, a morpho-phenological, chemical, and genetic characterization of maize populations native to the Lazio region was carried out. During 2022 and 2023, a set of 50 accessions, belonging both to ARSIAL and CREA maize collections, were multiplied in Bergamo. Morpho-phenological descriptors were recorded in the field: plant height, ear height, and male and female flowering time. The grain chemical composition in terms of protein, lipid, starch, ash and fiber was evaluated by near-infrared spectroscopy (NIRS). A double-digest restriction-site-associated DNA sequencing (ddRADseq) strategy was used to genotype the landraces. The two collections were not significantly different in terms of grain chemical composition. On the other hand, the ARSIAL and CREA germplasm showed a different distribution in the three cluster-based population structure obtained by ddRADseq, which largely corresponded to the distribution map of their collection sites. The materials from the Lazio region maintained by ARSIAL and CREA were revealed to be different. The comparison between the two groups of landraces showed the importance of characterizing germplasm collections to promote the recovery and valorization of local biodiversity. Full article
(This article belongs to the Section Crop Physiology and Crop Production)
Show Figures

Figure 1

17 pages, 4207 KiB  
Article
Deep Multi-Similarity Hashing with Spatial-Enhanced Learning for Remote Sensing Image Retrieval
by Huihui Zhang, Qibing Qin, Meiling Ge and Jianyong Huang
Electronics 2024, 13(22), 4520; https://doi.org/10.3390/electronics13224520 - 18 Nov 2024
Viewed by 643
Abstract
Remote sensing image retrieval (RSIR) plays a crucial role in remote sensing applications, focusing on retrieving a collection of items that closely match a specified query image. Due to the advantages of low storage cost and fast search speed, deep hashing has been [...] Read more.
Remote sensing image retrieval (RSIR) plays a crucial role in remote sensing applications, focusing on retrieving a collection of items that closely match a specified query image. Due to the advantages of low storage cost and fast search speed, deep hashing has been one of the most active research problems in remote sensing image retrieval. However, remote sensing images contain many content-irrelevant backgrounds or noises, and they often lack the ability to capture essential fine-grained features. In addition, existing hash learning often relies on random sampling or semi-hard negative mining strategies to form training batches, which could be overwhelmed by some redundant pairs that slow down the model convergence and compromise the retrieval performance. To solve these problems effectively, a novel Deep Multi-similarity Hashing with Spatial-enhanced Learning, termed DMsH-SL, is proposed to learn compact yet discriminative binary descriptors for remote sensing image retrieval. Specifically, to suppress interfering information and accurately localize the target location, by introducing a spatial enhancement learning mechanism, the spatial group-enhanced hierarchical network is firstly designed to learn the spatial distribution of different semantic sub-features, capturing the noise-robust semantic embedding representation. Furthermore, to fully explore the similarity relationships of data points in the embedding space, the multi-similarity loss is proposed to construct informative and representative training batches, which is based on pairwise mining and weighting to compute the self-similarity and relative similarity of the image pairs, effectively mitigating the effects of redundant and unbalanced pairs. Experimental results on three benchmark datasets validate the superior performance of our approach. Full article
Show Figures

Figure 1

24 pages, 14942 KiB  
Article
The Ground-Penetrating Radar Image Matching Method Based on Central Dense Structure Context Features
by Jie Xu, Qifeng Lai, Dongyan Wei, Xinchun Ji, Ge Shen and Hong Yuan
Remote Sens. 2024, 16(22), 4291; https://doi.org/10.3390/rs16224291 - 18 Nov 2024
Viewed by 670
Abstract
Subsurface structural distribution can be detected using Ground-Penetrating Radar (GPR). The distribution can be considered as road fingerprints for vehicle positioning. Similar to the principle of visual image matching for localization, the position coordinates of the vehicle can be calculated by matching real-time [...] Read more.
Subsurface structural distribution can be detected using Ground-Penetrating Radar (GPR). The distribution can be considered as road fingerprints for vehicle positioning. Similar to the principle of visual image matching for localization, the position coordinates of the vehicle can be calculated by matching real-time GPR images with pre-constructed reference GPR images. However, GPR images, due to their low resolution, cannot extract well-defined geometric features such as corners and lines. Thus, traditional visual image processing algorithms perform inadequately when applied to GPR image matching. To address this issue, this paper innovatively proposes a GPR image matching and localization method based on a novel feature descriptor, termed as central dense structure context (CDSC) features. The algorithm utilizes the strip-like elements in GPR images to improve the accuracy of GPR image matching. First, a CDSC feature descriptor is designed. By applying threshold segmentation and extremum point extraction to the GPR image, stratified strip-like elements and pseudo-corner points are obtained. The pseudo-corner points are treated as the centers, and the surrounding strip-like elements are described in context to form the GPR feature descriptors. Then, based on the feature description method, feature descriptors for both the real-time image and the reference image are calculated separately. By searching for the nearest matching point pairs and removing erroneous pairs, GPR image matching and localization are achieved. The proposed algorithm was evaluated on datasets collected from urban roads and railway tracks, achieving localization errors of 0.06 m (RMSE) and 1.22 m (RMSE), respectively. Compared to the traditional Speeded Up Robust Features (SURF) visual image matching algorithm, localization errors were reduced by 86.6% and 95.7% in urban road and railway track scenarios, respectively. Full article
(This article belongs to the Special Issue Advanced Ground-Penetrating Radar (GPR) Technologies and Applications)
Show Figures

Graphical abstract

18 pages, 2990 KiB  
Article
A GGCM-E Based Semantic Filter and Its Application in VSLAM Systems
by Yuanjie Li, Chunyan Shao and Jiaming Wang
Electronics 2024, 13(22), 4487; https://doi.org/10.3390/electronics13224487 - 15 Nov 2024
Viewed by 424
Abstract
Image matching-based visual simultaneous localization and mapping (vSLAM) extracts low-level pixel features to reconstruct camera trajectories and maps through the epipolar geometry method. However, it fails to achieve correct trajectories and mapping when there are low-quality feature correspondences in several challenging environments. Although [...] Read more.
Image matching-based visual simultaneous localization and mapping (vSLAM) extracts low-level pixel features to reconstruct camera trajectories and maps through the epipolar geometry method. However, it fails to achieve correct trajectories and mapping when there are low-quality feature correspondences in several challenging environments. Although the RANSAC-based framework can enable better results, it is computationally inefficient and unstable in the presence of a large number of outliers. A Faster R-CNN learning-based semantic filter is proposed to explore the semantic information of inliers to remove low-quality correspondences, helping vSLAM localize accurately in our previous work. However, the semantic filter learning method generalizes low precision for low-level and dense texture-rich scenes, leading the semantic filter-based vSLAM to be unstable and have poor geometry estimation. In this paper, a GGCM-E-based semantic filter using YOLOv8 is proposed to address these problems. Firstly, the semantic patches of images are collected from the KITTI dataset, the TUM dataset provided by the Technical University of Munich, and real outdoor scenes. Secondly, the semantic patches are classified by our proposed GGCM-E descriptors to obtain the YOLOv8 neural network training dataset. Finally, several semantic filters for filtering low-level and dense texture-rich scenes are generated and combined into the ORB-SLAM3 system. Extensive experiments show that the semantic filter can detect and classify semantic levels of different scenes effectively, filtering low-level semantic scenes to improve the quality of correspondences, thus achieving accurate and robust trajectory reconstruction and mapping. For the challenging autonomous driving benchmark and real environments, the vSLAM system with respect to the GGCM-E-based semantic filter demonstrates its superiority regarding reducing the 3D position error, such that the absolute trajectory error is reduced by up to approximately 17.44%, showing its promise and good generalization. Full article
(This article belongs to the Special Issue Application of Artificial Intelligence in Robotics)
Show Figures

Figure 1

20 pages, 3018 KiB  
Article
Global Semantic Localization from Abstract Ellipse-Ellipsoid Model and Object-Level Instance Topology
by Heng Wu, Yanjie Liu, Chao Wang and Yanlong Wei
Remote Sens. 2024, 16(22), 4187; https://doi.org/10.3390/rs16224187 - 10 Nov 2024
Viewed by 611
Abstract
Robust and highly accurate localization using a camera is a challenging task when appearance varies significantly. In indoor environments, changes in illumination and object occlusion can have a significant impact on visual localization. In this paper, we propose a visual localization method based [...] Read more.
Robust and highly accurate localization using a camera is a challenging task when appearance varies significantly. In indoor environments, changes in illumination and object occlusion can have a significant impact on visual localization. In this paper, we propose a visual localization method based on an ellipse-ellipsoid model, combined with object-level instance topology and alignment. First, we develop a CNN-based (Convolutional Neural Network) ellipse prediction network, DEllipse-Net, which integrates depth information with RGB data to estimate the projection of ellipsoids onto images. Second, we model environments using 3D (Three-dimensional) ellipsoids, instance topology, and ellipsoid descriptors. Finally, the detected ellipses are aligned with the ellipsoids in the environment through semantic object association, and 6-DoF (Degree of Freedom) pose estimation is performed using the ellipse-ellipsoid model. In the bounding box noise experiment, DEllipse-Net demonstrates higher robustness compared to other methods, achieving the highest prediction accuracy for 11 out of 23 objects in ellipse prediction. In the localization test with 15 pixels of noise, we achieve ATE (Absolute Translation Error) and ARE (Absolute Rotation Error) of 0.077 m and 2.70 in the fr2_desk sequence. Additionally, DEllipse-Net is lightweight and highly portable, with a model size of only 18.6 MB, and a single model can handle all objects. In the object-level instance topology and alignment experiment, our topology and alignment methods significantly enhance the global localization accuracy of the ellipse-ellipsoid model. In experiments involving lighting changes and occlusions, our method achieves more robust global localization compared to the classical bag-of-words based localization method and other ellipse-ellipsoid localization methods. Full article
Show Figures

Figure 1

15 pages, 397 KiB  
Article
Local Directional Difference and Relational Descriptor for Texture Classification
by Weidan Yan and Yongsheng Dong
Mathematics 2024, 12(21), 3432; https://doi.org/10.3390/math12213432 - 1 Nov 2024
Viewed by 730
Abstract
The local binary pattern (LBP) has been widely used for extracting texture features. However, the LBP and most of its variants tend to focus on pixel units within small neighborhoods, neglecting differences in direction and relationships among different directions. To alleviate this issue, [...] Read more.
The local binary pattern (LBP) has been widely used for extracting texture features. However, the LBP and most of its variants tend to focus on pixel units within small neighborhoods, neglecting differences in direction and relationships among different directions. To alleviate this issue, in this paper, we propose a novel local directional difference and relational descriptor (LDDRD) for texture classification. Our proposed LDDRD utilizes information from multiple pixels along the radial direction. Specifically, a directional difference pattern (DDP) is first extracted by performing binary encoding on the differences between the central pixel and multiple neighboring pixels along the radial direction. Furthermore, by taking the central pixel as a reference, we extract the directional relation pattern (DRP) by comparing binary encodings representing different directions. Finally, we fuse the above DDP and DRP to form the LDDRD feature vector. Experimental results on six texture datasets reveal that our proposed LDDRD is effective and outperforms eight representative methods. Full article
(This article belongs to the Special Issue Advances in Computer Vision and Machine Learning, 2nd Edition)
Show Figures

Figure 1

21 pages, 57861 KiB  
Article
Automatic Apple Detection and Counting with AD-YOLO and MR-SORT
by Xueliang Yang, Yapeng Gao, Mengyu Yin and Haifang Li
Sensors 2024, 24(21), 7012; https://doi.org/10.3390/s24217012 - 31 Oct 2024
Viewed by 928
Abstract
In the production management of agriculture, accurate fruit counting plays a vital role in the orchard yield estimation and appropriate production decisions. Although recent tracking-by-detection algorithms have emerged as a promising fruit-counting method, they still cannot completely avoid fruit occlusion and light variations [...] Read more.
In the production management of agriculture, accurate fruit counting plays a vital role in the orchard yield estimation and appropriate production decisions. Although recent tracking-by-detection algorithms have emerged as a promising fruit-counting method, they still cannot completely avoid fruit occlusion and light variations in complex orchard environments, and it is difficult to realize automatic and accurate apple counting. In this paper, a video-based multiple-object tracking method, MR-SORT (Multiple Rematching SORT), is proposed based on the improved YOLOv8 and BoT-SORT. First, we propose the AD-YOLO model, which aims to reduce the number of incorrect detections during object tracking. In the YOLOv8s backbone network, an Omni-dimensional Dynamic Convolution (ODConv) module is used to extract local feature information and enhance the model’s ability better; a Global Attention Mechanism (GAM) is introduced to improve the detection ability of a foreground object (apple) in the whole image; a Soft Spatial Pyramid Pooling Layer (SSPPL) is designed to reduce the feature information dispersion and increase the sensory field of the network. Then, the improved BoT-SORT algorithm is proposed by fusing the verification mechanism, SURF feature descriptors, and the Vector of Local Aggregate Descriptors (VLAD) algorithm, which can match apples more accurately in adjacent video frames and reduce the probability of ID switching in the tracking process. The results show that the mAP metrics of the proposed AD-YOLO model are 3.1% higher than those of the YOLOv8 model, reaching 96.4%. The improved tracking algorithm has 297 fewer ID switches, which is 35.6% less than the original algorithm. The multiple-object tracking accuracy of the improved algorithm reached 85.6%, and the average counting error was reduced to 0.07. The coefficient of determination R2 between the ground truth and the predicted value reached 0.98. The above metrics show that our method can give more accurate counting results for apples and even other types of fruit. Full article
Show Figures

Figure 1

26 pages, 24227 KiB  
Article
A Base-Map-Guided Global Localization Solution for Heterogeneous Robots Using a Co-View Context Descriptor
by Xuzhe Duan, Meng Wu, Chao Xiong, Qingwu Hu and Pengcheng Zhao
Remote Sens. 2024, 16(21), 4027; https://doi.org/10.3390/rs16214027 - 30 Oct 2024
Viewed by 874
Abstract
With the continuous advancement of autonomous driving technology, an increasing number of high-definition (HD) maps have been generated and stored in geospatial databases. These HD maps can provide strong localization support for mobile robots equipped with light detection and ranging (LiDAR) sensors. However, [...] Read more.
With the continuous advancement of autonomous driving technology, an increasing number of high-definition (HD) maps have been generated and stored in geospatial databases. These HD maps can provide strong localization support for mobile robots equipped with light detection and ranging (LiDAR) sensors. However, the global localization of heterogeneous robots under complex environments remains challenging. Most of the existing point cloud global localization methods perform poorly due to the different perspective views of heterogeneous robots. Leveraging existing HD maps, this paper proposes a base-map-guided heterogeneous robots localization solution. A novel co-view context descriptor with rotational invariance is developed to represent the characteristics of heterogeneous point clouds in a unified manner. The pre-set base map is divided into virtual scans, each of which generates a candidate co-view context descriptor. These descriptors are assigned to robots before operations. By matching the query co-view context descriptors of a working robot with the assigned candidate descriptors, the coarse localization is achieved. Finally, the refined localization is done through point cloud registration. The proposed solution can be applied to both single-robot and multi-robot global localization scenarios, especially when communication is impaired. The heterogeneous datasets used for the experiments cover both indoor and outdoor scenarios, utilizing various scanning modes. The average rotation and translation errors are within 1° and 0.30 m, indicating the proposed solution can provide reliable localization support despite communication failures, even across heterogeneous robots. Full article
Show Figures

Figure 1

17 pages, 2991 KiB  
Article
Feature Extraction and Identification of Rheumatoid Nodules Using Advanced Image Processing Techniques
by Azmath Mubeen and Uma N. Dulhare
Rheumato 2024, 4(4), 176-192; https://doi.org/10.3390/rheumato4040014 - 24 Oct 2024
Viewed by 582
Abstract
Background/Objectives: Accurate detection and classification of nodules in medical images, particularly rheumatoid nodules, are critical due to the varying nature of these nodules, where their specific type is often unknown before analysis. This study addresses the challenges of multi-class prediction in nodule detection, [...] Read more.
Background/Objectives: Accurate detection and classification of nodules in medical images, particularly rheumatoid nodules, are critical due to the varying nature of these nodules, where their specific type is often unknown before analysis. This study addresses the challenges of multi-class prediction in nodule detection, with a specific focus on rheumatoid nodules, by employing a comprehensive approach to feature extraction and classification. We utilized a diverse dataset of nodules, including rheumatoid nodules sourced from the DermNet dataset and local rheumatologists. Method: This study integrates 62 features, combining traditional image characteristics with advanced graph-based features derived from a superpixel graph constructed through Delaunay triangulation. The key steps include image preprocessing with anisotropic diffusion and Retinex enhancement, superpixel segmentation using SLIC, and graph-based feature extraction. Texture analysis was performed using Gray-Level Co-occurrence Matrix (GLCM) metrics, while shape analysis was conducted with Fourier descriptors. Vascular pattern recognition, crucial for identifying rheumatoid nodules, was enhanced using the Frangi filter. A Hybrid CNN–Transformer model was employed for feature fusion, and feature selection and hyperparameter tuning were optimized using Gray Wolf Optimization (GWO) and Particle Swarm Optimization (PSO). Feature importance was assessed using SHAP values. Results: The proposed methodology achieved an accuracy of 85%, with a precision of 0.85, a recall of 0.89, and an F1 measure of 0.87, demonstrating the effectiveness of the approach in detecting and classifying rheumatoid nodules in both binary and multi-class classification scenarios. Conclusions: This study presents a robust tool for the detection and classification of nodules, particularly rheumatoid nodules, in medical imaging, offering significant potential for improving diagnostic accuracy and aiding in the early identification of rheumatoid conditions. Full article
Show Figures

Figure 1

Back to TopTop