Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (246)

Search Parameters:
Keywords = Bag-of-Words

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
19 pages, 3143 KiB  
Article
Non-Convex Metric Learning-Based Trajectory Clustering Algorithm
by Xiaoyan Lei and Hongyan Wang
Mathematics 2025, 13(3), 387; https://doi.org/10.3390/math13030387 - 24 Jan 2025
Viewed by 303
Abstract
To address the issue of suboptimal clustering performance arising from the limitations of distance measurement in traditional trajectory clustering methods, this paper presents a novel trajectory clustering strategy that integrates the bag-of-words model with non-convex metric learning. Initially, the strategy extracts motion characteristic [...] Read more.
To address the issue of suboptimal clustering performance arising from the limitations of distance measurement in traditional trajectory clustering methods, this paper presents a novel trajectory clustering strategy that integrates the bag-of-words model with non-convex metric learning. Initially, the strategy extracts motion characteristic parameters from trajectory points. Subsequently, based on the minimum description length criterion, trajectories are segmented into several homogeneous segments, and statistical properties for each segment are computed. A non-convex metric learning mechanism is then introduced to enhance similarity evaluation accuracy. Furthermore, by combining a bag-of-words model with a non-convex metric learning algorithm, segmented trajectory fragments are transformed into fixed-length feature descriptors. Finally, the K-means method and the proposed non-convex metric learning algorithm are utilized to analyze the feature descriptors, and hence, the effective clustering of trajectories can be achieved. Experimental results demonstrate that the proposed method exhibits superior clustering performance compared to the state-of-the-art trajectory clustering approaches. Full article
(This article belongs to the Section E1: Mathematics and Computer Science)
Show Figures

Figure 1

17 pages, 1865 KiB  
Article
Improving Sentiment Analysis Performance on Imbalanced Moroccan Dialect Datasets Using Resample and Feature Extraction Techniques
by Zineb Nassr, Faouzia Benabbou, Nawal Sael and Touria Hamim
Information 2025, 16(1), 39; https://doi.org/10.3390/info16010039 - 10 Jan 2025
Viewed by 554
Abstract
Sentiment analysis is a crucial component of text mining and natural language processing (NLP), involving the evaluation and classification of text data based on its emotional tone, typically categorized as positive, negative, or neutral. While significant research has focused on structured languages like [...] Read more.
Sentiment analysis is a crucial component of text mining and natural language processing (NLP), involving the evaluation and classification of text data based on its emotional tone, typically categorized as positive, negative, or neutral. While significant research has focused on structured languages like English, unstructured languages, such as the Moroccan Dialect (MD), face substantial resource limitations and linguistic challenges, making effective sentiment analysis difficult. This study addresses this gap by exploring the integration of data-balancing techniques with machine learning (ML) methods, specifically investigating the impact of resampling techniques and feature extraction methods, including Term Frequency–Inverse Document Frequency (TF-IDF), Bag of Words (BOW), and N-grams. Through rigorous experimentation, we evaluate the effectiveness of these approaches in enhancing sentiment analysis accuracy for the Moroccan dialect. Our findings demonstrate that strategic resampling, combined with the TF-IDF method, significantly improves classification accuracy and robustness. We also explore the interaction between resampling strategies and feature extraction methods, revealing varying levels of effectiveness across different combinations. Notably, the Support Vector Machine (SVM) classifier, when paired with TF-IDF representation, achieves superior performance, with an accuracy of 90.24% and a precision of 90.34%. These results highlight the importance of tailored resampling techniques, appropriate feature extraction methods, and machine learning optimization in advancing sentiment analysis for under-resourced and dialect-heavy languages like the Moroccan dialect, providing a practical framework for future research and development in NLP for unstructured languages. Full article
Show Figures

Graphical abstract

21 pages, 3119 KiB  
Article
LCA and Emergy Approach to Evaluate the Environmental Performance of Plastic Bags from Fossil and Renewable Sources with the Function of Conditioning MSW
by Matheus Tavares Lacerda, Marcelo Vitor Fiatkoski, Marcell Mariano Corrêa Maceno, Feni Dalano Roosevelt Agostinho, Michele Rigon Spier, Mariana Kleina and Marcos Augusto Mendes Marques
Sustainability 2024, 16(24), 11293; https://doi.org/10.3390/su162411293 - 23 Dec 2024
Viewed by 615
Abstract
This study aimed to compare the environmental performance of plastic bags made of three different polymers, considering two product functions: carrying goods and packing municipal solid waste. The three polymers studied were HDPE, LDPE, and thermoplastic starch (TPS). Life cycle assessment and emergy [...] Read more.
This study aimed to compare the environmental performance of plastic bags made of three different polymers, considering two product functions: carrying goods and packing municipal solid waste. The three polymers studied were HDPE, LDPE, and thermoplastic starch (TPS). Life cycle assessment and emergy accounting were used to evaluate the environmental performance of each scenario in analysis. To develop this research, eight scenarios were created to represent the customs of use and consumption in the Brazilian population. The LCA results showed that, in general, the scenarios with HDPE plastic bags presented the best environmental performances, while those with TPS presented the worst. The processes that contributed most to these results, representing 70% or more of the environmental impact in each impact category, are related to the use of raw materials, electricity, and water for the manufacture of plastic bags and the treatment in landfills. In other words, the fact that TPS has a mass around six times greater than that of HDPE and two times greater than that of LDPE ends up leaving this type of polymer with the worst environmental performance. In the comparative analysis of scenarios for the same polymer, scenarios that involve the use and reuse of plastic bags present the lowest potential environmental impacts. In contrast, those related to the use and disposal in landfills present the highest possible environmental impacts. The results of emergy accounting showed that the HDPE scenarios had the lowest total emergy flow, ranging from 1.77 × 1013 seJ to 2.40 × 1013 seJ. In contrast, the LDPE scenarios had the highest total emergy flow, ranging from 1.15 × 1014 to 1.21 × 1014 seJ. Although LDPE had the highest total emergy flow values, these results are similar to those obtained by the fossil resource scarcity impact category, which focuses on resource consumption analysis. Thus, through a real approach to the use of plastic bags and solid waste management in the Brazilian context, this study brings essential insights to direct public policies related to the consumption of plastic bags. Full article
(This article belongs to the Section Sustainable Products and Services)
Show Figures

Figure 1

20 pages, 3018 KiB  
Article
Global Semantic Localization from Abstract Ellipse-Ellipsoid Model and Object-Level Instance Topology
by Heng Wu, Yanjie Liu, Chao Wang and Yanlong Wei
Remote Sens. 2024, 16(22), 4187; https://doi.org/10.3390/rs16224187 - 10 Nov 2024
Viewed by 778
Abstract
Robust and highly accurate localization using a camera is a challenging task when appearance varies significantly. In indoor environments, changes in illumination and object occlusion can have a significant impact on visual localization. In this paper, we propose a visual localization method based [...] Read more.
Robust and highly accurate localization using a camera is a challenging task when appearance varies significantly. In indoor environments, changes in illumination and object occlusion can have a significant impact on visual localization. In this paper, we propose a visual localization method based on an ellipse-ellipsoid model, combined with object-level instance topology and alignment. First, we develop a CNN-based (Convolutional Neural Network) ellipse prediction network, DEllipse-Net, which integrates depth information with RGB data to estimate the projection of ellipsoids onto images. Second, we model environments using 3D (Three-dimensional) ellipsoids, instance topology, and ellipsoid descriptors. Finally, the detected ellipses are aligned with the ellipsoids in the environment through semantic object association, and 6-DoF (Degree of Freedom) pose estimation is performed using the ellipse-ellipsoid model. In the bounding box noise experiment, DEllipse-Net demonstrates higher robustness compared to other methods, achieving the highest prediction accuracy for 11 out of 23 objects in ellipse prediction. In the localization test with 15 pixels of noise, we achieve ATE (Absolute Translation Error) and ARE (Absolute Rotation Error) of 0.077 m and 2.70 in the fr2_desk sequence. Additionally, DEllipse-Net is lightweight and highly portable, with a model size of only 18.6 MB, and a single model can handle all objects. In the object-level instance topology and alignment experiment, our topology and alignment methods significantly enhance the global localization accuracy of the ellipse-ellipsoid model. In experiments involving lighting changes and occlusions, our method achieves more robust global localization compared to the classical bag-of-words based localization method and other ellipse-ellipsoid localization methods. Full article
Show Figures

Figure 1

21 pages, 1335 KiB  
Article
Does Government Environmental Concern Affect Enterprise Sustainable Development? Evidence from China
by Fan Ren
Sustainability 2024, 16(21), 9527; https://doi.org/10.3390/su16219527 - 1 Nov 2024
Cited by 1 | Viewed by 1262
Abstract
As the executor and agent of China’s environmental policy, local governments’ environmental concern reflects local governments’ determination in environmental governance. To figure out how the strengthening environmental concerns affect enterprises’ long-term activities, this study focuses on pharmaceutical manufacturing enterprises due to the enormous [...] Read more.
As the executor and agent of China’s environmental policy, local governments’ environmental concern reflects local governments’ determination in environmental governance. To figure out how the strengthening environmental concerns affect enterprises’ long-term activities, this study focuses on pharmaceutical manufacturing enterprises due to the enormous and complex composition of emissions. We apply bag of words to summarize relevant environmental words from the annual work reports in local governments to measure environmental concern. The empirical results of the OLS method reveal that the increasing environmental concerns of local governments did decrease the growth rate of chemical oxygen demand (COD) emission authentically. At the same time, it will inhibit the research and experimental development (R&D) activity intensity, but promote production efficiency of pharmaceutical manufacturing enterprises. After that, we discuss the heterogeneity of enterprise ownership, corporate social responsibility and regional regulatory strength of enterprises. Overall, we conclude that environmental concern did reduce COD emission and promote production efficiency, but it also has negative spillover effects. The novel contribution of this paper is that it enriches the trade-off between strengthening environmental compliance costs and long-term production and innovation activities. These results indicate that pharmaceutical manufacturing enterprises prioritize optimizing existing production processes instead of adopting efficient technology when complying with stricter environmental regulation. The reduction of R&D activities may pose risks to the long-term sustainable development of enterprises. Full article
Show Figures

Figure 1

23 pages, 410 KiB  
Article
Towards AI-Generated Essay Classification Using Numerical Text Representation
by Natalia Krawczyk, Barbara Probierz and Jan Kozak
Appl. Sci. 2024, 14(21), 9795; https://doi.org/10.3390/app14219795 - 26 Oct 2024
Viewed by 1047
Abstract
The detection of essays written by AI compared to those authored by students is increasingly becoming a significant issue in educational settings. This research examines various numerical text representation techniques to improve the classification of these essays. Utilizing a diverse dataset, we undertook [...] Read more.
The detection of essays written by AI compared to those authored by students is increasingly becoming a significant issue in educational settings. This research examines various numerical text representation techniques to improve the classification of these essays. Utilizing a diverse dataset, we undertook several preprocessing steps, including data cleaning, tokenization, and lemmatization. Our system analyzes different text representation methods such as Bag of Words, TF-IDF, and fastText embeddings in conjunction with multiple classifiers. Our experiments showed that TF-IDF weights paired with logistic regression reached the highest accuracy of 99.82%. Methods like Bag of Words, TF-IDF, and fastText embeddings achieved accuracies exceeding 96.50% across all tested classifiers. Sentence embeddings, including MiniLM and distilBERT, yielded accuracies from 93.78% to 96.63%, indicating room for further refinement. Conversely, pre-trained fastText embeddings showed reduced performance, with a lowest accuracy of 89.88% in logistic regression. Remarkably, the XGBoost classifier delivered the highest minimum accuracy of 96.24%. Specificity and precision were above 99% for most methods, showcasing high capability in differentiating between student-created and AI-generated texts. This study underscores the vital role of choosing dataset-specific text representations to boost classification accuracy. Full article
Show Figures

Figure 1

16 pages, 2663 KiB  
Article
Bag of Feature-Based Ensemble Subspace KNN Classifier in Muscle Ultrasound Diagnosis of Diabetic Peripheral Neuropathy
by Kadhim K. Al-Barazanchi, Ali H. Al-Timemy and Zahid M. Kadhim
Math. Comput. Appl. 2024, 29(5), 95; https://doi.org/10.3390/mca29050095 - 20 Oct 2024
Viewed by 892
Abstract
Muscle ultrasound quantification is a valuable complementary diagnostic tool for diabetic peripheral neuropathy (DPN), enhancing physicians’ diagnostic capabilities. Quantitative assessment is generally regarded as more reliable and sensitive than visual evaluation, which often necessitates specialized expertise. This work develops a computer-aided diagnostic (CAD) [...] Read more.
Muscle ultrasound quantification is a valuable complementary diagnostic tool for diabetic peripheral neuropathy (DPN), enhancing physicians’ diagnostic capabilities. Quantitative assessment is generally regarded as more reliable and sensitive than visual evaluation, which often necessitates specialized expertise. This work develops a computer-aided diagnostic (CAD) system based on muscle ultrasound that integrates the bag of features (BOF) and an ensemble subspace k-nearest neighbor (KNN) algorithm for DPN detection. The BOF creates a histogram of visual word occurrences to represent the muscle ultrasound images and trains an ensemble classifier through cross-validation, determining optimal parameters to improve classification accuracy for the ensemble diagnosis system. The dataset includes ultrasound images of six muscles from 53 subjects, consisting of 27 control and 26 patient cases. An empirical analysis was conducted for each binary classifier based on muscle type to select the best vocabulary tree properties or K values for BOF. The result indicates that ensemble subspace KNN classification, based on the bag of features, achieved an accuracy of 97.23%. CAD systems can effectively diagnose muscle pathology, thereby addressing limitations and identifying issues in individuals with diabetes. This research underscores muscle ultrasound as a promising diagnostic tool to aid physicians in making accurate diagnoses, streamlining workflow, and uncovering muscle-related complications in DPN patients. Full article
(This article belongs to the Section Engineering)
Show Figures

Figure 1

21 pages, 1242 KiB  
Article
A Bag-of-Words Approach for Information Extraction from Electricity Invoices
by Javier Sánchez and Giovanny A. Cuervo-Londoño
AI 2024, 5(4), 1837-1857; https://doi.org/10.3390/ai5040091 - 8 Oct 2024
Viewed by 888
Abstract
In the context of digitization and automation, extracting relevant information from business documents remains a significant challenge. It is typical to rely on machine-learning techniques to automate the process, reduce manual labor, and minimize errors. This work introduces a new model for extracting [...] Read more.
In the context of digitization and automation, extracting relevant information from business documents remains a significant challenge. It is typical to rely on machine-learning techniques to automate the process, reduce manual labor, and minimize errors. This work introduces a new model for extracting key values from electricity invoices, including customer data, bill breakdown, electricity consumption, or marketer data. We evaluate several machine learning techniques, such as Naive Bayes, Logistic Regression, Random Forests, or Support Vector Machines. Our approach relies on a bag-of-words strategy and custom-designed features tailored for electricity data. We validate our method on the IDSEM dataset, which includes 75,000 electricity invoices with eighty-six fields. The model converts PDF invoices into text and processes each word separately using a context of eleven words. The results of our experiments indicate that Support Vector Machines and Random Forests perform exceptionally well in capturing numerous values with high precision. The study also explores the advantages of our custom features and evaluates the performance of unseen documents. The precision obtained with Support Vector Machines is 91.86% on average, peaking at 98.47% for one document template. These results demonstrate the effectiveness of our method in accurately extracting key values from invoices. Full article
Show Figures

Figure 1

18 pages, 489 KiB  
Article
Maximizing Profitability and Occupancy: An Optimal Pricing Strategy for Airbnb Hosts Using Regression Techniques and Natural Language Processing
by Luca Di Persio and Enis Lalmi
J. Risk Financial Manag. 2024, 17(9), 414; https://doi.org/10.3390/jrfm17090414 - 18 Sep 2024
Viewed by 2224
Abstract
In the competitive landscape of Airbnb hosting, optimizing pricing strategies for properties is a complex challenge that requires revenue maximization with high occupancy rates. This research aimed to introduce a solution that leverages big data and machine learning techniques to help hosts improve [...] Read more.
In the competitive landscape of Airbnb hosting, optimizing pricing strategies for properties is a complex challenge that requires revenue maximization with high occupancy rates. This research aimed to introduce a solution that leverages big data and machine learning techniques to help hosts improve their property’s market performance. Our primary goal was to introduce a solution that can augment property owners’ understanding of their property’s market value within their urban context, thereby optimizing both the utilization and profitability of their listings. We employed a multi-faceted approach with diverse models, including support vector regression, XGBoost, and neural networks, to analyze the influence of factors such as location, host attributes, and guest reviews on a listing’s financial performance. To further refine our predictive models, we integrated natural language processing techniques for in-depth listing review analysis, focusing on term frequency-inverse document frequency (TF-IDF), bag-of-words, and aspect-based sentiment analysis. Integrating such techniques allowed for in-depth listing review analysis, providing nuanced insights into guest preferences and satisfaction. Our findings demonstrated that AirBnB hosts can effectively utilize both state-of-the-art and traditional machine learning algorithms to better understand customer needs and preferences, more accurately assess their listings’ market value, and focus on the importance of dynamic pricing strategies. By adopting this data-driven approach, hosts can achieve a balance between maintaining competitive pricing and ensuring high occupancy rates. This method not only enhances revenue potential but also contributes to improved guest satisfaction and the growing field of data-driven decisions in the sharing economy, specially tailored to the challenges of short-term rentals. Full article
(This article belongs to the Section Mathematics and Finance)
Show Figures

Figure 1

28 pages, 2653 KiB  
Article
How Does Digital Transformation Moderate Green Culture, Job Satisfaction, and Competitive Advantage in Sustainable Hotels?
by Gul Coskun Degirmen, Derya Ozilhan Ozbey, Emine Sardagı, Ilknur Cevik Tekin, Durmus Koc, Pınar Erdogan, Feden Koc and Emel Arık
Sustainability 2024, 16(18), 8072; https://doi.org/10.3390/su16188072 - 15 Sep 2024
Viewed by 2036
Abstract
Target groups within an organization adopt its culture, reflecting it in all internal and external business processes. Adopting a green organizational culture in hotels with sustainability certificates plays an important role in reshaping business processes by developing sustainability awareness among employees. Digital transformation, [...] Read more.
Target groups within an organization adopt its culture, reflecting it in all internal and external business processes. Adopting a green organizational culture in hotels with sustainability certificates plays an important role in reshaping business processes by developing sustainability awareness among employees. Digital transformation, which facilitates corporate culture and business processes, plays a role in employee job satisfaction while also supporting environmental, social, and economic sustainability. This research aims to determine the relationship between green organizational culture, job satisfaction, and competitive advantage variables and to examine the moderating role of digital transformation on these relationships. The data-collecting techniques of choice were surveys and semi-structured interviews. While Amos software (Version 24) was used to test the hypothetical model in the analysis of survey data, a Hayes Process macro was used to determine the moderating effect. The interview forms’ data was analyzed using a bag-of-words model. According to the research results, there is a positive relationship between the participation, consistency, and adaptability sub-dimensions of green organizational culture and job satisfaction, while there is no significant relationship between the mission sub-dimension and job satisfaction. Furthermore, the study reveals the moderating role of digital transformation in the effect of job satisfaction on competitive advantage. Full article
Show Figures

Figure 1

26 pages, 1413 KiB  
Article
Active Learning for Biomedical Article Classification with Bag of Words and FastText Embeddings
by Paweł Cichosz
Appl. Sci. 2024, 14(17), 7945; https://doi.org/10.3390/app14177945 - 6 Sep 2024
Viewed by 1031
Abstract
In several applications of text classification, training document labels are provided by human evaluators, and therefore, gathering sufficient data for model creation is time consuming and costly. The labeling time and effort may be reduced by active learning, in which classification models are [...] Read more.
In several applications of text classification, training document labels are provided by human evaluators, and therefore, gathering sufficient data for model creation is time consuming and costly. The labeling time and effort may be reduced by active learning, in which classification models are created based on relatively small training sets, which are obtained by collecting class labels provided in response to labeling requests or queries. This is an iterative process with a sequence of models being fitted, and each of them is used to select query articles to be added to the training set for the next one. Such a learning scenario may pose different challenges for machine learning algorithms and text representation methods used for text classification than ordinary passive learning, since they have to deal with very small, often imbalanced data, and the computational expense of both model creation and prediction has to remain low. This work examines how classification algorithms and text representation methods that have been found particularly useful by prior work handle these challenges. The random forest and support vector machines algorithms are coupled with the bag of words and FastText word embedding representations and applied to datasets consisting of scientific article abstracts from systematic literature review studies in the biomedical domain. Several strategies are used to select articles for active learning queries, including uncertainty sampling, diversity sampling, and strategies favoring the minority class. Confidence-based and stability-based early stopping criteria are used to generate active learning termination signals. The results confirm that active learning is a useful approach to creating text classification models with limited access to labeled data, making it possible to save at least half of the human effort needed to assign relevant or irrelevant class labels to training articles. Two of the four examined combinations of classification algorithms and text representation methods were the most successful: the SVM algorithm with the FastText representation and the random forest algorithm with the bag of words representation. Uncertainty sampling turned out to be the most useful query selection strategy, and confidence-based stopping was found more universal and easier to configure than stability-based stopping. Full article
(This article belongs to the Special Issue Data and Text Mining: New Approaches, Achievements and Applications)
Show Figures

Figure 1

21 pages, 7746 KiB  
Article
Multi-Robot Collaborative Mapping with Integrated Point-Line Features for Visual SLAM
by Yu Xia, Xiao Wu, Tao Ma, Liucun Zhu, Jingdi Cheng and Junwu Zhu
Sensors 2024, 24(17), 5743; https://doi.org/10.3390/s24175743 - 4 Sep 2024
Viewed by 1395
Abstract
Simultaneous Localization and Mapping (SLAM) enables mobile robots to autonomously perform localization and mapping tasks in unknown environments. Despite significant progress achieved by visual SLAM systems in ideal conditions, relying solely on a single robot and point features for mapping in large-scale indoor [...] Read more.
Simultaneous Localization and Mapping (SLAM) enables mobile robots to autonomously perform localization and mapping tasks in unknown environments. Despite significant progress achieved by visual SLAM systems in ideal conditions, relying solely on a single robot and point features for mapping in large-scale indoor environments with weak-texture structures can affect mapping efficiency and accuracy. Therefore, this paper proposes a multi-robot collaborative mapping method based on point-line fusion to address this issue. This method is designed for indoor environments with weak-texture structures for localization and mapping. The feature-extraction algorithm, which combines point and line features, supplements the existing environment point feature-extraction method by introducing a line feature-extraction step. This integration ensures the accuracy of visual odometry estimation in scenes with pronounced weak-texture structure features. For relatively large indoor scenes, a scene-recognition-based map-fusion method is proposed in this paper to enhance mapping efficiency. This method relies on visual bag of words to determine overlapping areas in the scene, while also proposing a keyframe-extraction method based on photogrammetry to improve the algorithm’s robustness. By combining the Perspective-3-Point (P3P) algorithm and Bundle Adjustment (BA) algorithm, the relative pose-transformation relationships of multi-robots in overlapping scenes are resolved, and map fusion is performed based on these relative pose relationships. We evaluated our algorithm on public datasets and a mobile robot platform. The experimental results demonstrate that the proposed algorithm exhibits higher robustness and mapping accuracy. It shows significant effectiveness in handling mapping in scenarios with weak texture and structure, as well as in small-scale map fusion. Full article
(This article belongs to the Section Navigation and Positioning)
Show Figures

Figure 1

24 pages, 1413 KiB  
Article
Loop Detection Method Based on Neural Radiance Field BoW Model for Visual Inertial Navigation of UAVs
by Xiaoyue Zhang, Yue Cui, Yanchao Ren, Guodong Duan and Huanrui Zhang
Remote Sens. 2024, 16(16), 3038; https://doi.org/10.3390/rs16163038 - 19 Aug 2024
Viewed by 965
Abstract
The loop closure detection (LCD) methods in Unmanned Aerial Vehicle (UAV) Visual Inertial Navigation System (VINS) are often affected by issues such as insufficient image texture information and limited observational perspectives, resulting in constrained UAV positioning accuracy and reduced capability to perform complex [...] Read more.
The loop closure detection (LCD) methods in Unmanned Aerial Vehicle (UAV) Visual Inertial Navigation System (VINS) are often affected by issues such as insufficient image texture information and limited observational perspectives, resulting in constrained UAV positioning accuracy and reduced capability to perform complex tasks. This study proposes a Bag-of-Words (BoW) LCD method based on Neural Radiance Field (NeRF), which estimates camera poses from existing images and achieves rapid scene reconstruction through NeRF. A method is designed to select virtual viewpoints and render images along the flight trajectory using a specific sampling approach to expand the limited observational angles, mitigating the impact of image blur and insufficient texture information at specific viewpoints while enlarging the loop closure candidate frames to improve the accuracy and success rate of LCD. Additionally, a BoW vector construction method that incorporates the importance of similar visual words and an adapted virtual image filtering and comprehensive scoring calculation method are designed to determine loop closures. Applied to VINS-Mono and ORB-SLAM3, and compared with the advanced BoW model LCDs of the two systems, results indicate that the NeRF-based BoW LCD method can detect more than 48% additional accurate loop closures, while the system’s navigation positioning error mean is reduced by over 46%, validating the effectiveness and superiority of the proposed method and demonstrating its significant importance for improving the navigation accuracy of VINS. Full article
Show Figures

Figure 1

17 pages, 9779 KiB  
Article
Optimizing Speech Emotion Recognition with Machine Learning Based Advanced Audio Cue Analysis
by Nuwan Pallewela, Damminda Alahakoon, Achini Adikari, John E. Pierce and Miranda L. Rose
Technologies 2024, 12(7), 111; https://doi.org/10.3390/technologies12070111 - 11 Jul 2024
Cited by 1 | Viewed by 2764
Abstract
In today’s fast-paced and interconnected world, where human–computer interaction is an integral component of daily life, the ability to recognize and understand human emotions has emerged as a crucial facet of technological advancement. However, human emotion, a complex interplay of physiological, psychological, and [...] Read more.
In today’s fast-paced and interconnected world, where human–computer interaction is an integral component of daily life, the ability to recognize and understand human emotions has emerged as a crucial facet of technological advancement. However, human emotion, a complex interplay of physiological, psychological, and social factors, poses a formidable challenge even for other humans to comprehend accurately. With the emergence of voice assistants and other speech-based applications, it has become essential to improve audio-based emotion expression. However, there is a lack of specificity and agreement in current emotion annotation practice, as evidenced by conflicting labels in many human-annotated emotional datasets for the same speech segments. Previous studies have had to filter out these conflicts and, therefore, a large portion of the collected data has been considered unusable. In this study, we aimed to improve the accuracy of computational prediction of uncertain emotion labels by utilizing high-confidence emotion labelled speech segments from the IEMOCAP emotion dataset. We implemented an audio-based emotion recognition model using bag of audio word encoding (BoAW) to obtain a representation of audio aspects of emotion in speech with state-of-the-art recurrent neural network models. Our approach improved the state-of-the-art audio-based emotion recognition with a 61.09% accuracy rate, an improvement of 1.02% over the BiDialogueRNN model and 1.72% over the EmoCaps multi-modal emotion recognition models. In comparison to human annotation, our approach achieved similar results in identifying positive and negative emotions. Furthermore, it has proven effective in accurately recognizing the sentiment of uncertain emotion segments that were previously considered unusable in other studies. Improvements in audio emotion recognition could have implications in voice-based assistants, healthcare, and other industrial applications that benefit from automated communication. Full article
Show Figures

Figure 1

19 pages, 2258 KiB  
Article
Social Network Forensics Analysis Model Based on Network Representation Learning
by Kuo Zhao, Huajian Zhang, Jiaxin Li, Qifu Pan, Li Lai, Yike Nie and Zhongfei Zhang
Entropy 2024, 26(7), 579; https://doi.org/10.3390/e26070579 - 7 Jul 2024
Viewed by 1429
Abstract
The rapid evolution of computer technology and social networks has led to massive data generation through interpersonal communications, necessitating improved methods for information mining and relational analysis in areas such as criminal activity. This paper introduces a Social Network Forensic Analysis model that [...] Read more.
The rapid evolution of computer technology and social networks has led to massive data generation through interpersonal communications, necessitating improved methods for information mining and relational analysis in areas such as criminal activity. This paper introduces a Social Network Forensic Analysis model that employs network representation learning to identify and analyze key figures within criminal networks, including leadership structures. The model incorporates traditional web forensics and community algorithms, utilizing concepts such as centrality and similarity measures and integrating the Deepwalk, Line, and Node2vec algorithms to map criminal networks into vector spaces. This maintains node features and structural information that are crucial for the relational analysis. The model refines node relationships through modified random walk sampling, using BFS and DFS, and employs a Continuous Bag-of-Words with Hierarchical Softmax for node vectorization, optimizing the value distribution via the Huffman tree. Hierarchical clustering and distance measures (cosine and Euclidean) were used to identify the key nodes and establish a hierarchy of influence. The findings demonstrate the effectiveness of the model in accurately vectorizing nodes, enhancing inter-node relationship precision, and optimizing clustering, thereby advancing the tools for combating complex criminal networks. Full article
(This article belongs to the Section Complexity)
Show Figures

Figure 1

Back to TopTop