KLU-APIN: Vol 54, No 21

Volume 54, Issue 21Nov 2024

Volume 54, Issue 21

Nov 2024

Publisher:

Kluwer Academic Publishers
101 Philip Drive Assinippi Park Norwell, MA
United States

ISSN:0924-669X

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

research-article

Deep learning models for perception of brightness related illusions

Pages 10259–10283https://doi.org/10.1007/s10489-024-05658-w

Abstract

Illusions are like holes in our effortless visual mechanism through which we can peep into the internal mechanisms of the brain. Scientists attempted to explain underlying physiological, physical, and cognitive mechanisms of illusions by the ...

research-article

A dual-branch network based on optical flow learning and semantic consistency for macro-expression spotting

Pages 10284–10299https://doi.org/10.1007/s10489-024-05726-1

Abstract

Macro-expression spotting is an important prior step in many dynamic facial expression analysis applications. It automatically detects the onset and offset image frames of a macro-expression in the video. The state-of-the-art methods of macro-...

research-article

Application of a dense fusion attention network in fault diagnosis of centrifugal fan

Pages 10300–10319https://doi.org/10.1007/s10489-024-05643-3

Abstract

Although the deep learning recognition model has been widely used in the condition monitoring of rotating machinery. However, it is still a challenge to understand the correspondence between the structure and function of the model and the ...

research-article

TSA-Net: a temporal knowledge graph completion method with temporal-structural adaptation

Pages 10320–10332https://doi.org/10.1007/s10489-024-05734-1

Abstract

Temporal Knowledge Graph Completion (TKGC) aims to infer missing facts in Temporal Knowledge Graphs (TKGs), where facts are stored along with significant temporal information. However, existing TKGC methods only consider message passing on ...

research-article

Knowledge graph-based recommendation with knowledge noise reduction and data augmentation

Pages 10333–10359https://doi.org/10.1007/s10489-024-05657-x

Abstract

In the field of recommendation algorithms, Knowledge Graphs are often utilized as supplementary information to enhance recommendation accuracy. However, while applying Knowledge Graphs enriches recommendation information, it also introduces ...

research-article

Chaotic image encryption based on partial face recognition and DNA diffusion

Pages 10360–10373https://doi.org/10.1007/s10489-024-05613-9

Abstract

This paper proposes an innovative image encryption algorithm that leverages partial face recognition and DNA diffusion, building upon advancements in chaotic image encryption and face recognition technologies. The key is generated using the secure ...

research-article

Differentiating broadcast from viral: a causal inference approach for information diffusion analysis

Pages 10374–10385https://doi.org/10.1007/s10489-024-05723-4

Abstract

Classifying information diffusion patterns is critical to many information analysis areas, e.g., misleading information detection. However, diffusion pattern classification remains challenging when multiple users are involved. To address this ...

research-article

Automated diagnosis of cervical spine physiological curvature based on deep neural networks with transformer by using nmODE

Pages 10386–10400https://doi.org/10.1007/s10489-024-05736-z

Abstract

In this paper, we focus on the automated diagnosis of physiological curvature in the cervical spine, with an emphasis on feature point localization. Cervical spine deformity is prevalent, and the Cobb angle is widely recognized as the gold ...

research-article

MAPM: multiscale attention pre-training model for TextVQA

Pages 10401–10413https://doi.org/10.1007/s10489-024-05727-0

Abstract

Text Visual Question Answering (TextVQA) task aims to enable models to read and answer questions based on images with text. Existing attention-based methods for TextVQA tasks often face challenges in effectively aligning local features between ...

research-article

The fuzzy inference system based on axiomatic fuzzy sets using overlap functions as aggregation operators and its approximation properties

Pages 10414–10437https://doi.org/10.1007/s10489-024-05716-3

Abstract

As significant vehicles for applying fuzzy set theories, fuzzy inference systems (FISs) have been widely utilized in artificial intelligence. However, challenges such as computational complexity and subjective design persist in FIS implementation. ...

research-article

Single-source unsupervised domain adaptation for cross-subject MI-EEG classification based on discriminative information

Pages 10438–10454https://doi.org/10.1007/s10489-024-05662-0

Abstract

Electroencephalography (EEG) provides a wealth of physiological and psychological information. Decoding EEG signals enables machines to recognize brain activity, a crucial aspect in brain-computer interaction and medical rehabilitation. However, ...

research-article

Securing IP in edge AI: neural network watermarking for multimodal models

Pages 10455–10472https://doi.org/10.1007/s10489-024-05746-x

Abstract

In the realm of edge AI systems where deep learning is paramount, protecting the intellectual property (IP) of multimodal neural network models is crucial. Current watermarking solutions often bypass the intricacies of multimodal models and the ...

research-article

Spatio-temporal data generation based on separated attention for ENSO prediction

Pages 10473–10489https://doi.org/10.1007/s10489-024-05547-2

Abstract

The El Niño-Southern Oscillation (ENSO) phenomenon is often accompanied by multiple extreme hazards—thus, its accurate prediction is crucial to the prevention of such crises. Recently, machine learning algorithms have exhibited excellent ENSO ...

research-article

Performance metrics for multi-step forecasting measuring win-loss, seasonal variance and forecast stability: an empirical study

Pages 10490–10515https://doi.org/10.1007/s10489-024-05715-4

Abstract

This paper addresses the evaluation of multi-step point forecasting models. Currently, deep learning models for multi-step forecasting are evaluated on datasets by selecting one error metric that is aggregated across the time series and the ...

research-article

Dirichlet stochastic weights averaging for graph neural networks: Dirichlet stochastic weights averaging for graph neural networks

Pages 10516–10524https://doi.org/10.1007/s10489-024-05708-3

Abstract

The popularity of Graph Neural Networks (GNNs) has grown significantly because GNNs handle relational datasets such as social networks and citation networks. However, the usual relational dataset is sparse, and GNNs are easy to overfit to the ...

research-article

Entity clustering-based meta-learning for link prediction in evolutionary fault diagnosis event graphs

Pages 10525–10540https://doi.org/10.1007/s10489-024-05749-8

Abstract

Fault diagnosis plays an important role in intelligent manufacturing. Knowledge modelling is often used for intelligent fault diagnosis purposes, and link prediction is performed in knowledge graphs to locate and trace system faults. However, due ...

research-article

Boosting sparsely annotated shadow detection

Pages 10541–10560https://doi.org/10.1007/s10489-024-05740-3

Abstract

Sparsely annotated image segmentation has gained popularity due to its ability to significantly reduce the labeling burden on training data. However, existing methods still struggle to learn complete object structures, especially for complex ...

research-article

Multi-view pre-trained transformer via hierarchical capsule network for answer sentence selection

Pages 10561–10580https://doi.org/10.1007/s10489-024-05513-y

Abstract

Answer selection requires technology that effectively captures in-depth semantic information between the question and the corresponding answer. Most existing studies focus on using linear or pooling operations to directly classify the output ...

research-article

Swin transformer-based traffic video text tracking

Pages 10581–10595https://doi.org/10.1007/s10489-024-05710-9

Abstract

Intelligent systems, such as driving assistance systems, can assist drivers by providing basic traffic, road blockage and possible route information to enable safe driving. The goal of scene text tracking in driver assistance systems is to locate ...

research-article

Learning the structure of multivariate regression chain graphs by testing complete separators in prime blocks

Pages 10596–10607https://doi.org/10.1007/s10489-024-05752-z

Abstract

This paper introduces an algorithm to construct a bidirectional causal graph using an augmented graph. The algorithm decomposes the augmented graph, significantly reducing the size of the variable set required for conditional independence testing. ...

research-article

Improving the transferability of adversarial attacks via self-ensemble

Pages 10608–10626https://doi.org/10.1007/s10489-024-05728-z

Abstract

Deep neural networks have been used extensively for diverse visual tasks, including object detection, face recognition, and image classification. However, they face several security threats, such as adversarial attacks. To improve the resistance ...

research-article

Radar-camera fusion for 3D object detection with aggregation transformer

Pages 10627–10639https://doi.org/10.1007/s10489-024-05718-1

Abstract

In recent years, with the continuous development of autonomous driving, monocular 3D object detection has garnered increasing attention as a crucial research topic. However, the precision of 3D object detection is impeded by the limitations of ...

research-article

Deep-SEA: a deep learning based patient specific multi-modality post-cancer survival estimation architecture

Pages 10640–10652https://doi.org/10.1007/s10489-024-05794-3

Abstract

Cancer survival estimation is essential for post-cancer patient care, cancer management policy building, and the development of tailored treatment plans. Existing survival estimation methods use censored data; therefore, standard machine learning ...

research-article

Unsupervised attribute reduction based on neighborhood dependency

Pages 10653–10670https://doi.org/10.1007/s10489-024-05604-w

Abstract

Neighborhood rough set theory is an important computational model in granular computing and has been successfully applied in many areas. One of its most prominent applications is in attribute reduction. However, most current attribute reduction ...

research-article

Semi-supervised regression with label-guided adaptive graph optimization

Pages 10671–10694https://doi.org/10.1007/s10489-024-05766-7

Abstract

For the semi-supervised regression task, both the similarity of paired samples and the limited label information serve as core indicators. Nevertheless, most traditional semi-supervised regression methods cannot make full use of both ...

research-article

A lightweight hierarchical graph convolutional model for knowledge graph representation learning

Pages 10695–10708https://doi.org/10.1007/s10489-024-05787-2

Abstract

Graph convolutional networks (GCNs) have emerged as powerful tools for handling graph-structured data. Many knowledge graph embedding models leverage GCNs as encoders to learn the relationships between central entities and their neighbors, showing ...

research-article

Crowd behavior detection: leveraging video swin transformer for crowd size and violence level analysis

Pages 10709–10730https://doi.org/10.1007/s10489-024-05775-6

Abstract

In recent years, crowd behavior detection has posed significant challenges in the realm of public safety and security, even with the advancements in surveillance technologies. The ability to perform real-time surveillance and accurately identify ...

research-article

Generating crisp boundaries using multi-scale features and mixed loss function

Pages 10731–10747https://doi.org/10.1007/s10489-024-05784-5

Abstract

Recently, boundary or edge detection has made great progress under the development of convolutional neural networks (CNNs), and some algorithms have achieved a beyond human-level performance. However, CNNs tend to generate blurred edge maps, and ...

research-article

ETransCap: efficient transformer for image captioning

Pages 10748–10762https://doi.org/10.1007/s10489-024-05739-w

Abstract

Image captioning is a challenging task in computer vision that automatically generates a textual description of an image by integrating visual and linguistic information, as the generated captions must accurately describe the image’s content while ...

research-article

Product quality time series prediction with attention-based convolutional recurrent neural network

Pages 10763–10779https://doi.org/10.1007/s10489-024-05709-2

Abstract

The product quality is the key index to measure the process of the industrial manufacture. Thanks to the ever-expanding scale of time-series data, the deep learning technology can be regarded as the effective approach to predict the future product ...

Applied Intelligence

Sections

Deep learning models for perception of brightness related illusions

A dual-branch network based on optical flow learning and semantic consistency for macro-expression spotting

Application of a dense fusion attention network in fault diagnosis of centrifugal fan

TSA-Net: a temporal knowledge graph completion method with temporal-structural adaptation

Knowledge graph-based recommendation with knowledge noise reduction and data augmentation

Chaotic image encryption based on partial face recognition and DNA diffusion

Differentiating broadcast from viral: a causal inference approach for information diffusion analysis

Automated diagnosis of cervical spine physiological curvature based on deep neural networks with transformer by using nmODE

MAPM: multiscale attention pre-training model for TextVQA

The fuzzy inference system based on axiomatic fuzzy sets using overlap functions as aggregation operators and its approximation properties

Single-source unsupervised domain adaptation for cross-subject MI-EEG classification based on discriminative information

Securing IP in edge AI: neural network watermarking for multimodal models

Spatio-temporal data generation based on separated attention for ENSO prediction

Performance metrics for multi-step forecasting measuring win-loss, seasonal variance and forecast stability: an empirical study

Dirichlet stochastic weights averaging for graph neural networks: Dirichlet stochastic weights averaging for graph neural networks

Entity clustering-based meta-learning for link prediction in evolutionary fault diagnosis event graphs

Boosting sparsely annotated shadow detection

Multi-view pre-trained transformer via hierarchical capsule network for answer sentence selection

Swin transformer-based traffic video text tracking

Learning the structure of multivariate regression chain graphs by testing complete separators in prime blocks

Improving the transferability of adversarial attacks via self-ensemble

Radar-camera fusion for 3D object detection with aggregation transformer

Deep-SEA: a deep learning based patient specific multi-modality post-cancer survival estimation architecture

Unsupervised attribute reduction based on neighborhood dependency

Semi-supervised regression with label-guided adaptive graph optimization

A lightweight hierarchical graph convolutional model for knowledge graph representation learning

Crowd behavior detection: leveraging video swin transformer for crowd size and violence level analysis

Generating crisp boundaries using multi-scale features and mixed loss function

ETransCap: efficient transformer for image captioning

Product quality time series prediction with attention-based convolutional recurrent neural network

Sections

Save to Binder

Comments