Keyword: Computer vision : Search

research-article

Neural Schrödinger bridge for unpaired real-world image deraining

Information Sciences: an International Journal (ISCI), Volume 682, Issue Chttps://doi.org/10.1016/j.ins.2024.121199

Abstract

Given the significant differences between domains, current unpaired learning methods struggle to accurately map the relationship between rainy and clear images. To this end, we introduce a neural Schrödinger bridge (NSB) for unpaired real-world ...

research-article

Feature aggregation network for small object detection

Expert Systems with Applications: An International Journal (EXWA), Volume 255, Issue PBhttps://doi.org/10.1016/j.eswa.2024.124686

Abstract

Due to the miniature scale and limited identifiable features, small objects pose a significant challenge in detection. Improving the accuracy of small object detection is a momentous issue of concern among researchers. Feature pyramid network ...

Highlights

Feature Aggregation Network (FAN), a simple and effective method, is proposed.
Feature-Aware Module narrows the semantic gap that favors detecting small objects.
A dual top-down pathway enhances spatial and semantic information.

research-article

AC-YOLO: Multi-category and high-precision detection model for stored grain pests based on integrated multiple attention mechanisms

Expert Systems with Applications: An International Journal (EXWA), Volume 255, Issue PBhttps://doi.org/10.1016/j.eswa.2024.124659

Abstract

The existing detection models for stored grain pests have low accuracy and poor generalization ability in fine-grained detection tasks involving numerous species, minor inter-class differences, and significant intra-class variations. This study ...

research-article

Multi-label image classification using adaptive graph convolutional networks: From a single domain to multiple domains

Computer Vision and Image Understanding (CVIU), Volume 247, Issue Chttps://doi.org/10.1016/j.cviu.2024.104062

Abstract

This paper proposes an adaptive graph-based approach for multi-label image classification. Graph-based methods have been largely exploited in the field of multi-label classification, given their ability to model label correlations. Specifically, ...

Highlights

A novel graph-based approach for multi-label image classification is proposed.
It learns adaptively a graph describing label dependencies.
It is extended to cross-domain settings with an adversarial domain adaptation schema.
...

article

Explainable artificial intelligence: A survey of needs, techniques, applications, and future direction

Neurocomputing (NEUROC), Volume 599, Issue Chttps://doi.org/10.1016/j.neucom.2024.128111

Abstract

Artificial intelligence models encounter significant challenges due to their black-box nature, particularly in safety-critical domains such as healthcare, finance, and autonomous vehicles. Explainable Artificial Intelligence (XAI) addresses these ...

Article

Clean-Image Backdoor Attacks

Artificial Neural Networks and Machine Learning – ICANN 2024Pages 187–202https://doi.org/10.1007/978-3-031-72359-9_14

Abstract

To gather a significant quantity of annotated training data for high-performance image classification models, numerous companies opt to enlist third-party providers to label their unlabeled data. This practice is widely regarded as secure, even in ...

Article

CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Ground Image Synthesis

Artificial Neural Networks and Machine Learning – ICANN 2024Pages 287–302https://doi.org/10.1007/978-3-031-72338-4_20

Abstract

Satellite-to-ground image synthesis aims at generating a realistic street-view image from its corresponding satellite-view image. Despite the considerable efforts towards improving the geometry structure of the synthetic street-view images, the ...

research-article

Improving visual grounding with multi-modal interaction and auto-regressive vertex generation

Neurocomputing (NEUROC), Volume 598, Issue Chttps://doi.org/10.1016/j.neucom.2024.128227

Abstract

We propose a concise and consistent network focusing on multi-task learning of Referring Expression Comprehension (REC) and Segmentation (RES) within Visual grounding (VG). To simplify the model architecture and achieve parameter sharing, we ...

Highlights

We propose a simple and unified framework for Referring Expression Comprehension and Segmentation, casting visual grounding as a sequence of point generation problem under a common L 1 loss.
Our Multi-Modal Interaction Fusion enhances ...

research-article

Applying deep learning image enhancement methods to improve person re-identification

Neurocomputing (NEUROC), Volume 598, Issue Chttps://doi.org/10.1016/j.neucom.2024.128011

Abstract

Person re-identification has gained significant attention in recent years due to its numerous practical applications in video surveillance. However, while artificial intelligence and deep learning methods have enabled substantial progress in ...

research-article

LDC-PP-YOLOE: a lightweight model for detecting and counting citrus fruit

Pattern Analysis & Applications (PAAS), Volume 27, Issue 4https://doi.org/10.1007/s10044-024-01329-1

Abstract

In the citrus orchard environment, accurate counting of the fruit, and the use of lightweight detection methods are the key presteps to automate citrus picking and yield estimations. Most high-precision fruit detection models based on deep ...

research-article

Instance segmentation of faces and mouth-opening degrees based on improved YOLOv8 method

Multimedia Systems (MUME), Volume 30, Issue 5https://doi.org/10.1007/s00530-024-01472-z

Abstract

Instance segmentation of faces and mouth-opening degrees is an important technology for meal-assisting robotics in food delivery safety. However, due to the diversity in in shape, color, and posture of faces and the mouth with small area contour, ...

research-article

Integrating YOLOv8 and CSPBottleneck based CNN for enhanced license plate character recognition

Journal of Real-Time Image Processing (SPJRTIP), Volume 21, Issue 5https://doi.org/10.1007/s11554-024-01537-2

Abstract

The paper introduces an integrated methodology for license plate character recognition, combining YOLOv8 for segmentation and a CSPBottleneck-based CNN classifier for character recognition. The proposed approach incorporates pre-processing ...

Article

Weak Supervised Asphalt Pavement Segmentation

Computational Collective IntelligencePages 256–268https://doi.org/10.1007/978-3-031-70819-0_20

Abstract

The labeling effort of deep learning-connected projects can introduce unexpected expenditure, especially when we are dealing with a challenging problem representation such as noisy camera images. Instead of a classical semi-supervised concept, ...

article

Revisiting vision-based violence detection in videos: A critical analysis

Neurocomputing (NEUROC), Volume 597, Issue Chttps://doi.org/10.1016/j.neucom.2024.128113

Abstract

An ever-increasing installation of surveillance cameras at different places for ensuring public safety, security and asset protection has triggered the need for intelligent video surveillance to monitor the people and their behavior. Violence ...

article

Deep learning-based 3D reconstruction from multiple images: A survey

Neurocomputing (NEUROC), Volume 597, Issue Chttps://doi.org/10.1016/j.neucom.2024.128018

Abstract

Reconstructing the three-dimensional structure of a scene is a classic and fundamental problem in computer vision, but it has been revolutionized by recent advancements in deep machine learning. In this paper, we survey this rich and growing ...

research-article

Visionary vigilance: Optimized YOLOV8 for fallen person detection with large-scale benchmark dataset

Image and Vision Computing (IAVC), Volume 149, Issue Chttps://doi.org/10.1016/j.imavis.2024.105195

Abstract

Falls pose a significant risk to elderly people, patients with diseases such as neurological disorders, cardiovascular diseases, and disabled children. This highlights the need for real-time intelligent fall detection (FD) systems for quick ...

Highlights

Developed DiverseFALL10500: a comprehensive dataset with 10,500 annotated images.
Optimized YOLOv8s with focus module and CBAM integration for improved performance.
Demonstrated superiority over SOTA techniques through extensive ...

research-article

Vision-based method to identify materials transported by dump trucks

Engineering Applications of Artificial Intelligence (EAAI), Volume 135, Issue Chttps://doi.org/10.1016/j.engappai.2024.108768

Abstract

Construction and demolition waste (C&DW) materials are highly valuable as, in most cases, they can be reused. In other cases, such material can be highly pollutant. Therefore, their traceability becomes very important in the frame of sustainable ...

research-article

Real-time evaluation of object detection models across open world scenarios

Applied Soft Computing (APSC), Volume 163, Issue Chttps://doi.org/10.1016/j.asoc.2024.111921

Abstract

Object detection models have been experiencing significant improvements over the years due to advancements in deep learning techniques, increased availability of large-scale annotated datasets, and computational resources. Different object ...

Graphical Abstract

Display Omitted

Highlights

Qualitative and Quantitative Comparison of object detection models.
Assessment of model performance under real-world challenges in environmental scenes.
Insights into model suitability for environmental monitoring, using detection ...

research-article

DenseViT-XGB: A hybrid approach for dates varieties identification

Neurocomputing (NEUROC), Volume 596, Issue Chttps://doi.org/10.1016/j.neucom.2024.127976

Abstract

The digitization of variety identification is of great importance for the improvement of farming practices in date fruit production. In this study, we have developed a hybrid approach called DenseViT-XGB for date fruit variety identification. ...

research-article

Real-time lightweight drone detection model: Fine-grained Identification of four types of drones based on an improved Yolov7 model

Neurocomputing (NEUROC), Volume 596, Issue Chttps://doi.org/10.1016/j.neucom.2024.127941

Abstract

Recently, the rapid progress of artificial intelligence has enhanced the human-robot relationship through the development of several autonomous robots; such as drones. The overwhelming rise of drones has brought both relevant advantages and ...

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Paper Award

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences