Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2024
Neural Schrödinger bridge for unpaired real-world image deraining
Information Sciences: an International Journal (ISCI), Volume 682, Issue Chttps://doi.org/10.1016/j.ins.2024.121199AbstractGiven the significant differences between domains, current unpaired learning methods struggle to accurately map the relationship between rainy and clear images. To this end, we introduce a neural Schrödinger bridge (NSB) for unpaired real-world ...
- research-articleOctober 2024
Feature aggregation network for small object detection
Expert Systems with Applications: An International Journal (EXWA), Volume 255, Issue PBhttps://doi.org/10.1016/j.eswa.2024.124686AbstractDue to the miniature scale and limited identifiable features, small objects pose a significant challenge in detection. Improving the accuracy of small object detection is a momentous issue of concern among researchers. Feature pyramid network ...
Highlights- Feature Aggregation Network (FAN), a simple and effective method, is proposed.
- Feature-Aware Module narrows the semantic gap that favors detecting small objects.
- A dual top-down pathway enhances spatial and semantic information.
- research-articleOctober 2024
AC-YOLO: Multi-category and high-precision detection model for stored grain pests based on integrated multiple attention mechanisms
Expert Systems with Applications: An International Journal (EXWA), Volume 255, Issue PBhttps://doi.org/10.1016/j.eswa.2024.124659AbstractThe existing detection models for stored grain pests have low accuracy and poor generalization ability in fine-grained detection tasks involving numerous species, minor inter-class differences, and significant intra-class variations. This study ...
- research-articleOctober 2024
Multi-label image classification using adaptive graph convolutional networks: From a single domain to multiple domains
Computer Vision and Image Understanding (CVIU), Volume 247, Issue Chttps://doi.org/10.1016/j.cviu.2024.104062AbstractThis paper proposes an adaptive graph-based approach for multi-label image classification. Graph-based methods have been largely exploited in the field of multi-label classification, given their ability to model label correlations. Specifically, ...
Highlights- A novel graph-based approach for multi-label image classification is proposed.
- It learns adaptively a graph describing label dependencies.
- It is extended to cross-domain settings with an adversarial domain adaptation schema.
- ...
- articleOctober 2024
Explainable artificial intelligence: A survey of needs, techniques, applications, and future direction
Neurocomputing (NEUROC), Volume 599, Issue Chttps://doi.org/10.1016/j.neucom.2024.128111AbstractArtificial intelligence models encounter significant challenges due to their black-box nature, particularly in safety-critical domains such as healthcare, finance, and autonomous vehicles. Explainable Artificial Intelligence (XAI) addresses these ...
-
- ArticleSeptember 2024
Clean-Image Backdoor Attacks
- Dazhong Rong,
- Guoyao Yu,
- Shuheng Shen,
- Xinyi Fu,
- Peng Qian,
- Jianhai Chen,
- Qinming He,
- Xing Fu,
- Weiqiang Wang
Artificial Neural Networks and Machine Learning – ICANN 2024Pages 187–202https://doi.org/10.1007/978-3-031-72359-9_14AbstractTo gather a significant quantity of annotated training data for high-performance image classification models, numerous companies opt to enlist third-party providers to label their unlabeled data. This practice is widely regarded as secure, even in ...
- ArticleSeptember 2024
CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Ground Image Synthesis
Artificial Neural Networks and Machine Learning – ICANN 2024Pages 287–302https://doi.org/10.1007/978-3-031-72338-4_20AbstractSatellite-to-ground image synthesis aims at generating a realistic street-view image from its corresponding satellite-view image. Despite the considerable efforts towards improving the geometry structure of the synthetic street-view images, the ...
- research-articleOctober 2024
Improving visual grounding with multi-modal interaction and auto-regressive vertex generation
Neurocomputing (NEUROC), Volume 598, Issue Chttps://doi.org/10.1016/j.neucom.2024.128227AbstractWe propose a concise and consistent network focusing on multi-task learning of Referring Expression Comprehension (REC) and Segmentation (RES) within Visual grounding (VG). To simplify the model architecture and achieve parameter sharing, we ...
Highlights- We propose a simple and unified framework for Referring Expression Comprehension and Segmentation, casting visual grounding as a sequence of point generation problem under a common L 1 loss.
- Our Multi-Modal Interaction Fusion enhances ...
- research-articleOctober 2024
Applying deep learning image enhancement methods to improve person re-identification
- Oliverio J. Santana,
- Javier Lorenzo-Navarro,
- David Freire-Obregón,
- Daniel Hernández-Sosa,
- Modesto Castrillón-Santana
Neurocomputing (NEUROC), Volume 598, Issue Chttps://doi.org/10.1016/j.neucom.2024.128011AbstractPerson re-identification has gained significant attention in recent years due to its numerous practical applications in video surveillance. However, while artificial intelligence and deep learning methods have enabled substantial progress in ...
- research-articleSeptember 2024
Instance segmentation of faces and mouth-opening degrees based on improved YOLOv8 method
Multimedia Systems (MUME), Volume 30, Issue 5https://doi.org/10.1007/s00530-024-01472-zAbstractInstance segmentation of faces and mouth-opening degrees is an important technology for meal-assisting robotics in food delivery safety. However, due to the diversity in in shape, color, and posture of faces and the mouth with small area contour, ...
- research-articleSeptember 2024
Integrating YOLOv8 and CSPBottleneck based CNN for enhanced license plate character recognition
Journal of Real-Time Image Processing (SPJRTIP), Volume 21, Issue 5https://doi.org/10.1007/s11554-024-01537-2AbstractThe paper introduces an integrated methodology for license plate character recognition, combining YOLOv8 for segmentation and a CSPBottleneck-based CNN classifier for character recognition. The proposed approach incorporates pre-processing ...
- ArticleSeptember 2024
Weak Supervised Asphalt Pavement Segmentation
Computational Collective IntelligencePages 256–268https://doi.org/10.1007/978-3-031-70819-0_20AbstractThe labeling effort of deep learning-connected projects can introduce unexpected expenditure, especially when we are dealing with a challenging problem representation such as noisy camera images. Instead of a classical semi-supervised concept, ...
- articleOctober 2024
Revisiting vision-based violence detection in videos: A critical analysis
Neurocomputing (NEUROC), Volume 597, Issue Chttps://doi.org/10.1016/j.neucom.2024.128113AbstractAn ever-increasing installation of surveillance cameras at different places for ensuring public safety, security and asset protection has triggered the need for intelligent video surveillance to monitor the people and their behavior. Violence ...
- articleOctober 2024
Deep learning-based 3D reconstruction from multiple images: A survey
- Chuhua Wang,
- Md Alimoor Reza,
- Vibhas Vats,
- Yingnan Ju,
- Nikhil Thakurdesai,
- Yuchen Wang,
- David J. Crandall,
- Soon-heung Jung,
- Jeongil Seo
Neurocomputing (NEUROC), Volume 597, Issue Chttps://doi.org/10.1016/j.neucom.2024.128018AbstractReconstructing the three-dimensional structure of a scene is a classic and fundamental problem in computer vision, but it has been revolutionized by recent advancements in deep machine learning. In this paper, we survey this rich and growing ...
- research-articleOctober 2024
Visionary vigilance: Optimized YOLOV8 for fallen person detection with large-scale benchmark dataset
- Habib Khan,
- Inam Ullah,
- Mohammad Shabaz,
- Muhammad Faizan Omer,
- Muhammad Talha Usman,
- Mohammed Seghir Guellil,
- JaKeoung Koo
Image and Vision Computing (IAVC), Volume 149, Issue Chttps://doi.org/10.1016/j.imavis.2024.105195AbstractFalls pose a significant risk to elderly people, patients with diseases such as neurological disorders, cardiovascular diseases, and disabled children. This highlights the need for real-time intelligent fall detection (FD) systems for quick ...
Highlights- Developed DiverseFALL10500: a comprehensive dataset with 10,500 annotated images.
- Optimized YOLOv8s with focus module and CBAM integration for improved performance.
- Demonstrated superiority over SOTA techniques through extensive ...
- research-articleOctober 2024
Vision-based method to identify materials transported by dump trucks
Engineering Applications of Artificial Intelligence (EAAI), Volume 135, Issue Chttps://doi.org/10.1016/j.engappai.2024.108768AbstractConstruction and demolition waste (C&DW) materials are highly valuable as, in most cases, they can be reused. In other cases, such material can be highly pollutant. Therefore, their traceability becomes very important in the frame of sustainable ...
- research-articleOctober 2024
Real-time evaluation of object detection models across open world scenarios
Applied Soft Computing (APSC), Volume 163, Issue Chttps://doi.org/10.1016/j.asoc.2024.111921AbstractObject detection models have been experiencing significant improvements over the years due to advancements in deep learning techniques, increased availability of large-scale annotated datasets, and computational resources. Different object ...
Graphical AbstractDisplay Omitted
Highlights- Qualitative and Quantitative Comparison of object detection models.
- Assessment of model performance under real-world challenges in environmental scenes.
- Insights into model suitability for environmental monitoring, using detection ...
- research-articleSeptember 2024
DenseViT-XGB: A hybrid approach for dates varieties identification
Neurocomputing (NEUROC), Volume 596, Issue Chttps://doi.org/10.1016/j.neucom.2024.127976AbstractThe digitization of variety identification is of great importance for the improvement of farming practices in date fruit production. In this study, we have developed a hybrid approach called DenseViT-XGB for date fruit variety identification. ...
- research-articleSeptember 2024
Real-time lightweight drone detection model: Fine-grained Identification of four types of drones based on an improved Yolov7 model
Neurocomputing (NEUROC), Volume 596, Issue Chttps://doi.org/10.1016/j.neucom.2024.127941AbstractRecently, the rapid progress of artificial intelligence has enhanced the human-robot relationship through the development of several autonomous robots; such as drones. The overwhelming rise of drones has brought both relevant advantages and ...