Interest point and salient region detections

Applied Filters

People

Publications

Publication Date

Searched The ACM Guide to Computing Literature (3,843,210 records)|Limit your search to The ACM Full-Text Collection (775,266 records)

Showing 1 - 20of33 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
January 2025
Frequency-Guided Spatial Adaptation for Camouflaged Object Detection
IEEE Transactions on Multimedia (TOM), Volume 27Pages 72–83https://doi.org/10.1109/TMM.2024.3521681
Camouflaged object detection (COD) aims to segment camouflaged objects which exhibit very similar patterns with the surrounding environment. Recent research works have shown that enhancing the feature representation via the frequency information can ...
0
Metrics
Total Citations0
research-article
December 2024
Multi-Perspective Pseudo-Label Generation and Confidence-Weighted Training for Semi-Supervised Semantic Segmentation
IEEE Transactions on Multimedia (TOM), Volume 27Pages 300–311https://doi.org/10.1109/TMM.2024.3521801
Self-training has been shown to achieve remarkable gains in semi-supervised semantic segmentation by creating pseudo-labels using unlabeled data. This approach, however, suffers from the quality of the generated pseudo-labels, and generating higher ...
0
Metrics
Total Citations0
research-article
December 2024
Category-Contrastive Fine-Grained Crowd Counting and Beyond
IEEE Transactions on Multimedia (TOM), Volume 27Pages 477–488https://doi.org/10.1109/TMM.2024.3521823
Crowd counting has drawn increasing attention across various fields. However, existing crowd counting tasks primarily focus on estimating the overall population, ignoring the behavioral and semantic information of different social groups within the crowd. ...
0
Metrics
Total Citations0
research-article
December 2024
Progressive Region-to-Boundary Exploration Network for Camouflaged Object Detection
- Guanghui Yue,
- Shangjie Wu,
- Tianwei Zhou,
- Gang Li,
- Jie Du,
- Yu Luo,
- Qiuping Jiang
IEEE Transactions on Multimedia (TOM), Volume 27Pages 236–248https://doi.org/10.1109/TMM.2024.3521761
Camouflaged object detection (COD) aims to segment targeted objects that have similar colors, textures, or shapes to their background environment. Due to the limited ability in distinguishing highly similar patterns, existing COD methods usually produce ...
0
Metrics
Total Citations0
research-article
December 2024
SDE2D: Semantic-Guided Discriminability Enhancement Feature Detector and Descriptor
IEEE Transactions on Multimedia (TOM), Volume 27Pages 275–286https://doi.org/10.1109/TMM.2024.3521748
Local feature detectors and descriptors serve various computer vision tasks, such as image matching, visual localization, and 3D reconstruction. To address the extreme variations of rotation and light in the real world, most detectors and descriptors ...
0
Metrics
Total Citations0
research-article
January 2024
CenterFormer: A Novel Cluster Center Enhanced Transformer for Unconstrained Dental Plaque Segmentation
- Wenfeng Song,
- Xuan Wang,
- Yuting Guo,
- Shuai Li,
- Bin Xia,
- Aimin Hao
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10965–10978https://doi.org/10.1109/TMM.2024.3428349
Dental plaque segmentation is crucial for maintaining oral health. However, accurately segmenting dental plaque in unconstrained environments can be challenging due to its low contrast and high variability in appearance. While existing transformer-based ...
0
Metrics
Total Citations0
research-article
January 2024
Low-Light Image Enhancement With SAM-Based Structure Priors and Guidance
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10854–10866https://doi.org/10.1109/TMM.2024.3414328
Low-light images often suffer from severe detail lost in darker areas and non-uniform illumination distribution across distinct regions. Thus, structure modeling and region-specific illumination manipulation are crucial for high-quality enhanced image ...
0
Metrics
Total Citations0
research-article
January 2024
Alignment-Free RGBT Salient Object Detection: Semantics-Guided Asymmetric Correlation Network and a Unified Benchmark
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10692–10707https://doi.org/10.1109/TMM.2024.3410542
RGB and Thermal (RGBT) Salient Object Detection (SOD) aims to achieve high-quality saliency prediction by exploiting the complementary information of visible and thermal image pairs, which are initially captured in an unaligned manner. However, existing ...
1
Metrics
Total Citations1
research-article
January 2024
IBFusion: An Infrared and Visible Image Fusion Method Based on Infrared Target Mask and Bimodal Feature Extraction Strategy
- Yang Bai,
- Meijing Gao,
- Shiyu Li,
- Ping Wang,
- Ning Guan,
- Haozheng Yin,
- Yonghao Yan
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10610–10622https://doi.org/10.1109/TMM.2024.3410113
The fusion of infrared (IR) and visible (VIS) images aims to capture complementary information from diverse sensors, resulting in a fused image that enhances the overall human perception of the scene. However, existing fusion methods face challenges ...
0
Metrics
Total Citations0
research-article
January 2024
Graph-Based Spatio-Temporal Semantic Reasoning Model for Anti-Occlusion Infrared Aerial Target Recognition
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10530–10544https://doi.org/10.1109/TMM.2024.3408051
Infrared target recognition and anti-interference in complex battlefields is one of the key technologies enabling the precise strike capability of aircraft. Currently, infrared-guided aircraft face complex interference such as natural backgrounds and ...
0
Metrics
Total Citations0
research-article
January 2024
Frequency-Based Matcher for Long-Tailed Semantic Segmentation
- Shan Li,
- Lu Yang,
- Pu Cao,
- Liulei Li,
- Huadong Ma
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10395–10405https://doi.org/10.1109/TMM.2024.3407679
The successful application of semantic segmentation technology in the real world has been among the most exciting achievements in the computer vision community over the past decade. Although the long-tailed phenomenon has been investigated in many fields, ...
0
Metrics
Total Citations0
research-article
January 2024
Difference-Aware Distillation for Semantic Segmentation
- Jianping Gou,
- Xiabin Zhou,
- Lan Du,
- Yibing Zhan,
- Wu Chen,
- Zhang Yi
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10069–10080https://doi.org/10.1109/TMM.2024.3405619
In recent years, various distillation methods for semantic segmentation have been proposed. However, these methods typically train the student model to imitate the intermediate features or logits of the teacher model directly, thereby overlooking the high-...
0
Metrics
Total Citations0
research-article
January 2024
Pyramid Fusion Transformer for Semantic Segmentation
IEEE Transactions on Multimedia (TOM), Volume 26Pages 9630–9643https://doi.org/10.1109/TMM.2024.3396281
The recently proposed MaskFormer [Cheng et al. (2021)] gives a refreshed perspective on the task of semantic segmentation: it shifts from the popular pixel-level classification paradigm to a mask-level classification method. In essence, it generates ...
0
Metrics
Total Citations0
research-article
January 2024
Dual-Guided Frequency Prototype Network for Few-Shot Semantic Segmentation
IEEE Transactions on Multimedia (TOM), Volume 26Pages 8874–8888https://doi.org/10.1109/TMM.2024.3383276
Few-shot semantic segmentation is a challenging task that aims to segment novel classes in the query images given only a few annotated support samples. Most existing prototype-based approaches extract global or local prototypes by global average pooling (...
0
Metrics
Total Citations0
research-article
January 2024
M2FNet: Mask-Guided Multi-Level Fusion for RGB-T Pedestrian Detection
IEEE Transactions on Multimedia (TOM), Volume 26Pages 8678–8690https://doi.org/10.1109/TMM.2024.3381377
RGB-Thermal pedestrian detection has shown many notable advantages in various lighting and weather conditions by combining the information from RGB-T images. Due to distinct imaging principles, RGB-T modalities consist of modality-specific and modality-...
0
Metrics
Total Citations0
research-article
January 2024
TFRNet: Semantic Segmentation Network with Token Filtration and Refinement Method
- Yingdong Ma,
- Xiaoyu Hu
IEEE Transactions on Multimedia (TOM), Volume 26Pages 8242–8254https://doi.org/10.1109/TMM.2024.3378465
Transformer-based semantic segmentation has been developed rapidly. Vision transformer (ViT) rely on self-attention mechanism which employs all image patches to compute long-range dependencies. ViT considers all tokens equally important for self-attention ...
1
Metrics
Total Citations1
research-article
January 2024
Cooperative Separation of Modality Shared-Specific Features for Visible-Infrared Person Re-Identification
- Xi Yang,
- Wenjiao Dong,
- Meijie Li,
- Ziyu Wei,
- Nannan Wang,
- Xinbo Gao
IEEE Transactions on Multimedia (TOM), Volume 26Pages 8172–8183https://doi.org/10.1109/TMM.2024.3377139
Visible-infrared person re-identification (VI-ReID) is a challenging task because the different imaging principles of visible and infrared images bring about huge modality discrepancy. Existing methods primarily address this issue by generating ...
4
Metrics
Total Citations4
research-article
January 2024
PointGT: A Method for Point-Cloud Classification and Segmentation Based on Local Geometric Transformation
IEEE Transactions on Multimedia (TOM), Volume 26Pages 8052–8062https://doi.org/10.1109/TMM.2024.3374580
Recently, three-dimensional (3D) point-cloud analysis has been extensively utilized in the domain of machine vision, encompassing tasks include shape classification and segmentation. However the inherent disorder in point clouds poses a challenge in ...
4
Metrics
Total Citations4
research-article
January 2024
Robust Tracking via Bidirectional Transduction With Mask Information
IEEE Transactions on Multimedia (TOM), Volume 26Pages 4308–4319https://doi.org/10.1109/TMM.2023.3321497
In the tracking literature, foreground and background information have been extensively investigated to discriminate a target from its surrounding background. However, both foreground and background possess their own spatial-temporal correlation ...
1
Metrics
Total Citations1
research-article
January 2024
Synthesize Boundaries: A Boundary-Aware Self-Consistent Framework for Weakly Supervised Salient Object Detection
IEEE Transactions on Multimedia (TOM), Volume 26Pages 4194–4205https://doi.org/10.1109/TMM.2023.3321393
Fully supervised salient object detection (SOD) has made considerable progress based on expensive and time-consuming data with pixel-wise annotations. Recently, to relieve the labeling burden while maintaining performance, some scribble-based SOD methods ...
0
Metrics
Total Citations0

Applied Filters

People

Names

Institutions

Authors

Publications

All Publications

Content Type

Publisher

Publication Date

Results

Frequency-Guided Spatial Adaptation for Camouflaged Object Detection

Multi-Perspective Pseudo-Label Generation and Confidence-Weighted Training for Semi-Supervised Semantic Segmentation

Category-Contrastive Fine-Grained Crowd Counting and Beyond

Progressive Region-to-Boundary Exploration Network for Camouflaged Object Detection

SDE2D: Semantic-Guided Discriminability Enhancement Feature Detector and Descriptor

CenterFormer: A Novel Cluster Center Enhanced Transformer for Unconstrained Dental Plaque Segmentation

Low-Light Image Enhancement With SAM-Based Structure Priors and Guidance

Alignment-Free RGBT Salient Object Detection: Semantics-Guided Asymmetric Correlation Network and a Unified Benchmark

IBFusion: An Infrared and Visible Image Fusion Method Based on Infrared Target Mask and Bimodal Feature Extraction Strategy

Graph-Based Spatio-Temporal Semantic Reasoning Model for Anti-Occlusion Infrared Aerial Target Recognition

Frequency-Based Matcher for Long-Tailed Semantic Segmentation

Difference-Aware Distillation for Semantic Segmentation

Pyramid Fusion Transformer for Semantic Segmentation

Dual-Guided Frequency Prototype Network for Few-Shot Semantic Segmentation

M2FNet: Mask-Guided Multi-Level Fusion for RGB-T Pedestrian Detection

TFRNet: Semantic Segmentation Network with Token Filtration and Refinement Method

Cooperative Separation of Modality Shared-Specific Features for Visible-Infrared Person Re-Identification

PointGT: A Method for Point-Cloud Classification and Segmentation Based on Local Geometric Transformation

Robust Tracking via Bidirectional Transduction With Mask Information

Synthesize Boundaries: A Boundary-Aware Self-Consistent Framework for Weakly Supervised Salient Object Detection