Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJanuary 2025
Frequency-Guided Spatial Adaptation for Camouflaged Object Detection
IEEE Transactions on Multimedia (TOM), Volume 27Pages 72–83https://doi.org/10.1109/TMM.2024.3521681Camouflaged object detection (COD) aims to segment camouflaged objects which exhibit very similar patterns with the surrounding environment. Recent research works have shown that enhancing the feature representation via the frequency information can ...
- research-articleDecember 2024
Multi-Perspective Pseudo-Label Generation and Confidence-Weighted Training for Semi-Supervised Semantic Segmentation
IEEE Transactions on Multimedia (TOM), Volume 27Pages 300–311https://doi.org/10.1109/TMM.2024.3521801Self-training has been shown to achieve remarkable gains in semi-supervised semantic segmentation by creating pseudo-labels using unlabeled data. This approach, however, suffers from the quality of the generated pseudo-labels, and generating higher ...
- research-articleDecember 2024
Category-Contrastive Fine-Grained Crowd Counting and Beyond
IEEE Transactions on Multimedia (TOM), Volume 27Pages 477–488https://doi.org/10.1109/TMM.2024.3521823Crowd counting has drawn increasing attention across various fields. However, existing crowd counting tasks primarily focus on estimating the overall population, ignoring the behavioral and semantic information of different social groups within the crowd. ...
- research-articleDecember 2024
Progressive Region-to-Boundary Exploration Network for Camouflaged Object Detection
IEEE Transactions on Multimedia (TOM), Volume 27Pages 236–248https://doi.org/10.1109/TMM.2024.3521761Camouflaged object detection (COD) aims to segment targeted objects that have similar colors, textures, or shapes to their background environment. Due to the limited ability in distinguishing highly similar patterns, existing COD methods usually produce ...
- research-articleDecember 2024
SDE2D: Semantic-Guided Discriminability Enhancement Feature Detector and Descriptor
IEEE Transactions on Multimedia (TOM), Volume 27Pages 275–286https://doi.org/10.1109/TMM.2024.3521748Local feature detectors and descriptors serve various computer vision tasks, such as image matching, visual localization, and 3D reconstruction. To address the extreme variations of rotation and light in the real world, most detectors and descriptors ...
-
- research-articleJanuary 2024
CenterFormer: A Novel Cluster Center Enhanced Transformer for Unconstrained Dental Plaque Segmentation
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10965–10978https://doi.org/10.1109/TMM.2024.3428349Dental plaque segmentation is crucial for maintaining oral health. However, accurately segmenting dental plaque in unconstrained environments can be challenging due to its low contrast and high variability in appearance. While existing transformer-based ...
- research-articleJanuary 2024
Low-Light Image Enhancement With SAM-Based Structure Priors and Guidance
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10854–10866https://doi.org/10.1109/TMM.2024.3414328Low-light images often suffer from severe detail lost in darker areas and non-uniform illumination distribution across distinct regions. Thus, structure modeling and region-specific illumination manipulation are crucial for high-quality enhanced image ...
- research-articleJanuary 2024
Alignment-Free RGBT Salient Object Detection: Semantics-Guided Asymmetric Correlation Network and a Unified Benchmark
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10692–10707https://doi.org/10.1109/TMM.2024.3410542RGB and Thermal (RGBT) Salient Object Detection (SOD) aims to achieve high-quality saliency prediction by exploiting the complementary information of visible and thermal image pairs, which are initially captured in an unaligned manner. However, existing ...
- research-articleJanuary 2024
IBFusion: An Infrared and Visible Image Fusion Method Based on Infrared Target Mask and Bimodal Feature Extraction Strategy
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10610–10622https://doi.org/10.1109/TMM.2024.3410113The fusion of infrared (IR) and visible (VIS) images aims to capture complementary information from diverse sensors, resulting in a fused image that enhances the overall human perception of the scene. However, existing fusion methods face challenges ...
- research-articleJanuary 2024
Graph-Based Spatio-Temporal Semantic Reasoning Model for Anti-Occlusion Infrared Aerial Target Recognition
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10530–10544https://doi.org/10.1109/TMM.2024.3408051Infrared target recognition and anti-interference in complex battlefields is one of the key technologies enabling the precise strike capability of aircraft. Currently, infrared-guided aircraft face complex interference such as natural backgrounds and ...
- research-articleJanuary 2024
Frequency-Based Matcher for Long-Tailed Semantic Segmentation
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10395–10405https://doi.org/10.1109/TMM.2024.3407679The successful application of semantic segmentation technology in the real world has been among the most exciting achievements in the computer vision community over the past decade. Although the long-tailed phenomenon has been investigated in many fields, ...
- research-articleJanuary 2024
Difference-Aware Distillation for Semantic Segmentation
IEEE Transactions on Multimedia (TOM), Volume 26Pages 10069–10080https://doi.org/10.1109/TMM.2024.3405619In recent years, various distillation methods for semantic segmentation have been proposed. However, these methods typically train the student model to imitate the intermediate features or logits of the teacher model directly, thereby overlooking the high-...
- research-articleJanuary 2024
Pyramid Fusion Transformer for Semantic Segmentation
IEEE Transactions on Multimedia (TOM), Volume 26Pages 9630–9643https://doi.org/10.1109/TMM.2024.3396281The recently proposed MaskFormer [Cheng et al. (2021)] gives a refreshed perspective on the task of semantic segmentation: it shifts from the popular pixel-level classification paradigm to a mask-level classification method. In essence, it generates ...
- research-articleJanuary 2024
Dual-Guided Frequency Prototype Network for Few-Shot Semantic Segmentation
IEEE Transactions on Multimedia (TOM), Volume 26Pages 8874–8888https://doi.org/10.1109/TMM.2024.3383276Few-shot semantic segmentation is a challenging task that aims to segment novel classes in the query images given only a few annotated support samples. Most existing prototype-based approaches extract global or local prototypes by global average pooling (...
- research-articleJanuary 2024
M2FNet: Mask-Guided Multi-Level Fusion for RGB-T Pedestrian Detection
IEEE Transactions on Multimedia (TOM), Volume 26Pages 8678–8690https://doi.org/10.1109/TMM.2024.3381377RGB-Thermal pedestrian detection has shown many notable advantages in various lighting and weather conditions by combining the information from RGB-T images. Due to distinct imaging principles, RGB-T modalities consist of modality-specific and modality-...
- research-articleJanuary 2024
TFRNet: Semantic Segmentation Network with Token Filtration and Refinement Method
IEEE Transactions on Multimedia (TOM), Volume 26Pages 8242–8254https://doi.org/10.1109/TMM.2024.3378465Transformer-based semantic segmentation has been developed rapidly. Vision transformer (ViT) rely on self-attention mechanism which employs all image patches to compute long-range dependencies. ViT considers all tokens equally important for self-attention ...
- research-articleJanuary 2024
Cooperative Separation of Modality Shared-Specific Features for Visible-Infrared Person Re-Identification
IEEE Transactions on Multimedia (TOM), Volume 26Pages 8172–8183https://doi.org/10.1109/TMM.2024.3377139Visible-infrared person re-identification (VI-ReID) is a challenging task because the different imaging principles of visible and infrared images bring about huge modality discrepancy. Existing methods primarily address this issue by generating ...
- research-articleJanuary 2024
PointGT: A Method for Point-Cloud Classification and Segmentation Based on Local Geometric Transformation
IEEE Transactions on Multimedia (TOM), Volume 26Pages 8052–8062https://doi.org/10.1109/TMM.2024.3374580Recently, three-dimensional (3D) point-cloud analysis has been extensively utilized in the domain of machine vision, encompassing tasks include shape classification and segmentation. However the inherent disorder in point clouds poses a challenge in ...
- research-articleJanuary 2024
Robust Tracking via Bidirectional Transduction With Mask Information
IEEE Transactions on Multimedia (TOM), Volume 26Pages 4308–4319https://doi.org/10.1109/TMM.2023.3321497In the tracking literature, foreground and background information have been extensively investigated to discriminate a target from its surrounding background. However, both foreground and background possess their own spatial-temporal correlation ...
- research-articleJanuary 2024
Synthesize Boundaries: A Boundary-Aware Self-Consistent Framework for Weakly Supervised Salient Object Detection
IEEE Transactions on Multimedia (TOM), Volume 26Pages 4194–4205https://doi.org/10.1109/TMM.2023.3321393Fully supervised salient object detection (SOD) has made considerable progress based on expensive and time-consuming data with pixel-wise annotations. Recently, to relieve the labeling burden while maintaining performance, some scribble-based SOD methods ...