Yunzhi Zhuge

Followers

Following

Public Views

Interests

Uploads

Papers by Yunzhi Zhuge

Learning Local-Global Representation for Scribble-based RGB-D Salient Object Detection via Transformer

IEEE transactions on circuits and systems for video technology, 2024

Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters

arXiv (Cornell University), Mar 18, 2024

Download

DME: Unveiling the Bias for Better Generalized Monocular Depth Estimation

Proceedings of the ... AAAI Conference on Artificial Intelligence, Mar 24, 2024

Download

Few-shot Semantic Segmentation by Exploiting Dynamic and Regional Contexts

CTVIS: Consistent Training for Online Video Instance Segmentation

arXiv (Cornell University), Jul 24, 2023

Download

TrackDiffusion: Multi-object Tracking Data Generation via Diffusion Models

arXiv (Cornell University), Nov 30, 2023

Download

Multi-granularity Transformer for Image Super-Resolution

Lecture Notes in Computer Science, 2023

Joint Learning of Saliency Detection and Weakly Supervised Semantic Segmentation

2019 IEEE/CVF International Conference on Computer Vision (ICCV), 2019

Download

Deep Reasoning Network for Few-shot Semantic Segmentation

Proceedings of the 29th ACM International Conference on Multimedia, 2021

Download

Multi-Source Weak Supervision for Saliency Detection

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019

Download

Deep Embedding Features for Salient Object Detection

Proceedings of the AAAI Conference on Artificial Intelligence, 2019

Benefiting from the rapid development of Convolutional Neural Networks (CNNs), some salient objec... more Benefiting from the rapid development of Convolutional Neural Networks (CNNs), some salient object detection methods have achieved remarkable results by utilizing multi-level convolutional features. However, the saliency training datasets is of limited scale due to the high cost of pixel-level labeling, which leads to a limited generalization of the trained model on new scenarios during testing. Besides, some FCN-based methods directly integrate multi-level features, ignoring the fact that the noise in some features are harmful to saliency detection. In this paper, we propose a novel approach that transforms prior information into an embedding space to select attentive features and filter out outliers for salient object detection. Our network firstly generates a coarse prediction map through an encorder-decorder structure. Then a Feature Embedding Network (FEN) is trained to embed each pixel of the coarse map into a metric space, which incorporates much attentive features that highl...

Download