default search action
Zuxuan Wu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j15]Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Adaptive Cross-Modal Transferable Adversarial Attacks From Images to Videos. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 3772-3783 (2024) - [j14]Zuxuan Wu, Zejia Weng, Wujian Peng, Xitong Yang, Ang Li, Larry S. Davis, Yu-Gang Jiang:
Building an Open-Vocabulary Video CLIP Model With Better Architectures, Optimization and Data. IEEE Trans. Pattern Anal. Mach. Intell. 46(7): 4747-4762 (2024) - [j13]Zejia Weng, Zuxuan Wu, Hengduo Li, Jingjing Chen, Yu-Gang Jiang:
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition. ACM Trans. Multim. Comput. Commun. Appl. 20(2): 35:1-35:18 (2024) - [c83]Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang:
SimDA: Simple Diffusion Adapter for Efficient Video Generation. CVPR 2024: 7827-7839 - [c82]Shuyuan Tu, Qi Dai, Zhi-Qi Cheng, Han Hu, Xintong Han, Zuxuan Wu, Yu-Gang Jiang:
MotionEditor: Editing Video Motion via Content-Aware Diffusion. CVPR 2024: 7882-7891 - [c81]Wujian Peng, Sicheng Xie, Zuyao You, Shiyi Lan, Zuxuan Wu:
Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding. CVPR 2024: 13279-13288 - [c80]Junke Wang, Dongdong Chen, Chong Luo, Bo He, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang:
OmniViD: A Generative Framework for Universal Video Understanding. CVPR 2024: 18209-18220 - [c79]Zhenxin Li, Shiyi Lan, José M. Álvarez, Zuxuan Wu:
BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection. CVPR 2024: 20113-20123 - [c78]Yang Luo, Zhineng Chen, Peng Zhou, Zuxuan Wu, Xieping Gao, Yu-Gang Jiang:
Learning to Rank Patches for Unbiased Image Redundancy Reduction. CVPR 2024: 22831-22840 - [c77]Lingchen Meng, Shiyi Lan, Hengduo Li, José M. Álvarez, Zuxuan Wu, Yu-Gang Jiang:
SegIC: Unleashing the Emergent Correspondence for In-Context Segmentation. ECCV (38) 2024: 203-220 - [c76]Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Qingping Zheng, Zuxuan Wu, Hang Xu, Yu-Gang Jiang:
MagDiff: Multi-alignment Diffusion for High-Fidelity Video Generation and Editing. ECCV (18) 2024: 205-221 - [c75]Bingwen Zhu, Fanyi Wang, Tianyi Lu, Peng Liu, Jingwen Su, Jinxiu Liu, Yanhao Zhang, Zuxuan Wu, Guo-Jun Qi, Yu-Gang Jiang:
Zero-shot High-fidelity and Pose-controllable Character Animation. IJCAI 2024: 1788-1797 - [c74]Tianyi Lu, Xing Zhang, Jiaxi Gu, Renjing Pei, Songcen Xu, Xingjun Ma, Hang Xu, Zuxuan Wu:
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models. ACM Multimedia 2024: 6745-6754 - [c73]Yifeng Gao, Yuhua Sun, Xingjun Ma, Zuxuan Wu, Yu-Gang Jiang:
ModelLock: Locking Your Model With a Spell. ACM Multimedia 2024: 11156-11165 - [i111]Binghai Wang, Rui Zheng, Lu Chen, Yan Liu, Shihan Dou, Caishuang Huang, Wei Shen, Senjie Jin, Enyu Zhou, Chenyu Shi, Songyang Gao, Nuo Xu, Yuhao Zhou, Xiaoran Fan, Zhiheng Xi, Jun Zhao, Xiao Wang, Tao Ji, Hang Yan, Lixing Shen, Zhan Chen, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
Secrets of RLHF in Large Language Models Part II: Reward Modeling. CoRR abs/2401.06080 (2024) - [i110]Xiaoran Fan, Tao Ji, Changhao Jiang, Shuo Li, Senjie Jin, Sirui Song, Junke Wang, Boyang Hong, Lu Chen, Guodong Zheng, Ming Zhang, Caishuang Huang, Rui Zheng, Zhiheng Xi, Yuhao Zhou, Shihan Dou, Junjie Ye, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
MouSi: Poly-Visual-Expert Vision-Language Models. CoRR abs/2401.17221 (2024) - [i109]Qijun Feng, Zhen Xing, Zuxuan Wu, Yu-Gang Jiang:
FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model. CoRR abs/2403.10242 (2024) - [i108]Junke Wang, Dongdong Chen, Chong Luo, Bo He, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang:
OmniVid: A Generative Framework for Universal Video Understanding. CoRR abs/2403.17935 (2024) - [i107]Yang Luo, Zhineng Chen, Peng Zhou, Zuxuan Wu, Xieping Gao, Yu-Gang Jiang:
Learning to Rank Patches for Unbiased Image Redundancy Reduction. CoRR abs/2404.00680 (2024) - [i106]Bingwen Zhu, Fanyi Wang, Tianyi Lu, Peng Liu, Jingwen Su, Jinxiu Liu, Yanhao Zhang, Zuxuan Wu, Yu-Gang Jiang, Guo-Jun Qi:
PoseAnimate: Zero-shot high fidelity pose controllable character animation. CoRR abs/2404.13680 (2024) - [i105]Haoran Chen, Micah Goldblum, Zuxuan Wu, Yu-Gang Jiang:
Adaptive Rentention & Correction for Continual Learning. CoRR abs/2405.14318 (2024) - [i104]Yifeng Gao, Yuhua Sun, Xingjun Ma, Zuxuan Wu, Yu-Gang Jiang:
ModelLock: Locking Your Model With a Spell. CoRR abs/2405.16285 (2024) - [i103]Shuyuan Tu, Qi Dai, Zihao Zhang, Sicheng Xie, Zhi-Qi Cheng, Chong Luo, Xintong Han, Zuxuan Wu, Yu-Gang Jiang:
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion. CoRR abs/2405.20325 (2024) - [i102]Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo, Junzhe Wang, Dingwen Yang, Chenyang Liao, Xin Guo, Wei He, Songyang Gao, Lu Chen, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments. CoRR abs/2406.04151 (2024) - [i101]Lingchen Meng, Jianwei Yang, Rui Tian, Xiyang Dai, Zuxuan Wu, Jianfeng Gao, Yu-Gang Jiang:
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs. CoRR abs/2406.04334 (2024) - [i100]Zhen Xing, Qi Dai, Zejia Weng, Zuxuan Wu, Yu-Gang Jiang:
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction. CoRR abs/2406.06465 (2024) - [i99]Zhenxin Li, Kailin Li, Shihao Wang, Shiyi Lan, Zhiding Yu, Yishen Ji, Zhiqi Li, Ziyue Zhu, Jan Kautz, Zuxuan Wu, Yu-Gang Jiang, José M. Álvarez:
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation. CoRR abs/2406.06978 (2024) - [i98]Xing Zhang, Jiaxi Gu, Haoyu Zhao, Shicong Wang, Hang Xu, Renjing Pei, Songcen Xu, Zuxuan Wu, Yu-Gang Jiang:
AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding. CoRR abs/2406.07091 (2024) - [i97]Miaosen Zhang, Yixuan Wei, Zhen Xing, Yifei Ma, Zuxuan Wu, Ji Li, Zheng Zhang, Qi Dai, Chong Luo, Xin Geng, Baining Guo:
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms. CoRR abs/2406.09397 (2024) - [i96]Junke Wang, Yi Jiang, Zehuan Yuan, Bingyue Peng, Zuxuan Wu, Yu-Gang Jiang:
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation. CoRR abs/2406.09399 (2024) - [i95]Jiaqi Wang, Yuhang Zang, Pan Zhang, Tao Chu, Yuhang Cao, Zeyi Sun, Ziyu Liu, Xiaoyi Dong, Tong Wu, Dahua Lin, Zeming Chen, Zhi Wang, Lingchen Meng, Wenhao Yao, Jianwei Yang, Sihong Wu, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Peixi Wu, Bosong Chai, Xuan Nie, Longquan Yan, Zeyu Wang, Qifan Zhou, Boning Wang, Jiaqi Huang, Zunnan Xu, Xiu Li, Kehong Yuan, Yanyan Zu, Jiayao Ha, Qiong Gao, Licheng Jiao:
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results. CoRR abs/2406.11739 (2024) - [i94]Weijie Zheng, Xingjun Ma, Hanxun Huang, Zuxuan Wu, Yu-Gang Jiang:
Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers. CoRR abs/2408.01705 (2024) - [i93]Zejia Weng, Xitong Yang, Zhen Xing, Zuxuan Wu, Yu-Gang Jiang:
GenRec: Unifying Video Generation and Recognition with Diffusion Models. CoRR abs/2408.15241 (2024) - [i92]Haibo Yang, Yang Chen, Yingwei Pan, Ting Yao, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Tao Mei:
DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation. CoRR abs/2409.07454 (2024) - 2023
- [j12]Tianyi Liu, Zuxuan Wu, Jingjing Chen, Yugang Jiang:
Multimodal Pre-training Method for Vision-language Understanding and Generation. Int. J. Softw. Informatics 13(2): 143-155 (2023) - [j11]Zhipeng Wei, Jingjing Chen, Micah Goldblum, Zuxuan Wu, Tom Goldstein, Yu-Gang Jiang, Larry S. Davis:
Towards Transferable Adversarial Attacks on Image and Video Transformers. IEEE Trans. Image Process. 32: 6346-6358 (2023) - [j10]Rui Wang, Zuxuan Wu, Zejia Weng, Jingjing Chen, Guo-Jun Qi, Yu-Gang Jiang:
Cross-Domain Contrastive Learning for Unsupervised Domain Adaptation. IEEE Trans. Multim. 25: 1665-1673 (2023) - [j9]Junke Wang, Shaoxiang Chen, Zuxuan Wu, Yu-Gang Jiang:
FT-TDR: Frequency-Guided Transformer and Top-Down Refinement Network for Blind Face Inpainting. IEEE Trans. Multim. 25: 2382-2392 (2023) - [j8]Fan Luo, Shaoxiang Chen, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Self-Supervised Learning for Semi-Supervised Temporal Language Grounding. IEEE Trans. Multim. 25: 7747-7757 (2023) - [c72]Bingchen Huang, Zhineng Chen, Peng Zhou, Jiayin Chen, Zuxuan Wu:
Resolving Task Confusion in Dynamic Expansion Architectures for Class Incremental Learning. AAAI 2023: 908-916 - [c71]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang:
Look Before You Match: Instance Understanding Matters in Video Object Segmentation. CVPR 2023: 2268-2278 - [c70]Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava:
Towards Scalable Neural Representation for Diverse Videos. CVPR 2023: 6132-6142 - [c69]Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Yu-Gang Jiang:
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning. CVPR 2023: 6312-6322 - [c68]Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang:
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding. CVPR 2023: 11402-11411 - [c67]Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Enhancing the Self-Universality for Transferable Targeted Attacks. CVPR 2023: 12281-12290 - [c66]Hui Zhang, Zuxuan Wu, Zheng Wang, Zhineng Chen, Yu-Gang Jiang:
Prototypical Residual Networks for Anomaly Detection and Localization. CVPR 2023: 16281-16291 - [c65]Zhen Xing, Qi Dai, Han Hu, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
SVFormer: Semi-supervised Video Transformer for Action Recognition. CVPR 2023: 18816-18826 - [c64]Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu Qiao, Yu-Gang Jiang:
ResFormer: Scaling ViTs with Multi-Resolution Training. CVPR 2023: 22721-22731 - [c63]Shiyi Lan, Xitong Yang, Zhiding Yu, Zuxuan Wu, José M. Álvarez, Anima Anandkumar:
Vision Transformers are Good Mask Auto-Labelers. CVPR 2023: 23745-23755 - [c62]Shuyuan Tu, Qi Dai, Zuxuan Wu, Zhi-Qi Cheng, Han Hu, Yu-Gang Jiang:
Implicit Temporal Modeling with Learnable Alignment for Video Recognition. ICCV 2023: 19879-19890 - [c61]Yiqiang Lv, Jingjing Chen, Zhipeng Wei, Kai Chen, Zuxuan Wu, Yu-Gang Jiang:
Downstream Task-agnostic Transferable Attacks on Language-Image Pre-training Models. ICME 2023: 2831-2836 - [c60]Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang:
Open-VCLIP: Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization. ICML 2023: 36978-36989 - [c59]Kai Chen, Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
GCMA: Generative Cross-Modal Transferable Adversarial Attacks from Images to Videos. ACM Multimedia 2023: 698-708 - [c58]Yilun Zhang, Yuqian Fu, Xingjun Ma, Lizhe Qi, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
On the Importance of Spatial Relations for Few-shot Action Recognition. ACM Multimedia 2023: 2243-2251 - [c57]Haoran Chen, Xintong Han, Zuxuan Wu, Yu-Gang Jiang:
Multi-Prompt Alignment for Multi-Source Unsupervised Domain Adaptation. NeurIPS 2023 - [c56]Lingchen Meng, Xiyang Dai, Jianwei Yang, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Yi-Ling Chen, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang:
Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection. NeurIPS 2023 - [i91]Shiyi Lan, Xitong Yang, Zhiding Yu, Zuxuan Wu, José M. Álvarez, Anima Anandkumar:
Vision Transformers Are Good Mask Auto-Labelers. CoRR abs/2301.03992 (2023) - [i90]Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang:
Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization. CoRR abs/2302.00624 (2023) - [i89]Haoran Chen, Zuxuan Wu, Xintong Han, Menglin Jia, Yu-Gang Jiang:
PromptFusion: Decoupling Stability and Plasticity for Continual Learning. CoRR abs/2303.07223 (2023) - [i88]Hui Zhang, Zheng Wang, Zuxuan Wu, Yu-Gang Jiang:
DiffusionAD: Denoising Diffusion for Anomaly Detection. CoRR abs/2303.08730 (2023) - [i87]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Xiyang Dai, Lu Yuan, Yu-Gang Jiang:
OmniTracker: Unifying Object Tracking by Tracking-with-Detection. CoRR abs/2303.12079 (2023) - [i86]Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava:
Towards Scalable Neural Representation for Diverse Videos. CoRR abs/2303.14124 (2023) - [i85]Shuyuan Tu, Qi Dai, Zuxuan Wu, Zhi-Qi Cheng, Han Hu, Yu-Gang Jiang:
Implicit Temporal Modeling with Learnable Alignment for Video Recognition. CoRR abs/2304.10465 (2023) - [i84]Junke Wang, Dongdong Chen, Chong Luo, Xiyang Dai, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang:
ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System. CoRR abs/2304.14407 (2023) - [i83]Wujian Peng, Zejia Weng, Hengduo Li, Zuxuan Wu:
BMB: Balanced Memory Bank for Imbalanced Semi-supervised Learning. CoRR abs/2305.12912 (2023) - [i82]Wenfeng Yan, Shaoxiang Chen, Zuxuan Wu, Yu-Gang Jiang:
Prompting Large Language Models to Reformulate Queries for Moment Localization. CoRR abs/2306.03422 (2023) - [i81]Yilun Zhang, Yuqian Fu, Xingjun Ma, Lizhe Qi, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
On the Importance of Spatial Relations for Few-shot Action Recognition. CoRR abs/2308.07119 (2023) - [i80]Zhen Xing, Qi Dai, Han Hu, Zuxuan Wu, Yu-Gang Jiang:
SimDA: Simple Diffusion Adapter for Efficient Video Generation. CoRR abs/2308.09710 (2023) - [i79]Jiaxi Gu, Shicong Wang, Haoyu Zhao, Tianyi Lu, Xing Zhang, Zuxuan Wu, Songcen Xu, Wei Zhang, Yu-Gang Jiang, Hang Xu:
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation. CoRR abs/2309.03549 (2023) - [i78]Zuxuan Wu, Zejia Weng, Wujian Peng, Xitong Yang, Ang Li, Larry S. Davis, Yu-Gang Jiang:
Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data. CoRR abs/2310.05010 (2023) - [i77]Zhen Xing, Qijun Feng, Haoran Chen, Qi Dai, Han Hu, Hang Xu, Zuxuan Wu, Yu-Gang Jiang:
A Survey on Video Diffusion Models. CoRR abs/2310.10647 (2023) - [i76]Lingchen Meng, Xiyang Dai, Jianwei Yang, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Yi-Ling Chen, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang:
Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection. CoRR abs/2310.12152 (2023) - [i75]Tianyi Lu, Xing Zhang, Jiaxi Gu, Hang Xu, Renjing Pei, Songcen Xu, Zuxuan Wu:
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models. CoRR abs/2310.16400 (2023) - [i74]Junke Wang, Lingchen Meng, Zejia Weng, Bo He, Zuxuan Wu, Yu-Gang Jiang:
To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning. CoRR abs/2311.07574 (2023) - [i73]Lingchen Meng, Shiyi Lan, Hengduo Li, José M. Álvarez, Zuxuan Wu, Yu-Gang Jiang:
SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation. CoRR abs/2311.14671 (2023) - [i72]Hui Zhang, Zuxuan Wu, Zhen Xing, Jie Shao, Yu-Gang Jiang:
AdaDiff: Adaptive Step Selection for Fast Diffusion. CoRR abs/2311.14768 (2023) - [i71]Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Zuxuan Wu, Hang Xu, Yu-Gang Jiang:
VideoAssembler: Identity-Consistent Video Generation with Reference Entities using Diffusion Model. CoRR abs/2311.17338 (2023) - [i70]Shuyuan Tu, Qi Dai, Zhi-Qi Cheng, Han Hu, Xintong Han, Zuxuan Wu, Yu-Gang Jiang:
MotionEditor: Editing Video Motion via Content-Aware Diffusion. CoRR abs/2311.18830 (2023) - [i69]Zhen Xing, Qi Dai, Zihao Zhang, Hui Zhang, Han Hu, Zuxuan Wu, Yu-Gang Jiang:
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models. CoRR abs/2311.18837 (2023) - [i68]Wujian Peng, Sicheng Xie, Zuyao You, Shiyi Lan, Zuxuan Wu:
Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding. CoRR abs/2312.00081 (2023) - [i67]Zhenxin Li, Shiyi Lan, José M. Álvarez, Zuxuan Wu:
BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection. CoRR abs/2312.01696 (2023) - 2022
- [j7]Zuxuan Wu, Hengduo Li, Caiming Xiong, Yu-Gang Jiang, Larry S. Davis:
A Dynamic Frame Selection Framework for Fast Video Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 44(4): 1699-1711 (2022) - [j6]Xing Zhang, Zuxuan Wu, Yu-Gang Jiang:
SAM: Modeling Scene, Object and Action With Semantics Attention Modules for Video Recognition. IEEE Trans. Multim. 24: 313-322 (2022) - [j5]Xue Song, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Spatial-Temporal Graphs for Cross-Modal Text2Video Retrieval. IEEE Trans. Multim. 24: 2914-2923 (2022) - [c55]Kai Chen, Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Attacking Video Recognition Models with Bullet-Screen Comments. AAAI 2022: 312-320 - [c54]Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis:
Rethinking Pseudo Labels for Semi-supervised Object Detection. AAAI 2022: 1314-1322 - [c53]Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Boosting the Transferability of Video Adversarial Examples via Temporal Translation. AAAI 2022: 2659-2667 - [c52]Zhipeng Wei, Jingjing Chen, Micah Goldblum, Zuxuan Wu, Tom Goldstein, Yu-Gang Jiang:
Towards Transferable Adversarial Attacks on Vision Transformers. AAAI 2022: 2668-2676 - [c51]Kezhi Kong, Guohao Li, Mucong Ding, Zuxuan Wu, Chen Zhu, Bernard Ghanem, Gavin Taylor, Tom Goldstein:
Robust Optimization as Data Augmentation for Large-scale Graphs. CVPR 2022: 60-69 - [c50]Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Ser-Nam Lim, Yu-Gang Jiang:
ObjectFormer for Image Manipulation Detection and Localization. CVPR 2022: 2354-2363 - [c49]Lingchen Meng, Hengduo Li, Bor-Chun Chen, Shiyi Lan, Zuxuan Wu, Yu-Gang Jiang, Ser-Nam Lim:
AdaViT: Adaptive Vision Transformers for Efficient Image Recognition. CVPR 2022: 12299-12308 - [c48]Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Yu-Gang Jiang, Luowei Zhou, Lu Yuan:
BEVT: BERT Pretraining of Video Transformers. CVPR 2022: 14713-14723 - [c47]Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Cross-Modal Transferable Adversarial Attacks from Images to Videos. CVPR 2022: 15044-15053 - [c46]Junke Wang, Xitong Yang, Hengduo Li, Li Liu, Zuxuan Wu, Yu-Gang Jiang:
Efficient Video Transformers with Spatial-Temporal Token Selection. ECCV (35) 2022: 69-86 - [c45]Zhen Xing, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang:
Semi-supervised Single-View 3D Reconstruction via Prototype Shape Priors. ECCV (1) 2022: 535-551 - [c44]Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang:
Semi-supervised Vision Transformers. ECCV (30) 2022: 605-620 - [c43]Junke Wang, Zuxuan Wu, Wenhao Ouyang, Xintong Han, Jingjing Chen, Yu-Gang Jiang, Ser-Nam Lim:
M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection. ICMR 2022: 615-623 - [c42]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Luowei Zhou, Yucheng Zhao, Yujia Xie, Ce Liu, Yu-Gang Jiang, Lu Yuan:
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks. NeurIPS 2022 - [c41]Tianrui Guan, Jun Wang, Shiyi Lan, Rohan Chandra, Zuxuan Wu, Larry Davis, Dinesh Manocha:
M3DETR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers. WACV 2022: 2293-2303 - [i66]Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Ser-Nam Lim, Yu-Gang Jiang:
ObjectFormer for Image Manipulation Detection and Localization. CoRR abs/2203.14681 (2022) - [i65]Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu-Gang Jiang:
Deeper Insights into ViTs Robustness towards Common Corruptions. CoRR abs/2204.12143 (2022) - [i64]Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang:
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding. CoRR abs/2206.03484 (2022) - [i63]Rui Wang, Zuxuan Wu, Dongdong Chen, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Luowei Zhou, Lu Yuan, Yu-Gang Jiang:
Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling. CoRR abs/2208.12257 (2022) - [i62]Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Incorporating Locality of Images to Generate Targeted Transferable Adversarial Examples. CoRR abs/2209.03716 (2022) - [i61]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Luowei Zhou, Yucheng Zhao, Yujia Xie, Ce Liu, Yu-Gang Jiang, Lu Yuan:
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks. CoRR abs/2209.07526 (2022) - [i60]Haoran Chen, Zuxuan Wu, Yu-Gang Jiang:
Multi-Prompt Alignment for Multi-source Unsupervised Domain Adaptation. CoRR abs/2209.15210 (2022) - [i59]Zhen Xing, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang:
Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors. CoRR abs/2209.15383 (2022) - [i58]Zhen Xing, Qi Dai, Han Hu, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
SVFormer: Semi-supervised Video Transformer for Action Recognition. CoRR abs/2211.13222 (2022) - [i57]Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu Qiao, Yu-Gang Jiang:
ResFormer: Scaling ViTs with Multi-Resolution Training. CoRR abs/2212.00776 (2022) - [i56]Hui Zhang, Zuxuan Wu, Zheng Wang, Zhineng Chen, Yu-Gang Jiang:
Prototypical Residual Networks for Anomaly Detection and Localization. CoRR abs/2212.02031 (2022) - [i55]Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Yu-Gang Jiang:
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning. CoRR abs/2212.04500 (2022) - [i54]Junke Wang, Zhenxin Li, Chao Zhang, Jingjing Chen, Zuxuan Wu, Larry S. Davis, Yu-Gang Jiang:
Fighting Malicious Media Data: A Survey on Tampering Detection and Deepfake Detection. CoRR abs/2212.05667 (2022) - [i53]Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang:
Look Before You Match: Instance Understanding Matters in Video Object Segmentation. CoRR abs/2212.06826 (2022) - [i52]Bingchen Huang, Zhineng Chen, Peng Zhou, Jiayin Chen, Zuxuan Wu:
Resolving Task Confusion in Dynamic Expansion Architectures for Class Incremental Learning. CoRR abs/2212.14284 (2022) - 2021
- [j4]Zuxuan Wu, Hengduo Li, Yingbin Zheng, Caiming Xiong, Yu-Gang Jiang, Larry S. Davis:
A Coarse-to-Fine Framework for Resource Efficient Video Recognition. Int. J. Comput. Vis. 129(11): 2965-2977 (2021) - [c40]Peng Zhou, Ning Yu, Zuxuan Wu, Larry Davis, Abhinav Shrivastava, Ser-Nam Lim:
Deep Video Inpainting Detection. BMVC 2021: 35 - [c39]Bo He, Xitong Yang, Zuxuan Wu, Hao Chen, Ser-Nam Lim, Abhinav Shrivastava:
GTA: Global Temporal Attention for Video Action Understanding. BMVC 2021: 292 - [c38]Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis:
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition. CVPR 2021: 6155-6164 - [c37]Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge J. Belongie, Ser-Nam Lim:
Intentonomy: A Dataset and Study Towards Human Intent Understanding. CVPR 2021: 12986-12996 - [c36]Bor-Chun Chen, Zuxuan Wu, Larry S. Davis, Ser-Nam Lim:
Efficient Object Embedding for Spliced Image Retrieval. CVPR 2021: 14965-14975 - [c35]Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge J. Belongie, Ser-Nam Lim:
Exploring Visual Engagement Signals for Representation Learning. ICCV 2021: 4186-4197 - [c34]Xing Zhang, Zuxuan Wu, Zejia Weng, Huazhu Fu, Jingjing Chen, Yu-Gang Jiang, Larry Davis:
VideoLT: Large-scale Long-tailed Video Recognition. ICCV 2021: 7940-7949 - [c33]Zejia Weng, Lingchen Meng, Rui Wang, Zuxuan Wu, Yu-Gang Jiang:
A Multimodal Framework for Video Ads Understanding. ACM Multimedia 2021: 4843-4847 - [c32]Manli Shu, Zuxuan Wu, Micah Goldblum, Tom Goldstein:
Encoding Robustness to Image Style via Adversarial Feature Perturbations. NeurIPS 2021: 28042-28053 - [i51]Peng Zhou, Ning Yu, Zuxuan Wu, Larry S. Davis, Abhinav Shrivastava, Ser-Nam Lim:
Deep Video Inpainting Detection. CoRR abs/2101.11080 (2021) - [i50]Zuxuan Wu, Tom Goldstein, Larry S. Davis, Ser-Nam Lim:
THAT: Two Head Adversarial Training for Improving Robustness at Scale. CoRR abs/2103.13612 (2021) - [i49]Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge J. Belongie, Ser-Nam Lim:
Exploring Visual Engagement Signals for Representation Learning. CoRR abs/2104.07767 (2021) - [i48]Zejia Weng, Zuxuan Wu, Hengduo Li, Yu-Gang Jiang:
HMS: Hierarchical Modality Selection for Efficient Video Recognition. CoRR abs/2104.09760 (2021) - [i47]Junke Wang, Zuxuan Wu, Jingjing Chen, Yu-Gang Jiang:
M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection. CoRR abs/2104.09770 (2021) - [i46]Tianrui Guan, Jun Wang, Shiyi Lan, Rohan Chandra, Zuxuan Wu, Larry Davis, Dinesh Manocha:
M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers. CoRR abs/2104.11896 (2021) - [i45]Xing Zhang, Zuxuan Wu, Zejia Weng, Huazhu Fu, Jingjing Chen, Yu-Gang Jiang, Larry Davis:
VideoLT: Large-scale Long-tailed Video Recognition. CoRR abs/2105.02668 (2021) - [i44]Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis:
Rethinking Pseudo Labels for Semi-Supervised Object Detection. CoRR abs/2106.00168 (2021) - [i43]Rui Wang, Zuxuan Wu, Zejia Weng, Jingjing Chen, Guo-Jun Qi, Yu-Gang Jiang:
Cross-domain Contrastive Learning for Unsupervised Domain Adaptation. CoRR abs/2106.05528 (2021) - [i42]Junke Wang, Shaoxiang Chen, Zuxuan Wu, Yu-Gang Jiang:
FT-TDR: Frequency-guided Transformer and Top-Down Refinement Network for Blind Face Inpainting. CoRR abs/2108.04424 (2021) - [i41]Zejia Weng, Lingchen Meng, Rui Wang, Zuxuan Wu, Yu-Gang Jiang:
A Multimodal Framework for Video Ads Understanding. CoRR abs/2108.12868 (2021) - [i40]Zhipeng Wei, Jingjing Chen, Micah Goldblum, Zuxuan Wu, Tom Goldstein, Yu-Gang Jiang:
Towards Transferable Adversarial Attacks on Vision Transformers. CoRR abs/2109.04176 (2021) - [i39]Fan Luo, Shaoxiang Chen, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Self-supervised Learning for Semi-supervised Temporal Language Grounding. CoRR abs/2109.11475 (2021) - [i38]Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Boosting the Transferability of Video Adversarial Examples via Temporal Translation. CoRR abs/2110.09075 (2021) - [i37]Kai Chen, Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Attacking Video Recognition Models with Bullet-Screen Comments. CoRR abs/2110.15629 (2021) - [i36]Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang:
Semi-Supervised Vision Transformers. CoRR abs/2111.11067 (2021) - [i35]Junke Wang, Xitong Yang, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang:
Efficient Video Transformers with Spatial-Temporal Token Selection. CoRR abs/2111.11591 (2021) - [i34]Lingchen Meng, Hengduo Li, Bor-Chun Chen, Shiyi Lan, Zuxuan Wu, Yu-Gang Jiang, Ser-Nam Lim:
AdaViT: Adaptive Vision Transformers for Efficient Image Recognition. CoRR abs/2111.15668 (2021) - [i33]Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Yu-Gang Jiang, Luowei Zhou, Lu Yuan:
BEVT: BERT Pretraining of Video Transformers. CoRR abs/2112.01529 (2021) - [i32]Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang:
Cross-Modal Transferable Adversarial Attacks from Images to Videos. CoRR abs/2112.05379 (2021) - [i31]Tianyi Liu, Zuxuan Wu, Wenhan Xiong, Jingjing Chen, Yu-Gang Jiang:
Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation. CoRR abs/2112.05587 (2021) - [i30]Menglin Jia, Bor-Chun Chen, Zuxuan Wu, Claire Cardie, Serge J. Belongie, Ser-Nam Lim:
Rethinking Nearest Neighbors for Visual Classification. CoRR abs/2112.08459 (2021) - 2020
- [b1]Zuxuan Wu:
Image and video Understanding with constrained Resources. University of Maryland, College Park, MD, USA, 2020 - [c31]Zhe Wu, Zuxuan Wu, Bharat Singh, Larry S. Davis:
Recognizing Instagram Filtered Images with Feature De-Stylization. AAAI 2020: 12418-12425 - [c30]Peng Zhou, Long Mai, Jianming Zhang, Ning Xu, Zuxuan Wu, Larry Davis:
M2KD: Incremental Learning via Multi-model and Multi-level Knowledge Distillation. BMVC 2020 - [c29]Hengduo Li, Zuxuan Wu, Chen Zhu, Caiming Xiong, Richard Socher, Larry S. Davis:
Learning From Noisy Anchors for One-Stage Object Detection. CVPR 2020: 10585-10594 - [c28]Zuxuan Wu, Ser-Nam Lim, Larry S. Davis, Tom Goldstein:
Making an Invisibility Cloak: Real World Adversarial Attacks on Object Detectors. ECCV (4) 2020: 1-17 - [i29]Manli Shu, Zuxuan Wu, Micah Goldblum, Tom Goldstein:
Prepare for the Worst: Generalizing across Domain Shifts with Adversarial Batch Normalization. CoRR abs/2009.08965 (2020) - [i28]Kezhi Kong, Guohao Li, Mucong Ding, Zuxuan Wu, Chen Zhu, Bernard Ghanem, Gavin Taylor, Tom Goldstein:
FLAG: Adversarial Data Augmentation for Graph Neural Networks. CoRR abs/2010.09891 (2020) - [i27]Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge J. Belongie, Ser-Nam Lim:
Intentonomy: a Dataset and Study towards Human Intent Understanding. CoRR abs/2011.05558 (2020) - [i26]Bo He, Xitong Yang, Zuxuan Wu, Hao Chen, Ser-Nam Lim, Abhinav Shrivastava:
GTA: Global Temporal Attention for Video Action Understanding. CoRR abs/2012.08510 (2020) - [i25]Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis:
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition. CoRR abs/2012.14950 (2020)
2010 – 2019
- 2019
- [j3]Rui-Wei Zhao, Qi Zhang, Zuxuan Wu, Jianguo Li, Yu-Gang Jiang:
Visual Content Recognition by Exploiting Semantic Feature Map with Attention and Multi-task Learning. ACM Trans. Multim. Comput. Commun. Appl. 15(1s): 6:1-6:22 (2019) - [c27]Zuxuan Wu, Caiming Xiong, Chih-Yao Ma, Richard Socher, Larry S. Davis:
AdaFrame: Adaptive Frame Selection for Fast Video Recognition. CVPR 2019: 1278-1287 - [c26]Chih-Yao Ma, Zuxuan Wu, Ghassan AlRegib, Caiming Xiong, Zsolt Kira:
The Regretful Agent: Heuristic-Aided Navigation Through Progress Estimation. CVPR 2019: 6732-6740 - [c25]Zuxuan Wu, Xin Wang, Joseph Gonzalez, Tom Goldstein, Larry Davis:
ACE: Adapting to Changing Environments for Semantic Segmentation. ICCV 2019: 2121-2130 - [c24]Xintong Han, Zuxuan Wu, Weilin Huang, Matthew R. Scott, Larry Davis:
FiNet: Compatible and Diverse Fashion Image Inpainting. ICCV 2019: 4480-4490 - [c23]Chih-Yao Ma, Jiasen Lu, Zuxuan Wu, Ghassan AlRegib, Zsolt Kira, Richard Socher, Caiming Xiong:
Self-Monitoring Navigation Agent via Auxiliary Progress Estimation. ICLR (Poster) 2019 - [c22]Zuxuan Wu, Caiming Xiong, Yu-Gang Jiang, Larry S. Davis:
LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition. NeurIPS 2019: 7778-7787 - [c21]Zuxuan Wu, Larry Davis, Leonid Sigal:
Weakly-Supervised Spatial Context Networks. WACV 2019: 1253-1261 - [i24]Chih-Yao Ma, Jiasen Lu, Zuxuan Wu, Ghassan AlRegib, Zsolt Kira, Richard Socher, Caiming Xiong:
Self-Monitoring Navigation Agent via Auxiliary Progress Estimation. CoRR abs/1901.03035 (2019) - [i23]Xintong Han, Zuxuan Wu, Weilin Huang, Matthew R. Scott, Larry S. Davis:
Compatible and Diverse Fashion Image Inpainting. CoRR abs/1902.01096 (2019) - [i22]Chih-Yao Ma, Zuxuan Wu, Ghassan AlRegib, Caiming Xiong, Zsolt Kira:
The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation. CoRR abs/1903.01602 (2019) - [i21]Peng Zhou, Long Mai, Jianming Zhang, Ning Xu, Zuxuan Wu, Larry S. Davis:
M2KD: Multi-model and Multi-level Knowledge Distillation for Incremental Learning. CoRR abs/1904.01769 (2019) - [i20]Hengduo Li, Bharat Singh, Mahyar Najibi, Zuxuan Wu, Larry S. Davis:
An Analysis of Pre-Training on Object Detection. CoRR abs/1904.05871 (2019) - [i19]Zuxuan Wu, Xin Wang, Joseph E. Gonzalez, Tom Goldstein, Larry S. Davis:
ACE: Adapting to Changing Environments for Semantic Segmentation. CoRR abs/1904.06268 (2019) - [i18]Zuxuan Wu, Ser-Nam Lim, Larry Davis, Tom Goldstein:
Making an Invisibility Cloak: Real World Adversarial Attacks on Object Detectors. CoRR abs/1910.14667 (2019) - [i17]Zuxuan Wu, Caiming Xiong, Yu-Gang Jiang, Larry S. Davis:
LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition. CoRR abs/1912.01601 (2019) - [i16]Hengduo Li, Zuxuan Wu, Chen Zhu, Caiming Xiong, Richard Socher, Larry S. Davis:
Learning from Noisy Anchors for One-stage Object Detection. CoRR abs/1912.05086 (2019) - [i15]Zhe Wu, Zuxuan Wu, Bharat Singh, Larry S. Davis:
Recognizing Instagram Filtered Images with Feature De-stylization. CoRR abs/1912.13000 (2019) - 2018
- [j2]Yu-Gang Jiang, Zuxuan Wu, Jun Wang, Xiangyang Xue, Shih-Fu Chang:
Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks. IEEE Trans. Pattern Anal. Mach. Intell. 40(2): 352-364 (2018) - [j1]Yu-Gang Jiang, Zuxuan Wu, Jinhui Tang, Zechao Li, Xiangyang Xue, Shih-Fu Chang:
Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification. IEEE Trans. Multim. 20(11): 3137-3147 (2018) - [c20]Xintong Han, Zuxuan Wu, Zhe Wu, Ruichi Yu, Larry S. Davis:
VITON: An Image-Based Virtual Try-On Network. CVPR 2018: 7543-7552 - [c19]Zuxuan Wu, Tushar Nagarajan, Abhishek Kumar, Steven Rennie, Larry S. Davis, Kristen Grauman, Rogério Schmidt Feris:
BlockDrop: Dynamic Inference Paths in Residual Networks. CVPR 2018: 8817-8826 - [c18]Zuxuan Wu, Xintong Han, Yen-Liang Lin, Mustafa Gökhan Uzunbas, Tom Goldstein, Ser-Nam Lim, Larry S. Davis:
DCAN: Dual Channel-Wise Alignment Networks for Unsupervised Scene Adaptation. ECCV (5) 2018: 535-552 - [p1]Zuxuan Wu, Ting Yao, Yanwei Fu, Yu-Gang Jiang:
Deep learning for video classification and captioning. Frontiers of Multimedia Research 2018: 3-29 - [i14]Zuxuan Wu, Xintong Han, Yen-Liang Lin, Mustafa Gökhan Uzunbas, Tom Goldstein, Ser-Nam Lim, Larry S. Davis:
DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation. CoRR abs/1804.05827 (2018) - [i13]Zuxuan Wu, Caiming Xiong, Chih-Yao Ma, Richard Socher, Larry S. Davis:
AdaFrame: Adaptive Frame Selection for Fast Video Recognition. CoRR abs/1811.12432 (2018) - 2017
- [c17]Xintong Han, Zuxuan Wu, Phoenix X. Huang, Xiao Zhang, Menglong Zhu, Yuan Li, Yang Zhao, Larry S. Davis:
Automatic Spatially-Aware Fashion Concept Discovery. ICCV 2017: 1472-1480 - [c16]Xintong Han, Zuxuan Wu, Yu-Gang Jiang, Larry S. Davis:
Learning Fashion Compatibility with Bidirectional LSTMs. ACM Multimedia 2017: 1078-1086 - [c15]Rui-Wei Zhao, Zuxuan Wu, Jianguo Li, Yu-Gang Jiang:
Learning Semantic Feature Map for Visual Content Recognition. ACM Multimedia 2017: 1291-1299 - [c14]Zuxuan Wu, Yu-Gang Jiang, Larry Davis, Shih-Fu Chang:
LSVC2017: Large-Scale Video Classification Challenge. ACM Multimedia 2017: 1978-1979 - [i12]Zuxuan Wu, Larry S. Davis, Leonid Sigal:
Weakly-Supervised Spatial Context Networks. CoRR abs/1704.02998 (2017) - [i11]Yu-Gang Jiang, Zuxuan Wu, Jinhui Tang, Zechao Li, Xiangyang Xue, Shih-Fu Chang:
Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification. CoRR abs/1706.04508 (2017) - [i10]Shaoxiang Chen, Xi Wang, Yongyi Tang, Xinpeng Chen, Zuxuan Wu, Yu-Gang Jiang:
Aggregating Frame-level Features for Large-Scale Video Classification. CoRR abs/1707.00803 (2017) - [i9]Xintong Han, Zuxuan Wu, Yu-Gang Jiang, Larry S. Davis:
Learning Fashion Compatibility with Bidirectional LSTMs. CoRR abs/1707.05691 (2017) - [i8]Xintong Han, Zuxuan Wu, Phoenix X. Huang, Xiao Zhang, Menglong Zhu, Yuan Li, Yang Zhao, Larry S. Davis:
Automatic Spatially-aware Fashion Concept Discovery. CoRR abs/1708.01311 (2017) - [i7]Zuxuan Wu, Tushar Nagarajan, Abhishek Kumar, Steven Rennie, Larry S. Davis, Kristen Grauman, Rogério Schmidt Feris:
BlockDrop: Dynamic Inference Paths in Residual Networks. CoRR abs/1711.08393 (2017) - [i6]Xintong Han, Zuxuan Wu, Zhe Wu, Ruichi Yu, Larry S. Davis:
VITON: An Image-based Virtual Try-on Network. CoRR abs/1711.08447 (2017) - 2016
- [c13]Zuxuan Wu, Yanwei Fu, Yu-Gang Jiang, Leonid Sigal:
Harnessing Object and Scene Semantics for Large-Scale Video Understanding. CVPR 2016: 3112-3121 - [c12]Chen Chen, Zuxuan Wu, Yu-Gang Jiang:
Emotion in Context: Deep Semantic Feature Fusion for Video Emotion Recognition. ACM Multimedia 2016: 127-131 - [c11]Yongqing Sun, Zuxuan Wu, Xi Wang, Hiroyuki Arai, Tetsuya Kinebuchi, Yu-Gang Jiang:
Exploiting Objects with LSTMs for Video Categorization. ACM Multimedia 2016: 142-146 - [c10]Zuxuan Wu, Yu-Gang Jiang, Xi Wang, Hao Ye, Xiangyang Xue:
Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification. ACM Multimedia 2016: 791-800 - [i5]Zuxuan Wu, Ting Yao, Yanwei Fu, Yu-Gang Jiang:
Deep Learning for Video Classification and Captioning. CoRR abs/1609.06782 (2016) - 2015
- [c9]Qi Dai, Rui-Wei Zhao, Zuxuan Wu, Xi Wang, Zichen Gu, Wenhai Wu, Yu-Gang Jiang:
Fudan-Huawei at MediaEval 2015: Detecting Violent Scenes and Affective Impact in Movies with Deep Learning. MediaEval 2015 - [c8]Hao Ye, Zuxuan Wu, Rui-Wei Zhao, Xi Wang, Yu-Gang Jiang, Xiangyang Xue:
Evaluating Two-Stream CNN for Video Classification. ICMR 2015: 435-442 - [c7]Zuxuan Wu, Xi Wang, Yu-Gang Jiang, Hao Ye, Xiangyang Xue:
Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification. ACM Multimedia 2015: 461-470 - [c6]Yongqing Sun, Zuxuan Wu, Xi Wang, Kyoko Sudo, Yukinobu Taniguchi, Tetsuya Kinebuchi, Yu-Gang Jiang:
NTT-Fudan Team @ TRECVID 2015: Multimedia Event Detection. TRECVID 2015 - [c5]Zuxuan Wu, Hao Ye, Yu-Gang Jiang, Xiangyang Xue:
Fudan at TRECVID 2015: Adaptive Feature Fusion for Multimedia Event Detection in Videos. TRECVID 2015 - [i4]Yu-Gang Jiang, Zuxuan Wu, Jun Wang, Xiangyang Xue, Shih-Fu Chang:
Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks. CoRR abs/1502.07209 (2015) - [i3]Zuxuan Wu, Xi Wang, Yu-Gang Jiang, Hao Ye, Xiangyang Xue:
Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification. CoRR abs/1504.01561 (2015) - [i2]Hao Ye, Zuxuan Wu, Rui-Wei Zhao, Xi Wang, Yu-Gang Jiang, Xiangyang Xue:
Evaluating Two-Stream CNN for Video Classification. CoRR abs/1504.01920 (2015) - [i1]Zuxuan Wu, Yu-Gang Jiang, Xi Wang, Hao Ye, Xiangyang Xue, Jun Wang:
Fusing Multi-Stream Deep Networks for Video Classification. CoRR abs/1509.06086 (2015) - 2014
- [c4]Jian Tu, Zuxuan Wu, Qi Dai, Yu-Gang Jiang, Xiangyang Xue:
Challenge Huawei challenge: Fusing multimodal features with deep neural networks for Mobile Video Annotation. ICME Workshops 2014: 1-6 - [c3]Qi Dai, Zuxuan Wu, Yu-Gang Jiang, Xiangyang Xue, Jinhui Tang:
Fudan-NJUST at MediaEval 2014: Violent Scenes Detection Using Deep Neural Networks. MediaEval 2014 - [c2]Zuxuan Wu, Yu-Gang Jiang, Jun Wang, Jian Pu, Xiangyang Xue:
Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification. ACM Multimedia 2014: 167-176 - [c1]Zuxuan Wu, Rui-Wei Zhao:
Fudan Team at TRECVID 2014: Multimedia Event Detection. TRECVID 2014
Coauthor Index
aka: Larry S. Davis
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-07 20:29 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint