default search action
Kai Chen 0026
Person information
- unicode name: 陈恺
- affiliation: Shanghai AI Laboratory, Guangzhou, China
- affiliation: SenseTime Research, Hong Kong
- affiliation (PhD 2019): Chinese University of Hong Kong, MMLab, Hong Kong
Other persons with the same name
- Kai Chen — disambiguation page
- Kai Chen 0001 — University of Science and Technology of China, Department of Electronics Science and Technology, Hefei, China (and 1 more)
- Kai Chen 0002 — University of California at Berkeley, EECS Department, CA, USA
- Kai Chen 0003 — Google, Mountain View, CA, USA (and 1 more)
- Kai Chen 0004 — Xi'an Jiaotong University, School of Electronic and Information Engineering
- Kai Chen 0005 — Hong Kong University of Science and Technology (and 1 more)
- Kai Chen 0006 — Shanghai Jiatong University, Institute of Image Communication and Network Engineering, China
- Kai Chen 0007 — Cisco Systems (and 1 more)
- Kai Chen 0008 — University of Science and Technology of China, Department of Modern Physics
- Kai Chen 0009 — University of Science and Technology of China, Department of Computer Science
- Kai Chen 0010 — Google (and 2 more)
- Kai Chen 0011 — University of Fribourg, Switzerland
- Kai Chen 0012 — Chinese Academy of Sciences, Institute of Information Engineering, SKLOIS, Beijing, China
- Kai Chen 0013 — Qualcomm, Wireless R&D Department, Beijing, China (and 2 more)
- Kai Chen 0014 — China University of Geosciences, School of Geophysics and Information Technology, Beijing, China
- Kai Chen 0015 — University of Southern California, Dept. of Industrial and Systems Engineering
- Kai Chen 0016 — Huazhong University of Science and Technology, School of Computer Science and Technology, Services Computing Technology and System Lab / Cluster and Grid Computing Lab, China
- Kai Chen 0017 — Zhejiang University
- Kai Chen 0018 — University of Electronic Science and Technology of China, School of Automation Engineering, Chengdu, China
- Kai Chen 0019 — Xiamen University, China
- Kai Chen 0020 — National University of Defense Technology, National Laboratory for Parallel and Distributed Processing, Changsha, China
- Kai Chen 0021 — Wuhan University of Technology, Laboratory of Intelligent Manufacture and Control, China
- Kai Chen 0022 — University of Arizona, Department of Electrical and Computer Engineering, Tucson, AZ, USA
- Kai Chen 0023 — Huazhong University of Science and Technology, School of Automation, National Key Laboratory of Science and Technology on Multi-spectral Information Processing, Wuhan, China
- Kai Chen 0024 — Wuhan University, School of Remote Sensing and Information Engineering, Wuhan, China
- Kai Chen 0025 — BUPT, School of Information and Communication Engineering, Beijing, China
- Kai Chen 0027 — Fudan University, School of Computer Science, Shanghai, China
- Kai Chen 0028 — Chinese University of Hong Kong, Department of Computer Science and Engineering, Hong Kong
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j4]Xiangtai Li, Henghui Ding, Haobo Yuan, Wenwei Zhang, Jiangmiao Pang, Guangliang Cheng, Kai Chen, Ziwei Liu, Chen Change Loy:
Transformer-Based Visual Segmentation: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 10138-10163 (2024) - [j3]Yuan Liu, Songyang Zhang, Jiacheng Chen, Kai Chen, Dahua Lin:
PixMIM: Rethinking Pixel Reconstruction in Masked Image Modeling. Trans. Mach. Learn. Res. 2024 (2024) - [c57]Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen:
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark. ACL (Findings) 2024: 6884-6915 - [c56]Xi Chen, Songyang Zhang, Qibing Bai, Kai Chen, Satoshi Nakamura:
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models. ACL (Findings) 2024: 6976-6987 - [c55]Ziwei Ji, Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen:
ANAH: Analytical Annotation of Hallucinations in Large Language Models. ACL (1) 2024: 8135-8158 - [c54]Zehui Chen, Kuikun Liu, Qiuchen Wang, Wenwei Zhang, Jiangning Liu, Dahua Lin, Kai Chen, Feng Zhao:
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models. ACL (Findings) 2024: 9354-9366 - [c53]Zehui Chen, Weihua Du, Wenwei Zhang, Kuikun Liu, Jiangning Liu, Miao Zheng, Jingming Zhuo, Songyang Zhang, Dahua Lin, Kai Chen, Feng Zhao:
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step. ACL (1) 2024: 9510-9529 - [c52]Xin Jin, Chunle Guo, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Ruoqi Li, Chang Liu, Ziyi Wang, Yao Du, Jingjing Yang, Long Bao, Heng Sun, Xiangyu Kong, Xiaoxia Xing, Jinlong Wu, Yuanyang Xue, Hyunhee Park, Sejun Song, Changho Kim, Jingfan Tan, Wenhan Luo, Zikun Liu, Mingde Qiao, Junjun Jiang, Kui Jiang, Yao Xiao, Chuyang Sun, Jinhui Hu, Weijian Ruan, Yubo Dong, Kai Chen, Hyejeong Jo, Jiahao Qin, Bingjie Han, Pinle Qin, Rui Chai, Pengyuan Wang:
MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results. CVPR Workshops 2024: 1153-1161 - [c51]Peng Lu, Tao Jiang, Yining Li, Xiangtai Li, Kai Chen, Wenming Yang:
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation. CVPR 2024: 1491-1500 - [c50]Junshu Tang, Yanhong Zeng, Ke Fan, Xuheng Wang, Bo Dai, Kai Chen, Lizhuang Ma:
Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text. CVPR 2024: 6243-6253 - [c49]Jianzong Wu, Xiangtai Li, Chenyang Si, Shangchen Zhou, Jingkang Yang, Jiangning Zhang, Yining Li, Kai Chen, Yunhai Tong, Ziwei Liu, Chen Change Loy:
Towards Language-Driven Video Inpainting via Multimodal Large Language Models. CVPR 2024: 12501-12511 - [c48]Tai Wang, Xiaohan Mao, Chenming Zhu, Runsen Xu, Ruiyuan Lyu, Peisen Li, Xiao Chen, Wenwei Zhang, Kai Chen, Tianfan Xue, Xihui Liu, Cewu Lu, Dahua Lin, Jiangmiao Pang:
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI. CVPR 2024: 19757-19767 - [c47]Xiangtai Li, Haobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, Chen Change Loy:
OMG-Seg: Is One Model Good Enough for all Segmentation? CVPR 2024: 27948-27959 - [c46]Rongjie Li, Songyang Zhang, Dahua Lin, Kai Chen, Xuming He:
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models. CVPR 2024: 28076-28086 - [c45]Xiang Xu, Lingdong Kong, Hui Shuai, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Qingshan Liu:
4D Contrastive Superflows are Dense 3D Representation Learners. ECCV (1) 2024: 58-80 - [c44]Yanan Sun, Yanchen Liu, Yinhao Tang, Wenjie Pei, Kai Chen:
AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation. ECCV (11) 2024: 92-109 - [c43]Chenming Zhu, Tai Wang, Wenwei Zhang, Kai Chen, Xihui Liu:
ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities. ECCV (8) 2024: 151-168 - [c42]Junhao Zhuang, Yanhong Zeng, Wenran Liu, Chun Yuan, Kai Chen:
A Task Is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting. ECCV (58) 2024: 195-211 - [c41]Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, Wangbo Zhao, Yike Yuan, Jiaqi Wang, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin:
MMBench: Is Your Multi-modal Model an All-Around Player? ECCV (6) 2024: 216-233 - [c40]Haobo Yuan, Xiangtai Li, Chong Zhou, Yining Li, Kai Chen, Chen Change Loy:
Open-Vocabulary SAM: Segment and Recognize Twenty-Thousand Classes Interactively. ECCV (43) 2024: 419-437 - [c39]Jingming Zhuo, Songyang Zhang, Xinyu Fang, Haodong Duan, Dahua Lin, Kai Chen:
ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs. EMNLP (Findings) 2024: 1950-1976 - [c38]Zhejian Zhou, Jiayu Wang, Dahua Lin, Kai Chen:
Scaling Behavior for Large Language Models regarding Numeral Systems: An Example using Pythia. EMNLP (Findings) 2024: 3806-3820 - [c37]Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Alan Huang, Songyang Zhang, Kai Chen, Zhixin Yin, Zongwen Shen, Jidong Ge, Vincent Ng:
LawBench: Benchmarking Legal Knowledge of Large Language Models. EMNLP 2024: 7933-7962 - [c36]Qinyuan Cheng, Tianxiang Sun, Xiangyang Liu, Wenwei Zhang, Zhangyue Yin, Shimin Li, Linyang Li, Zhengfu He, Kai Chen, Xipeng Qiu:
Can AI Assistants Know What They Don't Know? ICML 2024 - [c35]Haodong Duan, Junming Yang, Yuxuan Qiao, Xinyu Fang, Lin Chen, Yuan Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
VLMEvalKit: An Open-Source ToolKit for Evaluating Large Multi-Modality Models. ACM Multimedia 2024: 11198-11201 - [c34]Haodong Duan, Jueqi Wei, Chonghua Wang, Hongwei Liu, Yixiao Fang, Songyang Zhang, Dahua Lin, Kai Chen:
BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues. NAACL-HLT (Findings) 2024: 3184-3200 - [c33]Chonghua Wang, Haodong Duan, Songyang Zhang, Dahua Lin, Kai Chen:
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks. NAACL-HLT 2024: 3712-3724 - [i119]Haobo Yuan, Xiangtai Li, Chong Zhou, Yining Li, Kai Chen, Chen Change Loy:
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively. CoRR abs/2401.02955 (2024) - [i118]Huanjun Kong, Songyang Zhang, Kai Chen:
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance. CoRR abs/2401.08772 (2024) - [i117]Jianzong Wu, Xiangtai Li, Chenyang Si, Shangchen Zhou, Jingkang Yang, Jiangning Zhang, Yining Li, Kai Chen, Yunhai Tong, Ziwei Liu, Chen Change Loy:
Towards Language-Driven Video Inpainting via Multimodal Large Language Models. CoRR abs/2401.10226 (2024) - [i116]Shilin Xu, Haobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, Ming-Hsuan Yang:
RAP-SAM: Towards Real-Time All-Purpose Segment Anything. CoRR abs/2401.10228 (2024) - [i115]Xiangtai Li, Haobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, Chen Change Loy:
OMG-Seg: Is One Model Good Enough For All Segmentation? CoRR abs/2401.10229 (2024) - [i114]Qinyuan Cheng, Tianxiang Sun, Xiangyang Liu, Wenwei Zhang, Zhangyue Yin, Shimin Li, Linyang Li, Zhengfu He, Kai Chen, Xipeng Qiu:
Can AI Assistants Know What They Don't Know? CoRR abs/2401.13275 (2024) - [i113]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model. CoRR abs/2401.16420 (2024) - [i112]Huaiyuan Ying, Shuo Zhang, Linyang Li, Zhejian Zhou, Yunfan Shao, Zhaoye Fei, Yichuan Ma, Jiawei Hong, Kuikun Liu, Ziyi Wang, Yudong Wang, Zijian Wu, Shuaibin Li, Fengzhe Zhou, Hongwei Liu, Songyang Zhang, Wenwei Zhang, Hang Yan, Xipeng Qiu, Jiayu Wang, Kai Chen, Dahua Lin:
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning. CoRR abs/2402.06332 (2024) - [i111]Tian Lan, Wenwei Zhang, Chen Xu, Heyan Huang, Dahua Lin, Kai Chen, Xianling Mao:
CriticBench: Evaluating Large Language Models as Critic. CoRR abs/2402.13764 (2024) - [i110]Bowen Li, Wenhan Wu, Ziwei Tang, Lin Shi, John Yang, Jinyang Li, Shunyu Yao, Chen Qian, Binyuan Hui, Qicheng Zhang, Zhiyin Yu, He Du, Ping Yang, Dahua Lin, Chao Peng, Kai Chen:
DevBench: A Comprehensive Benchmark for Software Development. CoRR abs/2403.08604 (2024) - [i109]Zehui Chen, Kuikun Liu, Qiuchen Wang, Wenwei Zhang, Jiangning Liu, Dahua Lin, Kai Chen, Feng Zhao:
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models. CoRR abs/2403.12881 (2024) - [i108]Junshu Tang, Yanhong Zeng, Ke Fan, Xuheng Wang, Bo Dai, Kai Chen, Lizhuang Ma:
Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text. CoRR abs/2403.16897 (2024) - [i107]Lingdong Kong, Xiang Xu, Jun Cen, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu:
Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding. CoRR abs/2403.17010 (2024) - [i106]Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang, Penglong Jiao, Zhenjiang Jin, Zhikai Lei, Jiaxing Li, Jingwen Li, Linyang Li, Shuaibin Li, Wei Li, Yining Li, Hongwei Liu, Jiangning Liu, Jiawei Hong, Kaiwen Liu, Kuikun Liu, Xiaoran Liu, Chengqi Lv, Haijun Lv, Kai Lv, Li Ma, Runyuan Ma, Zerun Ma, Wenchang Ning, Linke Ouyang, Jiantao Qiu, Yuan Qu, Fukai Shang, Yunfan Shao, Demin Song, Zifan Song, Zhihao Sui, Peng Sun, Yu Sun, Huanze Tang, Bin Wang, Guoteng Wang, Jiaqi Wang, Jiayu Wang, Rui Wang, Yudong Wang, Ziyi Wang, Xingjian Wei, Qizhen Weng, Fan Wu, Yingtong Xiong, Xiaomeng Zhao, et al.:
InternLM2 Technical Report. CoRR abs/2403.17297 (2024) - [i105]Rongjie Li, Songyang Zhang, Dahua Lin, Kai Chen, Xuming He:
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models. CoRR abs/2404.00906 (2024) - [i104]Chonghua Wang, Haodong Duan, Songyang Zhang, Dahua Lin, Kai Chen:
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks. CoRR abs/2404.06480 (2024) - [i103]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. CoRR abs/2404.06512 (2024) - [i102]Jiahao Wang, Wenqi Shao, Mengzhao Chen, Chengyue Wu, Yong Liu, Kaipeng Zhang, Songyang Zhang, Kai Chen, Ping Luo:
Adapting LLaMA Decoder to Vision Transformer. CoRR abs/2404.06773 (2024) - [i101]Lingdong Kong, Xiang Xu, Jiawei Ren, Wenwei Zhang, Liang Pan, Kai Chen, Wei Tsang Ooi, Ziwei Liu:
Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving. CoRR abs/2405.05258 (2024) - [i100]Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Weichao Qiu, Wei Zhang, Xu Cao, Hao Lu, Ying-Cong Chen, Caixin Kang, Xinning Zhou, Chengyang Ying, Wentao Shang, Xingwei Wang, Yinpeng Dong, Bo Yang, Shengyin Jiang, Zeliang Ma, Dengyi Ji, Haiwen Li, Xingliang Huang, Yu Tian, Genghua Kou, Fan Jia, Yingfei Liu, Tiancai Wang, Ying Li, Xiaoshuai Hao, Yifan Yang, Hui Zhang, Mengchuan Wei, Yi Zhou, Haimei Zhao, Jing Zhang, Jinke Li, Xiao He, Xiaoqiang Cheng, Bingyang Zhang, Lirong Zhao, Dianlei Ding, Fangsheng Liu, Yixiang Yan, Hongming Wang, Nanfei Ye, Lun Luo, Yubo Tian, Yiwei Zuo, Zhe Cao, Yi Ren, Yunfan Li, Wenjie Liu, Xun Wu, Yifan Mao, Ming Li, Jian Liu, Jiayang Liu, Zihan Qin, Cunxi Chu, Jialei Xu, Wenbo Zhao, Junjun Jiang, Xianming Liu, Ziyan Wang, Chiwei Li, Shilong Li, Chendong Yuan, Songyue Yang, Wentao Liu, Peng Chen, Bin Zhou, Yubo Wang, Chi Zhang, Jianhang Sun, Hai Chen, Xiao Yang, Lizhong Wang, Dongyi Fu, Yongchun Lin, Huitong Yang, Haoang Li, Yadan Luo, Xianjing Cheng, Yong Xu:
The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition. CoRR abs/2405.08816 (2024) - [i99]Kai Hu, Weichen Yu, Tianjun Yao, Xiang Li, Wenhe Liu, Lijun Yu, Yining Li, Kai Chen, Zhiqiang Shen, Matt Fredrikson:
Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization. CoRR abs/2405.09113 (2024) - [i98]Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen:
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark. CoRR abs/2405.12209 (2024) - [i97]Jiahao Sun, Chunmei Qing, Xiang Xu, Lingdong Kong, Youquan Liu, Li Li, Chenming Zhu, Jingwei Zhang, Zeqi Xiao, Runnan Chen, Tai Wang, Wenwei Zhang, Kai Chen:
An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models. CoRR abs/2405.14870 (2024) - [i96]Shaoyuan Xie, Lingdong Kong, Wenwei Zhang, Jiawei Ren, Liang Pan, Kai Chen, Ziwei Liu:
Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving. CoRR abs/2405.17426 (2024) - [i95]Zifan Song, Yudong Wang, Wenwei Zhang, Kuikun Liu, Chengqi Lyu, Demin Song, Qipeng Guo, Hang Yan, Dahua Lin, Kai Chen, Cairong Zhao:
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data. CoRR abs/2405.19265 (2024) - [i94]Ziwei Ji, Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen:
ANAH: Analytical Annotation of Hallucinations in Large Language Models. CoRR abs/2405.20315 (2024) - [i93]Huaiyuan Ying, Zijian Wu, Yihan Geng, Jiayu Wang, Dahua Lin, Kai Chen:
Lean Workbook: A large-scale Lean problem set formalized from natural language math problems. CoRR abs/2406.03847 (2024) - [i92]Xin Jin, Chunle Guo, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Ruoqi Li, Chang Liu, Ziyi Wang, Yao Du, Jingjing Yang, Long Bao, Heng Sun, Xiangyu Kong, Xiaoxia Xing, Jinlong Wu, Yuanyang Xue, Hyunhee Park, Sejun Song, Changho Kim, Jingfan Tan, Wenhan Luo, Zikun Liu, Mingde Qiao, Junjun Jiang, Kui Jiang, Yao Xiao, Chuyang Sun, Jinhui Hu, Weijian Ruan, Yubo Dong, Kai Chen, Hyejeong Jo, Jiahao Qin, Bingjie Han, Pinle Qin, Rui Chai, Pengyuan Wang:
MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results. CoRR abs/2406.07006 (2024) - [i91]Baiang Li, Sizhuo Ma, Yanhong Zeng, Xiaogang Xu, Youqing Fang, Zhao Zhang, Jian Wang, Kai Chen:
Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior. CoRR abs/2406.09389 (2024) - [i90]Xinyu Fang, Kangrui Mao, Haodong Duan, Xiangyu Zhao, Yining Li, Dahua Lin, Kai Chen:
MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding. CoRR abs/2406.14515 (2024) - [i89]Yuxuan Qiao, Haodong Duan, Xinyu Fang, Junming Yang, Lin Chen, Songyang Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs. CoRR abs/2406.14544 (2024) - [i88]Zhiwei Fei, Songyang Zhang, Xiaoyu Shen, Dawei Zhu, Xiao Wang, Maosong Cao, Fengzhe Zhou, Yining Li, Wenwei Zhang, Dahua Lin, Kai Chen, Jidong Ge:
InternLM-Law: An Open Source Chinese Legal Large Language Model. CoRR abs/2406.14887 (2024) - [i87]Jianzong Wu, Xiangtai Li, Yanhong Zeng, Jiangning Zhang, Qianyu Zhou, Yining Li, Yunhai Tong, Kai Chen:
MotionBooth: Motion-Aware Customized Text-to-Video Generation. CoRR abs/2406.17758 (2024) - [i86]Xiangyu Zhao, Xiangtai Li, Haodong Duan, Haian Huang, Yining Li, Kai Chen, Hua Yang:
MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning. CoRR abs/2406.17770 (2024) - [i85]Yanan Sun, Yanchen Liu, Yinhao Tang, Wenjie Pei, Kai Chen:
AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation. CoRR abs/2406.18958 (2024) - [i84]Yicheng Chen, Xiangtai Li, Yining Li, Yanhong Zeng, Jianzong Wu, Xiangyu Zhao, Kai Chen:
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language. CoRR abs/2406.20085 (2024) - [i83]Junyao Gao, Yanchen Liu, Yanan Sun, Yinhao Tang, Yanhong Zeng, Kai Chen, Cairong Zhao:
StyleShot: A Snapshot on Any Style. CoRR abs/2407.01414 (2024) - [i82]Chenming Zhu, Tai Wang, Wenwei Zhang, Kai Chen, Xihui Liu:
ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities. CoRR abs/2407.01525 (2024) - [i81]Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output. CoRR abs/2407.03320 (2024) - [i80]Yuzhe Gu, Ziwei Ji, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen:
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models. CoRR abs/2407.04693 (2024) - [i79]Xiang Xu, Lingdong Kong, Hui Shuai, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Qingshan Liu:
4D Contrastive Superflows are Dense 3D Representation Learners. CoRR abs/2407.06190 (2024) - [i78]Zhening Xing, Gereon Fox, Yanhong Zeng, Xingang Pan, Mohamed Elgharib, Christian Theobalt, Kai Chen:
Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models. CoRR abs/2407.08701 (2024) - [i77]Jize Wang, Zerun Ma, Yining Li, Songyang Zhang, Cailian Chen, Kai Chen, Xinyi Le:
GTA: A Benchmark for General Tool Agents. CoRR abs/2407.08713 (2024) - [i76]Songyang Zhang, Chuyu Zhang, Yingfan Hu, Haowen Shen, Kuikun Liu, Zerun Ma, Fengzhe Zhou, Wenwei Zhang, Xuming He, Dahua Lin, Kai Chen:
CIBench: Evaluating Your LLMs with a Code Interpreter Plugin. CoRR abs/2407.10499 (2024) - [i75]Haodong Duan, Junming Yang, Yuxuan Qiao, Xinyu Fang, Lin Chen, Yuan Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models. CoRR abs/2407.11691 (2024) - [i74]Mo Li, Songyang Zhang, Yunxin Liu, Kai Chen:
NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window? CoRR abs/2407.11963 (2024) - [i73]Xi Chen, Songyang Zhang, Qibing Bai, Kai Chen, Satoshi Nakamura:
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models. CoRR abs/2407.15415 (2024) - [i72]Zijian Wu, Jiayu Wang, Dahua Lin, Kai Chen:
LEAN-GitHub: Compiling GitHub LEAN repositories for a versatile LEAN prover. CoRR abs/2407.17227 (2024) - [i71]Zhenzhi Wang, Yixuan Li, Yanhong Zeng, Youqing Fang, Yuwei Guo, Wenran Liu, Jing Tan, Kai Chen, Tianfan Xue, Bo Dai, Dahua Lin:
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation. CoRR abs/2407.17438 (2024) - [i70]Zehui Chen, Kuikun Liu, Qiuchen Wang, Jiangning Liu, Wenwei Zhang, Kai Chen, Feng Zhao:
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher. CoRR abs/2407.20183 (2024) - [i69]Zhi Chen, Qiguang Chen, Libo Qin, Qipeng Guo, Haijun Lv, Yicheng Zou, Wanxiang Che, Hang Yan, Kai Chen, Dahua Lin:
What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices. CoRR abs/2409.01893 (2024) - [i68]Haoran Que, Feiyu Duan, Liqun He, Yutao Mou, Wangchunshu Zhou, Jiaheng Liu, Wenge Rong, Zekun Moore Wang, Jian Yang, Ge Zhang, Junran Peng, Zhaoxiang Zhang, Songyang Zhang, Kai Chen:
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models. CoRR abs/2409.16191 (2024) - [i67]Zhejian Zhou, Jiayu Wang, Dahua Lin, Kai Chen:
Scaling Behavior for Large Language Models regarding Numeral Systems: An Example using Pythia. CoRR abs/2409.17391 (2024) - [i66]Jingming Zhuo, Songyang Zhang, Xinyu Fang, Haodong Duan, Dahua Lin, Kai Chen:
ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs. CoRR abs/2410.12405 (2024) - [i65]Tian Lan, Wenwei Zhang, Chengqi Lyu, Shuaibin Li, Chen Xu, Heyan Huang, Dahua Lin, Xian-Ling Mao, Kai Chen:
Training Language Models to Critique With Multi-agent Feedback. CoRR abs/2410.15287 (2024) - [i64]Zijian Wu, Suozhi Huang, Zhejian Zhou, Huaiyuan Ying, Jiayu Wang, Dahua Lin, Kai Chen:
InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN Problems. CoRR abs/2410.15700 (2024) - [i63]Maosong Cao, Alexander Lam, Haodong Duan, Hongwei Liu, Songyang Zhang, Kai Chen:
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution. CoRR abs/2410.16256 (2024) - 2023
- [c32]Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H. S. Torr:
Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation. AAAI 2023: 3222-3230 - [c31]Shilong Zhang, Xinjiang Wang, Jiaqi Wang, Jiangmiao Pang, Chengqi Lyu, Wenwei Zhang, Ping Luo, Kai Chen:
Dense Distinct Query for End-to-End Object Detection. CVPR 2023: 7329-7338 - [c30]Jiahao Wang, Songyang Zhang, Yong Liu, Taiqiang Wu, Yujiu Yang, Xihui Liu, Kai Chen, Ping Luo, Dahua Lin:
RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer. CVPR 2023: 14443-14452 - [c29]Yuan Liu, Songyang Zhang, Jiacheng Chen, Zhaohui Yu, Kai Chen, Dahua Lin:
Improving Pixel-based MIM by Reducing Wasted Modeling Capability. ICCV 2023: 5338-5349 - [c28]Lingdong Kong, Youquan Liu, Xin Li, Runnan Chen, Wenwei Zhang, Jiawei Ren, Liang Pan, Kai Chen, Ziwei Liu:
Robo3D: Towards Robust and Reliable 3D Perception against Corruptions. ICCV 2023: 19937-19949 - [c27]Hao Li, Peng Jin, Zesen Cheng, Songyang Zhang, Kai Chen, Zhennan Wang, Chang Liu, Jie Chen:
TG-VQA: Ternary Game of Video Question Answering. IJCAI 2023: 1044-1052 - [c26]Youquan Liu, Lingdong Kong, Jun Cen, Runnan Chen, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu:
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models. NeurIPS 2023 - [i62]Yuan Liu, Songyang Zhang, Jiacheng Chen, Kai Chen, Dahua Lin:
PixMIM: Rethinking Pixel Reconstruction in Masked Image Modeling. CoRR abs/2303.02416 (2023) - [i61]Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H. S. Torr:
Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation. CoRR abs/2303.06345 (2023) - [i60]Tao Jiang, Peng Lu, Li Zhang, Ningsheng Ma, Rui Han, Chengqi Lyu, Yining Li, Kai Chen:
RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose. CoRR abs/2303.07399 (2023) - [i59]Shilong Zhang, Xinjiang Wang, Jiaqi Wang, Jiangmiao Pang, Chengqi Lyu, Wenwei Zhang, Ping Luo, Kai Chen:
Dense Distinct Query for End-to-End Object Detection. CoRR abs/2303.12776 (2023) - [i58]Lingdong Kong, Youquan Liu, Xin Li, Runnan Chen, Wenwei Zhang, Jiawei Ren, Liang Pan, Kai Chen, Ziwei Liu:
Robo3D: Towards Robust and Reliable 3D Perception against Corruptions. CoRR abs/2303.17597 (2023) - [i57]Jiahao Wang, Songyang Zhang, Yong Liu, Taiqiang Wu, Yujiu Yang, Xihui Liu, Kai Chen, Ping Luo, Dahua Lin:
RIFormer: Keep Your Vision Backbone Effective While Removing Token Mixer. CoRR abs/2304.05659 (2023) - [i56]Shaoyuan Xie, Lingdong Kong, Wenwei Zhang, Jiawei Ren, Liang Pan, Kai Chen, Ziwei Liu:
RoboBEV: Towards Robust Bird's Eye View Perception under Corruptions. CoRR abs/2304.06719 (2023) - [i55]Xiangtai Li, Henghui Ding, Wenwei Zhang, Haobo Yuan, Jiangmiao Pang, Guangliang Cheng, Kai Chen, Ziwei Liu, Chen Change Loy:
Transformer-Based Visual Segmentation: A Survey. CoRR abs/2304.09854 (2023) - [i54]Tao Gong, Chengqi Lyu, Shilong Zhang, Yudong Wang, Miao Zheng, Qian Zhao, Kuikun Liu, Wenwei Zhang, Ping Luo, Kai Chen:
MultiModal-GPT: A Vision and Language Model for Dialogue with Humans. CoRR abs/2305.04790 (2023) - [i53]Hao Li, Peng Jin, Zesen Cheng, Songyang Zhang, Kai Chen, Zhennan Wang, Chang Liu, Jie Chen:
TG-VQA: Ternary Game of Video Question Answering. CoRR abs/2305.10049 (2023) - [i52]Youquan Liu, Lingdong Kong, Jun Cen, Runnan Chen, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu:
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models. CoRR abs/2306.09347 (2023) - [i51]Shilong Zhang, Peize Sun, Shoufa Chen, Min Xiao, Wenqi Shao, Wenwei Zhang, Kai Chen, Ping Luo:
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest. CoRR abs/2307.03601 (2023) - [i50]Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, Wangbo Zhao, Yike Yuan, Jiaqi Wang, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin:
MMBench: Is Your Multi-modal Model an All-around Player? CoRR abs/2307.06281 (2023) - [i49]Yuan Liu, Songyang Zhang, Jiacheng Chen, Zhaohui Yu, Kai Chen, Dahua Lin:
Improving Pixel-based MIM by Reducing Wasted Modeling Capability. CoRR abs/2308.00261 (2023) - [i48]Wangbo Zhao, Kepan Nan, Songyang Zhang, Kai Chen, Dahua Lin, Yang You:
Learning Referring Video Object Segmentation from Weak Annotation. CoRR abs/2308.02162 (2023) - [i47]Chenming Zhu, Wenwei Zhang, Tai Wang, Xihui Liu, Kai Chen:
Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection. CoRR abs/2309.09456 (2023) - [i46]Pan Zhang, Xiaoyi Dong, Bin Wang, Yuhang Cao, Chao Xu, Linke Ouyang, Zhiyuan Zhao, Shuangrui Ding, Songyang Zhang, Haodong Duan, Wenwei Zhang, Hang Yan, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition. CoRR abs/2309.15112 (2023) - [i45]Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Songyang Zhang, Kai Chen, Zongwen Shen, Jidong Ge:
LawBench: Benchmarking Legal Knowledge of Large Language Models. CoRR abs/2309.16289 (2023) - [i44]Shilin Xu, Xiangtai Li, Size Wu, Wenwei Zhang, Yining Li, Guangliang Cheng, Yunhai Tong, Kai Chen, Chen Change Loy:
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection. CoRR abs/2310.01393 (2023) - [i43]Qinyuan Cheng, Tianxiang Sun, Wenwei Zhang, Siyin Wang, Xiangyang Liu, Mozhi Zhang, Junliang He, Mianqiu Huang, Zhangyue Yin, Kai Chen, Xipeng Qiu:
Evaluating Hallucinations in Chinese Large Language Models. CoRR abs/2310.03368 (2023) - [i42]Haodong Duan, Jueqi Wei, Chonghua Wang, Hongwei Liu, Yixiao Fang, Songyang Zhang, Dahua Lin, Kai Chen:
BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues. CoRR abs/2310.13650 (2023) - [i41]Junhao Zhuang, Yanhong Zeng, Wenran Liu, Chun Yuan, Kai Chen:
A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting. CoRR abs/2312.03594 (2023) - [i40]Zeming Chen, Wenwei Zhang, Xinjiang Wang, Kai Chen, Zhi Wang:
Mixed Pseudo Labels for Semi-Supervised Object Detection. CoRR abs/2312.07006 (2023) - [i39]Peng Lu, Tao Jiang, Yining Li, Xiangtai Li, Kai Chen, Wenming Yang:
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation. CoRR abs/2312.07526 (2023) - [i38]Yiming Zhang, Zhening Xing, Yanhong Zeng, Youqing Fang, Kai Chen:
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models. CoRR abs/2312.13964 (2023) - [i37]Zehui Chen, Weihua Du, Wenwei Zhang, Kuikun Liu, Jiangning Liu, Miao Zheng, Jingming Zhuo, Songyang Zhang, Dahua Lin, Kai Chen, Feng Zhao:
T-Eval: Evaluating the Tool Utilization Capability Step by Step. CoRR abs/2312.14033 (2023) - [i36]Tai Wang, Xiaohan Mao, Chenming Zhu, Runsen Xu, Ruiyuan Lyu, Peisen Li, Xiao Chen, Wenwei Zhang, Kai Chen, Tianfan Xue, Xihui Liu, Cewu Lu, Dahua Lin, Jiangmiao Pang:
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI. CoRR abs/2312.16170 (2023) - 2022
- [j2]Jiaqi Wang, Kai Chen, Rui Xu, Ziwei Liu, Chen Change Loy, Dahua Lin:
CARAFE++: Unified Content-Aware ReAssembly of FEatures. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 4674-4687 (2022) - [c25]Haodong Duan, Yue Zhao, Kai Chen, Dahua Lin, Bo Dai:
Revisiting Skeleton-based Action Recognition. CVPR 2022: 2959-2968 - [c24]Haodong Duan, Nanxuan Zhao, Kai Chen, Dahua Lin:
TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognition. CVPR 2022: 2990-3000 - [c23]Shilong Zhang, Zhuoran Yu, Liyang Liu, Xinjiang Wang, Aojun Zhou, Kai Chen:
Group R-CNN for Weakly Semi-supervised Object Detection with Points. CVPR 2022: 9407-9416 - [c22]Jintao Lin, Haodong Duan, Kai Chen, Dahua Lin, Limin Wang:
OCSampler: Compressing Videos to One Clip with Single-step Sampling. CVPR 2022: 13884-13893 - [c21]Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H. S. Torr:
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation. CVPR 2022: 18134-18144 - [c20]Xiangtai Li, Wenwei Zhang, Jiangmiao Pang, Kai Chen, Guangliang Cheng, Yunhai Tong, Chen Change Loy:
Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation. CVPR 2022: 18825-18835 - [c19]Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy:
Dense Siamese Network for Dense Unsupervised Learning. ECCV (30) 2022: 464-480 - [c18]Haodong Duan, Yue Zhao, Kai Chen, Yuanjun Xiong, Dahua Lin:
Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks. ECCV Workshops (4) 2022: 557-575 - [c17]Yue Zhou, Xue Yang, Gefan Zhang, Jiabao Wang, Yanyi Liu, Liping Hou, Xue Jiang, Xingzhao Liu, Junchi Yan, Chengqi Lyu, Wenwei Zhang, Kai Chen:
MMRotate: A Rotated Object Detection Benchmark using PyTorch. ACM Multimedia 2022: 7331-7334 - [c16]Haodong Duan, Jiaqi Wang, Kai Chen, Dahua Lin:
PYSKL: Towards Good Practices for Skeleton Action Recognition. ACM Multimedia 2022: 7351-7354 - [c15]Lin Chen, Zhixiang Wei, Xin Jin, Huaian Chen, Miao Zheng, Kai Chen, Yi Jin:
Deliberated Domain Bridging for Domain Adaptive Semantic Segmentation. NeurIPS 2022 - [i35]Jintao Lin, Haodong Duan, Kai Chen, Dahua Lin, Limin Wang:
OCSampler: Compressing Videos to One Clip with Single-step Sampling. CoRR abs/2201.04388 (2022) - [i34]Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy:
Dense Siamese Network. CoRR abs/2203.11075 (2022) - [i33]Xiangtai Li, Wenwei Zhang, Jiangmiao Pang, Kai Chen, Guangliang Cheng, Yunhai Tong, Chen Change Loy:
Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation. CoRR abs/2204.04656 (2022) - [i32]Yue Zhou, Xue Yang, Gefan Zhang, Jiabao Wang, Yanyi Liu, Liping Hou, Xue Jiang, Xingzhao Liu, Junchi Yan, Chengqi Lyu, Wenwei Zhang, Kai Chen:
MMRotate: A Rotated Object Detection Benchmark using Pytorch. CoRR abs/2204.13317 (2022) - [i31]Haodong Duan, Nanxuan Zhao, Kai Chen, Dahua Lin:
TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognition. CoRR abs/2205.02028 (2022) - [i30]Shilong Zhang, Zhuoran Yu, Liyang Liu, Xinjiang Wang, Aojun Zhou, Kai Chen:
Group R-CNN for Weakly Semi-supervised Object Detection with Points. CoRR abs/2205.05920 (2022) - [i29]Haodong Duan, Jiaqi Wang, Kai Chen, Dahua Lin:
PYSKL: Towards Good Practices for Skeleton Action Recognition. CoRR abs/2205.09443 (2022) - [i28]Shilong Zhang, Xinjiang Wang, Jiaqi Wang, Jiangmiao Pang, Kai Chen:
What Are Expected Queries in End-to-End Object Detection? CoRR abs/2206.01232 (2022) - [i27]Lin Chen, Zhixiang Wei, Xin Jin, Huaian Chen, Miao Zheng, Kai Chen, Yi Jin:
Deliberated Domain Bridging for Domain Adaptive Semantic Segmentation. CoRR abs/2209.07695 (2022) - [i26]Haodong Duan, Yue Zhao, Kai Chen, Yuanjun Xiong, Dahua Lin:
Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks. CoRR abs/2209.09393 (2022) - [i25]Haodong Duan, Jiaqi Wang, Kai Chen, Dahua Lin:
DG-STGCN: Dynamic Spatial-Temporal Modeling for Skeleton-based Action Recognition. CoRR abs/2210.05895 (2022) - [i24]Chengqi Lyu, Wenwei Zhang, Haian Huang, Yue Zhou, Yudong Wang, Yanyi Liu, Shilong Zhang, Kai Chen:
RTMDet: An Empirical Study of Designing Real-Time Object Detectors. CoRR abs/2212.07784 (2022) - 2021
- [j1]Jiangmiao Pang, Kai Chen, Qi Li, Zhihai Xu, Huajun Feng, Jianping Shi, Wanli Ouyang, Dahua Lin:
Towards Balanced Learning for Instance Recognition. Int. J. Comput. Vis. 129(5): 1376-1393 (2021) - [c14]Tao Gong, Kai Chen, Xinjiang Wang, Qi Chu, Feng Zhu, Dahua Lin, Nenghai Yu, Huamin Feng:
Temporal ROI Align for Video Object Recognition. AAAI 2021: 1442-1450 - [c13]Jiaqi Wang, Wenwei Zhang, Yuhang Zang, Yuhang Cao, Jiangmiao Pang, Tao Gong, Kai Chen, Ziwei Liu, Chen Change Loy, Dahua Lin:
Seesaw Loss for Long-Tailed Instance Segmentation. CVPR 2021: 9695-9704 - [c12]Rui Xu, Xintao Wang, Kai Chen, Bolei Zhou, Chen Change Loy:
Positional Encoding As Spatial Inductive Bias in GANs. CVPR 2021: 13569-13578 - [c11]Zhanghui Kuang, Hongbin Sun, Zhizhong Li, Xiaoyu Yue, Tsui Hin Lin, Jianyong Chen, Huaqiang Wei, Yiqin Zhu, Tong Gao, Wenwei Zhang, Kai Chen, Wayne Zhang, Dahua Lin:
MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding. ACM Multimedia 2021: 3791-3794 - [c10]Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy:
K-Net: Towards Unified Image Segmentation. NeurIPS 2021: 10326-10338 - [c9]Yuhang Cao, Jiaqi Wang, Ying Jin, Tong Wu, Kai Chen, Ziwei Liu, Dahua Lin:
Few-Shot Object Detection via Association and DIscrimination. NeurIPS 2021: 16570-16581 - [i23]Haodong Duan, Yue Zhao, Kai Chen, Dian Shao, Dahua Lin, Bo Dai:
Revisiting Skeleton-based Action Recognition. CoRR abs/2104.13586 (2021) - [i22]Shijie Fang, Yuhang Cao, Xinjiang Wang, Kai Chen, Dahua Lin, Wayne Zhang:
WSSOD: A New Pipeline for Weakly- and Semi-Supervised Object Detection. CoRR abs/2105.11293 (2021) - [i21]Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy:
K-Net: Towards Unified Image Segmentation. CoRR abs/2106.14855 (2021) - [i20]Zhanghui Kuang, Hongbin Sun, Zhizhong Li, Xiaoyu Yue, Tsui Hin Lin, Jianyong Chen, Huaqiang Wei, Yiqin Zhu, Tong Gao, Wenwei Zhang, Kai Chen, Wayne Zhang, Dahua Lin:
MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding. CoRR abs/2108.06543 (2021) - [i19]Jiangmiao Pang, Kai Chen, Qi Li, Zhihai Xu, Huajun Feng, Jianping Shi, Wanli Ouyang, Dahua Lin:
Towards Balanced Learning for Instance Recognition. CoRR abs/2108.10175 (2021) - [i18]Tao Gong, Kai Chen, Xinjiang Wang, Qi Chu, Feng Zhu, Dahua Lin, Nenghai Yu, Huamin Feng:
Temporal RoI Align for Video Object Recognition. CoRR abs/2109.03495 (2021) - [i17]Rui Xu, Xiangyu Xu, Kai Chen, Bolei Zhou, Chen Change Loy:
STransGAN: An Empirical Study on Transformer in GANs. CoRR abs/2110.13107 (2021) - [i16]Yuhang Cao, Jiaqi Wang, Ying Jin, Tong Wu, Kai Chen, Ziwei Liu, Dahua Lin:
Few-Shot Object Detection via Association and DIscrimination. CoRR abs/2111.11656 (2021) - [i15]Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H. S. Torr:
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation. CoRR abs/2112.02244 (2021) - 2020
- [c8]Yuhang Cao, Kai Chen, Chen Change Loy, Dahua Lin:
Prime Sample Attention in Object Detection. CVPR 2020: 11580-11588 - [c7]Jiaqi Wang, Wenwei Zhang, Yuhang Cao, Kai Chen, Jiangmiao Pang, Tao Gong, Jianping Shi, Chen Change Loy, Dahua Lin:
Side-Aware Boundary Localization for More Precise Object Detection. ECCV (4) 2020: 403-419 - [i14]Kai Chen, Yuhang Cao, Chen Change Loy, Dahua Lin, Christoph Feichtenhofer:
Feature Pyramid Grids. CoRR abs/2004.03580 (2020) - [i13]Jiaqi Wang, Wenwei Zhang, Yuhang Zang, Yuhang Cao, Jiangmiao Pang, Tao Gong, Kai Chen, Ziwei Liu, Chen Change Loy, Dahua Lin:
Seesaw Loss for Long-Tailed Instance Segmentation. CoRR abs/2008.10032 (2020) - [i12]Jiaqi Wang, Kai Chen, Rui Xu, Ziwei Liu, Chen Change Loy, Dahua Lin:
CARAFE++: Unified Content-Aware ReAssembly of FEatures. CoRR abs/2012.04733 (2020) - [i11]Rui Xu, Xintao Wang, Kai Chen, Bolei Zhou, Chen Change Loy:
Positional Encoding as Spatial Inductive Bias in GANs. CoRR abs/2012.05217 (2020)
2010 – 2019
- 2019
- [c6]Jiangmiao Pang, Kai Chen, Jianping Shi, Huajun Feng, Wanli Ouyang, Dahua Lin:
Libra R-CNN: Towards Balanced Learning for Object Detection. CVPR 2019: 821-830 - [c5]Jiaqi Wang, Kai Chen, Shuo Yang, Chen Change Loy, Dahua Lin:
Region Proposal by Guided Anchoring. CVPR 2019: 2965-2974 - [c4]Kai Chen, Jiangmiao Pang, Jiaqi Wang, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin:
Hybrid Task Cascade for Instance Segmentation. CVPR 2019: 4974-4983 - [c3]Jiaqi Wang, Kai Chen, Rui Xu, Ziwei Liu, Chen Change Loy, Dahua Lin:
CARAFE: Content-Aware ReAssembly of FEatures. ICCV 2019: 3007-3016 - [i10]Jiaqi Wang, Kai Chen, Shuo Yang, Chen Change Loy, Dahua Lin:
Region Proposal by Guided Anchoring. CoRR abs/1901.03278 (2019) - [i9]Kai Chen, Jiangmiao Pang, Jiaqi Wang, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin:
Hybrid Task Cascade for Instance Segmentation. CoRR abs/1901.07518 (2019) - [i8]Jiangmiao Pang, Kai Chen, Jianping Shi, Huajun Feng, Wanli Ouyang, Dahua Lin:
Libra R-CNN: Towards Balanced Learning for Object Detection. CoRR abs/1904.02701 (2019) - [i7]Yuhang Cao, Kai Chen, Chen Change Loy, Dahua Lin:
Prime Sample Attention in Object Detection. CoRR abs/1904.04821 (2019) - [i6]Jiaqi Wang, Kai Chen, Rui Xu, Ziwei Liu, Chen Change Loy, Dahua Lin:
CARAFE: Content-Aware ReAssembly of FEatures. CoRR abs/1905.02188 (2019) - [i5]Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, Jingdong Wang, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin:
MMDetection: Open MMLab Detection Toolbox and Benchmark. CoRR abs/1906.07155 (2019) - [i4]Jiaqi Wang, Wenwei Zhang, Yuhang Cao, Kai Chen, Jiangmiao Pang, Tao Gong, Jianping Shi, Chen Change Loy, Dahua Lin:
Side-Aware Boundary Localization for More Precise Object Detection. CoRR abs/1912.04260 (2019) - 2018
- [c2]Kai Chen, Jiaqi Wang, Shuo Yang, Xingcheng Zhang, Yuanjun Xiong, Chen Change Loy, Dahua Lin:
Optimizing Video Object Detection via a Scale-Time Lattice. CVPR 2018: 7814-7823 - [i3]Kai Chen, Jiaqi Wang, Shuo Yang, Xingcheng Zhang, Yuanjun Xiong, Chen Change Loy, Dahua Lin:
Optimizing Video Object Detection via a Scale-Time Lattice. CoRR abs/1804.05472 (2018) - 2017
- [c1]Kai Chen, Hang Song, Chen Change Loy, Dahua Lin:
Discover and Learn New Objects from Documentaries. CVPR 2017: 1111-1120 - [i2]Kai Chen, Hang Song, Chen Change Loy, Dahua Lin:
Discover and Learn New Objects from Documentaries. CoRR abs/1707.09593 (2017) - [i1]Xiaoxiao Li, Yuankai Qi, Zhe Wang, Kai Chen, Ziwei Liu, Jianping Shi, Ping Luo, Xiaoou Tang, Chen Change Loy:
Video Object Segmentation with Re-identification. CoRR abs/1708.00197 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-23 19:35 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint