default search action
18th ECCV 2024: Milan, Italy - Part LXXIV
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXXIV. Lecture Notes in Computer Science 15132, Springer 2025, ISBN 978-3-031-72903-4 - Kangqi Ma, Hao Dong, Yadong Mu:
Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection. 1-18 - Meng Wang, Yuyao Huang, Henghui Ding, Xinlong Wang, Tiejun Huang, Yao Zhao, Yunchao Wei, Shuicheng Yan:
Region-Native Visual Tokenization. 19-36 - Mae Younes, Amine Ouasfi, Adnane Boukhayma:
SparseCraft: Few-Shot Neural Reconstruction Through Stereopsis Guided Geometric Linearization. 37-56 - Fei Wang:
Sketch2Vox: Learning 3D Reconstruction from a Single Monocular Sketch. 57-73 - Minghao Chen, Iro Laina, Andrea Vedaldi:
DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing. 74-92 - Jiafeng Mao, Xueting Wang, Kiyoharu Aizawa:
The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization. 93-109 - Silvio Galesso, Philipp Schröppel, Hssan Driss, Thomas Brox:
Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond. 110-126 - Zijie Jiang, Tianhan Xu, Hiroharu Kato:
Rethinking Directional Parameterization in Neural Implicit Surface Reconstruction. 127-142 - Tianhe Wu, Kede Ma, Jie Liang, Yujiu Yang, Lei Zhang:
A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment. 143-160 - Wulian Yun, Mengshi Qi, Fei Peng, Huadong Ma:
Semi-supervised Teacher-Reference-Student Architecture for Action Quality Assessment. 161-178 - Seungjun Shin, Suji Kim, Dokwan Oh:
Efficient Neural Video Representation with Temporally Coherent Modulation. 179-195 - Yaoting Wang, Peiwen Sun, Dongzhan Zhou, Guangyao Li, Honggang Zhang, Di Hu:
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes. 196-213 - Haoran Li, Haolin Shi, Wenli Zhang, Wenjun Wu, Yong Liao, Lin Wang, Lik-Hang Lee, Peng Yuan Zhou:
DreamScene: 3D Gaussian-Based Text-to-3D Scene Generation via Formation Pattern Sampling. 214-230 - Haoliang Meng, Xiaopeng Hong, Chenhao Wang, Miao Shang, Wangmeng Zuo:
Multi-modal Crowd Counting via a Broker Modality. 231-250 - Tianyu Zhang, Guocheng Qian, Jin Xie, Jian Yang:
FastPCI: Motion-Structure Guided Fast Point Cloud Frame Interpolation. 251-267 - Charig Yang, Weidi Xie, Andrew Zisserman:
Made to Order: Discovering Monotonic Temporal Changes via Self-supervised Video Ordering. 268-286 - Runzhao Yao, Shaoyi Du, Wenting Cui, Canhui Tang, Chengwu Yang:
PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration. 287-303 - Guoqiang Zhao, Junjie Huang, Xiaoyun Yan, Zhaojing Wang, Junwei Tang, Yangjun Ou, Xinrong Hu, Tao Peng:
Open-Vocabulary RGB-Thermal Semantic Segmentation. 304-320 - Gabriele Moreno Berton, Lorenz Junglas, Riccardo Zaccone, Thomas Pollok, Barbara Caputo, Carlo Masone:
MeshVPR: Citywide Visual Place Recognition Using 3D Meshes. 321-339 - Yaoting Wang, Peiwen Sun, Yuanchao Li, Honggang Zhang, Di Hu:
Can Textual Semantics Mitigate Sounding Object Segmentation Preference? 340-356 - Raphael Sulzer, Florent Lafarge:
Concise Plane Arrangements for Low-Poly Surface and Volume Modelling. 357-373 - Hairong Jin, Yuefan Shen, Jianwen Lou, Kun Zhou, Youyi Zheng:
KeypointDETR: An End-to-End 3D Keypoint Detector. 374-390 - Sogand Salehi, Mahdi Shafiei, Teresa Yeo, Roman Bachmann, Amir Zamir:
ViPer: Visual Personalization of Generative Models via Individual Preference Learning. 391-406 - Jian Yang, Jiakun Li, Guoming Li, Huai-Yu Wu, Zhen Shen, Zhaoxin Fan:
MLPHand: Real Time Multi-view 3D Hand Reconstruction via MLP Modeling. 407-424 - A. Tuan Nguyen, Kai Sheng Tai, Bor-Chun Chen, Satya Narayan Shukla, Hanchao Yu, Philip Torr, Tai-Peng Tian, Ser-Nam Lim:
uCAP: An Unsupervised Prompting Method for Vision-Language Models. 425-439 - Dilxat Muhtar, Zhenshi Li, Feng Gu, Xueliang Zhang, Pengfeng Xiao:
LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model. 440-457 - Andrei Atanov, Jiawei Fu, Rishubh Singh, Isabella Yu, Andrew Spielberg, Amir Zamir:
How Far Can a 1-Pixel Camera Go? Solving Vision Tasks Using Photoreceptors and Computationally Designed Visual Morphology. 458-476
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.