Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2023
WormTrack: Dataset and Benchmark for Multi-Object Tracking in Worm Crowds
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 5756–5763https://doi.org/10.1145/3581783.3613812Currently, multimedia systems and computer vision algorithms are increasingly playing a crucial role in biological research. However, due to the significant difference between macro and micro scenarios, it is impractical to directly transfer existing ...
- demonstrationOctober 2023
Reference-based Dense Pose Estimation via Partial 3D Point Cloud Matching
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 9411–9413https://doi.org/10.1145/3581783.3612679Interacting with real-world objects is one of the fundamental tasks in multimedia. Despite its importance, existing object pose estimation targets only rigid objects. This demonstration proposes a novel application for non-rigid object pose estimation. ...
- demonstrationOctober 2023
H2V4Sports: Real-Time Horizontal-to-Vertical Video Converter for Sports Lives via Fast Object Detection and Tracking
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 9376–9378https://doi.org/10.1145/3581783.3612669We present H2V4Sports, a real-time horizontal-to-vertical video converter specifically designed for sports live broadcasts. With the increasing demand of smartphone users who prefer to watch sports events on their vertical screens anywhere, anytime, our ...
- research-articleOctober 2023
Quality-Aware RGBT Tracking via Supervised Reliability Learning and Weighted Residual Guidance
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 3129–3137https://doi.org/10.1145/3581783.3612341RGB and thermal infrared (TIR) data have different visual properties, which make their fusion essential for effective object tracking in diverse environments and scenes. Existing RGBT tracking methods commonly use attention mechanisms to generate ...
- research-articleOctober 2023
Progressive Domain-style Translation for Nighttime Tracking
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 7324–7334https://doi.org/10.1145/3581783.3612305Nighttime tracking is challenging due to the lack of sufficient training data and scene diversity. Unsupervised domain adaptation is a solution by transferring knowledge from day (source domain) to night (target domain). It typically involves adversarial ...
- research-articleOctober 2023
Unambiguous Object Tracking by Exploiting Target Cues
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 1997–2005https://doi.org/10.1145/3581783.3612240Siamese tracking exploits the template and the search region features to adaptively locate arbitrary objects in the tracking. A noteworthy issue is that both foreground and background mix in the template, and thus a tracker needs to learn what the target ...
- research-articleOctober 2023
ColSLAM: A Versatile Collaborative SLAM System for Mobile Phones Using Point-Line Features and Map Caching
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 9032–9041https://doi.org/10.1145/3581783.3611995Over the past years, augmented reality (AR) based on mobile phones has gained great attention. When multiple phones are used in AR applications, collaborative simultaneous localization and mapping (SLAM) is considered one of the enabling technologies, ...
- research-articleOctober 2023
Follow-me: Deceiving Trackers with Fabricated Paths
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 8808–8818https://doi.org/10.1145/3581783.3611935Convolutional Neural Networks (CNNs) are vulnerable to adversarial attacks in which visually imperceptible perturbations can deceive CNN-based models. While current research on adversarial attacks in single object tracking exists, it overlooks a critical ...
- research-articleOctober 2023
FOLT: Fast Multiple Object Tracking from UAV-captured Videos Based on Optical Flow
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 3375–3383https://doi.org/10.1145/3581783.3611868Multiple object tracking (MOT) has been successfully investigated in computer vision. However, MOT for the videos captured by unmanned aerial vehicles (UAV) is still challenging due to small object size, blurred object appearance, and very large and/or ...
- research-articleOctober 2023
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 5552–5561https://doi.org/10.1145/3581783.3611803Current mainstream vision-language (VL) tracking framework consists of three parts,i.e., a visual feature extractor, a language feature extractor, and a fusion model. To pursue better performance, a natural modus operandi for VL tracking is employing ...
- research-articleOctober 2023
DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 2734–2743https://doi.org/10.1145/3581783.3611728Multiple object tracking (MOT) tends to become more challenging when severe occlusions occur. In this paper, we analyze the limitations of traditional Convolutional Neural Network-based methods and Transformer-based methods in handling occlusions and ...
- research-articleOctober 2023
A Simple Baseline for Open-World Tracking via Self-training
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 2765–2774https://doi.org/10.1145/3581783.3611695Open-World Tracking (OWT) presents a challenging yet emerging problem, aiming to track every object of any category. Different from traditional Multi-Object Tracking (MOT), OWT needs to additionally track targets beyond predefined categories in the ...