Dear MMSys 2024 Participants,
On behalf of the organizers, we are very pleased to welcome you to the 15th ACM Multimedia Systems Conference, taking place for the first time in Italy, in the city of Bari.
MMSys is a premier conference dedicated to the exciting and multidisciplinary field of multimedia, with a specific focus on its systems and applications. The conference provides a platform for researchers from both academia and industry to share their latest findings in the multimedia systems research area. Many international researchers, practitioners, engineers, and students from academia, industry, standardization bodies, and government agencies join the MMSys conference each year.
Reliability Groups with Standby Flying Light Specks
A Flying Light Speck, FLS, is a miniature sized drone configured with light sources to illuminate different colors and textures. A swarm of FLSs illuminates complex 3D multimedia shapes in a fixed volume, a 3D display. An FLS is a mechanical device. Its ...
BOLA360: Near-optimal View and Bitrate Adaptation for 360-degree Video Streaming
Recent advances in omnidirectional cameras and AR/VR headsets have spurred the adoption of 360° videos, which are widely believed to be the future of online video streaming. 360° videos allow users to wear a head-mounted display (HMD) and experience the ...
VP9 bitstream-based Tiled Multipoint Control Unit: Scaling simultaneous RGBD user streams in an immersive 3D communication system
Video conference applications that allow group communication through video and audio over distance have become commonplace and mainstream. To scale the number of participants in video conferencing systems, the usual practice is to deploy centralized ...
AGiLE: Enhancing Adaptive GOP in Live Video Streaming
As live streaming video continues to gain popularity, encoding efficiency remains a critical challenge. Current commercial systems limit the Group of Picture (GOP) length to optimize for spontaneous viewer access, but this often compromises encoding ...
OASIS: Collaborative Neural-Enhanced Mobile Video Streaming
Neural-enhanced video streaming (e.g., super-resolution) is an ongoing revolution which can provide extremely high-quality video streaming services breaking the restriction of bandwidth. However, such enhancements require intense computation power that ...
FlexMark: Adaptive Watermarking Method for Images
Most current watermarking methods offer low and fixed capacity, which means they can only embed small-size watermarks into images. Additionally, they are typically robust to only a small subset of the known image transformations (aka distortions) that ...
FovOptix: Human Vision-Compatible Video Encoding and Adaptive Streaming in VR Cloud Gaming
VR cloud gaming enables users to play high-end VR games on lightweight devices by offloading rendering tasks to cloud servers. Despite video compression, high-definition video streaming requires substantial data transfer rates. Foveated rendering (FR) ...
DIGITWISE: Digital Twin-based Modeling of Adaptive Video Streaming Engagement
As the popularity of video streaming entertainment continues to grow, understanding how users engage with the content and react to its changes becomes a critical success factor for every stakeholder. User engagement, i.e., the percentage of video the ...
Just-in-Time Transcoding of 360° Video Streams
Adaptive streaming of 360° tiled video requires encoding tiles to multiple qualities to support client decisions in the light of fluctuating bandwidth and dynamic view-ports. Some static approaches allocate fixed encoding resources independently of the ...
Automatic Preparation of Sensory Effects: Managing Synchronization in Mulsemedia Applications
Traditional audiovisual multimedia applications can be synchronized with sensory effects to further enhance the users' quality of experience. These applications are named mulsemedia applications and stimulate other human senses, such as touch, smell, and ...
Low-Latency Live Video Streaming over a Low-Earth-Orbit Satellite Network with DASH
In light of Starlink's recent rapid growth in constructing a global low-Earth-orbit satellite constellation and offering high-speed, low-latency Internet services, the implications of utilizing Starlink for low-latency live video streaming, particularly ...
Scalable MDC-Based Volumetric Video Delivery for Real-Time One-to-Many WebRTC Conferencing
The production and consumption of video content has become a staple in the current day and age. With the rise of virtual reality (VR), users are now looking for immersive, interactive experiences which combine the classic video applications, such as ...
Accelerated Event-Based Feature Detection and Compression for Surveillance Video Systems
The strong temporal consistency of surveillance video enables compelling compression performance with traditional methods, but downstream vision applications operate on decoded image frames with a high data rate. Since it is not straightforward for ...
QV4: QoE-based Viewpoint-Aware V-PCC-encoded Volumetric Video Streaming
Volumetric videos allow six degrees of freedom (6DoF) movement for viewers, enabling numerous applications in domains such as entertainment, healthcare, and education. MPEG's Video-based Point Cloud Compression (V-PCC) is a recent new standard for ...
Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces
Neural Radiance Fields (NeRF) have quickly become the primary approach for 3D reconstruction and novel view synthesis in recent years due to their remarkable performance. Despite the huge interest in NeRF methods, a practical use case of NeRFs has ...
Vesper: Learning to Manage Uncertainty in Video Streaming
Video codecs are crucial in video streaming systems. However, the quantization operation in existing codecs introduces irreversible jitters. Moreover, the common practice of fitting a single codec to diverse video content lacks the flexibility to adapt ...
QoE Metrics for Interactivity in Video Conferencing Applications: Definition and Evaluation Methodology
Video conferencing applications (VCAs) have become an indispensable tool for business, educational, and personal communications. There is, therefore, considerable interest in understanding and measuring the Quality of Experience (QoE) delivered by VCAs ...
A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation
Performance degradation caused by corrupted multimedia samples is a critical challenge for machine learning models. Previously, three groups of approaches have been proposed to tackle this issue: i) enhancer and denoiser modules to improve the quality of ...
Inter-Frame Parallelization in an Open Optimized VVC Encoder
- Valeri George,
- Jens Brandenburg,
- Gabriel Hege,
- Tobias Hinz,
- Adam Wieckowski,
- Benjamin Bross,
- Thomas Schierl,
- Detlev Marpe
The Versatile Video Coding (VVC) standard promises high compression efficiency for diverse content types. Based on VVenC, an open and optimized VVC software video encoder, this work presents an inter-frame parallelization (IFP) method designed to exploit ...
User Study-based Models of Game Player Quality of Experience with Frame Display Time Variation
Computer games are often rendered with inconsistent frame timing (frame jitter), particularly in cloud-based game streaming where frames traverse network bottlenecks before being rendered. While previous studies have helped understand the Quality of ...
How do Users Experience Asynchrony between Visual and Haptic Information?
In this paper, we investigate the effects of asynchrony between the visual and haptic feedback in virtual reality (VR) on user experience, specifically focusing on understanding users' awareness of this asynchrony and its effect on their level of ...
CVF: Cross-Video Filtration on the Edge
Many edge applications rely on expensive Deep-Neural-Network (DNN) inference-based video analytics. Typically, a single instance of an inference service analyzes multiple realtime camera streams concurrently. In many cases, only a fraction of these ...
Ceasefire Hierarchical Weapon Dataset
- Thierry Malon,
- Sylvie Chambon,
- Alain Crouzil,
- Loubna Lechelek,
- Grégory Jalabert,
- Christian Brocard,
- Nadia Bernardeau,
- Laurence Abadie,
- Bruno Sera,
- Thierry Hartmann,
- Marjorie Le Bras
Given the huge level of firearms trafficking in Europe, governments, and in particular interior ministries, are actively engaged in developing artificial intelligence tools to identify firearms more effectively. Indeed, confronted with the huge number of ...
TACDEC: Dataset of Tackle Events in Soccer Game Videos
- Evan Jåsund Kassab,
- Håkon Maric Solberg,
- Sushant Gautam,
- Saeed Shafiee Sabet,
- Thomas Torjusen,
- Michael Riegler,
- Pål Halvorsen,
- Cise Midoglu
This paper introduces TACDEC, a dataset of tackle events in soccer game videos. Recognizing the gap in existing open datasets that predominantly focus on official soccer events such as goals and cards, TACDEC targets a comprehensive analysis of tackles --...
Nagare Media Engine: Task Error Recovery in MPEG NBMP Workflows Through Event Sourcing
Multimedia workflows have become complex distributed systems that are deployed in a multi-cloud and multi-edge fashion. Such systems are prone to errors either in hardware or software. Modern multimedia workflows therefore need to design an appropriate ...
GREEM: An Open-Source Energy Measurement Tool for Video Processing
Addressing climate change requires a global decrease in greenhouse gas (GHG) emissions. In today's digital landscape, video streaming significantly influences internet traffic, driven by the widespread use of mobile devices and the rising popularity of ...
An Open Software Suite for Event-Based Video
While traditional video representations are organized around discrete image frames, event-based video is a new paradigm that forgoes image frames altogether. Rather, pixel samples are temporally asynchronous and independent of one another. Until now, ...
LENS: A LEO Satellite Network Measurement Dataset
Low-Earth-Orbit (LEO) satellite constellations are narrowing the performance gap between satellite networks and the terrestrial Internet. Low-latency satellite Internet offered by Starlink enables functionalities that are otherwise unachievable with the ...
EVCA: Enhanced Video Complexity Analyzer
The optimization of video compression and streaming workflows critically relies on understanding the video complexity, including both spatial and temporal features. These features play a vital role in guiding rate control, predicting video encoding ...
Quality-Aware Dynamic Resolution Adaptation Framework for Adaptive Video Streaming
Traditional per-title encoding schemes aim to optimize encoding resolutions to deliver the highest perceptual quality for each representation. XPSNR is observed to correlate better with the subjective quality of VVC-coded bitstreams. Towards this ...