Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- abstractOctober 2022
PIES-ME '22: 1st Workshop on Photorealistic Image and Environment Synthesis for Multimedia Experiments
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 7420–7422https://doi.org/10.1145/3503161.3554770Photorealistic media aim to faithfully represent the world, creating an experience that is perceptually indistinguishable from a real world experience. In the past few years, this area has grown significantly, with new multimedia areas emerging, such as ...
- abstractOctober 2022
SUMAC '22: 4th ACM International workshop on Structuring and Understanding of Multimedia heritAge Contents
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 7426–7427https://doi.org/10.1145/3503161.3554768SUMAC 2022 is the fourth edition of the workshop on Structuring and Understanding of Multimedia heritAge Contents. It is held in Lisboa, Portugal on October 10th, 2022 and is co-located with the 30th ACM International Conference on Multimedia. Its ...
- abstractOctober 2022
QoEVMA'22: 2nd Workshop on Quality of Experience (QoE) in Visual Multimedia Applications
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 7423–7425https://doi.org/10.1145/3503161.3554767Nowadays, people spend dramatically more time on watching videos through different devices. The advanced hardware technology and network allow for the increasing demands of users viewing experience. Thus, enhancing the Quality of Experience of end-users ...
- abstractOctober 2022
MMSports'22: 5th International ACM Workshop on Multimedia Content Analysis in Sports
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 7386–7388https://doi.org/10.1145/3503161.3551791The fifth ACM International Workshop on Multimedia Content Analysis in Sports (ACM MMSports'22) is part of the ACM International Conference on Multimedia 2022 (ACM Multimedia 2022). After two years of pure virtual MMSports workshops due to COVID-19, ...
- short-paperOctober 2022
Interaction with Immersive Cultural Heritage Environments: Using XR Technologies to Represent Multiple Perspectives on Serralves Museum
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 6920–6924https://doi.org/10.1145/3503161.3548756Museums have increasingly been using digital approaches to explore new ways to provide new experiences with Cultural Heritage (CH). The need for these solutions exploded with the COVID-19 pandemic forcing museums and cultural organizations to move ...
-
- short-paperOctober 2022
OpenHardwareVC: An Open Source Library for 8K UHD Video Coding Hardware Implementation
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 7339–7342https://doi.org/10.1145/3503161.3548543The hardware-accelerated real-time compression of 8K Ultra-High-Definition (UHD) video is an exemplary application that empowered by the latest video coding standard. However, the coding tools added to the recently released third-generation audio video ...
Reproducibility Companion Paper: Focusing on Persons: Colorizing Old Images Learning from Modern Historical Movies
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 7364–7367https://doi.org/10.1145/3503161.3548526In this paper we reproduce experimental results presented in our earlier work titled "Focusing on Persons: Colorizing Old Images Learning from Modern Historical Movies" that was presented in the course of the 29th ACM International Conference on ...
- research-articleOctober 2022
High-Quality 3D Face Reconstruction with Affine Convolutional Networks
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 2495–2503https://doi.org/10.1145/3503161.3548421Recent works based on convolutional encoder-decoder architecture and 3DMM parameterization have shown great potential for canonical view reconstruction from a single input image. Conventional CNN architectures benefit from exploiting the spatial ...
- research-articleOctober 2022
Robust Multimodal Depth Estimation using Transformer based Generative Adversarial Networks
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 3559–3568https://doi.org/10.1145/3503161.3548418Accurately measuring the absolute depth of every pixel captured by an imaging sensor is of critical importance in real-time applications such as autonomous navigation, augmented reality and robotics. In order to predict dense depth, a general approach ...
- research-articleOctober 2022
DisCo: Disentangled Implicit Content and Rhythm Learning for Diverse Co-Speech Gestures Synthesis
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 3764–3773https://doi.org/10.1145/3503161.3548400Current co-speech gestures synthesis methods struggle with generating diverse motions and typically collapse to single or few frequent motion sequences, which are trained on original data distribution with customized models and strategies. We tackle ...
- research-articleOctober 2022
Box-FaceS: A Bidirectional Method for Box-Guided Face Component Editing
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 6061–6071https://doi.org/10.1145/3503161.3548392While the quality of face manipulation has been improved tremendously, the ability to control face components, e.g., eyebrows, is still limited. Although existing methods have realized component editing with user-provided geometry guidance, such as ...
- research-articleOctober 2022
Error Concealment of Dynamic 3D Point Cloud Streaming
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 3134–3142https://doi.org/10.1145/3503161.3548384Recently standardized MPEG Video-based Point Cloud Compression (V-PCC) codec has shown promise in achieving a good rate-distortion ratio of dynamic 3D point cloud compression. Current error concealment methods of V-PCC, however, lead to significantly ...
- research-articleOctober 2022
Enhancing Image Rescaling using Dual Latent Variables in Invertible Neural Network
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 5602–5610https://doi.org/10.1145/3503161.3548376Normalizing flow models have been used successfully for generative image super-resolution (SR) by approximating complex distribution of natural images to simple tractable distribution in latent space through Invertible Neural Networks (INN). These ...
- research-articleOctober 2022
CrossHuman: Learning Cross-guidance from Multi-frame Images for Human Reconstruction
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 2483–2494https://doi.org/10.1145/3503161.3548351We propose CrossHuman, a novel method that learns cross-guidance from parametric human model and multi-frame RGB images to achieve high-quality 3D human reconstruction. To recover geometry details and texture even in invisible regions, we design a ...
- research-articleOctober 2022
Atrous Pyramid Transformer with Spectral Convolution for Image Inpainting
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 4674–4683https://doi.org/10.1145/3503161.3548348Owing to the ability of extracting features of images on long-range dependencies naturally, transformer is possible to reconstruct the damaged areas of images with the information from the uncorrupted regions globally. In this paper, we propose a two-...
- research-articleOctober 2022
RCRN: Real-world Character Image Restoration Network via Skeleton Extraction
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 1177–1185https://doi.org/10.1145/3503161.3548344Constructing high-quality character image datasets is challenging because real-world images are often affected by image degradation. There are limitations when applying current image restoration methods to such real-world character images, since (i) the ...
- research-articleOctober 2022
AGTGAN: Unpaired Image Translation for Photographic Ancient Character Generation
- Hongxiang Huang,
- Daihui Yang,
- Gang Dai,
- Zhen Han,
- Yuyi Wang,
- Kin-Man Lam,
- Fan Yang,
- Shuangping Huang,
- Yongge Liu,
- Mengchao He
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 5456–5467https://doi.org/10.1145/3503161.3548338The study of ancient writings has great value for archaeology and philology. Essential forms of material are photographic characters, but manual photographic character recognition is extremely time-consuming and expertise-dependent. Automatic ...
- research-articleOctober 2022
Learning-Based Video Coding with Joint Deep Compression and Enhancement
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 3045–3054https://doi.org/10.1145/3503161.3548314End-to-end learning-based video coding has attracted substantial attentions by compressing video signals as stacked visual features. This paper proposes an end-to-end deep video codec with jointly optimized compression and enhancement modules (JCEVC). ...
- research-articleOctober 2022
Unsupervised Textured Terrain Generation via Differentiable Rendering
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 2654–2662https://doi.org/10.1145/3503161.3548297Constructing large-scale realistic terrains using modern modeling tools is an extremely challenging task even for professional users, undermining the effectiveness of video games, virtual reality, and other applications. In this paper, we present a step ...
- research-articleOctober 2022
Uncertainty-Aware Semi-Supervised Learning of 3D Face Rigging from Single Image
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 170–179https://doi.org/10.1145/3503161.3548285We present a method to rig 3D faces via Action Units (AUs), viewpoint and light direction, from single input image. Existing 3D methods for face synthesis and animation rely heavily on 3D morphable model (3DMM), which was built on 3D data and cannot ...