TOMM: Vol 18, No 1

Volume 18, Issue 1January 2022

Volume 18, Issue 1

January 2022

Editor:

Alberto Del Bimbo
University of Firenze, Italy

Publisher:

Association for Computing Machinery
New York
NY
United States

ISSN:1551-6857

EISSN:1551-6865

Tags:

Subscribe to Journal Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Issue Downloads

PDFfront matter (TOC, masthead, submission information)

Select All

Export Citations Save to Binder

research-article

Sparse LIDAR Measurement Fusion with Joint Updating Cost for Fast Stereo Matching

Article No.: 1, Pages 1–18https://doi.org/10.1145/3471870

The complementary virtues of active and passive depth sensors inspire the LIDAR-Stereo fusion for enhancing the accuracy of stereo matching. However, most of the fusion based stereo matching algorithms have exploited dense LIDAR priors with single fusion ...

research-article

Online Learning for Adaptive Video Streaming in Mobile Networks

Article No.: 2, Pages 1–22https://doi.org/10.1145/3460819

In this paper, we propose a novel algorithm for video bitrate adaptation in HTTP Adaptive Streaming (HAS), based on online learning. The proposed algorithm, named Learn2Adapt (L2A), is shown to provide a robust bitrate adaptation strategy which, unlike ...

research-article

Modeling the User Experience of Watching 360° Videos with Head-Mounted Displays

Article No.: 3, Pages 1–23https://doi.org/10.1145/3463825

Conducting user studies to quantify the Quality of Experience (QoE) of watching the increasingly more popular 360° videos in Head-Mounted Displays (HMDs) is time-consuming, tedious, and expensive. Deriving QoE models, however, is very challenging because ...

research-article

TTV Regularized LRTA Technique for the Estimation of Haze Model Parameters in Video Dehazing

Article No.: 4, Pages 1–22https://doi.org/10.1145/3465454

Nowadays, intelligent transport systems have a major role in providing a safe and secure traffic society for passengers, pedestrians, and vehicles. However, some bad weather conditions such as haze or fog may affect the visual clarity of video footage ...

research-article

MMSUM Digital Twins: A Multi-view Multi-modality Summarization Framework for Sporting Events

Article No.: 5, Pages 1–25https://doi.org/10.1145/3462777

Sporting events generate a massive amount of traffic on social media with live moment-to-moment accounts as any given situation unfolds. The generated data are intensified by fans feelings, reactions, and subjective opinions towards what happens during ...

research-article

Multi-feature Fusion VoteNet for 3D Object Detection

Article No.: 6, Pages 1–17https://doi.org/10.1145/3462219

In this article, we propose a Multi-feature Fusion VoteNet (MFFVoteNet) framework for improving the 3D object detection performance in cluttered and heavily occluded scenes. Our method takes the point cloud and the synchronized RGB image as inputs to ...

research-article

A Novel Multi-Modal Network-Based Dynamic Scene Understanding

Article No.: 7, Pages 1–19https://doi.org/10.1145/3462218

In recent years, dynamic scene understanding has gained attention from researchers because of its widespread applications. The main important factor in successfully understanding the dynamic scenes lies in jointly representing the appearance and motion ...

research-article

Facial-expression-aware Emotional Color Transfer Based on Convolutional Neural Network

Article No.: 8, Pages 1–19https://doi.org/10.1145/3464382

Emotional color transfer aims to change the evoked emotion of a source image to that of a target image by adjusting color distribution. Most of existing emotional color transfer methods only consider the low-level visual features of an image and ignore ...

research-article

The Impact of Artificial Intelligence on the Creativity of Videos

Article No.: 9, Pages 1–27https://doi.org/10.1145/3462634

This study explored the impact Artificial Intelligence (AI) has on the evaluation of creative elements in artistic videos. The aim was to verify to what extent the use of an AI algorithm (Style Transfer) contributes to changes in the perceived creativity ...

research-article

Learning Hierarchical Video Graph Networks for One-Stop Video Delivery

Article No.: 10, Pages 1–23https://doi.org/10.1145/3466886

The explosive growth of video data has brought great challenges to video retrieval, which aims to find out related videos from a video collection. Most users are usually not interested in all the content of retrieved videos but have a more fine-grained ...

research-article

Mask-Guided Deformation Adaptive Network for Human Parsing

Article No.: 11, Pages 1–20https://doi.org/10.1145/3467889

Due to the challenges of densely compacted body parts, nonrigid clothing items, and severe overlap in crowd scenes, human parsing needs to focus more on multilevel feature representations compared to general scene parsing tasks. Based on this observation, ...

research-article

Mimicking Individual Media Quality Perception with Neural Network based Artificial Observers

Article No.: 12, Pages 1–25https://doi.org/10.1145/3464393

The media quality assessment research community has traditionally been focusing on developing objective algorithms to predict the result of a typical subjective experiment in terms of Mean Opinion Score (MOS) value. However, the MOS, being a single value, ...

research-article

Diversely-Supervised Visual Product Search

Article No.: 13, Pages 1–22https://doi.org/10.1145/3461646

This article strives for a diversely supervised visual product search, where queries specify a diverse set of labels to search for. Where previous works have focused on representing attribute, instance, or category labels individually, we consider them ...

research-article

CAPTAIN: Comprehensive Composition Assistance for Photo Taking

Article No.: 14, Pages 1–24https://doi.org/10.1145/3462762

Many people are interested in taking astonishing photos and sharing them with others. Emerging high-tech hardware and software facilitate the ubiquitousness and functionality of digital photography. Because composition matters in photography, researchers ...

survey

Defining Scents: A Systematic Literature Review of Olfactory-based Computing Systems

Article No.: 15, Pages 1–22https://doi.org/10.1145/3470975

The human sense of smell is a primal ability that has the potential to reveal unexplored relationships between user behaviors and technology. Humans use millions of olfactory receptor cells to observe the environment around them. Olfaction studies are ...

research-article

Hyperspectral Image Reconstruction Using Multi-scale Fusion Learning

Article No.: 16, Pages 1–21https://doi.org/10.1145/3477396

Hyperspectral imaging is a promising imaging modality that simultaneously captures several images for the same scene on narrow spectral bands, and it has made considerable progress in different fields, such as agriculture, astronomy, and surveillance. ...

research-article

Open Access

An Empirical Method for Causal Inference of Constructs for QoE in Haptic–Audiovisual Communications

Shuji Tasaka

Article No.: 17, Pages 1–24https://doi.org/10.1145/3473986

This article proposes an empirical method for inferring causal directions in multidimensional Quality of Experience (QoE) in multimedia communications, noting that causation in QoE is perceptual. As an example for modeling framework, we pick up a Bayesian ...

research-article

RD-IOD: Two-Level Residual-Distillation-Based Triple-Network for Incremental Object Detection

Article No.: 18, Pages 1–23https://doi.org/10.1145/3472393

As a basic component in multimedia applications, object detectors are generally trained on a fixed set of classes that are pre-defined. However, new object classes often emerge after the models are trained in practice. Modern object detectors based on ...

research-article

Optimizing Immersive Video Coding Configurations Using Deep Learning: A Case Study on TMIV

Article No.: 19, Pages 1–25https://doi.org/10.1145/3471191

Immersive video streaming technologies improve Virtual Reality (VR) user experience by providing users more intuitive ways to move in simulated worlds, e.g., with 6 Degree-of-Freedom (6DoF) interaction mode. A naive method to achieve 6DoF is deploying ...

research-article

Robust Unsupervised Gaze Calibration Using Conversation and Manipulation Attention Priors

Article No.: 20, Pages 1–27https://doi.org/10.1145/3472622

Gaze estimation is a difficult task, even for humans. However, as humans, we are good at understanding a situation and exploiting it to guess the expected visual focus of attention of people, and we usually use this information to retrieve people’s gaze. ...

research-article

LogoDet-3K: A Large-scale Image Dataset for Logo Detection

Article No.: 21, Pages 1–19https://doi.org/10.1145/3466780

Logo detection has been gaining considerable attention because of its wide range of applications in the multimedia field, such as copyright infringement detection, brand visibility monitoring, and product brand management on social media. In this article, ...

research-article

Authentication of LINE Chat History Files by Information Hiding

Article No.: 22, Pages 1–23https://doi.org/10.1145/3474225

With the prevalence of smartphones, message exchanges via mobile chatting programs like LINE have become popular. The messages in the form of chat records in a LINE chat history, after being downloaded for legal uses, might be tampered with illicitly. A ...

research-article

Privacy-preserving Motion Detection for HEVC-compressed Surveillance Video

Article No.: 23, Pages 1–27https://doi.org/10.1145/3472669

In the cloud era, a large amount of data is uploaded to and processed by public clouds. The risk of privacy leakage has become a major concern for cloud users. Cloud-based video surveillance requires motion detection, which may reveal the privacy of ...

ACM Transactions on Multimedia Computing, Communications, and Applications

Sections

Issue Downloads

Sparse LIDAR Measurement Fusion with Joint Updating Cost for Fast Stereo Matching

Online Learning for Adaptive Video Streaming in Mobile Networks

Modeling the User Experience of Watching 360° Videos with Head-Mounted Displays

TTV Regularized LRTA Technique for the Estimation of Haze Model Parameters in Video Dehazing

MMSUM Digital Twins: A Multi-view Multi-modality Summarization Framework for Sporting Events

Multi-feature Fusion VoteNet for 3D Object Detection

A Novel Multi-Modal Network-Based Dynamic Scene Understanding

Facial-expression-aware Emotional Color Transfer Based on Convolutional Neural Network

The Impact of Artificial Intelligence on the Creativity of Videos

Learning Hierarchical Video Graph Networks for One-Stop Video Delivery

Mask-Guided Deformation Adaptive Network for Human Parsing

Mimicking Individual Media Quality Perception with Neural Network based Artificial Observers

Diversely-Supervised Visual Product Search

CAPTAIN: Comprehensive Composition Assistance for Photo Taking

Defining Scents: A Systematic Literature Review of Olfactory-based Computing Systems

Hyperspectral Image Reconstruction Using Multi-scale Fusion Learning

An Empirical Method for Causal Inference of Constructs for QoE in Haptic–Audiovisual Communications

RD-IOD: Two-Level Residual-Distillation-Based Triple-Network for Incremental Object Detection

Optimizing Immersive Video Coding Configurations Using Deep Learning: A Case Study on TMIV

Robust Unsupervised Gaze Calibration Using Conversation and Manipulation Attention Priors

LogoDet-3K: A Large-scale Image Dataset for Logo Detection

Authentication of LINE Chat History Files by Information Hiding

Privacy-preserving Motion Detection for HEVC-compressed Surveillance Video

Sections

Issue Downloads

Save to Binder

Subjects

Comments