Taking a “Deep” Look at Multimedia Streaming
Streaming multimedia content has become an integral part of our lives influencing the way we consume daily news, communicate with friends, family and in office, and entertain ourselves. Quality of multimedia content has been improving by leaps and bounds ...
Reversible Modal Conversion Model for Thermal Infrared Tracking
Learning powerful CNN representation of the target is a key issue for thermal infrared (TIR) tracking. The lack of massive training TIR data is one of the obstacles to training the network in an end-to-end way from the scratch. Compared to the time-...
An Improved Interaction Estimation and Optimization Method for Surveillance Video Synopsis
Videos synopsis is an efficient technique for condensing long-duration videos into short videos. The interactions between moving objects in the original video need to be preserved during video condensation. However, identifying objects with strong spatio-...
Bandwidth-Aware High-Efficiency Video Coding Design Scheme on a Multiprocessor System on Chip
H.265/high-efficiency video coding (HEVC) provides a multitude of video data compression to minimize data storage and data transmission while preserving video coding quality and ameliorating coding bit rates. However, HEVC encoder chips are frequently ...
A Cross-Domain Multimodal Supervised Latent Topic Model for Item Tagging and Cold-Start Recommendation
Cross-domain data analysis is playing an increasingly important role in media convergence and can be adopted for many applications. Most existing methods consider the domain discrimination as the multimodal representation difference or the imbalanced item ...
Edge Distraction-aware Salient Object Detection
Integrating low-level edge features has been proven to be effective in preserving clear boundaries of salient objects. However, the locality of edge features makes it difficult to capture globally salient edges, leading to distraction in the final ...
A New Fog-Based Transmission Scheduler on the Internet of Multimedia Things Using a Fuzzy-Based Quantum Genetic Algorithm
The Internet of Multimedia Things (IoMT) has recently experienced a considerable surge in multimedia-based services. Due to the fast proliferation and transfer of massive data, the IoMT has service quality challenges. This article proposes a novel fog-...
Content-Aware Latent Semantic Direction Fusion for Multi-Attribute Editing
For facial attribute editing, significant progress has been made in discovering semantic directions in the latent space of StyleGAN, and the manipulation is performed by mapping an input image to a latent code and then moving along a direction associated ...
PP8K: A New Dataset for 8K UHD Video Compression and Processing
In the new era of ultra-high definition (UHD) videos, 8K is becoming more popular in diversified applications to boost the human visual experience and the performances of related vision tasks. However, researchers still suffer from the lack of 8K video ...
Reviving Standard-Dynamic-Range Videos for High-Dynamic-Range Devices: A Learning Paradigm With Hybrid Attention Mechanisms
With the prevalence of high-dynamic-range (HDR) display devices, the demand to convert existing standard-dynamic-range television (SDRTV) video content to its corresponding HDR television (HDRTV) counterpart is growing exponentially. Herein, we propose a ...
Optimizing Multidimensional Perceptual Quality in Online Interactive Multimedia
Network latencies and losses in online interactive multimedia applications may lead to a degraded perception of quality, such as lower interactivity or sluggish responses. We can measure these degradations in perceptual quality by the just-noticeable ...
Perceptual Authentication Hashing for Digital Images With Contrastive Unsupervised Learning
In recent years, many perceptual image hashing schemes for content authentication have been proposed based on classical methods and deep learning. However, most existing schemes target specific and limited content-preserving manipulations and cannot ...