Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- abstractOctober 2017
MUSA2: First ACM Workshop on Multimodal Understanding of Social, Affective and Subjective Attributes
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1974–1975https://doi.org/10.1145/3123266.3132057Multimedia scientists have largely focused their research on the recognition of tangible properties of data, such as objects and scenes. Recently, the field has started evolving towards the modeling of more complex properties. For example, the ...
- abstractOctober 2017
MultiEdTech 2017: 1st International Workshop on Multimedia-based Educational and Knowledge Technologies for Personalized and Social Online Training
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1980–1982https://doi.org/10.1145/3123266.3132056Educational and Knowledge Technologies (EdTech), especially in connection to multimedia content and the vision of mobile and personalized learning, is a hot topic in both academia and the business start-ups ecosystem. The driver and enabler of this is ...
- tutorialOctober 2017
Medical Multimedia Information Systems (MMIS)
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1957–1958https://doi.org/10.1145/3123266.3130142In hospitals all around the world, medical multimedia information systems have gained high importance over the last few years. One of the reasons is that an increasing number of interventions are performed in a minimally invasive way. These endoscopic ...
- research-articleOctober 2017
Popularity Meter: An Influence- and Aesthetics-aware Social Media Popularity Predictor
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1918–1923https://doi.org/10.1145/3123266.3127903Social media websites have become an important channel for content sharing and communication between users on social networks. The shared images on the websites, even the ones from the same user, tend to receive a quite diverse distribution of views. ...
- research-articleOctober 2017
Combining Multiple Features for Image Popularity Prediction in Social Media
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1901–1905https://doi.org/10.1145/3123266.3127900Popularity prediction, aiming at predicting target items' total interactions with users, is a very significant type of problem and has attracted a lot of attention in recent years. It can benefit a lot of real applications, such as cold-start ...
-
- research-articleOctober 2017
Exploring the use of Time-Dependent Cross-Network Information for Personalized Recommendations
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1780–1788https://doi.org/10.1145/3123266.3123447The overwhelming volume and complexity of information in online applications make recommendation essential for users to find information of interest. However, two major limitations that coexist in real world applications (1) incomplete user profiles, ...
- research-articleOctober 2017
Cross-Domain Image Retrieval with Attention Modeling
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1654–1662https://doi.org/10.1145/3123266.3123429With the proliferation of e-commerce websites and the ubiquitousness of smart phones, cross-domain image retrieval using images taken by smart phones as queries to search products on e-commerce websites is emerging as a popular application. One challenge ...
- research-articleOctober 2017
Cross-modal Recipe Retrieval with Rich Food Attributes
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1771–1779https://doi.org/10.1145/3123266.3123428Food is rich of visible (e.g., colour, shape) and procedural (e.g., cutting, cooking) attributes. Proper leveraging of these attributes, particularly the interplay among ingredients, cutting and cooking methods, for health-related applications has not ...
- research-articleOctober 2017
Video Question Answering via Gradually Refined Attention over Appearance and Motion
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1645–1653https://doi.org/10.1145/3123266.3123427Recently image question answering (ImageQA) has gained lots of attention in the research community. However, as its natural extension, video question answering (VideoQA) is less explored. Although both tasks look similar, VideoQA is more challenging ...
- research-articleOctober 2017
Beyond Human-level License Plate Super-resolution with Progressive Vehicle Search and Domain Priori GAN
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1618–1626https://doi.org/10.1145/3123266.3123422In this paper, we address the challenging problem of vehicle license plate image super-resolution. Different from existing image super-resolution approaches only resorted to one single image, we propose to leverage complementary information from multiple ...
- research-articleOctober 2017
Statistical Inference of Gaussian-Laplace Distribution for Person Verification
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1609–1617https://doi.org/10.1145/3123266.3123421Metric learning is an important issue in the person verification problem, which is to identify whether a pair of face or human body images is about the same person. Due to low running cost, the non-iterative statistical inference methods for metric ...
- research-articleOctober 2017
Deep Supervised Quantization by Self-Organizing Map
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1707–1715https://doi.org/10.1145/3123266.3123415Approximate Nearest Neighbour (ANN) search is an important research topic in multimedia and computer vision fields. In this paper, we propose a new deep supervised quantization method by Self-Organizing Map (SOM) to address this problem. Our method ...
- research-articleOctober 2017
Multi-Modal Localization and Enhancement of Multiple Sound Sources from a Micro Aerial Vehicle
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1591–1599https://doi.org/10.1145/3123266.3123412The ego-noise generated by the motors and propellers of a micro aerial vehicle (MAV) masks the environmental sounds and considerably degrades the quality of the on-board sound recording. Sound enhancement approaches generally require knowledge of the ...
- research-articleOctober 2017
Incremental Accelerated Kernel Discriminant Analysis
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1575–1583https://doi.org/10.1145/3123266.3123401In this paper a novel incremental dimensionality reduction (DR) technique called incremental accelerated kernel discriminant analysis (IAKDA) is proposed. Consisting of the eigenvalue decomposition of a relatively small-size matrix and the recursive ...
- research-articleOctober 2017
Cross-media Retrieval by Learning Rich Semantic Embeddings of Multimedia
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1698–1706https://doi.org/10.1145/3123266.3123369Cross-media retrieval aims at seeking the semantic association between different media types. Most existing methods paid much attention on learning mapping functions or finding the optimal spaces, but neglected how people accurately cognize images and ...
- research-articleOctober 2017
Deep Asymmetric Pairwise Hashing
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1522–1530https://doi.org/10.1145/3123266.3123345Recently, deep neural networks based hashing methods have greatly improved the multimedia retrieval performance by simultaneously learning feature representations and binary hash functions. Inspired by the latest advance in the asymmetric hashing scheme,...
- research-articleOctober 2017
Semi-Relaxation Supervised Hashing for Cross-Modal Retrieval
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1762–1770https://doi.org/10.1145/3123266.3123320Recently, some cross-modal hashing methods have been devised for cross-modal search task. Essentially, given a similarity matrix, most of these methods tackle a discrete optimization problem by separating it into two stages, i.e., first relaxing the ...
- research-articleOctober 2017
Exploring Outliers in Crowdsourced Ranking for QoE
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1540–1548https://doi.org/10.1145/3123266.3123267Outlier detection is a crucial part of robust evaluation for crowdsourceable assessment of Quality of Experience (QoE) and has attracted much attention in recent years. In this paper, we propose some simple and fast algorithms for outlier detection and ...
- research-articleOctober 2017
Learning Visual Emotion Distributions via Multi-Modal Features Fusion
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 369–377https://doi.org/10.1145/3123266.3130858Current image emotion recognition works mainly classified the images into one dominant emotion category, or regressed the images with average dimension values by assuming that the emotions perceived among different viewers highly accord with each other. ...
- demonstrationOctober 2017
Sketch-based Image Retrieval using Generative Adversarial Networks
MM '17: Proceedings of the 25th ACM international conference on MultimediaPages 1267–1268https://doi.org/10.1145/3123266.3127939For sketch-based image retrieval (SBIR), we propose a generative adversarial network trained on a large number of sketches and their corresponding real images. To imitate human search process, we attempt to match candidate images with theimaginary image ...