skip to main content
10.1145/3204949.3208141acmconferencesArticle/Chapter ViewAbstractPublication PagesmmsysConference Proceedingsconference-collections

MMTF-14K: a multifaceted movie trailer feature dataset for recommendation and retrieval

Published: 12 June 2018 Publication History


In this paper we propose a new dataset, i.e., the MMTF-14K multi-faceted dataset. It is primarily designed for the evaluation of video-based recommender systems, but it also supports the exploration of other multimedia tasks such as popularity prediction, genre classification and auto-tagging (aka tag prediction). The data consists of 13,623 Hollywood-type movie trailers, ranked by 138,492 users, generating a total of almost 12.5 million ratings. To address a broader community, metadata, audio and visual descriptors are also pre-computed and provided along with several baseline benchmarking results for uni-modal and multi-modal recommendation systems. This creates a rich collection of data for benchmarking results and which supports future development of this field.


{n. d.}. Yahoo!: Webscope movie data set (Version 1.0). ({n. d.}). Accessed: 2018-03-01.
Charu C Aggarwal. 2016. An Introduction to Recommender Systems. In Recommender Systems. Springer, 1--28.
Linas Baltrunas, Tadas Makcinskas, and Francesco Ricci. 2010. Group recommendations with rank aggregation and collaborative filtering. In Proceedings of the fourth ACM conf. on Recommender systems. ACM, 119--126.
Thierry Bertin-Mahieux, Daniel PW Ellis, Brian Whitman, and Paul Lamere. 2011. The Million Song Dataset. In ISMIR, Vol. 2. 10.
David Bordwell, Kristin Thompson, and Jeff Smith. 1997. Film art: An introduction. Vol. 7. McGraw-Hill New York.
Ritendra Datta, Dhiraj Joshi Jia Li, and James Z Wang. 2006. Studying aesthetics in photographic images using a computational approach. In European Conference on Computer Vision. Springer, 288--301.
Najim Dehak, Patrick J Kenny, Reda Dehak, Pierre Dumouchel, and Pierre Ouellet. 2011. Front-end factor analysis for speaker verification. IEEE Transactions on Audio, Speech, and Language Processing 19, 4 (2011), 788--798.
Yashar Deldjoo, Mehdi Elahi, Paolo Cremonesi, Franca Garzotto, and Pietro Piazzolla. 2016. Recommending movies based on mise-en-scene design. In Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems. ACM, 1540--1547.
Yashar Deldjoo, Mehdi Elahi, Paolo Cremonesi, Franca Garzotto, Pietro Piazzolla, and Massimo Quadrana. 2016. Content-based video recommendation system based on stylistic visual features. Journal on Data Semantics 5, 2 (2016), 99--113.
Claire-Hélène Demarty, Mats Viktor Sjöberg, Bogdan Ionescu, Thanh-Toan Do, Hanli Wang, Ngoc QK Duong, Frédéric Lefebvre, et al. 2016. Mediaeval 2016 predicting media interestingness task. In MediaEval 2016 Multimedia Benchmark Workshop Working Notes Proceedings of the MediaEval 2016 Workshop.
H. Eghbal-Zadeh, B. Lehner, M. Dorfer, and G. Widmer. 2016. CP-JKU Submissions for DCASE-2016: a Hybrid Approach Using Binaural I-Vectors and Deep CNNs. Technical Report. DCASE2016 Challenge.
Mehdi Elahi, Yashar Deldjoo, Farshad Bakhshandegan Moghaddam, Leonardo Cella, Stefano Cereda, and Paolo Cremonesi. 2017. Exploring the Semantic Gap for Movie Recommendations. In Proceedings of the Eleventh ACM Conference on Recommender Systems. ACM, 326--330.
Andreas F Haas, Marine Guibert, Anja Foerschner, Sandi Calhoun, Emma George, Mark Hatay, Elizabeth Dinsdale, Stuart A Sandin, Jennifer E Smith, Mark JA Vermeij, et al. 2015. Can we measure beauty? Computational evaluation of coral reef aesthetics. PeerJ 3 (2015), e1390.
F Maxwell Harper and Joseph A Konstan. 2016. The movielens datasets: History and context. ACM Transactions on Interactive Intelligent Systems (TiiS) 5, 4 (2016), 19.
David Hauger, Markus Schedl, Andrej Košir, and Marko Tkalcic. 2013. The million musical tweets dataset: What can we learn from microblogs. In Proceedings of the 14th International Society for Music Information Retrieval Conference (ISMIR 2013).
Yan Ke, Xiaoou Tang, and Feng Jing. 2006. The design of high-level features for photo quality assessment. In Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, Vol. 1. IEEE, 419--426.
Shu Kong, Xiaohui Shen, Zhe Lin, Radomir Mech, and Charless Fowlkes. 2016. Photo aesthetics ranking network with attributes and content adaptation. In European Conference on Computer Vision. Springer, 662--679.
Andrej Košir, Ante Odic, Matevz Kunaver, Marko Tkalcic, and Jurij F Tasic. 2011. Database for contextual personalization. (2011).
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. 1097--1105.
Congcong Li and Tsuhan Chen. 2009. Aesthetic visual quality assessment of paintings. IEEE Journal of selected topics in Signal Processing 3, 2 (2009), 236--252.
Martin F Porter. 1980. An algorithm for suffix stripping. Program 14, 3 (1980), 130--137.
Francesco Ricci, Lior Rokach, and Bracha Shapira. 2015. Recommender systems: introduction and challenges. In Recommender systems handbook. Springer, 1--34.
Markus Schedl. 2016. The LFM-1b Dataset for Music Retrieval and Recommendation. In Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR). New York, USA.
Markus Schedl, Hamed Zamani, Ching-Wei Chen, Yashar Deldjoo, and Mehdi Elahi. 2018. Current Challenges and Visions in Music Recommender Systems Research. International Journal of Multimedia Information Retrieval (2018), 1--22.
Klaus Seyerlehner, Gerhard Widmer, Markus Schedl, and Peter Knees. 2010. Automatic Music Tag Classification based on Block-Level Features. In Proceedings of the 7th Sound and Music Computing conf. (SMC 2010). Barcelona, Spain.
Lei Tang, Suju Rajan, and Vijay K Narayanan. 2009. Large scale multi-label classification via metalabeler. In Proceedings of the 18th international conference on World wide web. ACM, 211--220.
Andreu Vail, Hamid Eghbal-zadeh, Matthias Dorfer, Markus Schedl, and Gerhard Widmer. 2017. Music Playlist Continuation by Learning from Hand-Curated Examples and Song Features: Alleviating the Cold-Start Problem for Rare and Out-of-Set Songs. In Proceedings of the 2nd Workshop on Deep Learning for Recommender Systems. ACM, 46--54.
Mi Zhang, Jie Tang, Xuchen Zhang, and Xiangyang Xue. 2014. Addressing cold start in recommender systems: A semi-supervised co-training algorithm. In Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval. ACM, 73--82.

Cited By

View all

Index Terms

  1. MMTF-14K: a multifaceted movie trailer feature dataset for recommendation and retrieval



    Information & Contributors


    Published In

    cover image ACM Conferences
    MMSys '18: Proceedings of the 9th ACM Multimedia Systems Conference
    June 2018
    604 pages
    • General Chair:
    • Pablo Cesar,
    • Program Chairs:
    • Michael Zink,
    • Niall Murray
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].




    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 June 2018


    Request permissions for this article.

    Check for updates

    Author Tags

    1. content description
    2. social media
    3. video recommendation
    4. video trailer benchmarking dataset


    • Research-article


    MMSys '18
    MMSys '18: 9th ACM Multimedia Systems Conference
    June 12 - 15, 2018
    Amsterdam, Netherlands

    Acceptance Rates

    Overall Acceptance Rate 176 of 530 submissions, 33%


    Other Metrics

    Bibliometrics & Citations


    Article Metrics

    • Downloads (Last 12 months)23
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 25 Dec 2024

    Other Metrics


    Cited By

    View all

    View Options

    Login options

    View options


    View or Download as a PDF file.



    View online with eReader.








    Share this Publication link

    Share on social media