skip to main content
research-article

Identifying content unaware features influencing popularity of videos on YouTube: : A study based on seven regions

Published: 15 November 2022 Publication History

Highlights

AI techniques to identify the satellite features that make YouTube video trending.
AI-based methodology that assesses the impact of various content-agnostic factors.
Dataset is created from YouTube for seven regions.
Feature selection techniques executed on to extract important attributes.
Helpful for content creators and YouTubers to make their video trending.

Abstract

Predicting the popularity of User Generated Content (UGC) is a subject of interest to the Internet service providers, content makers, social media researchers, and online advertisers. However, it is also a challenging task due to multiple factors that influence social networks' content popularity. This work utilizes the Artificial Intelligence (AI) techniques to identify the features that contribute towards a video to enter into the trending category on YouTube. It examines the data generated by a video and its potential to get trending. For this, the present work utilizes AI for feature prediction. An AI-based methodology is presented that assesses the impact of various content-agnostic factors regarding video popularity in seven different regions, including Canada, France, Germany, India, Pakistan, United Arab Emirates, and the United States of America. A dataset is extracted from YouTube for these regions, and feature selection techniques are executed on the datasets to extract important attributes. A class label is assigned to each video, and the dataset is profiled having one of the two class labels, i.e., trending or non-trending. The top three features for each video (region wise) are obtained. It is observed that the trending behavior is dissimilar in different regions. Finally, three classifiers, namely, artificial neural networks, k-Nearest Neighbor, and support vector machine, are trained to predict if a video can get into the trending category on YouTube. The proposed solution is compared with two closely related state-of-the-art methods. This work is useful for content creators and YouTubers to make their video trending and more appealing.

References

[1]
A. Abisheva, V.R.K. Garimella, D. Garcia, I. Weber, Who watches (and shares) what on youtube? and when? using twitter to understand youtube viewership, in: In Proceedings of the 7th ACM international conference on Web search and data mining, 2014, pp. 593–602.
[2]
N. Aggrawal, A. Arora, P. Kumaraguru, Multiple metric aware YouTube tutorial videos virality analysis, International Journal of Social Network Mining 2 (4) (2017) 362–387.
[3]
M. Asad, Z. Halim, M. Waqas, S. Tu, An In-ad contents-based viewability prediction framework using Artificial Intelligence for Web Ads, Artificial Intelligence Review 54 (7) (2021) 5095–5125.
[4]
H. Ashraf, B. Shah, A.M. Soomro, Z. Halim, S.K. Shah, Ambient-noise free generation of clean underwater ship engine audios from hydrophones using generative adversarial networks, Computers and Electrical Engineering 100 (2022).
[5]
S. Baluja R. Seth D. Sivakumar Y. Jing J. Yagnik S. Kumar … M. Aly Video suggestion and discovery for youtube: taking random walks through the view graph 2008 ACM 895 904.
[6]
J. Berger, K.L. Milkman, What makes online content viral?, Journal of marketing research 49 (2) (2012) 192–205.
[7]
D.M. Beskow, S. Kumar, K.M. Carley, The evolution of political memes: Detecting and characterizing internet memes with multi-modal deep learning, Information Processing & Management 57 (2) (2020).
[8]
Y. Borghol, S. Ardon, N. Carlsson, D. Eager, A. Mahanti, The untold story of the clones: Content-agnostic factors that impact YouTube video popularity, in: In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, 2012, pp. 1186–1194.
[9]
Burges J (2008) All Your chocolate Rain Are Belong to Us. Video Vortex Reader. Responses to YouTube. Geert Lovink and Sabine Niederer,(eds.), Amsterdam: Institute of Network Cultures, 101:109.
[10]
M. Cha H. Kwak P. Rodriguez Y.Y. Ahn S. Moon I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system 2007 ACM 1 14.
[11]
W.L. Chang, L.M. Chen, A. Verkholantsev, Revisiting online video popularity: A sentimental analysis, Cybernetics and Systems 50 (6) (2019) 563–577.
[12]
Chatzopoulou G, Sheng C, Faloutsos M (2010) March. A first step towards understanding popularity in YouTube. In 2010 INFOCOM IEEE Conference on Computer Communications Workshops (pp. 1-6). IEEE.
[13]
Y.L. Chen, C.L. Chang, Early prediction of the future popularity of uploaded videos, Expert Systems with Applications 133 (2019) 59–74.
[14]
N. Chintalapati, V.S.K. Daruri, Examining the use of YouTube as a Learning Resource in higher education: Scale development and validation of TAM model, Telematics and Informatics 34 (6) (2017) 853–860.
[15]
R. Crane, D. Sornette, Robust dynamic classes revealed by measuring the response function of a social system, Proceedings of the National Academy of Sciences 105 (41) (2008) 15649–15653.
[16]
T. Daugherty, M.S. Eastin, L. Bright, Exploring consumer motivations for creating user-generated content, Journal of Interactive Advertising 8 (2) (2008) 16–25.
[17]
V.K. Dave, V. Vakharia, S. Singh, Ball Bearing Fault Diagnosis Using Mutual Information and Walsh-Hadamard Transform, in: Reliability, Safety and Hazard Assessment for Risk-Based Technologies, Springer, Singapore, 2020, pp. 607–616.
[18]
Ding Y, Du Y, Hu Y, Liu Z, Wang L, Ross K, Ghose A (2011) November. Broadcast yourself: understanding YouTube uploaders. In Proceedings of the 2011 ACM SIGCOMM conference on Internet measurement conference (pp. 361-370). ACM.
[19]
Dubovi I, Tabak I (2021). Interactions between emotional and cognitive engagement with science on YouTube. Public Understanding of Science, 0963662521990848.
[20]
F. Figueiredo, J.M. Almeida, M.A. Gonçalves, F. Benevenuto, Trendlearner: Early prediction of popularity trends of user generated content, Information Sciences 349 (2016) 172–187.
[21]
F. Figueiredo F. Benevenuto J.M. Almeida The tube over time: characterizing popularity growth of youtube videos 2011 ACM 745 754.
[22]
T. Gillespie, The politics of 'platforms', New Media & Society 12 (3) (2010) 347–364.
[23]
L. Guo, P. Wu, S. Lou, J. Gao, Y. Liu, A multi-feature extraction technique based on principal component analysis for non-linear dynamic process monitoring, Journal of Process Control 85 (2020) 159–172.
[24]
Z. Halim, O. Ali, M.G. Khan, On the efficient representation of datasets as graphs to mine maximal frequent itemsets, IEEE Transactions on Knowledge and Data Engineering 33 (4) (2019) 1674–1691.
[25]
Z. Halim, M. Atif, A. Rashid, C.A. Edwin, Profiling players using real-world datasets: Clustering the data and correlating the results with the big-five personality traits, IEEE Transactions on Affective Computing 10 (4) (2019) 568–584.
[26]
Z. Halim, M. Rehan, On identification of driving-induced stress using electroencephalogram signals: A framework based on wearable safety-critical scheme and machine learning, Information Fusion 53 (2020) 66–79.
[27]
L. Hilderbrand, YouTube: Where cultural memory and copyright converge, FILM QUART 61 (1) (2007) 48–57.
[28]
Hussain MN, Tokdemir S, Al-khateeb S, Bandeli KK, Agarwal N (2018) Understanding digital ethnography: socio-computational analysis of trending YouTube videos. In International Conference on Social Computing, Behavioral-Cultural Modeling & Prediction and Behavior Representation in Modeling and Simulation (SBP-BRiMS 2018).
[29]
Jenkins H, Deuze M, (2008) Convergence culture.
[30]
Y. Jiang, W. Li, M.S. Hossain, M. Chen, A. Alelaiwi, M. Al-Hammadi, A snapshot research and implementation of multimodal information fusion for data-driven emotion recognition, Information Fusion 53 (2020) 209–221.
[31]
G.F. Khan, S. Vong, Virality over YouTube: An empirical analysis, Internet research 24 (5) (2014) 1066–2243.
[32]
Q. Kong, M.A. Rizoiu, S. Wu, L. Xie, Will This Video Go Viral: Explaining and Predicting the Popularity of Youtube Videos, in: In Companion Proceedings of the The Web Conference 2018, 2018, pp. 175–178.
[33]
P.J. Lee, Y.H. Hu, K.T. Lu, Assessing the helpfulness of online hotel reviews: A classification-based approach, Telematics and Informatics 35 (2) (2018) 436–445.
[34]
J. Maldonado, M.C. Riff, B. Neveu, A review of recent approaches on wrapper feature selection for intrusion detection, Expert Systems with Applications 116822 (2022).
[35]
I. Mazid, Virality of social change messages on facebook: A study of advocacy and relationship building strategies of LGBTQ advocacy organizations, International Journal of Strategic Communication 14 (2) (2020) 105–121.
[36]
A. Meseguer-Martinez, A. Ros-Galvez, A. Rosa-Garcia, Linking YouTube and university rankings: Research performance as predictor of online video impact, Telematics and Informatics 43 (2019).
[37]
L. Nie, M. Liu, X. Song, Multimodal learning toward micro-video understanding, Synthesis Lectures on Image, Video, and Multimedia Processing 9 (4) (2019) 1–186.
[38]
L. Nie, X. Wang, J. Zhang, X. He, H. Zhang, R. Hong, Q. Tian, Enhancing micro-video understanding by harnessing external sounds, in: In Proceedings of the 25th ACM international conference on Multimedia, 2017, pp. 1192–1200.
[39]
M. Nilashi, O. Ibrahim, H. Ahmadi, L. Shahmoradi, A knowledge-based system for breast cancer classification using fuzzy logic method, Telematics and Informatics 34 (4) (2017) 133–144.
[40]
M. Park, M. Naaman, J. Berger, A data-driven study of view duration on YouTube, in: 10th International AAAI Conference on Weblogs and Social Media (ICWSM), 2016, pp. 1–4.
[41]
H. Pinto, J.M. Almeida, M.A. Gonçalves, Using early view patterns to predict the popularity of youtube videos, in: In Proceedings of the sixth ACM international conference on Web search and data mining, 2013, pp. 365–374.
[42]
S. Rani, M. Kaur, M. Kumar, V. Ravi, U. Ghosh, J.R. Mohanty, Detection of shilling attack in recommender system for YouTube video statistics using machine learning techniques, Soft Computing (2021) 1–13.
[43]
D. Röchert, G. Neubaum, B. Ross, S. Stieglitz, Caught in a networked collusion? Homogeneity in conspiracy-related discussion networks on YouTube, Information Systems 103 (2022).
[44]
R. Roopchund, V. Ramesh, V. Jaunky, Use of Social Media for Improving Student Engagement at Université des Mascareignes (UDM). In Information Systems Design and Intelligent Applications, Springer, Singapore, 2019, pp. 11–20.
[45]
S.D. Roy, T. Mei, W. Zeng, S. Li, Towards cross-domain learning for social video popularity prediction, IEEE Transactions on Multimedia 15 (6) (2013) 1255–1267.
[46]
L.T. Rui, Z.A. Afif, R.R. Saedudin, A. Mustapha, N. Razali, A regression approach for prediction of Youtube views, Bulletin of Electrical Engineering and Informatics 8 (4) (2019) 1502–1506.
[47]
N. Sangwan, V. Bhatnagar, Video popularity prediction based on fuzzy inference system, Journal of Statistics and Management Systems (2020) 1–13.
[48]
F. Shen, C. Xia, M. Skoric, Examining the roles of social media and alternative media in social movement participation: A study of Hong Kong's Umbrella Movement, Telematics and Informatics (2020).
[49]
D. Southgate, N. Westoby, G. Page, Creative determinants of viral video viewing, International Journal of Advertising 29 (3) (2010) 349–368.
[50]
O. Stromann, A. Nascetti, O. Yousif, Y. Ban, Dimensionality reduction and feature selection for object-based land cover classification based on sentinel-1 and sentinel-2 time series using google earth engine, Remote Sensing 12 (1) (2020) 76.
[51]
T. Trzciński, P. Rokita, Predicting popularity of online videos using support vector regression, IEEE Transactions on Multimedia 19 (11) (2017) 2561–2570.
[52]
D. Vallet, S. Berkovsky, S. Ardon, A. Mahanti, M.A. Kafaar, Characterizing and predicting viral-and-popular video content, in: In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, 2015, pp. 1591–1600.
[53]
J. Van Dijck, Users like you? Theorizing agency in user-generated content, Media, Culture & Society 31 (1) (2009) 41–58.
[54]
K. Wallsten, Yes we can: How online viewership, blog discussion, campaign statements, and mainstream media coverage produced a viral video phenomenon, Journal of Information Technology & Politics 7 (2–3) (2010) 163–181.
[55]
A. Wan, L. Moscowitz, L. Wu, Online social viewing: Cross-cultural adoption and uses of bullet-screen videos, Journal of International and Intercultural Communication (2019) 1–19.
[56]
West T (2011) Going viral: Factors that lead videos to become internet phenomena. Elon Journal of Undergraduate Research, 2(1):pp.76-84.
[57]
H. Xie, Y. He, D. Xu, J.Y. Kuo, H. Lei, B. Lei, Joint segmentation and classification task via adversarial network: Application to HEp-2 cell images, Applied Soft Computing 114 (2022).
[58]
X. Zhao, Y. Zhang, X. Du, DFpin: Deep learning–based protein-binding site prediction with feature-based non-redundancy from RNA level, Computers in biology and medicine 142 (2022).

Cited By

View all

Index Terms

  1. Identifying content unaware features influencing popularity of videos on YouTube: A study based on seven regions
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image Expert Systems with Applications: An International Journal
        Expert Systems with Applications: An International Journal  Volume 206, Issue C
        Nov 2022
        1603 pages

        Publisher

        Pergamon Press, Inc.

        United States

        Publication History

        Published: 15 November 2022

        Author Tags

        1. Video classification
        2. YouTube
        3. Artificial intelligence
        4. Content popularity
        5. YouTube trend prediction

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 26 Jan 2025

        Other Metrics

        Citations

        Cited By

        View all

        View Options

        View options

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media