research-article

LiveSense: Contextual Advertising in Live Streaming Videos

Authors:

Mohan KankanhalliAuthors Info & Claims

MM '19: Proceedings of the 27th ACM International Conference on Multimedia

Pages 392 - 400

https://doi.org/10.1145/3343031.3350888

Published: 15 October 2019 Publication History

Abstract

Live streaming has become a new form of entertainment, which attracts hundreds of millions of users worldwide. The huge amount of multimedia data in live streaming platforms creates tremendous opportunities for online advertising. However, existing state-of-the-art video advertising strategies (e.g., pre-roll and contextual mid-roll advertising) that rely on analyzing the whole video, are not applicable to live streaming videos. This paper describes a novel monetization framework, named LiveSense, for live streaming videos, which is able to display a contextually relevant ad at a suitable timestamp in a non-intrusive way. Specifically, given a live streaming video, we first employ a deep neural network to determine whether the current moment is appropriate for displaying an ad using the historical streaming data. Then, we detect a set of candidate ad insertion areas by incorporating image saliency, background map, and location priorities, so that the ad is displayed over the non-important area. We introduce three types of relevance metrics including textual relevance, global visual relevance and local visual relevance to select the contextually relevant ad. To minimize user intrusiveness, we initially display the ad at a non-important area. If the user is interested in the ad, we will show the ad in an overlaid window with a translucent background. Empirical evaluation on a real-world dataset demonstrates that our proposed framework is able to effectively display ads in live streaming videos while maintaining users' online experience.

References

[1]

Yusuf Aytar, Carl Vondrick, and Antonio Torralba. 2016. Soundnet: Learning sound representations from unlabeled video. In Advances in Neural Information Processing Systems. 892--900.

Digital Library

[2]

Lamberto Ballan, Marco Bertini, and Arjun Jain. 2008. A system for automatic detection and recognition of advertising trademarks in sports videos. In Proceedings of the 16th ACM international conference on Multimedia. 991--992.

Digital Library

[3]

Chiahu Chang, Kueiyi Hsieh, Mingche Chung, and Jaling Wu. 2008. ViSA: virtual spotlighted advertising. In Proceedings of the 16th ACM international conference on Multimedia. 837--840.

Digital Library

[4]

Xiang Chen, Bowei Chen, and Mohan Kankanhalli. 2017. Optimizing trade-offs among stakeholders in real-time bidding by incorporating multimedia metrics. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 205--214.

Digital Library

[5]

Xiang Chen, Tam V Nguyen, and Mohan Kankanhalli. 2015. SalAd: A Multimodal Approach for Contextual Video Advertising. In 2015 IEEE International Symposium on Multimedia. 211--216.

[6]

Zhiqi Cheng, Yang Liu, Xiao Wu, and Xiansheng Hua. 2016. Video ecommerce: Towards online video advertising. In Proceedings of the 2016 ACM on Multimedia Conference. 1365--1374.

Digital Library

[7]

Jie Deng, Felix Cuadrado, Gareth Tyson, and Steve Uhlig. 2015. Behind the game: Exploring the twitch streaming platform. In 2015 International Workshop on Network and Systems Support for Games (NetGames). 1--6.

[8]

Jamesdouglas Hamilton. 1994. Time series analysis . Vol. 2. Princeton university press Princeton, NJ.

[9]

William Hamilton, Oliver Garretson, and Andruid Kerne. 2014. Streaming on twitch: fostering participatory communities of play within live mixed media. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 1315--1324.

Digital Library

[10]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.

[11]

Zorah Hilvert, James Neill, Max Sjöblom, and Juho Hamari. 2018. Social motivations of live-streaming viewer engagement on Twitch. Computers in Human Behavior, Vol. 84 (2018), 58--67.

[12]

IAB. 2017. Live Video Streaming -- A global Perspective. https://www.iab.com/wp-content/uploads/2018/06/IAB-Live-Video-Streaming-Trends.pdf

[13]

Mehdi Kaytoue, Arlei Silva, Lo"ic Cerf, Wagner Meirajr, and Chedy Ra"issi. 2012. Watch me playing, i am a professional: a first study on video game live streaming. In Proceedings of the 21st International Conference on World Wide Web . 1181--1188.

Digital Library

[14]

Jan Kostka, Yvonne Anne Oswald, and Roger Wattenhofer. 2008. Word of mouth: Rumor dissemination in social networks. In International Colloquium on Structural Information and Communication Complexity . 185--196.

[15]

Zhiwei Li, Lei Zhang, and Weiying Ma. 2008. Delivering online advertisements inside images. In Proceedings of the 16th ACM international conference on Multimedia. 1051--1060.

Digital Library

[16]

Mathias Lux, Michael Riegler, Duc-Tien Dang-Nguyen, Marcus Larson, Martin Potthast, and Pål Halvorsen. 2018. GameStory Task at MediaEval 2018. In Working Notes Proceedings of the MediaEval 2018 Workshop .

[17]

Tao Mei, Jinlian Guo, Xiansheng Hua, and Falin Liu. 2010. AdOn: Toward contextual overlay in-video advertising. Multimedia systems, Vol. 16, 4--5 (2010), 335--344.

[18]

Tao Mei, Xiansheng Hua, Linjun Yang, and Shipeng Li. 2007. VideoSense: towards effective online video advertising. In Proceedings of the 15th ACM international conference on Multimedia. 1075--1084.

Digital Library

[19]

Tao Mei, Lusong Li, Xiansheng Hua, and Shipeng Li. 2012. ImageSense: Towards contextual image advertising. ACM Transactions on Multimedia Computing, Communications, and Applications, Vol. 8, 1 (2012), 6.

Digital Library

[20]

Tao Mei, Lusong Li, Xinmei Tian, Dacheng Tao, and Chongwah Ngo. 2016. PageSense: Towards Style-wise Contextual Advertising via Visual Analysis of Webpages. IEEE Transactions on Circuits and Systems for Video Technology (2016).

[21]

Lindasalwa Muda, Mumtaj Begam, and Irraivan Elamvazuthi. 2010. Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques. arXiv preprint arXiv:1003.4083 (2010).

[22]

Tam V Nguyen and Mohan Kankanhalli. 2017. As-similar-as-possible saliency fusion. Multimedia Tools and Applications, Vol. 76, 8 (2017), 10501--10519.

Digital Library

[23]

Karla Okada, Edleno de Moura, Marco Cristo, David Fernandes, Marcos André Goncc alves, and Klessius Berlt. 2012. Advertisement selection for online videos. In Proceedings of the 18th Brazilian symposium on Multimedia and the web. ACM, 367--374.

Digital Library

[24]

Karine Pires and Gwendal Simon. 2015. YouTube live and Twitch: a tour of user-generated live streaming systems. In Proceedings of the 6th ACM Multimedia Systems Conference. 225--230.

Digital Library

[25]

Berthier Ribeiro-Neto, Marco Cristo, Paulo Golgher, and Edleno Silva de Moura. 2005. Impedance coupling in content-targeted advertising. In Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval. 496--503.

Digital Library

[26]

Srinivasan Sengamedu, Neela Sawant, and Smita Wadhwa. 2007. vADeo: video advertising system. In Proceedings of the 15th ACM international conference on Multimedia. 455--456.

Digital Library

[27]

Mennatullah Siam, Sepehr Valipour, Martin Jagersand, and Nilanjan Ray. 2017. Convolutional gated recurrent networks for video segmentation. In Proceedings of the 2017 IEEE International Conference on Image Processing. 3090--3094.

[28]

Karen Simonyan and Andrew Zisserman. 2014. Two-stream convolutional networks for action recognition in videos. In Advances in neural information processing systems. 568--576.

[29]

Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, and Manohar Paluri. 2015. Learning spatiotemporal features with 3d convolutional networks. In Proceedings of the IEEE international conference on computer vision. 4489--4497.

Digital Library

[30]

Subhashini Venugopalan, Marcus Rohrbach, Jeffrey Donahue, Raymond Mooney, Trevor Darrell, and Kate Saenko. 2015. Sequence to sequence-video to text. In Proceedings of the IEEE international conference on computer vision. 4534--4542.

Digital Library

[31]

Wenguan Wang, Jianbing Shen, and Ling Shao. 2018. Video salient object detection via fully convolutional networks. IEEE Transactions on Image Processing, Vol. 27, 1 (2018), 38--49.

[32]

R William Soukoreff and I Scott Mackenzie. 1995. Theoretical upper and lower bounds on typing speed using a stylus and a soft keyboard. Behaviour & Information Technology, Vol. 14, 6 (1995), 370--379.

[33]

Zuxuan Wu, Yugang Jiang, Xi Wang, Hao Ye, and Xiangyang Xue. 2016. Multi-stream multi-class fusion of deep networks for video classification. In Proceedings of the 2016 ACM on Multimedia Conference. 791--800.

Digital Library

[34]

Karthik Yadati, Harish Katti, and Mohan Kankanhalli. 2014. CAVVA: Computational affective video-in-video advertising. IEEE Transactions on Multimedia, Vol. 16, 1 (2014), 15--23.

[35]

Jianming Zhang, Stan Sclaroff, Zhe Lin, Xiaohui Shen, Brian Price, and Radomir Mech. 2015. Minimum barrier salient object detection at 80 fps. In Proceedings of the IEEE International Conference on Computer Vision. 1404--1412.

Digital Library

Cited By

Shamieh FAbusharkh MHawilo H(2024)OTT Ad Insertion Using Memorability Enforcement2024 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)10.1109/CCECE59415.2024.10667332(381-386)Online publication date: 6-Aug-2024
https://doi.org/10.1109/CCECE59415.2024.10667332
An SZhou FYang MBai STang CZhu H(2023)Learning an insertion region for advertisement embedding on planesSignal Processing: Image Communication10.1016/j.image.2023.116963116(116963)Online publication date: Aug-2023
https://doi.org/10.1016/j.image.2023.116963
LI ZZHU C(2022)The marketing strategy of online video based on danmaku-video: A bimodal analysisAdvances in Psychological Science10.3724/SP.J.1042.2021.0156129:9(1561-1575)Online publication date: 13-Jul-2022
https://doi.org/10.3724/SP.J.1042.2021.01561
Show More Cited By

Index Terms

LiveSense: Contextual Advertising in Live Streaming Videos
1. Information systems
  1. World Wide Web
    1. Online advertising
      1. Content match advertising
      2. Display advertising

Recommendations

You Watch, You Give, and You Engage: A Study of Live Streaming Practices in China
CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems

Despite gaining traction in North America, live streaming has not reached the popularity it has in China, where live- streaming has a tremendous impact on the social behaviors of users. To better understand this socio-technological phenomenon, we ...
More Kawaii than a Real-Person Live Streamer: Understanding How the Otaku Community Engages with and Perceives Virtual YouTubers
CHI '21: Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems

Live streaming has become increasingly popular, with most streamers presenting their real-life appearance. However, Virtual YouTubers (VTubers), virtual 2D or 3D avatars that are voiced by humans, are emerging as live streamers and attracting a growing ...
StreamSketch: Exploring Multi-Modal Interactions in Creative Live Streams
CSCW

Creative live streams, where artists or designers demonstrate their creative process, have emerged as a unique and popular genre of live streams due to the real-time interactivity they afford. However, streamer-viewer interactions on most live streaming ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '19: Proceedings of the 27th ACM International Conference on Multimedia

October 2019

2794 pages

ISBN:9781450368896

DOI:10.1145/3343031

General Chairs:
Laurent Amsaleg
CNRS-IRISA, France
,
Benoit Huet
EURECOM, France
,
Martha Larson
Radboud University and TU Delft (Netherlands)
,
Program Chairs:
Guillaume Gravier
CNRS-IRISA, France
,
Hayley Hung
Delft University of Technology Netherlands
,
Chong-Wah Ngo
City University of Hong Kong Hong Kong
,
Wei Tsang Ooi
National University of Singapore Singapore

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

MM '19

Sponsor:

SIGMM

MM '19: The 27th ACM International Conference on Multimedia

October 21 - 25, 2019

Nice, France

Acceptance Rates

MM '19 Paper Acceptance Rate 252 of 936 submissions, 27%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

10
Total Citations
View Citations
836
Total Downloads

Downloads (Last 12 months)111
Downloads (Last 6 weeks)36

Reflects downloads up to 23 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Shamieh FAbusharkh MHawilo H(2024)OTT Ad Insertion Using Memorability Enforcement2024 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)10.1109/CCECE59415.2024.10667332(381-386)Online publication date: 6-Aug-2024
https://doi.org/10.1109/CCECE59415.2024.10667332
An SZhou FYang MBai STang CZhu H(2023)Learning an insertion region for advertisement embedding on planesSignal Processing: Image Communication10.1016/j.image.2023.116963116(116963)Online publication date: Aug-2023
https://doi.org/10.1016/j.image.2023.116963
LI ZZHU C(2022)The marketing strategy of online video based on danmaku-video: A bimodal analysisAdvances in Psychological Science10.3724/SP.J.1042.2021.0156129:9(1561-1575)Online publication date: 13-Jul-2022
https://doi.org/10.3724/SP.J.1042.2021.01561
Li YKim HDo BChoi J(2022)The effect of emotion in thumbnails and titles of video clips on pre-roll advertising effectivenessJournal of Business Research10.1016/j.jbusres.2022.06.051151(232-243)Online publication date: Nov-2022
https://doi.org/10.1016/j.jbusres.2022.06.051
Nixon LFoss JApostolidis KMezaris V(2022)Data-driven personalisation of television content: a surveyMultimedia Systems10.1007/s00530-022-00926-628:6(2193-2225)Online publication date: 23-Apr-2022
https://doi.org/10.1007/s00530-022-00926-6
Yang CKim GJeong Y(2021)Exploring strategies to promote health services online: The role of contextual priming, digital ad type, and health threat orientation in determining the effectiveness of health service adsHealth Marketing Quarterly10.1080/07359683.2021.199751140:1(39-58)Online publication date: 24-Nov-2021
https://doi.org/10.1080/07359683.2021.1997511
Zhang SLiu HHe JHan SDu X(2021)A deep bi-directional prediction model for live streaming recommendationInformation Processing & Management10.1016/j.ipm.2020.10245358:2(102453)Online publication date: Mar-2021
https://doi.org/10.1016/j.ipm.2020.102453
Miralles-Pechuán LQureshi MNamee B(2021)Real-time bidding campaigns optimization using user profile settingsElectronic Commerce Research10.1007/s10660-021-09513-923:2(1297-1322)Online publication date: 25-Nov-2021
https://doi.org/10.1007/s10660-021-09513-9
Zhang SLiu HMei LHe JDu X(2021)Predicting viewer’s watching behavior and live streaming content change for anchor recommendationApplied Intelligence10.1007/s10489-021-02560-7Online publication date: 12-Jun-2021
https://doi.org/10.1007/s10489-021-02560-7
Zhang HLi YAi QLuo YWen YJin YTa NWen Chen CCucchiara RHua XQi GRicci EZhang ZZimmermann R(2020)Hysia: Serving DNN-Based Video-to-Retail Applications in CloudProceedings of the 28th ACM International Conference on Multimedia10.1145/3394171.3414536(4457-4460)Online publication date: 12-Oct-2020
https://dl.acm.org/doi/10.1145/3394171.3414536

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents