skip to main content
10.1145/3391614.3399391acmconferencesArticle/Chapter ViewAbstractPublication PagesimxConference Proceedingsconference-collections
Work in Progress

A Web Service for Video Summarization

Published: 17 June 2020 Publication History

Abstract

This paper presents a Web service that supports the automatic generation of video summaries for user-submitted videos. The developed Web application decomposes the video into segments, evaluates the fitness of each segment to be included in the video summary and selects appropriate segments until a pre-defined time budget is filled. The integrated deep-learning-based video analysis and summarization technologies exhibit state-of-the-art performance and, by exploiting the processing capabilities of modern GPUs, offer faster than real-time processing. Configurations for generating video summaries that fulfill the specifications for posting on the most common video sharing platforms and social networks are available in the user interface of this application, enabling the one-click generation of distribution-channel-specific summaries.

References

[1]
Evlampios Apostolidis, Eleni Adamantidou, Alexandros I Metsai, Vasileios Mezaris, and Ioannis Patras. 2020. Unsupervised Video Summarization via Attention-Driven Adversarial Learning. In International Conference on Multimedia Modeling. Springer, 492–504.
[2]
Evlampios Apostolidis, Alexandros I Metsai, Eleni Adamantidou, Vasileios Mezaris, and Ioannis Patras. 2019. A stepwise, label-based approach for improving the adversarial training in unsupervised video summarization. In Proceedings of the 1st International Workshop on AI for Smart TV Content Production, Access and Delivery. 17–25.
[3]
Evlampios Apostolidis and Vasileios Mezaris. 2014. Fast shot segmentation combining global and local visual descriptors. In 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 6583–6587.
[4]
Lorenzo Baraldi, Costantino Grana, and Rita Cucchiara. 2015. A deep siamese network for scene detection in broadcast videos. In Proceedings of the 23rd ACM international conference on Multimedia. 1199–1202.
[5]
Wen-Sheng Chu, Yale Song, and Alejandro Jaimes. 2015. Video co-summarization: Video summarization by visual co-occurrence. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3584–3592.
[6]
Giorgos Dimopoulos, Pere Barlet-Ros, and Josep Sanjuas-Cuxart. 2013. Analysis of YouTube user experience from passive measurements. In Proceedings of the 9th International Conference on Network and Service Management (CNSM 2013). IEEE, 260–267.
[7]
Naveed Ejaz, Irfan Mehmood, and Sung Wook Baik. 2014. Feature aggregation based visual attention model for video summarization. Computers & Electrical Engineering 40, 3 (2014), 993 – 1005. https://doi.org/10.1016/j.compeleceng.2013.10.005 Special Issue on Image and Video Processing.
[8]
Naveed Ejaz, Tayyab Bin Tariq, and Sung Wook Baik. 2012. Adaptive key frame extraction for video summarization using an aggregation mechanism. Journal of Visual Communication and Image Representation 23, 7(2012), 1031 – 1040.
[9]
Mohamed Elfeki and Ali Borji. 2019. Video summarization via actionness ranking. In 2019 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 754–763.
[10]
[10] Facebook video requirements (accessed: 2020-03-20). https://www.facebook.com/business/m/one-sheeters/video-requirements
[11]
Jiri Fajtl, Hajar Sadeghi Sokeh, Vasileios Argyriou, Dorothy Monekosso, and Paolo Remagnino. 2018. Summarizing videos with attention. In Asian Conference on Computer Vision. Springer, 39–54.
[12]
Jiri Fajtl, Hajar Sadeghi Sokeh, Vasileios Argyriou, Dorothy Monekosso, and Paolo Remagnino. 2019. Summarizing Videos with Attention. In Computer Vision – ACCV 2018 Workshops, Gustavo Carneiro and Shaodi You (Eds.). Springer International Publishing, Cham, 39–54.
[13]
Tsu-Jui Fu, Shao-Heng Tai, and Hwann-Tzong Chen. 2019. Attentive and Adversarial Learning for Video Summarization. In IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Village, HI, USA, January 7-11, 2019. 1579–1587. https://doi.org/10.1109/WACV.2019.00173
[14]
Marco Furini, Filippo Geraci, Manuela Montangero, and Marco Pellegrini. 2010. STIMO: STIll and MOving Video Storyboard for the Web Scenario. Multimedia Tools Appl. 46, 1 (Jan. 2010), 47–69.
[15]
Boqing Gong, Wei-Lun Chao, Kristen Grauman, and Fei Sha. 2014. Diverse Sequential Subset Selection for Supervised Video Summarization. In Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2 (Montreal, Canada) (NIPS’14). MIT Press, Cambridge, MA, USA, 2069–2077. http://dl.acm.org/citation.cfm?id=2969033.2969058
[16]
Michael Gygli. 2017. Ridiculously fast shot boundary detection with fully convolutional neural networks. arXiv preprint arXiv:1705.08214(2017).
[17]
Michael Gygli, Helmut Grabner, Hayko Riemenschneider, and Luc Van Gool. 2014. Creating summaries from user videos. In European conference on computer vision. Springer, 505–520.
[18]
[18] How Long Should Your Videos Be? Ideal Lengths for Facebook, Instagram, Twitter, and YouTube (accessed: 2020-03-20). https://blog.hubspot.com/marketing/how-long-should-videos-be-on-instagram-twitter-facebook-youtube
[19]
[19] How many seconds of video can I record on Instagram? (accessed: 2020-03-20). https://www.facebook.com/help/instagram/270963803047681
[20]
Zhong Ji, Kailin Xiong, Yanwei Pang, and Xuelong Li. 2019. Video summarization with attention-based encoder-decoder networks. IEEE Transactions on Circuits and Systems for Video Technology (2019).
[21]
Jie-Ling Lai and Yang Yi. 2012. Key frame extraction based on visual attention model. Journal of Visual Communication and Image Representation 23, 1(2012), 114 – 125. https://doi.org/10.1016/j.jvcir.2011.08.005
[22]
Christian Moldovan, Florian Wamser, and Tobias Hoßfeld. 2019. User Behavior and Engagement of a Mobile Video Streaming User from Crowdsourced Measurements. In 2019 Eleventh International Conference on Quality of Multimedia Experience (QoMEX). IEEE, 1–3.
[23]
[23] Online summarize tool (free summarizing) (accessed: 2020-03-20). https://www.tools4noobs.com/summarize/
[24]
[24] Online Text Summary Generator (accessed: 2020-03-20). http://autosummarizer.com/
[25]
Mayu Otani, Yuta Nakashima, Esa Rahtu, and Janne Heikkila. 2019. Rethinking the evaluation of video summaries. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7596–7604.
[26]
Rameswar Panda, Abir Das, Ziyan Wu, Jan Ernst, and Amit K Roy-Chowdhury. 2017. Weakly supervised summarization of web videos. In Proceedings of the IEEE International Conference on Computer Vision. 3657–3666.
[27]
Danila Potapov, Matthijs Douze, Zaid Harchaoui, and Cordelia Schmid. 2014. Category-Specific Video Summarization. In Computer Vision – ECCV 2014, David Fleet, Tomas Pajdla, Bernt Schiele, and Tinne Tuytelaars (Eds.). Springer International Publishing, Cham, 540–555.
[28]
[28] Resoomer | Summarizer to make an automatic text summary online (accessed: 2020-03-20). https://resoomer.com/en/
[29]
Sartaj Sahni. 1975. Approximate algorithms for the 0/1 knapsack problem. Journal of the ACM (JACM) 22, 1 (1975), 115–124.
[30]
Yair Shemer, Daniel Rotman, and Nahum Shimkin. 2019. ILS-SUMM: Iterated Local Search for Unsupervised Video Summarization. arXiv preprint arXiv:1912.03650(2019).
[31]
Yale Song, Jordi Vallmitjana, Amanda Stent, and Alejandro Jaimes. 2015. Tvsum: Summarizing web videos using titles. In Proceedings of the IEEE conference on computer vision and pattern recognition. 5179–5187.
[32]
[32] Text Compactor: Free Online Automatic Text Summarization Tool (accessed: 2020-03-20). https://www.textcompactor.com/
[33]
[33] Text Summarizer - Text Summarization (accessed: 2020-03-20). http://textsummarization.net/text-summarizer
[34]
[34] The Ideal Length For Every Online Content (accessed: 2020-03-20). https://seopressor.com/blog/the-ideal-length-for-every-online-content/
[35]
[35] The Ultimate Guide to TikTok Videos (accessed: 2020-03-20). https://clipchamp.com/en/blog/2019/ultimate-guide-to-tiktok/
[36]
[36] Twitter media upload best practices (accessed: 2020-03-20). https://developer.twitter.com/en/docs/media/upload-media/uploading-media/media-best-practices
[37]
[37] Video Length: 4 Tips That Will Help You Boost Engagement (accessed: 2020-03-20). https://meetmaestro.com/insights/how
[38]
Li Yuan, Francis Eng Hock Tay, Ping Li, Li Zhou, and Jiashi Feng. 2019. Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization. In 2019 AAAI Conference on Artificial Intelligence (AAAI).
[39]
HongJiang Zhang, Jianhua Wu, Di Zhong, and Stephen W. Smoliar. 1997. An integrated system for content-based video retrieval and browsing. Pattern Recognition 30(1997), 643–658.
[40]
Kaiyang Zhou and Yu Qiao. 2018. Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward. In 2018 AAAI Conference on Artificial Intelligence (AAAI).
[41]
Kaiyang Zhou, Yu Qiao, and Tao Xiang. 2018. Deep reinforcement learning for unsupervised video summarization with diversity-representativeness reward. In Thirty-Second AAAI Conference on Artificial Intelligence.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
IMX '20: Proceedings of the 2020 ACM International Conference on Interactive Media Experiences
June 2020
211 pages
ISBN:9781450379762
DOI:10.1145/3391614
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 June 2020

Check for updates

Author Tags

  1. Deep learning
  2. Generative adversarial networks
  3. Social networks
  4. Video sharing platforms
  5. Video summarization
  6. Web service

Qualifiers

  • Work in progress
  • Research
  • Refereed limited

Funding Sources

Conference

IMX '20

Acceptance Rates

Overall Acceptance Rate 69 of 245 submissions, 28%

Upcoming Conference

IMX '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media