skip to main content
research-article

CrowdStory: Fine-Grained Event Storyline Generation by Fusion of Multi-Modal Crowdsourced Data

Published: 11 September 2017 Publication History

Abstract

Event summarization based on crowdsourced microblog data is a promising research area, and several researchers have recently focused on this field. However, these previous works fail to characterize the fine-grained evolution of an event and the rich correlations among posts. The semantic associations among the multi-modal data in posts are also not investigated as a means to enhance the summarization performance. To address these issues, this study presents CrowdStory, which aims to characterize an event as a fine-grained, evolutionary, and correlation-rich storyline. A crowd-powered event model and a generic event storyline generation framework are first proposed, based on which a multi-clue--based approach to fine-grained event summarization is presented. The implicit human intelligence (HI) extracted from visual contents and community interactions is then used to identify inter-clue associations. Finally, a cross-media mining approach to selective visual story presentation is proposed. The experiment results indicate that, compared with the state-of-the-art methods, CrowdStory enables fine-grained event summarization (e.g., dynamic evolution) and correctly identifies up to 60% strong correlations (e.g., causality) of clues. The cross-media approach shows diversity and relevancy in visual data selection.

References

[1]
J. Bian, Y. Yang, H. Zhang, and T. Chua, “Multimedia Summarization for Social Events in Microblog Stream,” IEEE Transactions on multimedia, vol. 17, no. 2, pp. 216--228, 2015.
[2]
D. M. Blei, A. Y. Ng, and M. I. Jordan, “Latent dirichlet allocation,” Journal of machine Learning research, vol. 3, no.1, pp. 993-1022, 2003.
[3]
F. Buckley and M. Lewinter, “A friendly introduction to graph theory,” Prentice Hall, 2003.
[4]
D. Chakrabarti and K. Punera, “Event Summarization using Tweets,” in Proc. of ICWSM’11, AAAI, 2011, pp. 66--73.
[5]
Y. Chen, et al., “Event detection using customer care calls,” in Proc. of INFOCOM’13, 2013, pp. 1690-1698.
[6]
D. Corney, C. Martin, and A. Göker, “Two sides to every story: Subjective event summarization of sports events using Twitter,” in Proc. of the SoMuS ICMR 2014 Workshop, ACM, 2014.
[7]
A. Cui, M. Zhang, Y. Liu, et al., “Discover breaking events with popular hashtags in twitter,” in Proc. of CIKM’12, ACM, 2012, pp. 1794-1798.
[8]
N. Dehghani, M. Asadpour, “Graph-based Method for Summarized Storyline Generation in Twitter,” arXiv preprint arXiv:1504.07361, 2015.
[9]
J.L. Fleiss, B. Levin, C.P. Myunghee, “The measurement of interrater agreement. Statistical methods for rates and proportions,” no. 2, pp. 212-236, 1981.
[10]
B. Guo, H. Chen, Z. Yu, X. Xie, S. Huangfu, D. Zhang, “FlierMeet: A Mobile Crowdsensing System for Cross-Space Public Information Reposting, Tagging, and Sharing,” IEEE Transactions on Mobile Computing, vol. 14, no. 10, 2015, pp. 2020-2033.
[11]
M. Halkidi, Y. Batistakis, and M. Vazirgiannis, “On clustering validation techniques,” Journal of intelligent information systems, vol. 17, no. 2-3, pp. 107-145, 2001.
[12]
T. Hua, X. Zhang, W. Wang, et al., “Automatical Storyline Generation with Help from Twitter,” in Proc. of CIKM’16, ACM, 2016, pp. 2383-2388.
[13]
L. Huang and L. Huang, “Optimized Event Storyline Generation based on Mixture-Event-Aspect Model,” in Proc. of EMNLP’13, ACL, 2013, pp. 726--735.
[14]
L. Hubert and P. Arabie, “Comparing partitions,” Journal of classification, vol. 2, no. 1, pp. 193-218, 1985.
[15]
J. Kim, A. Monroy-Hernandez, “Storia: Summarizing social media content based on narrative theory using crowdsourcing,” in Proc. of CSCW’16. ACM, 2016, pp. 1018-1027.
[16]
J. Kleinberg, D. Easley, Networks, Crowds, and Markets, Cambridge University Press, 2010.
[17]
P. Lee, et al., “CAST: A Context-Aware Story-Teller for Streaming Social Content,” in Proc. of CIKM’14, ACM, 2014, pp. 789-798.
[18]
C. Lin, et al., “Generating event storylines from microblogs,” in Proc. of CIKM’15, ACM, 2012, pp. 175-184.
[19]
C. Y. Lin, “Rouge: A package for automatic evaluation of summaries,” in Proc. of ACL’04 workshop, 2004, pp. 1-8.
[20]
H. Lin and J. Bilmes, “Multi-document summarization via budgeted maximization of submodular functions,” in Proc. of ACL’10, 2010, pp. 912-920.
[21]
D. G. Lowe, “Object recognition from local scale-invariant features,” in Proc. of ICCV’99, 1999, pp. 1150-1157.
[22]
M. Mathioudakis, N. Koudas, “TwitterMonitor: trend detection over the Twitter stream,” in Proc. Of SIGMOD’10, ACM, 2010, pp. 1155-1158.
[23]
A. J. McMinn, Y. Moshfeghi, and J. M. Jose, “Building a large-scale corpus for evaluating event detection on twitter,” in Proc. of CIKM’13, ACM, 2013, pp. 409-418
[24]
P. Meladianos, et al., “Degeneracy-based real-time sub-event detection in twitter stream,” in Proc. of ICWSM’15, 2015, pp. 248-257.
[25]
J. Nichols, J. Mahmud, and C. Drews, “Summarizing sporting events using twitter,” in Proc. of IUI’12, ACM, 2012, pp. 189-198.
[26]
J. Pearl, “Causality,” Cambridge university press, 2009.
[27]
K. Rudra, et al., “Extracting Situational Information from Microblogs during Disaster Events: a Classification-Summarization Approach,” in Proc. of CIKM’15, ACM, 2015, pp. 583-592.
[28]
T. Sakaki, M. Okazaki, and Y. Matsuo, “Earthquake shakes Twitter users: real-time event detection by social sensors”, in Proc. of WWW’10, ACM, 2010, pp. 851-860.
[29]
M. Schinas, S. Papadopoulos, Y. Kompatsiaris, et al., “Visual event summarization on social media using topic modelling and graph-based ranking algorithms,” in Proc. of ICMR’15, ACM, 2015, pp. 203-210.
[30]
M. Schinas, S. Papadopoulos, Y. Kompatsiaris, et al., “StreamGrid: Summarization of Large Scale Events using Topic Modelling and Temporal Analysis,” SoMuS@ ICMR, 2014.
[31]
X. Shang, H. Zhang, T S. Chua, “Deep learning generic features for cross-media retrieval,” International Conference on Multimedia Modeling. Springer International Publishing, 2016, pp. 264-275.
[32]
C. Shen, F. Liu, F. Weng, et al., “A Participant-based Approach for Event Summarization Using Twitter Streams,” HLT-NAACL, 2013, pp. 1152-1162.
[33]
J. Song J, Y. Yang, Y. Yang, et al., “Inter-media hashing for large-scale retrieval from heterogeneous data sources,” in Proc. of SIGMOD’13. ACM, 2013, pp. 785-796.
[34]
S. Tang, F. Wu, S. Li, et al., “Sketch the Storyline with CHARCOAL: A Non-Parametric Approach,” IJCAI, 2015, pp. 3841-3848.
[35]
D. Wang, T. Li, and M. Ogihara, “Generating Pictorial Storylines Via Minimum-Weight Connected Dominating Set Approximation in Multi-View Graphs”, in Proc. of AAAI’12, 2012.
[36]
W. Wang, et al., “Effective deep learning-based multi-modal retrieval,” The VLDB Journal, vol. 25, no. 1, pp. 79-101, 2016.
[37]
Y. Wei, Y. Zhao, Z. Zhu, et al., “Modality-dependent cross-media retrieval,” ACM Transactions on Intelligent Systems and Technology (TIST), vol. 7, no. 4, pp. 57, 2016.
[38]
A. Weiler, M H. Scholl, F. Wanner, et al., “Event identification for local areas using social media streaming data,” in Proc. of the ACM SIGMOD Workshop on Databases and Social Networks. ACM, 2013, pp. 1-6.
[39]
A. Witayangkurn, T. Horanont, Y. Sekimoto, et al., “Anomalous event detection on large-scale gps data from mobile phones using hidden markov model and cloud platform,” in Proc. of UbiComp’13. ACM, 2013, pp. 1219-1228.
[40]
F. Wu, X. Lu, Z. Zhang, et al., “Cross-media semantic representation via bi-directional learning to rank,” in Proc. of MM’13, ACM, 2013, pp. 877-886.
[41]
D. Wu, Q. Liu, Y. Li, et al., “Adaptive Lookup of Open WiFi Using Crowdsensing,” IEEE/ACM Transactions on Networking, vol. 24, no. 6, pp. 3634-3647, 2016.
[42]
J. Xu, T C. Lu, “Seeing the big picture from microblogs: Harnessing social signals for visual event summarization,” in Proc. of IUI’15. ACM, 2015, pp. 62-66.
[43]
Z. Yu, F. Wu, Y. Yang, et al., “Discriminative coupled dictionary hashing for fast cross-media retrieval,” in Proc. of SIGIR’14, ACM, 2014, pp. 395-404.
[44]
D. Zhang and W.J. Li, “Large-Scale Supervised Multimodal Hashing with Semantic Correlation Maximization,” in Proc. of AAAI’14, 2014.
[45]
Q. Zhang, B. Goncalves, “Topical differences between Chinese language Twitter and Sina Weibo,” in Proc. of WWW’16, 2016.
[46]
J. Zhou, G. Ding, Y. Guo, “Latent semantic sparse hashing for cross-modal similarity search,” in Proc. of SIGIR’14, 2014, pp. 415-424.
[47]
W. Zhou, et al., “Generating textual storyline to improve situation awareness in disaster management,” in Proc. IRI’14, 2014, pp. 585-592.
[48]
L. J. Griffin, “Narrative, event-structure analysis, and causal interpretation in historical sociology,” American journal of Sociology, vol. 98, no. 5, pp. 1094-1133, 1993.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies  Volume 1, Issue 3
September 2017
2023 pages
EISSN:2474-9567
DOI:10.1145/3139486
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 September 2017
Accepted: 01 July 2017
Revised: 01 May 2017
Received: 01 February 2017
Published in IMWUT Volume 1, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Correlation
  2. Event Sensing
  3. Fine-grained
  4. Mobile Crowdsourcing
  5. Storyline

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)27
  • Downloads (Last 6 weeks)1
Reflects downloads up to 09 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media