skip to main content
10.1145/1321440.1321554acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Finding and linking incidents in news

Published: 06 November 2007 Publication History

Abstract

News reports are being produced and disseminated in overwhelming volume, making it difficult to keep up with the newest information. Most previous research in automatic news organization treated news topics as a flat list, ignoring the intrinsic connection among individual reports. We argue that more contextual information within and across the topics will benefit users in their news understanding process.
A news organization infrastructure, incident threading, is proposed in this article. All text snippets describing the occurrence of a real-world happening are combined into a news incident, and a network is composed of incidents that are interconnected by links in certain types. A limited vocabulary of connection types is defined and corresponding rules are established based upon the human experience of news understanding.
The incident threading system is implemented with two different algorithms. One starts from clustering of text passages and then creates links with pre-built rules. The other method defines a global score function over the whole collection and solves the optimization problem with simulated annealing. The former achieves higher accuracy in the identification of incidents and the latter generates better links, which is preferred since the links are more important for the formation of the incident network.

References

[1]
J. Allan, editor. Topic Detection and Tracking: event-based information organization. Kluwer Academic Publishers, 2002.
[2]
M. Connell, A. Feng, G. Kumaran, H. Raghavan, C. Shah, and J. Allan. UMass at TDT 2004. In Proceedings of TDT 2004, 2004. www.nist.gov/speech/tests/tdt/tdt2004/papers/UMass-TDT2004-paper.pdf.
[3]
F. Kilander. A brief comparison of news filtering software. Unpublished paper, 1995.
[4]
K. R. McKeown, R. Barzilay, D. Evans, V. Hatzivassiloglou, J. L. Klavans, C. Sable, B. Schiffman, and S. Sigelman. Tracking and summarizing news on a daily basis with Columbia's Newsblaster. In Proceedings of the Human Language Technology Conference, 2002.
[5]
R. Nallapati, A. Feng, F. Peng, and J. Allan. Event threading within news topics. In Proceedings of ACM Thirteenth Conference on Information and Knowledge Management, pages 446--453, 2004.
[6]
D. E. O'Leary. The Internet, intranets, and the AI renaissance. Computer, 30(1):71--78, 1997.
[7]
D. Radev, J. Otterbacher, A. Winkel, and S. Blair-Goldensohn. NewsInEssence: Summarizing online news topics. Communications of the ACM, 48(10):95--98, 2005.
[8]
R. C. Schank and R. P. Abelson. Scripts, Plans, Goals, and Understanding: an Inquiry into Human Knowledge Structure. Lawrence Erlbaum Associates, 1977.
[9]
R. E. Schapire and Y. Singer. BoosTexter: A boosting-based system for text categorization. Machine Learning, 39(2/3):135--168, 2000.
[10]
I. Soboroff. Overview of the TREC 2004 novelty track. In The Thirteenth Text Retrieval Conference. NIST, November 2004. http://trec.nist.gov/pubs/trec13/papers/NOVELTY.OVERVIEW.pdf.
[11]
D. Trieschnigg and W. Kraaij. Scalable hierarchical topic detection: exploring a sample based approach. Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pages 655--656, 2005.
[12]
T. A. van Dijk. News as Discourse. Lawrence Erlbaum Associates, 1988.
[13]
Y. Yang, J. Carbonell, R. Brown, T. Pierce, B. T. Archibald, and X. Liu. Learning approaches for detection and tracking news events. IEEE Intelligent Systems Special Issue on Applications of Intelligent Information Retrieval, 14(4):32--43, 1999.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
November 2007
1048 pages
ISBN:9781595938039
DOI:10.1145/1321440
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 November 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. automatic news organization
  2. global optimization
  3. incident threading
  4. simulated annealing
  5. threading rules

Qualifiers

  • Research-article

Conference

CIKM07

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media