skip to main content
10.1145/1645953.1646175acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
poster

Effective and efficient structured retrieval

Published: 02 November 2009 Publication History

Abstract

Search engines that support structured documents typically support structure created by the author (e.g., title, section), and may also support structure added by an annotation process (e.g., part of speech, named entity, semantic role). Exploiting such structure can be difficult. Query structure may fail to match structure in a relevant document for a variety of reasons, thus structured queries, although containing more information than keyword queries, are often less effective than unstructured queries. This paper studies retrieval of sentences with annotations for a question answering task. Three problems of structured retrieval are identified and solutions proposed. Structural mismatch is addressed by query structure expansion of predicted relevant structures. Lack of presence of all key aspects of a question is solved by Boolean filtering of result sentences. The score variations of the annotator generated fields with all the different lengths are accounted for by using field specific smoothing. Experiments show that each solution incrementally improves structured retrieval, and a combination of Boolean filtering, structural expansion, and keyword queries outperforms keyword and simple structured retrieval baselines.

References

[1]
Matthew W. Bilotti, Paul Ogilvie, Jamie Callan and Eric Nyberg. Structured Retrieval for Question Answering. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. 2007.
[2]
Le Zhao and Jamie Callan. A Generative Retrieval Model for Structured Documents. In Proceedings of CIKM 2008.
[3]
Norbert Gövert and Gabriella Kazai. Overview of the INitiative for the Evaluation of XML retrieval (INEX) 2002, 2002.
[4]
Andrew Trotman and Mounia Lalmas. Why Structural Hints in Queries do not Help XML-Retrieval. In Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 711--712. 2006.
[5]
Hui Fang, Tao Tao and Chengxiang Zhai. A formal study of information retrieval heuristics. In Proceedings of the 27th international ACM SIGIR conference on Research and development in information retrieval. 2004.
[6]
TREC Legal track homepage (retrieved on Jan 2. 2009): http://trec-legal.umiacs.umd.edu/
[7]
John Prager, Jennifer Chu-Carroll, Eric Brown, Krzysztof Czuba. Question Answering By Predictive Annotation. In Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval. 2000.
[8]
INDRI - Language modeling meets inference networks. http://www.lemurproject.org/indri/. Retrieved Jan 2, 2009.
[9]
Mengqiu Wang, Noah Smith and Teruko Mitamura. What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA. Proceedings of EMNLP '07.
[10]
Nico Schlaefer, Jeongwoo Ko, Justin Betteridge, Guido Sautter, Manas Pathak and Eric Nyberg. Semantic Extensions of the Ephyra QA System for TREC 2007. In Proceedings of the Sixteenth Text REtrieval Conference, TREC 2007.
[11]
Wilfrid Lancaster. Information Retrieval Systems: Characteristics, Testing and Evaluation. Wiley, New York, 1968.
[12]
Stephen Harter. Online information retrieval: concepts, principles, and techniques Academic Press, San Diego, California, 1986.
[13]
Marti Hearst. Improving full-text precision on short queries using simple constraints. In Proceedings the Fifth Annual Symposium on Document Analysis and Information Retrieval (SDAIR 1996), 1996.
[14]
Mandar Mitra, Amit Singhal and Chris Buckley, Improving automatic query expansion, Proceedings the 21st annual international ACM SIGIR conference on Research and development in information retrieval, p. 206--214, 1998.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management
November 2009
2162 pages
ISBN:9781605585123
DOI:10.1145/1645953
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 November 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. boolean filtering
  2. indri query language
  3. question answering
  4. structural mismatch
  5. structured query formulation

Qualifiers

  • Poster

Conference

CIKM '09
Sponsor:

Acceptance Rates

Overall Acceptance Rate 520 of 2,712 submissions, 19%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Nov 2024

Other Metrics

Citations

Cited By

View all

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media