skip to main content
10.1145/564376.564430acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

Using part-of-speech patterns to reduce query ambiguity

Published: 11 August 2002 Publication History

Abstract

Query ambiguity is a generally recognized problem, particularly in Web environments where queries are commonly only one or two words in length. In this study, we explore one technique that finds commonly occurring patterns of parts of speech near a one-word query and allows them to be transformed into clarification questions. We use a technique derived from statistical language modeling to show that the clarification queries will reduce ambiguity much of the time, and often quite substantially.

References

[1]
A.Leuski. Evaluating document clustering for interactive information retrieval. In Proceedings of the ACM CIKM 2001 Tenth International Conference on Information and Knowledge Management, pages 33--40, 2001.
[2]
A.Leuski. Interactive Information Organization: Techniques and Evaluation. PhD thesis, University of Massachusetts, Amherst, 2001.
[3]
Peter G. Anick and Suresh Tipirneni. The paraphrase search assistant: Terminological feedback for iterative information seeking. In Proceedings of the ACM Conference on Research in Information Retrieval (SIGIR), pages 153--159, 1999.
[4]
P.D. Bruza and S. Dennis. Query-reformulation on the internet: empirical data and the hyperindex search engine. In Proceedings of the RIAO Conference: Intelligent Text and Image Handling, pages 488--499, Montreal, Canada, 1997.
[5]
Peter Bruza, Robert McArthur, and Simon Dennis. Interactive internet search: Keyword, directory and query reformulation mechanisms compared. In Proceedings of the ACM Conference on Research in Information Retrieval (SIGIR), pages 280--287, 2000.
[6]
James P. Callan, W. Bruce Croft, and Stephen M. Harding. The INQUERY retrieval system. In Proceedings of DEXA-92, 3rd International Conference on Database and Expert Systems Applications, pages 78--83, 1992.
[7]
K.W. Church, W.Gale, P.Hanks, and D.Hindle. Using statistics in lexical analysis. In Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon, pages 115--164, 1991.
[8]
W.B. Croft, S. Cronen Townsend, and V.Lavrenko. Relevance feedback and personalization: A language modeling perspective. In Proceedings of the DELOS-NSF Worshop on Personalization and Recommender Systems in Digital Libraries, pages 49--54, 2001.
[9]
Steve Cronen-Townsend and W. Bruce Croft. Quantifying query ambiguity. In Proceedings of the Conference on Human Language Technology (HLT 2002), pages 94--98, 2002.
[10]
Steve Cronen-Townsend, Yun Zhou, and W. Bruce Croft. Predicting query performance. In Proceedings of the ACM Conference on Research in Information Retrieval (SIGIR), 2002.
[11]
D.Lawrie, W.B. Croft, and A.Rosenberg. Finding topic words for hierarchical summarization. In Proceedings of the ACM Conference on Research in Information Retrieval (SIGIR), pages 349--357, 2001.
[12]
G.Grefenstette. SQLET: Short query linguistic expansion techniques: Palliating one or two-word queries by providing intermediate structure to WWW pages. In Proceedings of RIAO '97. (1997), 1997.
[13]
Marti Hearst and Jan Pedersen. Reexamining the cluster hypothesis: Scatter/gather on retrieval results. In Proceedings of the ACM Conference on Research in Information Retrieval (SIGIR), pages 76--84, 1996.
[14]
J.Allan, A.Leuski, R. Swan, and D.Byrd. Evaluating combinations of ranked lists and visualizations of inter-document similarity. In Information Processing and Management (IPM), vol. 37, pages 435--458, 2000.
[15]
K.Franzen and J.Karlgren. Verbosity and interface design. Technical Report T2000:04, Swedish Institute of Computer Science (SICS), 2000. Also available at http://citeseer.nj.nec.com/313985.html.
[16]
R. Krovetz. Word Sense Disambiguation for Large Text Databases. PhD thesis, University of Massachusetts, Amherst, 1995.
[17]
Dawn Lawrie and W.B. Croft. Discovering and comparing topic hierarchies. In Proceedings of RIAO 2000 Conference, pages 314--330, 2000.
[18]
Anton Leuski and James Allan. Lighthouse: Showing the way to relevant information. In Proceedings of the IEEE Symposium on Information Visualization 2000 (InfoVis 2000), pages 125--130, 2000.
[19]
M.Sanderson and D.Lawrie. Building, testing, and applying concept hierarchies. In W. Bruce Croft, editor, Advances in Information Retrieval: Recent Research from the CIIR, W. Bruce Croft, ed., Kluwer Academic Publishers, chapter 9, pages 235--266. Kluwer Academic Press, 2000.
[20]
Mark Sanderson. Word sense disambiguation and information retrieval. In Proceedings of the ACM Conference on Research in Information Retrieval (SIGIR), pages 142--151, 1994.
[21]
Mark Sanderson and W. Bruce Croft. Deriving concept hierarchies from text. In Proceedings of the ACM Conference on Research in Information Retrieval (SIGIR), pages 206--213, 1999.
[22]
Richard Seltzer. Taking "snapshots" with livetopics: Watch out for mirages, 1997. http://www.samizdat.com/cobb/oct97.html.
[23]
Russell C. Swan and James Allan. Aspect windows, 3-d visualizations, and indirect comparisons of information retrieval systems. In Proceedings of the ACM Conference on Research in Information Retrieval (SIGIR), pages 173--181, 1998.
[24]
J. Xu, J. Broglio, and W.B. Croft. The design and implementation of a part of speech tagger for english. Technical Report IR-52, University of Massachusetts, Amherst, Center for Intelligent Information Retrieval, 1994.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
August 2002
478 pages
ISBN:1581135610
DOI:10.1145/564376
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 August 2002

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. clarity
  2. part of speech
  3. query ambiguity

Qualifiers

  • Article

Conference

SIGIR02
Sponsor:

Acceptance Rates

SIGIR '02 Paper Acceptance Rate 44 of 219 submissions, 20%;
Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 26 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media