skip to main content
10.1145/3077136.3080786acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models

Published: 07 August 2017 Publication History

Abstract

This paper provides a unified account of two schools of thinking in information retrieval modelling: the generative retrieval focusing on predicting relevant documents given a query, and the discriminative retrieval focusing on predicting relevancy given a query-document pair. We propose a game theoretical minimax game to iteratively optimise both models. On one hand, the discriminative model, aiming to mine signals from labelled and unlabelled data, provides guidance to train the generative model towards fitting the underlying relevance distribution over documents given the query. On the other hand, the generative model, acting as an attacker to the current discriminative model, generates difficult examples for the discriminative model in an adversarial way by minimising its discrimination objective. With the competition between these two models, we show that the unified framework takes advantage of both schools of thinking: (i) the generative model learns to fit the relevance distribution over documents via the signals from the discriminative model, and (ii) the discriminative model is able to exploit the unlabelled data selected by the generative model to achieve a better estimation for document ranking. Our experimental results have demonstrated significant performance gains as much as 23.96% on Precision@5 and 15.50% on MAP over strong baselines in a variety of applications including web search, item recommendation, and question answering.

References

[1]
Ricardo Baeza-Yates, Berthier Ribeiro-Neto, and others. 1999. Modern information retrieval.
[2]
Oren Barkan and Noam Koenigstein 2016. Item2vec: neural item embedding for collaborative filtering MLSP Workshop.
[3]
Chris Burges, Tal Shaked, Erin Renshaw, Ari Lazier, Matt Deeds, Nicole Hamilton, and Greg Hullender 2005. Learning to Rank Using Gradient Descent. In ICML.
[4]
Christopher J. C. Burges. 2010. From RankNet to LambdaRank to LambdaMART: An Overview. Learning (2010).
[5]
Christopher J. C. Burges, Robert Ragno, and Quoc Viet Le. 2006. Learning to Rank with Nonsmooth Cost Functions. In NIPS.
[6]
Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li 2007. Learning to rank: from pairwise approach to listwise approach ICML.
[7]
Wei Chen, Tie-Yan Liu, Yanyan Lan, Zhi-Ming Ma, and Hang Li 2009. Ranking Measures and Loss Functions in Learning to Rank NIPS. 315--323.
[8]
Paul Covington, Jay Adams, and Emre Sargin. 2016. Deep neural networks for youtube recommendations. RecSys.
[9]
Cícero Nogueira dos Santos, Ming Tan, Bing Xiang, and Bowen Zhou 2016. Attentive Pooling Networks. CoRR (2016).
[10]
Minwei Feng, Bing Xiang, Michael R Glass, Lidan Wang, and Bowen Zhou 2015. Applying deep learning to answer selection: A study and an open task ASRU Workshop.
[11]
Yoav Freund, Raj Iyer, Robert E Schapire, and Yoram Singer. 2003. An Efficient Boosting Algorithm for Combining Preferences. JMLR (2003).
[12]
Ian Goodfellow. 2016. NIPS 2016 Tutorial: Generative Adversarial Networks. arXiv preprint arXiv:1701.00160 (2016).
[13]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In NIPS.
[14]
Ian J. Goodfellow. 2014. On Distinguishability Criteria for Estimating Generative Models. arXiv:1412.6515 (2014).
[15]
Michael Gutmann and Aapo Hyvärinen 2010. Noise-contrastive estimation: A new estimation principle for unnormalized statistical models AISTATS.
[16]
Ralf Herbrich, Thore Graepel, and Klaus Obermayer. 2000. Large Margin Rank Boundaries for Ordinal Regression. Advances in Large Margin Classifiers.
[17]
Thomas Hofmann. 1999. Probabilistic Latent Semantic Indexing. In SIGIR.
[18]
Ferenc Huszár. 2015. How (not) to Train your Generative Model: Scheduled Sampling, Likelihood, Adversary? arXiv preprint arXiv:1511.05101 (2015).
[19]
Thorsten Joachims. 2002. Optimizing search engines using clickthrough data. KDD.
[20]
Yoon Kim 2014. Convolutional Neural Networks for Sentence Classification. arXiv preprint arXiv:1408.5882 (2014).
[21]
Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer (2009).
[22]
John Lafferty and Chengxiang Zhai 2002. Probabilistic Relevance Models Based on Document and Query Generation Language Modeling and Information Retrieval.
[23]
Ping Li, Christopher J. C. Burges, Qiang Wu, J. C. Platt, D. Koller, Y. Singer, and S. Roweis 2007. McRank: Learning to Rank Using Multiple Classification and Gradient Boosting. NIPS.
[24]
Tie-Yan Liu. 2009. Learning to rank for information retrieval. Foundations and Trends in Information Retrieval (2009).
[25]
Tie-Yan Liu, Jun Xu, Tao Qin, Wenying Xiong, and Hang Li 2007. LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval Proceedings of SIGIR 2007 Workshop on Learning to Rank for Information Retrieval.
[26]
Jiyun Luo, Sicong Zhang, and Hui Yang 2014. Win-win search: dual-agent stochastic game in session search SIGIR.
[27]
H. Brendan McMahan, Gary Holt, David Sculley, Michael Young, Dietmar Ebner, Julian Grady, Lan Nie, Todd Phillips, Eugene Davydov, Daniel Golovin, and others 2013. Ad click prediction: a view from the trenches. In KDD.
[28]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean 2013. Distributed representations of words and phrases and their compositionality NIPS.
[29]
Mehdi Mirza and Simon Osindero 2014. Conditional Generative Adversarial Nets. arXiv preprint arXiv:1411.1784 (2014).
[30]
In Jae Myung. 2003. Tutorial on maximum likelihood estimation. Journal of mathematical Psychology (2003).
[31]
Ramesh Nallapati. 2004. Discriminative Models for Information Retrieval. SIGIR.
[32]
Jay M. Ponte and W. Bruce Croft 1998. A language modeling approach to information retrieval SIGIR.
[33]
Steffen Rendle. 2010. Factorization machines. In ICDM.
[34]
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme 2009. BPR: Bayesian personalized ranking from implicit feedback UAI.
[35]
Stephen E. Robertson and K. Sparck Jones 1976. Relevance weighting of search terms. Journal of the American Society for Information science (1976).
[36]
Tim Salimans, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, and Xi Chen. 2016. Improved Techniques for Training GANs. In NIPS.
[37]
Aliaksei Severyn and Alessandro Moschitti 2015. Learning to rank short text pairs with convolutional deep neural networks SIGIR.
[38]
Richard S. Sutton, David A. McAllester, Satinder P. Singh, Yishay Mansour, and others 1999. Policy Gradient Methods for Reinforcement Learning with Function Approximation NIPS.
[39]
Tao Tao and ChengXiang Zhai 2006. Regularized estimation of mixture models for robust pseudo-relevance feedback SIGIR. ACM, 162--169.
[40]
Di Wang and Eric Nyberg 2015. A Long Short-Term Memory Model for Answer Sentence Selection in Question Answering ACL.
[41]
Jun Wang, Arjen P. De Vries, and Marcel J. T. Reinders. 2006. Unifying user-based and item-based collaborative filtering approaches by similarity fusion SIGIR.
[42]
Ronald J. Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning (1992).
[43]
Yingce Xia, Di He, Tao Qin, Liwei Wang, Nenghai Yu, Tie-Yan Liu, and Wei-Ying Ma 2016. Dual Learning for Machine Translation. In NIPS.
[44]
Lantao Yu, Weinan Zhang, Jun Wang, and Yong Yu. 2017. SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient AAAI.
[45]
Shipeng Yu, Deng Cai, Ji-Rong Wen, and Wei-Ying Ma. 2003. Improving pseudo-relevance feedback in web information retrieval using web page segmentation. In WWW. ACM, 11--18.
[46]
Fajie Yuan, Guibing Guo, Joemon M. Jose, Long Chen, Haitao Yu, and Weinan Zhang. 2016. Lambdafm: learning optimal ranking with factorization machines using lambda surrogates CIKM.
[47]
ChengXiang Zhai. 2016. Towards a game-theoretic framework for text data retrieval. IEEE Data Eng. Bull. (2016).
[48]
Chengxiang Zhai and John Lafferty 2004. A study of smoothing methods for language models applied to information retrieval. TOIS (2004).
[49]
ChengXiang Zhai and John D. Lafferty 2001. Model-based Feedback in the Language Modeling Approach to Information Retrieval CIKM.
[50]
Peng Zhang, Qian Yu, Yuexian Hou, Dawei Song, Jingfei Li, and Bin Hu. 2017. A Distribution Separation Method Using Irrelevance Feedback Data for Information Retrieval. ACM TIST (2017).
[51]
Weinan Zhang, Tianqi Chen, Jun Wang, and Yong Yu. 2013. Optimizing top-n collaborative filtering via dynamic negative item sampling SIGIR.

Cited By

View all

Index Terms

  1. IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval
    August 2017
    1476 pages
    ISBN:9781450350228
    DOI:10.1145/3077136
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 August 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Badges

    • Honorable Mention

    Author Tags

    1. adversarial training
    2. information retrieval
    3. information retrieval models
    4. question answering
    5. recommender systems
    6. web search

    Qualifiers

    • Research-article

    Funding Sources

    • National Natural Science Foundation of China
    • Shanghai Sailing Program

    Conference

    SIGIR '17
    Sponsor:

    Acceptance Rates

    SIGIR '17 Paper Acceptance Rate 78 of 362 submissions, 22%;
    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)190
    • Downloads (Last 6 weeks)28
    Reflects downloads up to 13 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media