research-article

IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models

Authors:

Dell ZhangAuthors Info & Claims

SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 515 - 524

https://doi.org/10.1145/3077136.3080786

Published: 07 August 2017 Publication History

Abstract

This paper provides a unified account of two schools of thinking in information retrieval modelling: the generative retrieval focusing on predicting relevant documents given a query, and the discriminative retrieval focusing on predicting relevancy given a query-document pair. We propose a game theoretical minimax game to iteratively optimise both models. On one hand, the discriminative model, aiming to mine signals from labelled and unlabelled data, provides guidance to train the generative model towards fitting the underlying relevance distribution over documents given the query. On the other hand, the generative model, acting as an attacker to the current discriminative model, generates difficult examples for the discriminative model in an adversarial way by minimising its discrimination objective. With the competition between these two models, we show that the unified framework takes advantage of both schools of thinking: (i) the generative model learns to fit the relevance distribution over documents via the signals from the discriminative model, and (ii) the discriminative model is able to exploit the unlabelled data selected by the generative model to achieve a better estimation for document ranking. Our experimental results have demonstrated significant performance gains as much as 23.96% on Precision@5 and 15.50% on MAP over strong baselines in a variety of applications including web search, item recommendation, and question answering.

References

[1]

Ricardo Baeza-Yates, Berthier Ribeiro-Neto, and others. 1999. Modern information retrieval.

[2]

Oren Barkan and Noam Koenigstein 2016. Item2vec: neural item embedding for collaborative filtering MLSP Workshop.

[3]

Chris Burges, Tal Shaked, Erin Renshaw, Ari Lazier, Matt Deeds, Nicole Hamilton, and Greg Hullender 2005. Learning to Rank Using Gradient Descent. In ICML.

Digital Library

[4]

Christopher J. C. Burges. 2010. From RankNet to LambdaRank to LambdaMART: An Overview. Learning (2010).

[5]

Christopher J. C. Burges, Robert Ragno, and Quoc Viet Le. 2006. Learning to Rank with Nonsmooth Cost Functions. In NIPS.

[6]

Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li 2007. Learning to rank: from pairwise approach to listwise approach ICML.

[7]

Wei Chen, Tie-Yan Liu, Yanyan Lan, Zhi-Ming Ma, and Hang Li 2009. Ranking Measures and Loss Functions in Learning to Rank NIPS. 315--323.

[8]

Paul Covington, Jay Adams, and Emre Sargin. 2016. Deep neural networks for youtube recommendations. RecSys.

Digital Library

[9]

Cícero Nogueira dos Santos, Ming Tan, Bing Xiang, and Bowen Zhou 2016. Attentive Pooling Networks. CoRR (2016).

[10]

Minwei Feng, Bing Xiang, Michael R Glass, Lidan Wang, and Bowen Zhou 2015. Applying deep learning to answer selection: A study and an open task ASRU Workshop.

[11]

Yoav Freund, Raj Iyer, Robert E Schapire, and Yoram Singer. 2003. An Efficient Boosting Algorithm for Combining Preferences. JMLR (2003).

[12]

Ian Goodfellow. 2016. NIPS 2016 Tutorial: Generative Adversarial Networks. arXiv preprint arXiv:1701.00160 (2016).

[13]

Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In NIPS.

[14]

Ian J. Goodfellow. 2014. On Distinguishability Criteria for Estimating Generative Models. arXiv:1412.6515 (2014).

[15]

Michael Gutmann and Aapo Hyvärinen 2010. Noise-contrastive estimation: A new estimation principle for unnormalized statistical models AISTATS.

[16]

Ralf Herbrich, Thore Graepel, and Klaus Obermayer. 2000. Large Margin Rank Boundaries for Ordinal Regression. Advances in Large Margin Classifiers.

[17]

Thomas Hofmann. 1999. Probabilistic Latent Semantic Indexing. In SIGIR.

Digital Library

[18]

Ferenc Huszár. 2015. How (not) to Train your Generative Model: Scheduled Sampling, Likelihood, Adversary? arXiv preprint arXiv:1511.05101 (2015).

[19]

Thorsten Joachims. 2002. Optimizing search engines using clickthrough data. KDD.

Digital Library

[20]

Yoon Kim 2014. Convolutional Neural Networks for Sentence Classification. arXiv preprint arXiv:1408.5882 (2014).

[21]

Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer (2009).

Digital Library

[22]

John Lafferty and Chengxiang Zhai 2002. Probabilistic Relevance Models Based on Document and Query Generation Language Modeling and Information Retrieval.

[23]

Ping Li, Christopher J. C. Burges, Qiang Wu, J. C. Platt, D. Koller, Y. Singer, and S. Roweis 2007. McRank: Learning to Rank Using Multiple Classification and Gradient Boosting. NIPS.

[24]

Tie-Yan Liu. 2009. Learning to rank for information retrieval. Foundations and Trends in Information Retrieval (2009).

[25]

Tie-Yan Liu, Jun Xu, Tao Qin, Wenying Xiong, and Hang Li 2007. LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval Proceedings of SIGIR 2007 Workshop on Learning to Rank for Information Retrieval.

[26]

Jiyun Luo, Sicong Zhang, and Hui Yang 2014. Win-win search: dual-agent stochastic game in session search SIGIR.

[27]

H. Brendan McMahan, Gary Holt, David Sculley, Michael Young, Dietmar Ebner, Julian Grady, Lan Nie, Todd Phillips, Eugene Davydov, Daniel Golovin, and others 2013. Ad click prediction: a view from the trenches. In KDD.

Digital Library

[28]

Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean 2013. Distributed representations of words and phrases and their compositionality NIPS.

[29]

Mehdi Mirza and Simon Osindero 2014. Conditional Generative Adversarial Nets. arXiv preprint arXiv:1411.1784 (2014).

[30]

In Jae Myung. 2003. Tutorial on maximum likelihood estimation. Journal of mathematical Psychology (2003).

[31]

Ramesh Nallapati. 2004. Discriminative Models for Information Retrieval. SIGIR.

Digital Library

[32]

Jay M. Ponte and W. Bruce Croft 1998. A language modeling approach to information retrieval SIGIR.

[33]

Steffen Rendle. 2010. Factorization machines. In ICDM.

Digital Library

[34]

Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme 2009. BPR: Bayesian personalized ranking from implicit feedback UAI.

[35]

Stephen E. Robertson and K. Sparck Jones 1976. Relevance weighting of search terms. Journal of the American Society for Information science (1976).

[36]

Tim Salimans, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, and Xi Chen. 2016. Improved Techniques for Training GANs. In NIPS.

[37]

Aliaksei Severyn and Alessandro Moschitti 2015. Learning to rank short text pairs with convolutional deep neural networks SIGIR.

[38]

Richard S. Sutton, David A. McAllester, Satinder P. Singh, Yishay Mansour, and others 1999. Policy Gradient Methods for Reinforcement Learning with Function Approximation NIPS.

[39]

Tao Tao and ChengXiang Zhai 2006. Regularized estimation of mixture models for robust pseudo-relevance feedback SIGIR. ACM, 162--169.

[40]

Di Wang and Eric Nyberg 2015. A Long Short-Term Memory Model for Answer Sentence Selection in Question Answering ACL.

[41]

Jun Wang, Arjen P. De Vries, and Marcel J. T. Reinders. 2006. Unifying user-based and item-based collaborative filtering approaches by similarity fusion SIGIR.

[42]

Ronald J. Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning (1992).

[43]

Yingce Xia, Di He, Tao Qin, Liwei Wang, Nenghai Yu, Tie-Yan Liu, and Wei-Ying Ma 2016. Dual Learning for Machine Translation. In NIPS.

[44]

Lantao Yu, Weinan Zhang, Jun Wang, and Yong Yu. 2017. SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient AAAI.

[45]

Shipeng Yu, Deng Cai, Ji-Rong Wen, and Wei-Ying Ma. 2003. Improving pseudo-relevance feedback in web information retrieval using web page segmentation. In WWW. ACM, 11--18.

Digital Library

[46]

Fajie Yuan, Guibing Guo, Joemon M. Jose, Long Chen, Haitao Yu, and Weinan Zhang. 2016. Lambdafm: learning optimal ranking with factorization machines using lambda surrogates CIKM.

[47]

ChengXiang Zhai. 2016. Towards a game-theoretic framework for text data retrieval. IEEE Data Eng. Bull. (2016).

[48]

Chengxiang Zhai and John Lafferty 2004. A study of smoothing methods for language models applied to information retrieval. TOIS (2004).

[49]

ChengXiang Zhai and John D. Lafferty 2001. Model-based Feedback in the Language Modeling Approach to Information Retrieval CIKM.

Digital Library

[50]

Peng Zhang, Qian Yu, Yuexian Hou, Dawei Song, Jingfei Li, and Bin Hu. 2017. A Distribution Separation Method Using Irrelevance Feedback Data for Information Retrieval. ACM TIST (2017).

[51]

Weinan Zhang, Tianqi Chen, Jun Wang, and Yong Yu. 2013. Optimizing top-n collaborative filtering via dynamic negative item sampling SIGIR.

Cited By

Liu Y(2025)A generative and discriminative model for diversity-promoting recommendationInformation Systems10.1016/j.is.2024.102488128(102488)Online publication date: Feb-2025
https://doi.org/10.1016/j.is.2024.102488
Niu YXing XJia ZLiu RXin M(2025)Implicit local–global feature extraction for diffusion sequence recommendationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109471139(109471)Online publication date: Jan-2025
https://doi.org/10.1016/j.engappai.2024.109471
Zhang AYu YXu GGao RZhang LGao SYin H(2025)Hyperbolic Adversarial Learning for Personalized Item RecommendationDatabase Systems for Advanced Applications10.1007/978-981-97-5555-4_20(303-312)Online publication date: 12-Jan-2025
https://doi.org/10.1007/978-981-97-5555-4_20
Show More Cited By

Index Terms

IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking

Recommendations

Learning to find answers to questions on the Web

We introduce a method for learning to find documents on the Web that contain answers to a given natural language question. In our approach, questions are transformed into new queries aimed at maximizing the probability of retrieving answers from ...
Improving Convergence in IRGAN with PPO
CoDS COMAD 2020: Proceedings of the 7th ACM IKDD CoDS and 25th COMAD

Information retrieval modeling aims to optimise generative and discriminative retrieval strategies, where, generative retrieval focuses on predicting query-specific relevant documents and discriminative retrieval tries to predict relevancy given a query-...
An intelligent platform for information retrieval
Proceedings of the 2005 joint Chinese-German conference on Cognitive systems

Information Retrieval (IR) has played a very important role in our modern life. However, the results of search engines are not satisfactory for human intelligent activities. The platform proposed in this paper tried to solve the problems from three ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval

August 2017

1476 pages

ISBN:9781450350228

DOI:10.1145/3077136

General Chairs:
Noriko Kando
National Institute of Informatics
,
Tetsuya Sakai
Waseda University
,
Hideo Joho
University of Tsukuba
,
Program Chairs:
Hang Li
Huawei Noah's Ark Lab
,
Arjen P. de Vries
Radboud University
,
Ryen W. White
Microsoft Cortana

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 August 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Badges

Honorable Mention

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China
Shanghai Sailing Program

Conference

SIGIR '17

Sponsor:

SIGIR

SIGIR '17: The 40th International ACM SIGIR conference on research and development in Information Retrieval

August 7 - 11, 2017

Tokyo, Shinjuku, Japan

Acceptance Rates

SIGIR '17 Paper Acceptance Rate 78 of 362 submissions, 22%;

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

428
Total Citations
View Citations
3,398
Total Downloads

Downloads (Last 12 months)190
Downloads (Last 6 weeks)28

Reflects downloads up to 13 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Liu Y(2025)A generative and discriminative model for diversity-promoting recommendationInformation Systems10.1016/j.is.2024.102488128(102488)Online publication date: Feb-2025
https://doi.org/10.1016/j.is.2024.102488
Niu YXing XJia ZLiu RXin M(2025)Implicit local–global feature extraction for diffusion sequence recommendationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109471139(109471)Online publication date: Jan-2025
https://doi.org/10.1016/j.engappai.2024.109471
Zhang AYu YXu GGao RZhang LGao SYin H(2025)Hyperbolic Adversarial Learning for Personalized Item RecommendationDatabase Systems for Advanced Applications10.1007/978-981-97-5555-4_20(303-312)Online publication date: 12-Jan-2025
https://doi.org/10.1007/978-981-97-5555-4_20
Peng BBo XGuo JZhang Y(2025)A Diffusion Model with User Preference Guidance for RecommendationDatabase Systems for Advanced Applications10.1007/978-981-97-5555-4_15(228-244)Online publication date: 12-Jan-2025
https://doi.org/10.1007/978-981-97-5555-4_15
Selwon KSzymański J(2024)Enhancing Personalized Travel Recommendations: Integrating User Behavior and Content AnalysisProceedings of the 32nd International Conference on Information Systems Development10.62036/ISD.2024.49Online publication date: 2024
https://doi.org/10.62036/ISD.2024.49
Yang ZHuang YChen YWu XFeng JDeng C(2024)CTGGAN: Controllable Text Generation with Generative Adversarial NetworkApplied Sciences10.3390/app1407310614:7(3106)Online publication date: 8-Apr-2024
https://doi.org/10.3390/app14073106
Ibrahim MYusof UEisa TNasser M(2024)Bioinspired Algorithms for Multiple Sequence Alignment: A Systematic Review and RoadmapApplied Sciences10.3390/app1406243314:6(2433)Online publication date: 13-Mar-2024
https://doi.org/10.3390/app14062433
Ma HWang CZhu HYang SZhang XZhang XWooldridge MDy JNatarajan S(2024)Enhancing cognitive diagnosis using un-interacted exercisesProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i8.28735(8877-8885)Online publication date: 20-Feb-2024
https://dl.acm.org/doi/10.1609/aaai.v38i8.28735
Zhang KQian XSong W(2024)GAN-based Fairness-Aware Recommendation for Enhancing the Fairness of DataGAN-based Fairness-Aware RecommendationFairness-Aware RecommendationProceedings of the 2024 Guangdong-Hong Kong-Macao Greater Bay Area International Conference on Digital Economy and Artificial Intelligence10.1145/3675417.3675468(315-321)Online publication date: 19-Jan-2024
https://dl.acm.org/doi/10.1145/3675417.3675468
Meng SZhou JChen XLiu YLu FHuang X(2024)Structure-Information-Based Reasoning over the Knowledge Graph: A Survey of Methods and ApplicationsACM Transactions on Knowledge Discovery from Data10.1145/367114818:8(1-42)Online publication date: 16-Aug-2024
https://dl.acm.org/doi/10.1145/3671148
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents