skip to main content
10.1145/3209978.3210061acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Chat More: Deepening and Widening the Chatting Topic via A Deep Model

Published: 27 June 2018 Publication History

Abstract

The past decade has witnessed the boom of human-machine interactions, particularly via dialog systems. In this paper, we study the task of response generation in open-domain multi-turn dialog systems. Many research efforts have been dedicated to building intelligent dialog systems, yet few shed light on deepening or widening the chatting topics in a conversational session, which would attract users to talk more. To this end, this paper presents a novel deep scheme consisting of three channels, namely global, wide, and deep ones. The global channel encodes the complete historical information within the given context, the wide one employs an attention-based recurrent neural network model to predict the keywords that may not appear in the historical context, and the deep one trains a Multi-layer Perceptron model to select some keywords for an in-depth discussion. Thereafter, our scheme integrates the outputs of these three channels to generate desired responses. To justify our model, we conducted extensive experiments to compare our model with several state-of-the-art baselines on two datasets: one is constructed by ourselves and the other is a public benchmark dataset. Experimental results demonstrate that our model yields promising performance by widening or deepening the topics of interest.

References

[1]
James F. Allen, Bradford W. Miller, Eric K. Ringger, and Teresa Sikorski . 1996. A Robust System for Natural Spoken Dialogue. In Proceedings of Annual Meeting of the Association for Computational Linguistics. ACL, 62--70.
[2]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio . 2014. Neural Machine Translation by Jointly Learning to Align and Translate. arXiv preprint arXiv:1409.0473 (2014).
[3]
Jimmy Lei Ba. Diederik P. Kingma . 2015. Adam: A Method for Stochastic Optimization. arXiv preprint arXiv:1412.6980 (2015).
[4]
Warren R. Greiff . 1998. A Theory of Term Weighting Based on Exploratory Data Analysis Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 11--19.
[5]
Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan . 2016 a. A Diversity-Promoting Objective Function for Neural Conversation Models Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technologies. ACL, 110--119.
[6]
Jiwei Li, Will Monroe, Alan Ritter, Dan Jurafsky, Michel Galley, and Jianfeng Gao . 2016 b. Deep Reinforcement Learning for Dialogue Generation Proceedings of the Conference on Empirical Methods in Natural Language Processing. ACL, 1192--1202.
[7]
Yanran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, and Shuzi Niu . 2017. DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset Proceedings of the International Joint Conference on Natural Language Processing. ACL, 986--995.
[8]
Chia-Wei Liu, Ryan Lowe, Iulian Serban, Michael Noseworthy, Laurent Charlin, and Joelle Pineau . 2016. How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation Proceedings of the Conference on Empirical Methods in Natural Language Processing. ACL, 2122--2132.
[9]
Meng Liu, Liqiang Nie, Meng Wang, and Baoquan Chen . 2017. Towards Micro-video Understanding by Joint Sequential-Sparse Modeling Proceedings of the 2017 ACM on Multimedia Conference. ACM, 970--978.
[10]
Ryan Lowe, Nissan Pow, Iulian Serban, and Joelle Pineau . 2015. The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems. In Proceedings of the Annual Meeting of the Special Interest Group on Discourse and Dialogue. SIGDIAL, 285--294.
[11]
Hongyuan Mei, Mohit Bansal, and Matthew R. Walter . 2017. Coherent Dialogue with Attention-Based Language Models Proceedings of the AAAI Conference on Artificial Intelligence. AAAI Press, 3252--3258.
[12]
L. Nie, M. Wang, Y. Gao, Z. J. Zha, and T. S. Chua . 2013. Beyond Text QA: Multimedia Answer Generation by Harvesting Web Information. IEEE Transactions on Multimedia Vol. 15, 2 (2013), 426--441.
[13]
L. Nie, M. Wang, L. Zhang, S. Yan, B. Zhang, and T. S. Chua . 2015. Disease Inference from Health-Related Questions via Sparse Deep Learning. IEEE Transactions on Knowledge and Data Engineering, Vol. 27, 8 (2015), 2107--2119.
[14]
Liqiang Nie, Yi-Liang Zhao, Xiangyu Wang, Jialie Shen, and Tat-Seng Chua . 2014. Learning to Recommend Descriptive Tags for Questions in Social Forums. ACM Trans. Inf. Syst. Vol. 32, 1 (2014), 5:1--5:23.
[15]
Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu . 2002. BLEU: a Method for Automatic Evaluation of Machine Translation Proceedings of Annual Meeting of the Association for Computational Linguistics. ACL, 311--318.
[16]
Volha Petukhova, Martin Gropp, Dietrich Klakow, Anna Schmidt, Gregor Eigner, Mario Topf, Stefan Srb, Petr Motlicek, Blaise Potard, and John Dines . 2014. The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues Proceedings of International Conference on Language Resources and Evaluation. ELRA, 252--258.
[17]
Alan Ritter, Colin Cherry, and William B. Dolan . 2011. Data-driven Response Generation in Social Media. Proceedings of the Conference on Empirical Methods in Natural Language Processing. ACL, 583--593.
[18]
Thomas Roelleke . 2003. A Frequency-based and a Poisson-based Definition of the Probability of Being Informative Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval. ACM, 227--234.
[19]
Lina Maria Rojas-Barahona, Milica Gasic, Nikola Mrksic, Pei-Hao Su, Stefan Ultes, Tsung-Hsien Wen, Steve J. Young, and David Vandyke . 2017. A Network-based End-to-End Trainable Task-oriented Dialogue System Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics. ACL, 438--449.
[20]
Iulian Serban, Tim Klinger, Gerald Tesauro, Kartik Talamadupula, Bowen Zhou, Yoshua Bengio, and Aaron Courville . 2017 a. Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation Proceedings of the AAAI Conference on Artificial Intelligence. AAAI Press, 3288--3294.
[21]
Iulian Vlad Serban, Alessandro Sordoni, Yoshua Bengio, Aaron C. Courville, and Joelle Pineau . 2016. Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence. AAAI Press, 3776--3784.
[22]
Iulian Vlad Serban, Alessandro Sordoni, Ryan Lowe, Laurent Charlin, Joelle Pineau, Aaron C. Courville, and Yoshua Bengio . 2017 b. A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues Proceedings of the AAAI Conference on Artificial Intelligence. AAAI Press, 3295--3301.
[23]
Lifeng Shang, Zhengdong Lu, and Hang Li . 2015. Neural Responding Machine for Short-Text Conversation Proceedings of the Annual Meeting of the Association for Computational Linguistics on Natural Language Processing. ACL, 1577--1586.
[24]
Alessandro Sordoni, Michel Galley, Michael Auli, Chris Brockett, Yangfeng Ji, Margaret Mitchell, Jian-Yun Nie, Jianfeng Gao, and Bill Dolan . 2015. A Neural Network Approach to Context-Sensitive Generation of Conversational Responses Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technologies. ACL, 196--205.
[25]
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le . 2014. Sequence to Sequence Learning with Neural Networks Proceedings of the Neural Information Processing Systems Conference on Neural Information Processing Systems. MIT Press, 3104--3112.
[26]
Hao Wang, Zhengdong Lu, Hang Li, and Enhong Chen . 2013. A Dataset for Research on Short-Text Conversations Proceedings of the Conference on Empirical Methods in Natural Language Processing. ACL, 935--945.
[27]
Mingxuan Wang, Zhengdong Lu, Hang Li, and Qun Liu . 2015. Syntax-Based Deep Matching of Short Texts. In Proceedings of the International Joint Conference on Artificial Intelligence. AAAI Press, 1354--1361.
[28]
Jason D. Williams, Antoine Raux, Deepak Ramachandran, and Alan W. Blac . 2013. The dialog state tracking challenge. In Proceedings of the SIGDIAL Conference on Discourse and Dialogue. SIGDIAL, 404--413.
[29]
Jason D. Williams and Geoffrey Zweig . 2016. End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning. arXiv preprint arXiv:1606.01269 (2016).
[30]
Ho Chung Wu, Robert Wing Pong Luk, Kam Fai Wong, and Kui Lam Kwok . 2008. Interpreting TF-IDF Term Weights As Making Relevance Decisions. ACM Transactions on Information System Vol. 26, 3 (2008), 13:1--13:37.
[31]
Yu Wu, Wei Wu, Chen Xing, Ming Zhou, and Zhoujun Li . 2017. Sequential Matching Network: A New Architecture for Multi-turn Response Selection in Retrieval-Based Chatbots. In Proceedings of the Annual Meeting of the Association for Computational Linguistics. ACL, 496--505.
[32]
Chen Xing, Wei Wu, Yu Wu, Jie Liu, Yalou Huang, Ming Zhou, and Wei-Ying Ma . 2017. Topic Aware Neural Response Generation. In Proceedings of the AAAI Conference on Artificial Intelligence. AAAI Press, 3351--3357.
[33]
Rui Yan, Yiping Song, and Hua Wu . 2016. Learning to Respond with Deep Neural Networks for Retrieval-Based Human-Computer Conversation System. In Proceedings of the International ACM SIGIR conference on Research and Development in Information Retrieval. ACM, 55--64.
[34]
Rui Yan, Dongyan Zhao, and Weinan E. . 2017. Joint Learning of Response Ranking and Next Utterance Suggestion in Human-Computer Conversation System. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 685--694.
[35]
Kaisheng Yao, Geoffrey Zweig, and Baolin Peng . 2015. Attention with Intention for a Neural Network Conversation Model. arXiv preprint arXiv:1510.08565 (2015).

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
June 2018
1509 pages
ISBN:9781450356572
DOI:10.1145/3209978
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 June 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. deepening and widening topics
  2. multi-turn dialog dataset
  3. multi-turn dialog systems
  4. response generation

Qualifiers

  • Research-article

Funding Sources

  • National Natural Science Foundation of China
  • the National Basic Research Program of China (973 Program)
  • the Project of Thousand Youth Talents 2016

Conference

SIGIR '18
Sponsor:

Acceptance Rates

SIGIR '18 Paper Acceptance Rate 86 of 409 submissions, 21%;
Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)24
  • Downloads (Last 6 weeks)1
Reflects downloads up to 27 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media