skip to main content
10.1145/3152494.3152505acmotherconferencesArticle/Chapter ViewAbstractPublication PagescodsConference Proceedingsconference-collections
research-article

A neural attention based approach for clickstream mining

Published: 11 January 2018 Publication History

Abstract

E-commerce has seen tremendous growth over the past few years, so much so that only those companies which analyze browsing behaviour of users, can hope to survive the stiff competition in market. Analyzing customer behaviour helps in modeling and recognizing purchase intent which is vital to e-commerce for providing improved personalization and better ranking of search results. In this work, we make use of user clickstreams to model browsing behaviour of users. But clickstreams are known to be noisy and hence generating features from clickstreams and using them in one go for building a predictive model may not always capture the purchase/intent characteristics. There are multiple aspects within clickstreams which are to be considered such as the sequence (path) and temporal behaviour. Hence we model clickstreams as having multiple views, each view, concentrating on an aspect or a component of clickstream. In this work, we develop a Multi-View learning (MVL) framework that predicts whether users would make a purchase or not by analyzing their clickstreams. Recent advances in deep learning allow us to build neural networks that are able to extract complex latent features from the data with minimal human intervention. Separate models known as experts are trained on each view. The experts are then combined using an Expert-Attention (EA) network, where the attention part of the network tries to learn when to attend to which view of the data. Multiple variants have been proposed based on how EA network is trained. Yet another challenge is the extreme class imbalance present in the data since only a small fraction of clickstreams correspond to buyers. We propose a well informed undersampling strategy using autoencoders. This simple undersampling technique ensured that the model trained was not biased to non-buyers and resulted in much improved f-scores. Experimental results show that using EA networks, there is an improvement of 13% over single view methods. Moreover, it was also noticed that MVL using EA network performs better than conventional MVL methods such as Multiple Kernel Learning.

References

[1]
David Ben-Shimon, Alexander Tsikinovsky, Michael Friedmann, Bracha Shapira, Lior Rokach, and Johannes Hoerle. 2015. Recsys challenge 2015 and the yoochoose dataset. In Proceedings of the 9th ACM Conference on Recommender Systems. ACM, 357--358.
[2]
Donald J Berndt and James Clifford. 1994. Using Dynamic Time Warping to Find Patterns in Time Series. In KDD workshop, Vol. 10. Seattle, WA, 359--370.
[3]
Veronika Bogina, Tsvi Kuflik, and Osnat Mokryn. 2016. Learning Item Temporal Dynamics for Predicting Buying Sessions. In Proceedings of the 21st International Conference on Intelligent User Interfaces. ACM, 251--255.
[4]
Randolph E Bucklin and Catarina Sismeiro. 2009. Click here for Internet insight: Advances in clickstream data analysis in marketing. Journal of Interactive Marketing 23, 1 (2009), 35--48.
[5]
Chih-Chung Chang and Chih-Jen Lin. 2011. LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2 (2011), 27.
[6]
Yoon Ho Cho, Jae Kyeong Kim, and Soung Hie Kim. 2002. A personalized recommender system based on web usage mining and decision tree induction. Expert systems with Applications 23, 3 (2002), 329--342.
[7]
Chester Curme, Tobias Preis, H Eugene Stanley, and Helen Susannah Moat. 2014. Quantifying the semantics of search behavior before stock market moves. Proceedings of the National Academy of Sciences 111, 32 (2014), 11600--11605.
[8]
Krzysztof Dembczynski, Wojciech Kotlowski, and Dawid Weiss. 2008. Predicting ads click-through rate with decision rules. In Workshop on targeting and ranking in online advertising, Vol. 2008.
[9]
Philippe Fournier-Viger, Antonio Gomariz, Manuel Campos, and Rincy Thomas. 2014. Fast vertical mining of sequential patterns using co-occurrence information. In Advances in Knowledge Discovery and Data Mining. Springer, 40--52.
[10]
Geoffrey E Hinton and Ruslan R Salakhutdinov. 2006. Reducing the dimensionality of data with neural networks. Science 313, 5786 (2006), 504--507.
[11]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735--1780.
[12]
Rajan Lukose, Jiye Li, Jing Zhou, and Satyanarayana Raju Penmetsa. 2008. Learning user purchase intent from user-centric data. In PAKDD. Springer, 673--680.
[13]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).
[14]
Wendy W Moe. 2003. Buying, searching, or browsing: Differentiating between online shoppers using in-store navigational clickstream. Journal of consumer psychology 13, 1 (2003), 29--39.
[15]
Wendy W Moe and Peter S Fader. 2004. Dynamic conversion behavior at e-commerce sites. Management Science 50, 3 (2004), 326--335.
[16]
Alan L Montgomery, Shibo Li, Kannan Srinivasan, and John C Liechty. 2004. Modeling online browsing and path analysis using clickstream data. Marketing Science 23, 4 (2004), 579--595.
[17]
Jooyoung Park and Irwin W Sandberg. 1991. Universal approximation using radial-basis-function networks. Neural computation 3, 2 (1991), 246--257.
[18]
Barak A Pearlmutter. 1989. Learning state space trajectories in recurrent neural networks. Neural Computation 1, 2 (1989), 263--269.
[19]
Manoj Kumar Priya, Siddhartha Ghosh. 2015. Visualizing Website Clickstream Data. (2015). http://www.ijraset.com/fileserve.php?FID=2593
[20]
Eric Sven Ristad and Peter N Yianilos. 1998. Learning string-edit distance. IEEE Transactions on Pattern Analysis and Machine Intelligence 20, 5 (1998), 522--532.
[21]
Peter Romov and Evgeny Sokolov. 2015. RecSys Challenge 2015: ensemble learning with categorical features. In Proceedings of the 2015 International ACM Recommender Systems Challenge. ACM, 1.
[22]
Dirk Van den Poel and Wouter Buckinx. 2005. Predicting online-purchasing behaviour. European Journal of Operational Research 166, 2 (2005), 557--575.
[23]
Zhenzhou Wu, Bao Hong Tan, Rubing Duan, Yong Liu, and Rick Siow Mong Goh. 2015. Neural Modeling of Buying Behaviour for E-Commerce from Clicking Patterns. In Proceedings of the 2015 International ACM Recommender Systems Challenge. ACM, 12.
[24]
Peng Yan, Xiaocong Zhou, and Yitao Duan. 2015. E-Commerce Item Recommendation Based on Field-aware Factorization Machine. In Proceedings of the 2015 International ACM Recommender Systems Challenge. ACM, 2.
[25]
Mingyue Zhang, Guoqing Chen, and Qiang Wei. 2016. Discovering ConsumersâĂŹ Purchase Intentions Based on Mobile Search Behaviors. In Flexible Query Answering Systems 2015. Springer, 15--28.

Cited By

View all
  • (2019)Simplifying E-Commerce Analytics by Discovering Hidden Knowledge in Big Data ClickstreamsPutting Social Media and Networking Data in Practice for Education, Planning, Prediction and Recommendation10.1007/978-3-030-33698-1_4(51-74)Online publication date: 28-Dec-2019

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
CODS-COMAD '18: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data
January 2018
379 pages
ISBN:9781450363419
DOI:10.1145/3152494
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 January 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. LSTMs
  2. attention
  3. behaviour modeling
  4. class imbalance
  5. clickstreams
  6. multi-view learning

Qualifiers

  • Research-article

Conference

CoDS-COMAD '18

Acceptance Rates

CODS-COMAD '18 Paper Acceptance Rate 50 of 150 submissions, 33%;
Overall Acceptance Rate 197 of 680 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)0
Reflects downloads up to 06 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2019)Simplifying E-Commerce Analytics by Discovering Hidden Knowledge in Big Data ClickstreamsPutting Social Media and Networking Data in Practice for Education, Planning, Prediction and Recommendation10.1007/978-3-030-33698-1_4(51-74)Online publication date: 28-Dec-2019

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media