research-article

Scalable distributed inference of dynamic user interests for behavioral targeting

Authors:

Vanja Josifovski,

Alexander J. SmolaAuthors Info & Claims

KDD '11: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining

Pages 114 - 122

https://doi.org/10.1145/2020408.2020433

Published: 21 August 2011 Publication History

Abstract

Historical user activity is key for building user profiles to predict the user behavior and affinities in many web applications such as targeting of online advertising, content personalization and social recommendations. User profiles are temporal, and changes in a user's activity patterns are particularly useful for improved prediction and recommendation. For instance, an increased interest in car-related web pages may well suggest that the user might be shopping for a new vehicle.In this paper we present a comprehensive statistical framework for user profiling based on topic models which is able to capture such effects in a fully \emph{unsupervised} fashion. Our method models topical interests of a user dynamically where both the user association with the topics and the topics themselves are allowed to vary over time, thus ensuring that the profiles remain current.

We describe a streaming, distributed inference algorithm which is able to handle tens of millions of users. Our results show that our model contributes towards improved behavioral targeting of display advertising relative to baseline models that do not incorporate topical and/or temporal dependencies. As a side-effect our model yields human-understandable results which can be used in an intuitive fashion by advertisers.

References

[1]

D. Agarwal and S. Merugu. Predictive discrete latent factor models for large scale dyadic data. KDD, 2007.

Digital Library

[2]

A. Ahmed and E. P. Xing. Dynamic non-parametric mixture models and the recurrent chinese restaurant process In SDM, pages 219--230. SIAM, 2008.

[3]

A. Ahmed and E. P. Xing. Timeline: A dynamic hierarchical dirichlet process model for recovering birth / death and evolution of topics in text stream. In UAI, 2010.

[4]

A. Asuncion, P. Smyth, and M. Welling. Asynchronous distributed learning of topic models. In NIPS, pages 81--88. MIT Press, 2008.

[5]

D. Blackwell and J. MacQueen. Ferguson distributions via polya urn schemes. The Annals of Statistics, 1973.

[6]

D. Blei, A. Ng, and M. Jordan. Latent Dirichlet allocation. JMLR, 3:993--1022, 2003.

Digital Library

[7]

D. M. Blei and J. D. Lafferty. Dynamic topic models. In ICML, volume 148, pages 113--120. ACM, 2006.

Digital Library

[8]

D. M. Blei and J. D. McAuliffe. Supervised topic models. In NIPS. MIT Press, 2007.

[9]

K. R. Canini, L. Shi, and T. L. Griffiths. Online inference of topics with latent dirichlet allocation. In AISTATS, 2009.

[10]

Y. Chen, D. Pavlov, and J. F. Canny. Large-scale behavioral targeting. In KDD, pages 209--218, 2009.

Digital Library

[11]

S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, and R. Harshman. Indexing by latent semantic analysis. Am. Soc. for Information Science, 41, 1990.

[12]

S. Gauch, M. Speretta, A. Chandramouli, and A. Micarelli. User profiles for personalized information access. In LNCS 4321, Springer, 2007.

Digital Library

[13]

R. Ghosh and M. Dekhil. Discovering user profiles.% In WWW, pages 1233--1234, 2009.

Digital Library

[14]

T.L. Griffiths and M. Steyvers. Finding scientific topics. PNAS, 101:5228--5235, 2004.

[15]

A. Hassan, R. Jones, and K. L. Klinkner. Beyond DCG: User behavior as a predictor of a successful search. In WSDM 2010, pages 221--230, 2010.

Digital Library

[16]

T. Hofmann. Unsupervised learning by probabilistic latent semantic analysis. Machine Learning, 2001.

Digital Library

[17]

T. Iwata, T. Yamada, Y. Sakurai, and N. Ueda. Online multiscale dynamic topic models. In KDD, 2010.

Digital Library

[18]

H. R. Kim and P. K. Chan. Learning implicit user interest hierarchy for context in personalization. In IUI, 2003.

Digital Library

[19]

R. Kumar and A. Tomkins. A characterization of online search behavior. In WWW, 561--570, 2010.

Digital Library

[20]

L. Li, Z. Yang, B. Wang, and M. Kitsuregawa. Dynamic adaptation strategies for long-term and short-term user profile to personalize search. In ADWM, 2007.

Digital Library

[21]

L. Li, W. Chu, J. Langford, and R. Schapire. A contextual bandit approach to personalized news article recommendation. In WWW, 661--670, 2010.

Digital Library

[22]

J. Mellor-Crummey and M. L. Scott. Algorithms for scalable synchronization on shared-memory multiprocessors. ACM TOCS, 9(1):21--65, February 1991.

Digital Library

[23]

F. J. Provost, B. Dalessandro, R. Hook, X. Zhang, and A. Murray. Audience selection for on-line brand advertising: privacy-friendly social network targeting. In KDD, pages 707--716, 2009.

Digital Library

[24]

A.J. Smola and S. Narayanamurthy. An architecture for parallel topic models. In VLDB, 2010.

Digital Library

[25]

K. Sugiyama, K. Hatano, and M. Yoshikawa. Adaptive web search based on user profile constructed without any effort from users. In WWW, pages 675--684, 2004.

Digital Library

[26]

Y. Teh, M. Jordan, M. Beal, and D. Blei. Hierarchical dirichlet processes. JASA, 2006.

[27]

L. Yao, D. Mimno, and A. McCallum. Efficient methods for topic model inference on streaming document collections. In KDD'09, 2009.

Digital Library

[28]

Y. Wang, H. Bai, M. Stanton, W. Chen, andE. Chang. PLDA: Parallel latent dirichlet allocationfor large-scale applications. In Proc. of 5thInternational Conference on Algorithmic Aspects inInformation and Management, 2009.

Digital Library

[29]

D. Newman, A. Asuncion, P. Smyth and M. Welling. Distributed Algorithms for Topic Models. In Journal of Machine Learning Research, 2009.

Digital Library

[30]

H. Wallach, D. Mimno and A. McCallum. Rethinking LDA: Why Priors Matter. In Advances in Neural Information Processing Systems 22, 2009.

Cited By

Lu SYang SYao Y(2024)Within-Category Satiation and Cross-Category Spillover in Multiproduct AdvertisingJournal of Marketing10.1177/00222429241274727Online publication date: 20-Nov-2024
https://doi.org/10.1177/00222429241274727
Zarrinkalam FNoughabi HNoorian ZFani HBagheri E(2024)Predicting users’ future interests on social networks: A reference frameworkInformation Processing & Management10.1016/j.ipm.2024.10376561:5(103765)Online publication date: Sep-2024
https://doi.org/10.1016/j.ipm.2024.103765
Zhu CDu PZhu XZhang WYu YCao YZhang ARangwala H(2022)User-tag Profile Modeling in Recommendation System via Contrast Weighted Tag MaskingProceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3534678.3539102(4630-4638)Online publication date: 14-Aug-2022
https://dl.acm.org/doi/10.1145/3534678.3539102
Show More Cited By

Index Terms

Scalable distributed inference of dynamic user interests for behavioral targeting
1. Computing methodologies
  1. Machine learning
2. Mathematics of computing
  1. Probability and statistics

Recommendations

Cross-representation mediation of user models

Personalization is considered a powerful methodology for improving the effectiveness of information search and decision making. It has led to the dissemination of systems capable of suggesting relevant and personalized information (or items) to the users,...
Inferring user interests in microblogging social networks: a survey

With the growing popularity of microblogging services such as Twitter in recent years, an increasing number of users are using these services in their daily lives. The huge volume of information generated by users raises new opportunities in various ...
Using Navigation to Improve Recommendations in Real-Time
RecSys '16: Proceedings of the 10th ACM Conference on Recommender Systems

Implicit feedback is a key source of information for many recommendation and personalization approaches. However, using it typically requires multiple episodes of interaction and roundtrips to a recommendation engine. This adds latency and neglects the ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '11: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining

August 2011

1446 pages

ISBN:9781450308137

DOI:10.1145/2020408

General Chair:
Chid Apte
IBM Research
,
Program Chairs:
Joydeep Ghosh
UT Austin
,
Padhraic Smyth
UC Irvine

Copyright © 2011 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 August 2011

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

KDD '11

Sponsor:

KDD '11: The 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 21 - 24, 2011

California, San Diego, USA

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Sponsor:
sigkdd
sigkdd

The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 3 - 7, 2025

Toronto , ON , Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

121
Total Citations
View Citations
2,334
Total Downloads

Downloads (Last 12 months)19
Downloads (Last 6 weeks)1

Reflects downloads up to 24 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Lu SYang SYao Y(2024)Within-Category Satiation and Cross-Category Spillover in Multiproduct AdvertisingJournal of Marketing10.1177/00222429241274727Online publication date: 20-Nov-2024
https://doi.org/10.1177/00222429241274727
Zarrinkalam FNoughabi HNoorian ZFani HBagheri E(2024)Predicting users’ future interests on social networks: A reference frameworkInformation Processing & Management10.1016/j.ipm.2024.10376561:5(103765)Online publication date: Sep-2024
https://doi.org/10.1016/j.ipm.2024.103765
Zhu CDu PZhu XZhang WYu YCao YZhang ARangwala H(2022)User-tag Profile Modeling in Recommendation System via Contrast Weighted Tag MaskingProceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3534678.3539102(4630-4638)Online publication date: 14-Aug-2022
https://dl.acm.org/doi/10.1145/3534678.3539102
Ding SGao XDong YTong YFu X(2021)Estimating Multiple Socioeconomic Attributes via Home Location—A Case Study in ChinaJournal of Social Computing10.23919/JSC.2021.00032:1(71-88)Online publication date: Mar-2021
https://doi.org/10.23919/JSC.2021.0003
Choi HMela CBalseiro SLeary A(2020)Online Display Advertising MarketsInformation Systems Research10.1287/isre.2019.090231:2(556-575)Online publication date: 1-Jun-2020
https://dl.acm.org/doi/10.1287/isre.2019.0902
Diao MZhang ZSu SGao SCao Hd'Aquin MDietze SHauff CCurry ECudre Mauroux P(2020)UPON: User Profile Transferring across NetworksProceedings of the 29th ACM International Conference on Information & Knowledge Management10.1145/3340531.3411964(265-274)Online publication date: 19-Oct-2020
https://dl.acm.org/doi/10.1145/3340531.3411964
Hosseini SKhodadadi AAlizadeh KArabzadeh AFarajtabar MZha HRabiee H(2020)Recurrent Poisson Factorization for Temporal RecommendationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2018.287979632:1(121-134)Online publication date: 1-Jan-2020
https://doi.org/10.1109/TKDE.2018.2879796
Si HWu HZhou LWan JXiong NZhang J(2020)An Industrial Analysis Technology About Occupational Adaptability and Association Rules in Social NetworksIEEE Transactions on Industrial Informatics10.1109/TII.2019.292657416:3(1698-1707)Online publication date: Mar-2020
https://doi.org/10.1109/TII.2019.2926574
Bonomo MCiaccio GDe Salve ARombo S(2019)Customer recommendation based on profile matching and customized campaigns in on-line social networksProceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining10.1145/3341161.3345621(1155-1159)Online publication date: 27-Aug-2019
https://dl.acm.org/doi/10.1145/3341161.3345621
Li QLiu LXu MWu BXiao Y(2019)GDTM: A Gaussian Dynamic Topic Model for Forwarding Prediction Under Complex MechanismsIEEE Transactions on Computational Social Systems10.1109/TCSS.2019.29002996:2(338-349)Online publication date: Apr-2019
https://doi.org/10.1109/TCSS.2019.2900299
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents