skip to main content
10.1145/3292500.3330728acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Large-scale User Visits Understanding and Forecasting with Deep Spatial-Temporal Tensor Factorization Framework

Published: 25 July 2019 Publication History

Abstract

Understanding and forecasting user visits is of great importance for a variety of tasks, e.g., online advertising, which is one of the most profitable business models for Internet services. Publishers sell advertising spaces in advance with user visit volume and attributes guarantees. There are usually tens of thousands of attribute combinations in an online advertising system. The key problem is how to accurately forecast the number of user visits for each attribute combination. Many traditional work characterizing temporal trends of every single time series are quite inefficient for large-scale time series. Recently, a number of models based on deep learning or matrix factorization have been proposed for high-dimensional time series forecasting. However, most of them neglect correlations among attribute combinations, or are tailored for specific applications, resulting in poor adaptability for different business scenarios.Besides, sophisticated deep learning models usually cause high time and space complexity. There is still a lack of an efficient highly scalable and adaptable solution for accurate high-dimensional time series forecasting. To address this issue, in this work, we conduct a thorough analysis on large-scale user visits data and propose a novel deep spatial-temporal tensor factorization framework, which provides a general design for high-dimensional time series forecasting. We deployed the proposed framework in Tencent online guaranteed delivery advertising system, and extensively evaluated the effectiveness and efficiency of the framework in two different large-scale application scenarios. The results show that our framework outperforms existing methods in prediction accuracy. Meanwhile, it significantly reduces the parameter number and is resistant to incomplete data with up to 20% missing values.

References

[1]
Mohammad Taha Bahadori, Qi Rose Yu, and Yan Liu. 2014. Fast multivariate spatio-temporal analysis via low rank tensor learning. In Advances in neural information processing systems. 3491--3499.
[2]
Preeti Bhargava, Thomas Phan, Jiayu Zhou, and Juhan Lee. {n.d.}. Who, what, when, and where: Multi-dimensional collaborative recommendations using tensor factorization on sparse user-generated data. In Proceedings of WWW .
[3]
George EP Box, Gwilym M Jenkins, Gregory C Reinsel, and Greta M Ljung. 2015. Time series analysis: forecasting and control .John Wiley & Sons.
[4]
Peter J Brockwell and Richard A Davis. 2016. Introduction to time series and forecasting .Springer.
[5]
Yongjie Cai, Hanghang Tong, Wei Fan, Ping Ji, and Qing He. 2015. Facets: Fast comprehensive mining of coevolving high-order time series. In Proceedings of SIGKDD. ACM, 79--88.
[6]
Sakyasingha Dasgupta and Takayuki Osogami. 2017. Nonlinear Dynamic Boltzmann Machines for Time-Series Prediction. In AAAI . 1833--1839.
[7]
Aaron Van Den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals, Alex Graves, Nal Kalchbrenner, Andrew W Senior, and Koray Kavukcuoglu. 2016. WaveNet: A Generative Model for Raw Audio. arXiv: Sound (2016), 125.
[8]
Dua Dheeru and Efi Karra Taniskidou. 2017. UCI Machine Learning Repository. http://archive.ics.uci.edu/ml
[9]
Hongchang Gao, Deguang Kong, Miao Lu, Xiao Bai, and Jian Yang. 2018. Attention Convolutional Neural Network for Advertiser-level Click-through Rate Forecasting. In Proceedings of WWW. ACM, 1855--1864.
[10]
George Evelyn Hutchinson. 1978. An introduction to population ecology. (1978).
[11]
Rob J Hyndman. 2014. TBATS with regressors . https://robjhyndman.com/hyndsight/tbats-with-regressors/ Retrieved September 2017 from
[12]
Qiang Liu, Shu Wu, Liang Wang, and Tieniu Tan. 2016. Predicting the Next Location: A Recurrent Model with Spatial and Temporal Contexts. In AAAI . 194--200.
[13]
Yasuko Matsubara, Yasushi Sakurai, and Christos Faloutsos. 2016. Non-linear mining of competing local activities. In Proceedings of WWW. ACM, 737--747.
[14]
Yao Qin, Dongjin Song, Haifeng Cheng, Wei Cheng, Guofei Jiang, and Garrison Cottrell. 2017. A dual-stage attention-based recurrent neural network for time series prediction. arXiv:1704.02971 (2017). https://arxiv.org/abs/1704.02971
[15]
Doyen Sahoo, Steven CH Hoi, and Bin Li. 2014. Online multiple kernel regression. In Proceedings of SIGKDD. ACM, 293--302.
[16]
T. N Sainath, O Vinyals, A Senior, and H Sak. 2015. Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks. In IEEE International Conference on Acoustics, Speech and Signal Processing . 4580--4584.
[17]
Tsubasa Takahashi, Bryan Hooi, and Christos Faloutsos. 2017. Autocyclone: automatic mining of cyclic online activities with robust tensor factorization. In Proceedings of WWW. ACM, 213--221.
[18]
Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio, and Yoshua Bengio. 2017. Graph attention networks. arXiv:1710.10903, Vol. 1, 2 (2017). https://arxiv.org/abs/1710.10903
[19]
Ruoxi Wang, Bin Fu, Gang Fu, and Mingliang Wang. 2017. Deep & cross network for ad click predictions. In Proceedings of the ADKDD'17. ACM, 12.
[20]
SHI Xingjian, Zhourong Chen, Hao Wang, Dit-Yan Yeung, Wai-Kin Wong, and Wang-chun Woo. 2015. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In Advances in neural information processing systems. 802--810.
[21]
Hsiang-Fu Yu, Nikhil Rao, and Inderjit S Dhillon. 2016. Temporal regularized matrix factorization for high-dimensional time series prediction. In Advances in neural information processing systems. 847--855.
[22]
Junbo Zhang, Yu Zheng, and Dekang Qi. 2017b. Deep Spatio-Temporal Residual Networks for Citywide Crowd Flows Prediction. In AAAI . 1655--1661.
[23]
Junbo Zhang, Yu Zheng, Dekang Qi, Ruiyuan Li, and Xiuwen Yi. 2016. DNN-based prediction model for spatio-temporal data. In Proceedings of the 24th ACM SIGSPATIAL . ACM, 92.
[24]
Liheng Zhang, Charu Aggarwal, and Guo-Jun Qi. 2017a. Stock Price Prediction via Discovering Multi-Frequency Trading Patterns. In Proceedings of SIGKDD. ACM, 2141--2149.

Cited By

View all
  • (2023)CLOCK: Online Temporal Hierarchical Framework for Multi-scale Multi-granularity Forecasting of User ImpressionProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614810(2544-2553)Online publication date: 21-Oct-2023
  • (2023)End-to-End Inventory Prediction and Contract Allocation for Guaranteed Delivery AdvertisingProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599332(1677-1686)Online publication date: 6-Aug-2023
  • (2022)Multi-Task Multi-Attention Graph Neural Network for Mobile Crowd Sensing Data Reconstruction and Prediction2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)10.1109/CCIS57298.2022.10016351(654-661)Online publication date: 26-Nov-2022
  • Show More Cited By

Index Terms

  1. Large-scale User Visits Understanding and Forecasting with Deep Spatial-Temporal Tensor Factorization Framework

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
      July 2019
      3305 pages
      ISBN:9781450362016
      DOI:10.1145/3292500
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 25 July 2019

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. guaranteed delivery advertising
      2. high-dimensional time series forecasting
      3. user visits forecasting

      Qualifiers

      • Research-article

      Funding Sources

      • National Natural Science Foundation of China
      • National Key R&D Program of China
      • Fundamental Research Funds for the Central Universities

      Conference

      KDD '19
      Sponsor:

      Acceptance Rates

      KDD '19 Paper Acceptance Rate 110 of 1,200 submissions, 9%;
      Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

      Upcoming Conference

      KDD '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)67
      • Downloads (Last 6 weeks)3
      Reflects downloads up to 26 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)CLOCK: Online Temporal Hierarchical Framework for Multi-scale Multi-granularity Forecasting of User ImpressionProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614810(2544-2553)Online publication date: 21-Oct-2023
      • (2023)End-to-End Inventory Prediction and Contract Allocation for Guaranteed Delivery AdvertisingProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599332(1677-1686)Online publication date: 6-Aug-2023
      • (2022)Multi-Task Multi-Attention Graph Neural Network for Mobile Crowd Sensing Data Reconstruction and Prediction2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)10.1109/CCIS57298.2022.10016351(654-661)Online publication date: 26-Nov-2022
      • (2022)Ultra-short-term wind speed and wind power forecast via selective Hankelization and low-rank tensor learning-based predictorInternational Journal of Electrical Power & Energy Systems10.1016/j.ijepes.2022.107994140(107994)Online publication date: Sep-2022
      • (2021)Tri-Partition Alphabet-Based State Prediction for Multivariate Time-SeriesApplied Sciences10.3390/app11231129411:23(11294)Online publication date: 29-Nov-2021
      • (2021)Why Not Match: On Explanations of Event Pattern QueriesProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3452818(1705-1717)Online publication date: 9-Jun-2021
      • (2021)RPM: Local Differential Privacy based Inventory Forecasting2021 7th International Conference on Big Data Computing and Communications (BigCom)10.1109/BigCom53800.2021.00015(106-113)Online publication date: Aug-2021
      • (2021)Efficient Ad-level Impression Forecasting based on Monotonicity and Sampling2021 7th International Conference on Big Data Computing and Communications (BigCom)10.1109/BigCom53800.2021.00012(180-187)Online publication date: Aug-2021

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media