skip to main content
research-article
Free access

Single and multiple change-point detection with differential privacy

Published: 01 January 2021 Publication History

Abstract

The change-point detection problem seeks to identify distributional changes at an unknown change-point k* in a stream of data. This problem appears in many important practical settings involving personal data, including biosurveillance, fault detection, finance, signal detection, and security systems. The field of differential privacy offers data analysis tools that provide powerful worst-case privacy guarantees. We study the statistical problem of change-point detection through the lens of differential privacy. We give private algorithms for both online and offline change-point detection, analyze these algorithms theoretically, and provide empirical validation of our results.

References

[1]
J. Bai and P. Perron. Computation and analysis of multiple structural change models. Journal of Applied Econometrics, 18(1):1-22, 2003.
[2]
R. Bhaskar, S. Laxman, A. Smith, and A. Thakurta. Discovering frequent patterns in sensitive data. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 503-512, 2010.
[3]
C. L. Canonne, G. Kamath, A. McMillan, A. Smith, and J. Ullman. The structure of optimal private tests for simple hypotheses. In Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing, STOC '19, pages 310-321, 2019.
[4]
E. Carlstein. Nonparametric change-point estimation. The Annals of Statistics, 16(1): 188-197, 1988.
[5]
H. P. Chan. Optimal sequential detection in multi-stream data. The Annals of Statistics, 45 (6):2736-2763, 2017.
[6]
J. Chen and A. K. Gupta. Parametric statistical change point analysis: with applications to genetics, medicine, and finance. Springer Science & Business Media, 2011.
[7]
M. F. D'Angelo, R. M. Palhares, R. H. Takahashi, and R. H. Loschi. Fuzzy/bayesian change point detection approach to incipient fault detection. IET control theory & applications, 5 (4):539-551, 2011.
[8]
C. Dwork and A. Roth. The algorithmic foundations of differential privacy. Foundations and Trends in Theoretical Computer Science, 9(3-4):211-407, 2014.
[9]
C. Dwork, F. McSherry, K. Nissim, and A. Smith. Calibrating noise to sensitivity in private data analysis. In Proceedings of the 3rd Conference on Theory of Cryptography, TCC '06, pages 265-284, 2006.
[10]
C. Dwork, M. Naor, T. Pitassi, and G. N. Rothblum. Differential privacy under continual observation. In Proceedings of the 42nd ACM Symposium on Theory of Computing, STOC '10, pages 715-724, 2010.
[11]
D. V. Hinkley. Inference about the change-point in a sequence of random variables. Biometrika, 57(1):1-17, 1970.
[12]
M. Kulldorff. Prospective time periodic geographical disease surveillance using a scan statistic. Journal of the Royal Statistical Society, Series A, 164(1):61-72, 2001.
[13]
L. Lai, Y. Fan, and H. V. Poor. Quickest detection in cognitive radio: A sequential change detection framework. In IEEE GLOBECOM 2008-2008 IEEE Global Telecommunications Conference, pages 1-5. IEEE, 2008.
[14]
T. L. Lai. Sequential changepoint detection in quality control and dynamical systems. Journal of the Royal Statistical Society, Series B, 57(4):613-658, 1995.
[15]
T. L. Lai. Sequential analysis: some classical problems and new challenges. Statistica Sinica, 11(2):303-408, 2001.
[16]
G. Lorden. Procedures for reacting to a change in distribution. The Annals of Mathematical Statistics, 42(6):1897-1908, 1971.
[17]
R. Lund and J. Reeves. Detection of undocumented changepoints: A revision of the two-phase regression model. Journal of Climate, 15(17):2547-2554, 2002.
[18]
F. McSherry. Privacy integrated queries: an extensible platform for privacy-preserving data analysis. In Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, SIGMOD '09, pages 19-30, 2009.
[19]
Y. Mei. Sequential change-point detection when unknown parameters are present in the pre-change distribution. The Annals of Statistics, 34(1):92-122, 2006.
[20]
Y. Mei. Is average run length to false alarm always an informative criterion? Sequential Analysis, 27(4):354-419, 2008.
[21]
Y. Mei. Efficient scalable schemes for monitoring a large number of data streams. Biometrika, 97(2):419-433, 2010.
[22]
G. V. Moustakides. Optimal stopping times for detecting changes in distributions. The Annals of Statistics, 14(4):1379-1387, 1986.
[23]
E. S. Page. Continuous inspection schemes. Biometrika, 41(1/2):100-115, 1954.
[24]
M. Pollak. Optimal detection of a change in distribution. The Annals of Statistics, 13(1): 206-227, 1985.
[25]
M. Pollak. Average run lengths of an optimal method of detecting a change in distribution. The Annals of Statistics, 15(2):749-779, 1987.
[26]
S. W. Roberts. A comparison of some control chart procedures. Technometrics, 8(3):411-430, 1966.
[27]
W. A. Shewhart. Economic Control of Quality of Manufactured Product. D. Van Norstrand Company, Inc., 1931.
[28]
A. N. Shiryaev. On optimum methods in quickest detection problems. Theory of Probability & Its Applications, 8(1):22-46, 1963.
[29]
G. Shmueli and H. Burkom. Statistical challenges facing early outbreak detection in biosurveillance. Technometrics, 52(1):39-51, 2010.
[30]
A. G. Tartakovsky, B. L. Rozovskii, R. B. Blažek, and H. Kim. Detection of intrusions in information systems by sequential change-point methods. Statistical methodology, 3(3): 252-293, 2006.
[31]
A. G. Tartakovsky, A. S. Polunchenko, and G. Sokolov. Efficient computer network anomaly detection by changepoint detection methods. IEEE Journal of Selected Topics in Signal Processing, 7(1):4-11, 2012.
[32]
A. W. Van Der Vaart and J. A. Wellner. Weak convergence. Springer, 1996.
[33]
N. Zhang and D. O. Siegmund. Model selection for high-dimensional, multisequence change-point problems. Statistica Sinica, 22(4):1507-1538, 2012.

Cited By

View all

Index Terms

  1. Single and multiple change-point detection with differential privacy
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image The Journal of Machine Learning Research
        The Journal of Machine Learning Research  Volume 22, Issue 1
        January 2021
        13310 pages
        ISSN:1532-4435
        EISSN:1533-7928
        Issue’s Table of Contents
        CC-BY 4.0

        Publisher

        JMLR.org

        Publication History

        Published: 01 January 2021
        Accepted: 01 January 2021
        Revised: 01 October 2020
        Received: 01 September 2019
        Published in JMLR Volume 22, Issue 1

        Author Tags

        1. differential privacy
        2. change-point detection
        3. learning theory
        4. online learning
        5. adaptive data analysis

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 104
          Total Downloads
        • Downloads (Last 12 months)69
        • Downloads (Last 6 weeks)15
        Reflects downloads up to 25 Jan 2025

        Other Metrics

        Citations

        Cited By

        View all

        View Options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Login options

        Full Access

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media