Approximation of stationary processes by hidden Markov models

Finesso, Lorenzo; Grassi, Angela; Spreij, Peter

doi:10.1007/s00498-010-0050-7

Approximation of stationary processes by hidden Markov models

Original Article
Open access
Published: 11 July 2010

Volume 22, pages 1–22, (2010)
Cite this article

Download PDF

You have full access to this open access article

Mathematics of Control, Signals, and Systems Aims and scope Submit manuscript

Approximation of stationary processes by hidden Markov models

Download PDF

Lorenzo Finesso¹,
Angela Grassi¹ &
Peter Spreij²

915 Accesses
13 Citations
Explore all metrics

Abstract

Stochastic realization is still an open problem for the class of hidden Markov models (HMM): given the law Q of an HMM find a finite parametric description of it. Fifty years after the introduction of HMMs, no computationally effective realization algorithm has been proposed. In this paper we direct our attention to an approximate version of the stochastic realization problem for HMMs. We aim at the realization of an HMM of assigned complexity (number of states of the underlying Markov chain) which best approximates, in Kullback Leibler divergence rate, a given stationary law Q. In the special case of Q being the law of an HMM this corresponds to solving the approximate realization problem for HMMs. In general there is no closed form expression of the Kullback Leibler divergence rate, therefore we replace it, as approximation criterion, with the informational divergence between the Hankel matrices of the processes. This not only has the advantage of being easy to compute, while providing a good approximation of the divergence rate, but also makes the problem amenable to the use of nonnegative matrix factorization (NMF) techniques. We propose a three step algorithm, based on the NMF, which realizes an optimal HMM. The viability of the algorithm as a practical tool is tested on a few examples of HMM order reduction.

Article PDF

Learning Hidden Markov Models Using Probabilistic Matrix Factorization

The continuous-time hidden Markov model based on discretization. Properties of estimators and applications

Article Open access 23 June 2023

The Generalized Entropy Ergodic Theorem with Two Types of Convergence for M-th-Order Nonhomogeneous Hidden Markov Models

Article 17 December 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Anderson BDO (1999) The realization problem for hidden Markov models. Math Control Signals Syst 12: 80–120
Article MATH Google Scholar
Baum LE, Petrie T (1966) Statistical inference for probabilistic functions of finite Markov chains. Ann Math Stat 37: 1554–1563
Article MATH MathSciNet Google Scholar
Blackwell D (1957) The entropy of functions of finite-state Markov chains Trans. of the first Prague conference on information theory, statistical decision functions, Random Processes, pp 13–20
Carlyle JW (1969) Stochastic finite-state system theory. In: Zadeh L, Polak L (eds) Systems theory, Chapter 10. McGraw-Hill, New York
Google Scholar
Csiszár I (1975) I-divergence geometry of probability distributions and minimization problems. Ann Probab 3: 146–158
Article MATH Google Scholar
Csiszár I, Tusnády G (1984) Information geometry and alternating minimization procedures. Stat Decis supplement issue 1: 205–237
Google Scholar
Finesso L (1990) Consistent estimation of the order for Markov and hidden Markov chains, PhD Thesis Report 91-1, Institute of Systems Research, University of Maryland College Park
Finesso L, Spreij PJC (2002) Approximate realization of finite hidden Markov chains. In: Proceedings of the 2002 IEEE information theory workshop, Bangalore, India, pp 90–93
Finesso L, Spreij PJC (2006) Nonnegative matrix factorization and I-divergence alternating minimization. Linear Algebra Appl 416: 270–287
Article MATH MathSciNet Google Scholar
Gray RM (1990) Entropy and information theory. Springer, New York
MATH Google Scholar
Han G, Marcus B (2006) Analyticity of entropy rate of hidden Markov chains. IEEE Trans Inf Theory 52(12): 5251–5266
Article MathSciNet Google Scholar
Heller A (1965) On stochastic processes derived from Markov chains. Ann Math Stat 36: 1286–1291
Article MATH MathSciNet Google Scholar
Juang BH, Rabiner LR (1985) A probabilistic distance measure for hidden Markov models. AT&T Tech J 64(20): 391–408
MathSciNet Google Scholar
Karan M, Anderson BDO, Williamson RC (1993) A note on the calculation of a probabilistic distance between hidden Markov models. In: Proc. ISPACS 93, 2nd international workshop on intelligent signal Proc. Comm. Systems, Sendai, 93–98
Lee DD, Seung HS (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401: 788–791
Article Google Scholar
LeGland F, Mevel L (2000) Exponential forgetting and geometric ergodicity in HMMs. Math Control Signals Syst 13(1): 63–93
Article MathSciNet Google Scholar
Leroux BG (1992) Maximum-likelihood estimation for hidden Markov models. Stoch Process Appl 40: 127–143
Article MATH MathSciNet Google Scholar
Mevel L, Finesso L (2004) Asymptotical statistics of misspecified hidden Markov models. IEEE Trans Autom Control 49(7): 1123–1132
Article MathSciNet Google Scholar
Norris JR (1998) Markov chains. Cambridge University Press, Cambridge
MATH Google Scholar
Picci G (1978) On the internal structure of finite state stochastic processes. In: Mohler RR, Ruberti A (eds) Recent developments in variable structure systems, economics and biology, Lecture notes in Economics and Mathematical Systems, vol 162, Springer, Berlin, pp 288–304
Picci G, van Schuppen JH (1984) On the weak finite stochastic realization problem. In: Korezlioglu H, Mazziotto G, Szpirglas J (eds) Filtering and control of random processes, Lecture Notes in Control and Information Sciences, vol 61, Springer, New York, pp 237–242
Rabiner LR, Juang BH (1986) An introduction to hidden Markov models. IEEE ASSP Mag 3(1): 4–16
Article Google Scholar
Vanluyten B, Willems JC, De Moor B (2006) Matrix factorization and stochastic state representations. In: Proceedings of the 45th IEEE conference on decision and control, San Diego, pp 4188–4193
Vidyasagar M (2005) The realization problem for hidden Markov models: the complete realization problem. In: Proceedings of the 44th conference on decision and control, Seville, pp 6632–6637
Vidyasagar M (2007) Stochastic modelling over a finite alphabet: approximation using the Kullback-Leibler divergence rate. In: Proceedings of the European control conference 2007, Kos, Paper ThA06.1
Wu CFJ (1983) On the convergence properties of the EM algorithm. Ann Stat 11: 95–103
Article MATH Google Scholar
Software code in R for the numerical implementation of the three step algorithm, http://www.isib.cnr.it/~grassi/HMMs

Download references

Open Access

This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Author information

Authors and Affiliations

ISIB-CNR, Corso Stati Uniti 4, 35127, Padova, Italy
Lorenzo Finesso & Angela Grassi
Korteweg-de Vries Institute for Mathematics, Universiteit van Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands
Peter Spreij

Authors

Lorenzo Finesso
View author publications
You can also search for this author in PubMed Google Scholar
Angela Grassi
View author publications
You can also search for this author in PubMed Google Scholar
Peter Spreij
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peter Spreij.

Additional information

A. Grassi was supported by a grant of Regione Veneto (Azione Biotech 3—DGR 2017/03-07-07) to CNR-ISIB.

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Finesso, L., Grassi, A. & Spreij, P. Approximation of stationary processes by hidden Markov models. Math. Control Signals Syst. 22, 1–22 (2010). https://doi.org/10.1007/s00498-010-0050-7

Download citation

Received: 24 June 2006
Accepted: 26 June 2010
Published: 11 July 2010
Issue Date: September 2010
DOI: https://doi.org/10.1007/s00498-010-0050-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Approximation of stationary processes by hidden Markov models

Abstract

Article PDF

Similar content being viewed by others

Learning Hidden Markov Models Using Probabilistic Matrix Factorization

The continuous-time hidden Markov model based on discretization. Properties of estimators and applications

The Generalized Entropy Ergodic Theorem with Two Types of Convergence for M-th-Order Nonhomogeneous Hidden Markov Models

References

Open Access

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Approximation of stationary processes by hidden Markov models

Abstract

Article PDF

Similar content being viewed by others

Learning Hidden Markov Models Using Probabilistic Matrix Factorization

The continuous-time hidden Markov model based on discretization. Properties of estimators and applications

The Generalized Entropy Ergodic Theorem with Two Types of Convergence for M-th-Order Nonhomogeneous Hidden Markov Models

References

Open Access

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation