Skip to main content

Showing 1–50 of 54 results for author: Lindsten, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.09845  [pdf, ps, other

    stat.ML cs.LG

    Towards understanding epoch-wise double descent in two-layer linear neural networks

    Authors: Amanda Olmin, Fredrik Lindsten

    Abstract: Epoch-wise double descent is the phenomenon where generalisation performance improves beyond the point of overfitting, resulting in a generalisation curve exhibiting two descents under the course of learning. Understanding the mechanisms driving this behaviour is crucial not only for understanding the generalisation behaviour of machine learning models in general, but also for employing convention… ▽ More

    Submitted 12 September, 2024; v1 submitted 13 July, 2024; originally announced July 2024.

  2. arXiv:2406.04759  [pdf, other

    cs.LG stat.ML

    Probabilistic Weather Forecasting with Hierarchical Graph Neural Networks

    Authors: Joel Oskarsson, Tomas Landelius, Marc Peter Deisenroth, Fredrik Lindsten

    Abstract: In recent years, machine learning has established itself as a powerful tool for high-resolution weather forecasting. While most current machine learning models focus on deterministic forecasts, accurately capturing the uncertainty in the chaotic weather system calls for probabilistic modeling. We propose a probabilistic weather forecasting model called Graph-EFM, combining a flexible latent-variab… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 67 pages, 29 figures. Code is available at https://github.com/mllam/neural-lam/tree/prob_model_global (global forecasting) and https://github.com/mllam/neural-lam/tree/prob_model_lam (limited area modeling)

  3. arXiv:2402.16688  [pdf, ps, other

    stat.ML cs.LG

    On the connection between Noise-Contrastive Estimation and Contrastive Divergence

    Authors: Amanda Olmin, Jakob Lindqvist, Lennart Svensson, Fredrik Lindsten

    Abstract: Noise-contrastive estimation (NCE) is a popular method for estimating unnormalised probabilistic models, such as energy-based models, which are effective for modelling complex data distributions. Unlike classical maximum likelihood (ML) estimation that relies on importance sampling (resulting in ML-IS) or MCMC (resulting in contrastive divergence, CD), NCE uses a proxy criterion to avoid the need… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted to AISTATS 2024

  4. arXiv:2310.15817  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Discriminator Guidance for Autoregressive Diffusion Models

    Authors: Filip Ekström Kelvinius, Fredrik Lindsten

    Abstract: We introduce discriminator guidance in the setting of Autoregressive Diffusion Models. The use of a discriminator to guide a diffusion process has previously been used for continuous diffusion models, and in this work we derive ways of using a discriminator together with a pretrained generative model in the discrete case. First, we show that using an optimal discriminator will correct the pretrain… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  5. arXiv:2309.17370  [pdf, other

    cs.LG stat.ML

    Graph-based Neural Weather Prediction for Limited Area Modeling

    Authors: Joel Oskarsson, Tomas Landelius, Fredrik Lindsten

    Abstract: The rise of accurate machine learning methods for weather forecasting is creating radical new possibilities for modeling the atmosphere. In the time of climate change, having access to high-resolution forecasts from models like these is also becoming increasingly vital. While most existing Neural Weather Prediction (NeurWP) methods focus on global forecasting, an important question is how these te… ▽ More

    Submitted 14 November, 2023; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: 38 pages, 27 figures. Accepted to the Tackling Climate Change with Machine Learning workshop at NeurIPS 2023. Code available at: https://github.com/joeloskarsson/neural-lam

  6. arXiv:2302.08415  [pdf, other

    stat.ML cs.LG cs.SI

    Temporal Graph Neural Networks for Irregular Data

    Authors: Joel Oskarsson, Per Sidén, Fredrik Lindsten

    Abstract: This paper proposes a temporal graph neural network model for forecasting of graph-structured irregularly observed time series. Our TGNN4I model is designed to handle both irregular time steps and partial observations of the graph. This is achieved by introducing a time-continuous latent state in each node, following a linear Ordinary Differential Equation (ODE) defined by the output of a Gated Re… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 17 pages, 4 figures. Accepted to AISTATS 2023. Code available at https://github.com/joeloskarsson/tgnn4i

  7. arXiv:2210.14684  [pdf, other

    stat.CO stat.AP stat.ME

    Nonlinear System Identification: Learning while respecting physical models using a sequential Monte Carlo method

    Authors: Anna Wigren, Johan Wågberg, Fredrik Lindsten, Adrian Wills, Thomas B. Schön

    Abstract: Identification of nonlinear systems is a challenging problem. Physical knowledge of the system can be used in the identification process to significantly improve the predictive performance by restricting the space of possible mappings from the input to the output. Typically, the physical models contain unknown parameters that must be learned from data. Classical methods often restrict the possible… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: 52 pages, 13 figures

    Journal ref: IEEE Control Systems Magazine, Volume 42, Issue 1, pages 75 - 102, February 2022

  8. arXiv:2210.13355  [pdf, other

    stat.ML cs.LG

    Calibration tests beyond classification

    Authors: David Widmann, Fredrik Lindsten, Dave Zachariah

    Abstract: Most supervised machine learning tasks are subject to irreducible prediction errors. Probabilistic predictive models address this limitation by providing probability distributions that represent a belief over plausible targets, rather than point estimates. Such models can be a valuable tool in decision-making under uncertainty, provided that the model output is meaningful and interpretable. Calibr… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 37 pages, 12 figures. Fixes some comments about the kernel choice in the original paper: https://github.com/devmotion/Calibration_ICLR2021/pull/6

    Journal ref: International Conference on Learning Representations (2021)

  9. arXiv:2210.07992  [pdf, other

    stat.ML cs.LG

    A Variational Perspective on Generative Flow Networks

    Authors: Heiko Zimmermann, Fredrik Lindsten, Jan-Willem van de Meent, Christian A. Naesseth

    Abstract: Generative flow networks (GFNs) are a class of models for sequential sampling of composite objects, which approximate a target distribution that is defined in terms of an energy function or a reward. GFNs are typically trained using a flow matching or trajectory balance objective, which matches forward and backward transition models over trajectories. In this work, we define variational objectives… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  10. arXiv:2210.07379  [pdf, other

    stat.ME stat.AP stat.CO stat.ML

    Marginalized particle Gibbs for multiple state-space models coupled through shared parameters

    Authors: Anna Wigren, Fredrik Lindsten

    Abstract: We consider Bayesian inference from multiple time series described by a common state-space model (SSM) structure, but where different subsets of parameters are shared between different submodels. An important example is disease-dynamics, where parameters can be either disease or location specific. Parameter inference in these models can be improved by systematically aggregating information from th… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: 26 pages, 10 figures (+ Supplementary material of 17 pages, 13 figures) Submitted to Journal of computational and graphical statistics

  11. arXiv:2206.05032  [pdf, other

    stat.ML cs.LG cs.SI stat.CO

    Scalable Deep Gaussian Markov Random Fields for General Graphs

    Authors: Joel Oskarsson, Per Sidén, Fredrik Lindsten

    Abstract: Machine learning methods on graphs have proven useful in many applications due to their ability to handle generally structured data. The framework of Gaussian Markov Random Fields (GMRFs) provides a principled way to define Gaussian models on graphs by utilizing their sparsity structure. We propose a flexible GMRF model for general graphs built on the multi-layer structure of Deep GMRFs, originall… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: 22 pages, 10 figures. Accepted at ICML 2022. Code available at https://github.com/joeloskarsson/graph-dgmrf

  12. Active Learning with Weak Supervision for Gaussian Processes

    Authors: Amanda Olmin, Jakob Lindqvist, Lennart Svensson, Fredrik Lindsten

    Abstract: Annotating data for supervised learning can be costly. When the annotation budget is limited, active learning can be used to select and annotate those observations that are likely to give the most gain in model performance. We propose an active learning algorithm that, in addition to selecting which observation to annotate, selects the precision of the annotation that is acquired. Assuming that an… ▽ More

    Submitted 16 August, 2024; v1 submitted 18 April, 2022; originally announced April 2022.

    Comments: This version of the contribution has been accepted for publication, after peer review but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/978-981-99-1642-9_17. Use of this Accepted Version is subject to the publisher's Accepted Manuscript terms of use

    Journal ref: In: ICONIP. Communications in Computer and Information Science, vol 1792. Springer, Singapore (2023)

  13. arXiv:2110.03321  [pdf, other

    stat.ML cs.LG

    Robustness and Reliability When Training With Noisy Labels

    Authors: Amanda Olmin, Fredrik Lindsten

    Abstract: Labelling of data for supervised learning can be costly and time-consuming and the risk of incorporating label noise in large data sets is imminent. When training a flexible discriminative model using a strictly proper loss, such noise will inevitably shift the solution towards the conditional distribution over noisy labels. Nevertheless, while deep neural networks have proven capable of fitting r… ▽ More

    Submitted 12 May, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: Accepted at AISTATS 2022

  14. arXiv:2003.10374  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Markovian Score Climbing: Variational Inference with KL(p||q)

    Authors: Christian A. Naesseth, Fredrik Lindsten, David Blei

    Abstract: Modern variational inference (VI) uses stochastic gradients to avoid intractable expectations, enabling large-scale probabilistic inference in complex models. VI posits a family of approximating distributions q and then finds the member of that family that is closest to the exact posterior p. Traditionally, VI algorithms minimize the "exclusive Kullback-Leibler (KL)" KL(q || p), often for computat… ▽ More

    Submitted 22 February, 2021; v1 submitted 23 March, 2020; originally announced March 2020.

  15. A general framework for ensemble distribution distillation

    Authors: Jakob Lindqvist, Amanda Olmin, Fredrik Lindsten, Lennart Svensson

    Abstract: Ensembles of neural networks have been shown to give better performance than single networks, both in terms of predictions and uncertainty estimation. Additionally, ensembles allow the uncertainty to be decomposed into aleatoric (data) and epistemic (model) components, giving a more complete picture of the predictive uncertainty. Ensemble distillation is the process of compressing an ensemble into… ▽ More

    Submitted 8 January, 2021; v1 submitted 26 February, 2020; originally announced February 2020.

    Journal ref: 2020 IEEE 30th International Workshop on Machine Learning for Signal Processing (MLSP), Espoo, Finland, 2020, pp. 1-6

  16. arXiv:2002.07467  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Deep Gaussian Markov Random Fields

    Authors: Per Sidén, Fredrik Lindsten

    Abstract: Gaussian Markov random fields (GMRFs) are probabilistic graphical models widely used in spatial statistics and related fields to model dependencies over spatial structures. We establish a formal connection between GMRFs and convolutional neural networks (CNNs). Common GMRFs are special cases of a generative model where the inverse mapping from data to latent variables is given by a 1-layer linear… ▽ More

    Submitted 10 August, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

  17. arXiv:1910.14145  [pdf, ps, other

    stat.CO stat.ML

    Parameter elimination in particle Gibbs sampling

    Authors: Anna Wigren, Riccardo Sven Risuleo, Lawrence Murray, Fredrik Lindsten

    Abstract: Bayesian inference in state-space models is challenging due to high-dimensional state trajectories. A viable approach is particle Markov chain Monte Carlo, combining MCMC and sequential Monte Carlo to form "exact approximations" to otherwise intractable MCMC methods. The performance of the approximation is limited to that of the exact method. We focus on particle Gibbs and particle Gibbs with ance… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Journal ref: Advances in Neural Information Processing Systems 32 (NeurIPS 2019)

  18. arXiv:1910.11385  [pdf, other

    stat.ML cs.LG

    Calibration tests in multi-class classification: A unifying framework

    Authors: David Widmann, Fredrik Lindsten, Dave Zachariah

    Abstract: In safety-critical applications a probabilistic model is usually required to be calibrated, i.e., to capture the uncertainty of its predictions accurately. In multi-class classification, calibration of the most confident predictions only is often not sufficient. We propose and study calibration measures for multi-class classification that generalize existing measures such as the expected calibrati… ▽ More

    Submitted 16 March, 2020; v1 submitted 24 October, 2019; originally announced October 2019.

    Comments: Corrected version that 1) fixes the ECE evaluation with bins of uniform size (does not affect our conclusions and discussions) and 2) contains additional experimental results in the supplementary material

    Journal ref: Advances in Neural Information Processing Systems 32 (NeurIPS 2019)

  19. arXiv:1910.09527  [pdf, ps, other

    stat.CO stat.ML

    Particle filter with rejection control and unbiased estimator of the marginal likelihood

    Authors: Jan Kudlicka, Lawrence M. Murray, Thomas B. Schön, Fredrik Lindsten

    Abstract: We consider the combined use of resampling and partial rejection control in sequential Monte Carlo methods, also known as particle filters. While the variance reducing properties of rejection control are known, there has not been (to the best of our knowledge) any work on unbiased estimation of the marginal likelihood (also known as the model evidence or the normalizing constant) in this type of p… ▽ More

    Submitted 4 March, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

  20. arXiv:1903.04797  [pdf, other

    stat.ML cs.LG stat.CO

    Elements of Sequential Monte Carlo

    Authors: Christian A. Naesseth, Fredrik Lindsten, Thomas B. Schön

    Abstract: A core problem in statistics and probabilistic machine learning is to compute probability distributions and expectations. This is the fundamental problem of Bayesian statistics and machine learning, which frames all inference as expectations with respect to the posterior distribution. The key challenge is to approximate these intractable expectations. In this tutorial, we review sequential Monte C… ▽ More

    Submitted 4 March, 2022; v1 submitted 12 March, 2019; originally announced March 2019.

    Comments: Foundations and Trends in Machine Learning

  21. arXiv:1902.06977  [pdf

    cs.LG stat.ML

    Evaluating model calibration in classification

    Authors: Juozas Vaicenavicius, David Widmann, Carl Andersson, Fredrik Lindsten, Jacob Roll, Thomas B. Schön

    Abstract: Probabilistic classifiers output a probability distribution on target classes rather than just a class prediction. Besides providing a clear separation of prediction and decision making, the main advantage of probabilistic models is their ability to represent uncertainty about predictions. In safety-critical applications, it is pivotal for a model to possess an adequate sense of uncertainty, which… ▽ More

    Submitted 19 February, 2019; originally announced February 2019.

  22. arXiv:1902.01182  [pdf, other

    stat.ML cs.AI cs.LG

    Constructing the Matrix Multilayer Perceptron and its Application to the VAE

    Authors: Jalil Taghia, Maria Bånkestad, Fredrik Lindsten, Thomas B. Schön

    Abstract: Like most learning algorithms, the multilayer perceptrons (MLP) is designed to learn a vector of parameters from data. However, in certain scenarios we are interested in learning structured parameters (predictions) in the form of symmetric positive definite matrices. Here, we introduce a variant of the MLP, referred to as the matrix MLP, that is specialized at learning symmetric positive definite… ▽ More

    Submitted 4 February, 2019; originally announced February 2019.

  23. arXiv:1901.02374  [pdf, other

    stat.ML cs.LG

    Graphical model inference: Sequential Monte Carlo meets deterministic approximations

    Authors: Fredrik Lindsten, Jouni Helske, Matti Vihola

    Abstract: Approximate inference in probabilistic graphical models (PGMs) can be grouped into deterministic methods and Monte-Carlo-based methods. The former can often provide accurate and rapid inferences, but are typically associated with biases that are hard to quantify. The latter enjoy asymptotic consistency, but can suffer from high computational costs. In this paper we present a way of bridging the ga… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

    Journal ref: 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montréal, Canada

  24. arXiv:1806.09548  [pdf, other

    stat.CO cs.CE eess.SP stat.ML

    Learning dynamical systems with particle stochastic approximation EM

    Authors: Andreas Lindholm, Fredrik Lindsten

    Abstract: We present the particle stochastic approximation EM (PSAEM) algorithm for learning of dynamical systems. The method builds on the EM algorithm, an iterative procedure for maximum likelihood inference in latent variable models. By combining stochastic approximation EM and particle Gibbs with ancestor sampling (PGAS), PSAEM obtains superior computational performance and convergence properties compar… ▽ More

    Submitted 10 December, 2019; v1 submitted 25 June, 2018; originally announced June 2018.

  25. Improving the particle filter in high dimensions using conjugate artificial process noise

    Authors: Anna Wigren, Lawrence Murray, Fredrik Lindsten

    Abstract: The particle filter is one of the most successful methods for state inference and identification of general non-linear and non-Gaussian models. However, standard particle filters suffer from degeneracy of the particle weights, in particular for high-dimensional problems. We propose a method for improving the performance of the particle filter for certain challenging state space models, with implic… ▽ More

    Submitted 22 November, 2018; v1 submitted 22 January, 2018; originally announced January 2018.

  26. arXiv:1711.10765  [pdf, other

    stat.CO eess.SY

    Learning nonlinear state-space models using smooth particle-filter-based likelihood approximations

    Authors: Andreas Svensson, Fredrik Lindsten, Thomas B. Schön

    Abstract: When classical particle filtering algorithms are used for maximum likelihood parameter estimation in nonlinear state-space models, a key challenge is that estimates of the likelihood function and its derivatives are inherently noisy. The key idea in this paper is to run a particle filter based on a current parameter estimate, but then use the output from this particle filter to re-evaluate the lik… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

  27. arXiv:1708.05239  [pdf, other

    stat.ME stat.CO stat.ML

    Pseudo-extended Markov chain Monte Carlo

    Authors: Christopher Nemeth, Fredrik Lindsten, Maurizio Filippone, James Hensman

    Abstract: Sampling from posterior distributions using Markov chain Monte Carlo (MCMC) methods can require an exhaustive number of iterations, particularly when the posterior is multi-modal as the MCMC sampler can become trapped in a local mode for a large number of iterations. In this paper, we introduce the pseudo-extended MCMC method as a simple approach for improving the mixing of the MCMC sampler for mu… ▽ More

    Submitted 29 October, 2019; v1 submitted 17 August, 2017; originally announced August 2017.

    Comments: Advances in Neural Information Processing Systems 2019

  28. arXiv:1703.02419  [pdf, ps, other

    stat.CO cs.LG eess.SY

    Probabilistic learning of nonlinear dynamical systems using sequential Monte Carlo

    Authors: Thomas B. Schön, Andreas Svensson, Lawrence Murray, Fredrik Lindsten

    Abstract: Probabilistic modeling provides the capability to represent and manipulate uncertainty in data, models, predictions and decisions. We are concerned with the problem of learning probabilistic models of dynamical systems from measured data. Specifically, we consider learning of probabilistic nonlinear state-space models. There is no closed-form solution available for this problem, implying that we a… ▽ More

    Submitted 15 December, 2017; v1 submitted 7 March, 2017; originally announced March 2017.

    Comments: Thomas B. Schön, Andreas Svensson, Lawrence Murray and Fredrik Lindsten, 2018. Probabilistic learning of nonlinear dynamical systems using sequential Monte Carlo. In Mechanical Systems and Signal Processing, Volume 104, pp. 866-883

  29. Learning of state-space models with highly informative observations: a tempered Sequential Monte Carlo solution

    Authors: Andreas Svensson, Thomas B. Schön, Fredrik Lindsten

    Abstract: Probabilistic (or Bayesian) modeling and learning offers interesting possibilities for systematic representation of uncertainty using probability theory. However, probabilistic learning often leads to computationally challenging problems. Some problems of this type that were previously intractable can now be solved on standard personal computers thanks to recent advances in Monte Carlo methods. In… ▽ More

    Submitted 13 December, 2017; v1 submitted 6 February, 2017; originally announced February 2017.

    Journal ref: Mechanical Systems and Signal Processing, Volume 104 (May 2018), Pages 915-928

  30. arXiv:1701.02002  [pdf, other

    stat.ME stat.CO

    Smoothing with Couplings of Conditional Particle Filters

    Authors: Pierre E. Jacob, Fredrik Lindsten, Thomas B. Schön

    Abstract: In state space models, smoothing refers to the task of estimating a latent stochastic process given noisy measurements related to the process. We propose an unbiased estimator of smoothing expectations. The lack-of-bias property has methodological benefits: independent estimators can be generated in parallel, and confidence intervals can be constructed from the central limit theorem to quantify th… ▽ More

    Submitted 5 September, 2018; v1 submitted 8 January, 2017; originally announced January 2017.

    Comments: This document is a self-contained and direct description of the smoothing method introduced in Coupling of Particle Filters (arXiv:1606.01156). Code is available at github.com/pierrejacob/CoupledCPF. Compared to the previous version, a bug was fixed in the code, and the numerical results were updated

  31. arXiv:1612.09162  [pdf, other

    stat.CO stat.ML

    High-dimensional Filtering using Nested Sequential Monte Carlo

    Authors: Christian A. Naesseth, Fredrik Lindsten, Thomas B. Schön

    Abstract: Sequential Monte Carlo (SMC) methods comprise one of the most successful approaches to approximate Bayesian filtering. However, SMC without good proposal distributions struggle in high dimensions. We propose nested sequential Monte Carlo (NSMC), a methodology that generalises the SMC framework by requiring only approximate, properly weighted, samples from the SMC proposal distribution, while still… ▽ More

    Submitted 29 December, 2016; originally announced December 2016.

  32. arXiv:1607.02516  [pdf, other

    stat.ME stat.ML

    Pseudo-Marginal Hamiltonian Monte Carlo

    Authors: Johan Alenlöv, Arnaud Doucet, Fredrik Lindsten

    Abstract: Bayesian inference in the presence of an intractable likelihood function is computationally challenging. When following a Markov chain Monte Carlo (MCMC) approach to approximate the posterior distribution in this context, one typically either uses MCMC schemes which target the joint posterior of the parameters and some auxiliary latent variables, or pseudo-marginal Metropolis--Hastings (MH) scheme… ▽ More

    Submitted 2 October, 2019; v1 submitted 8 July, 2016; originally announced July 2016.

  33. arXiv:1606.01156  [pdf, other

    stat.ME stat.CO

    Coupling of Particle Filters

    Authors: Pierre E. Jacob, Fredrik Lindsten, Thomas B. Schön

    Abstract: Particle filters provide Monte Carlo approximations of intractable quantities such as point-wise evaluations of the likelihood in state space models. In many scenarios, the interest lies in the comparison of these quantities as some parameter or input varies. To facilitate such comparisons, we introduce and study methods to couple two particle filters in such a way that the correlation between the… ▽ More

    Submitted 16 July, 2016; v1 submitted 3 June, 2016; originally announced June 2016.

    Comments: Technical report, 24 pages for the main document + 18 pages of appendices

  34. arXiv:1602.05128  [pdf, other

    stat.CO stat.ML

    Interacting Particle Markov Chain Monte Carlo

    Authors: Tom Rainforth, Christian A. Naesseth, Fredrik Lindsten, Brooks Paige, Jan-Willem van de Meent, Arnaud Doucet, Frank Wood

    Abstract: We introduce interacting particle Markov chain Monte Carlo (iPMCMC), a PMCMC method based on an interacting pool of standard and conditional sequential Monte Carlo samplers. Like related methods, iPMCMC is a Markov chain Monte Carlo sampler on an extended space. We present empirical results that show significant improvements in mixing rates relative to both non-interacting PMCMC samplers, and a si… ▽ More

    Submitted 12 April, 2017; v1 submitted 16 February, 2016; originally announced February 2016.

    Journal ref: JMLR W&CP 48 : 2616-2625, 2016

  35. arXiv:1511.05483  [pdf, other

    stat.CO stat.ML

    Accelerating pseudo-marginal Metropolis-Hastings by correlating auxiliary variables

    Authors: Johan Dahlin, Fredrik Lindsten, Joel Kronander, Thomas B. Schön

    Abstract: Pseudo-marginal Metropolis-Hastings (pmMH) is a powerful method for Bayesian inference in models where the posterior distribution is analytical intractable or computationally costly to evaluate directly. It operates by introducing additional auxiliary variables into the model and form an extended target distribution, which then can be evaluated point-wise. In many cases, the standard Metropolis-Ha… ▽ More

    Submitted 17 November, 2015; originally announced November 2015.

    Comments: 23 pages, 5 figures

  36. Rao-Blackwellized particle smoothers for conditionally linear Gaussian models

    Authors: Fredrik Lindsten, Pete Bunch, Simo Särkkä, Thomas B. Schön, Simon J. Godsill

    Abstract: Sequential Monte Carlo (SMC) methods, such as the particle filter, are by now one of the standard computational techniques for addressing the filtering problem in general state-space models. However, many applications require post-processing of data offline. In such scenarios the smoothing problem--in which all the available data is used to compute state estimates--is of central interest. We consi… ▽ More

    Submitted 23 May, 2015; originally announced May 2015.

  37. arXiv:1505.06356  [pdf, other

    stat.CO

    Particle ancestor sampling for near-degenerate or intractable state transition models

    Authors: Fredrik Lindsten, Pete Bunch, Sumeetpal S. Singh, Thomas B. Schön

    Abstract: We consider Bayesian inference in sequential latent variable models in general, and in nonlinear state space models in particular (i.e., state smoothing). We work with sequential Monte Carlo (SMC) algorithms, which provide a powerful inference framework for addressing this problem. However, for certain challenging and common model classes the state-of-the-art algorithms still struggle. The work is… ▽ More

    Submitted 23 May, 2015; originally announced May 2015.

  38. arXiv:1503.06058  [pdf, other

    stat.CO math.OC stat.ML

    Sequential Monte Carlo Methods for System Identification

    Authors: Thomas B. Schön, Fredrik Lindsten, Johan Dahlin, Johan Wågberg, Christian A. Naesseth, Andreas Svensson, Liang Dai

    Abstract: One of the key challenges in identifying nonlinear and possibly non-Gaussian state space models (SSMs) is the intractability of estimating the system state. Sequential Monte Carlo (SMC) methods, such as the particle filter (introduced more than two decades ago), provide numerical solutions to the nonlinear state estimation problems arising in SSMs. When combined with additional identification tech… ▽ More

    Submitted 10 March, 2016; v1 submitted 20 March, 2015; originally announced March 2015.

    Comments: In proceedings of the 17th IFAC Symposium on System Identification (SYSID). Added cover page

  39. arXiv:1502.03656  [pdf, other

    stat.CO q-fin.CP stat.ML

    Quasi-Newton particle Metropolis-Hastings

    Authors: Johan Dahlin, Fredrik Lindsten, Thomas B. Schön

    Abstract: Particle Metropolis-Hastings enables Bayesian parameter inference in general nonlinear state space models (SSMs). However, in many implementations a random walk proposal is used and this can result in poor mixing if not tuned correctly using tedious pilot runs. Therefore, we consider a new proposal inspired by quasi-Newton algorithms that may achieve similar (or better) mixing with less tuning. An… ▽ More

    Submitted 2 September, 2015; v1 submitted 12 February, 2015; originally announced February 2015.

    Comments: 23 pages, 5 figures. Accepted for the 17th IFAC Symposium on System Identification (SYSID), Beijing, China, October 2015

  40. arXiv:1502.02536  [pdf, other

    stat.CO stat.ME stat.ML

    Nested Sequential Monte Carlo Methods

    Authors: Christian A. Naesseth, Fredrik Lindsten, Thomas B. Schön

    Abstract: We propose nested sequential Monte Carlo (NSMC), a methodology to sample from sequences of probability distributions, even where the random variables are high-dimensional. NSMC generalises the SMC framework by requiring only approximate, properly weighted, samples from the SMC proposal distribution, while still resulting in a correct SMC algorithm. Furthermore, NSMC can in itself be used to produc… ▽ More

    Submitted 11 September, 2015; v1 submitted 9 February, 2015; originally announced February 2015.

    Comments: Extended version of paper published in Proceedings of the 32nd International Conference on Machine Learning (ICML), Lille, France, 2015

  41. arXiv:1501.02056  [pdf, other

    stat.ML cs.LG

    Sequential Kernel Herding: Frank-Wolfe Optimization for Particle Filtering

    Authors: Simon Lacoste-Julien, Fredrik Lindsten, Francis Bach

    Abstract: Recently, the Frank-Wolfe optimization algorithm was suggested as a procedure to obtain adaptive quadrature rules for integrals of functions in a reproducing kernel Hilbert space (RKHS) with a potentially faster rate of convergence than Monte Carlo integration (and "kernel herding" was shown to be a special case of this procedure). In this paper, we propose to replace the random sampling step in a… ▽ More

    Submitted 10 February, 2015; v1 submitted 9 January, 2015; originally announced January 2015.

    Comments: in 18th International Conference on Artificial Intelligence and Statistics (AISTATS), May 2015, San Diego, United States. 38, JMLR Workshop and Conference Proceedings

  42. arXiv:1409.7287  [pdf, other

    stat.CO math.OC stat.ML

    Identification of jump Markov linear models using particle filters

    Authors: Andreas Svensson, Thomas B. Schön, Fredrik Lindsten

    Abstract: Jump Markov linear models consists of a finite number of linear state space models and a discrete variable encoding the jumps (or switches) between the different linear models. Identifying jump Markov linear models makes for a challenging problem lacking an analytical solution. We derive a new expectation maximization (EM) type algorithm that produce maximum likelihood estimates of the model param… ▽ More

    Submitted 25 September, 2014; originally announced September 2014.

    Comments: Accepted to 53rd IEEE International Conference on Decision and Control (CDC), 2014 (Los Angeles, CA, USA)

    Journal ref: Proc. of IEEE 53rd Conference on Decision and Control (CDC), pp.6504,6509, 15-17 Dec. 2014 (Los Angeles, CA, USA)

  43. Divide-and-Conquer with Sequential Monte Carlo

    Authors: Fredrik Lindsten, Adam M. Johansen, Christian A. Naesseth, Bonnie Kirkpatrick, Thomas B. Schön, John Aston, Alexandre Bouchard-Côté

    Abstract: We propose a novel class of Sequential Monte Carlo (SMC) algorithms, appropriate for inference in probabilistic graphical models. This class of algorithms adopts a divide-and-conquer approach based upon an auxiliary tree-structured decomposition of the model of interest, turning the overall inferential task into a collection of recursively solved sub-problems. The proposed method is applicable to… ▽ More

    Submitted 30 June, 2015; v1 submitted 19 June, 2014; originally announced June 2014.

    Journal ref: Journal of Computational and Graphical Statistics, 26(2):445-458, 2017

  44. arXiv:1405.0102  [pdf, other

    cs.IT stat.CO

    Capacity estimation of two-dimensional channels using Sequential Monte Carlo

    Authors: Christian A. Naesseth, Fredrik Lindsten, Thomas B. Schön

    Abstract: We derive a new Sequential-Monte-Carlo-based algorithm to estimate the capacity of two-dimensional channel models. The focus is on computing the noiseless capacity of the 2-D one-infinity run-length limited constrained channel, but the underlying idea is generally applicable. The proposed algorithm is profiled against a state-of-the-art method, yielding more than an order of magnitude improvement… ▽ More

    Submitted 11 August, 2014; v1 submitted 1 May, 2014; originally announced May 2014.

  45. arXiv:1402.0330  [pdf, other

    stat.ME stat.ML

    Sequential Monte Carlo for Graphical Models

    Authors: Christian A. Naesseth, Fredrik Lindsten, Thomas B. Schön

    Abstract: We propose a new framework for how to use sequential Monte Carlo (SMC) algorithms for inference in probabilistic graphical models (PGM). Via a sequential decomposition of the PGM we find a sequence of auxiliary distributions defined on a monotonically increasing sequence of probability spaces. By targeting these auxiliary distributions using SMC we are able to approximate the full joint distributi… ▽ More

    Submitted 6 October, 2014; v1 submitted 3 February, 2014; originally announced February 2014.

  46. arXiv:1401.0604  [pdf, other

    stat.CO stat.ML

    Particle Gibbs with Ancestor Sampling

    Authors: Fredrik Lindsten, Michael I. Jordan, Thomas B. Schön

    Abstract: Particle Markov chain Monte Carlo (PMCMC) is a systematic way of combining the two main tools used for Monte Carlo statistical inference: sequential Monte Carlo (SMC) and Markov chain Monte Carlo (MCMC). We present a novel PMCMC algorithm that we refer to as particle Gibbs with ancestor sampling (PGAS). PGAS provides the data analyst with an off-the-shelf class of Markov kernels that can be used t… ▽ More

    Submitted 3 January, 2014; originally announced January 2014.

    Journal ref: Journal of Machine Learning Research, 15 (2014) 2145-2184

  47. arXiv:1312.4852  [pdf, other

    stat.ML eess.SY

    Identification of Gaussian Process State-Space Models with Particle Stochastic Approximation EM

    Authors: Roger Frigola, Fredrik Lindsten, Thomas B. Schön, Carl E. Rasmussen

    Abstract: Gaussian process state-space models (GP-SSMs) are a very flexible family of models of nonlinear dynamical systems. They comprise a Bayesian nonparametric representation of the dynamics of the system and additional (hyper-)parameters governing the properties of this nonparametric representation. The Bayesian formalism enables systematic reasoning about the uncertainty in the system dynamics. We pre… ▽ More

    Submitted 17 December, 2013; originally announced December 2013.

  48. arXiv:1312.0781  [pdf, other

    stat.CO

    Recursive maximum likelihood identification of jump Markov nonlinear systems

    Authors: Emre Özkan, Fredrik Lindsten, Carsten Fritsche, Fredrik Gustafsson

    Abstract: In this contribution, we present an online method for joint state and parameter estimation in jump Markov non-linear systems (JMNLS). State inference is enabled via the use of particle filters which makes the method applicable to a wide range of non-linear models. To exploit the inherent structure of JMNLS, we design a Rao-Blackwellized particle filter (RBPF) where the discrete mode is marginalize… ▽ More

    Submitted 3 December, 2013; originally announced December 2013.

    Comments: Submitted to the IEEE Transactions on Signal Processing on October 14, 2013

  49. Particle filter-based Gaussian process optimisation for parameter inference

    Authors: Johan Dahlin, Fredrik Lindsten

    Abstract: We propose a novel method for maximum likelihood-based parameter inference in nonlinear and/or non-Gaussian state space models. The method is an iterative procedure with three steps. At each iteration a particle filter is used to estimate the value of the log-likelihood function at the current parameter iterate. Using these log-likelihood estimates, a surrogate objective function is created by uti… ▽ More

    Submitted 31 March, 2014; v1 submitted 4 November, 2013; originally announced November 2013.

    Comments: Accepted for publication in proceedings of the 19th World Congress of the International Federation of Automatic Control (IFAC), Cape Town, South Africa, August 2014. 6 pages, 4 figures

  50. Particle Metropolis-Hastings using gradient and Hessian information

    Authors: Johan Dahlin, Fredrik Lindsten, Thomas B. Schön

    Abstract: Particle Metropolis-Hastings (PMH) allows for Bayesian parameter inference in nonlinear state space models by combining Markov chain Monte Carlo (MCMC) and particle filtering. The latter is used to estimate the intractable likelihood. In its original formulation, PMH makes use of a marginal MCMC proposal for the parameters, typically a Gaussian random walk. However, this can lead to a poor explora… ▽ More

    Submitted 18 September, 2014; v1 submitted 4 November, 2013; originally announced November 2013.

    Comments: 27 pages, 5 figures, 2 tables. The final publication is available at Springer via: http://dx.doi.org/10.1007/s11222-014-9510-0

    Journal ref: Statistics and Computing, Volume 25, Issue 1, pp 81-92, 2015