On particle methods for parameter estimation in state-space models

N Kantas, A Doucet, SS Singh, J Maciejowski… - 2015 - projecteuclid.org
Nonlinear non-Gaussian state-space models are ubiquitous in statistics, econometrics,
information engineering and signal processing. Particle methods, also known as Sequential …

Practical and asymptotically exact conditional sampling in diffusion models

L Wu, B Trippe, C Naesseth, D Blei… - Advances in Neural …, 2023 - proceedings.neurips.cc
Diffusion models have been successful on a range of conditional generation tasks including
molecular design and text-to-image generation. However, these achievements have …

Automated learning with a probabilistic programming language: Birch

LM Murray, TB Schön - Annual Reviews in Control, 2018 - Elsevier
This work offers a broad perspective on probabilistic modeling and inference in light of
recent advances in probabilistic programming, in which models are formally expressed in …

Probabilistic inference in language models via twisted sequential monte carlo

S Zhao, R Brekelmans, A Makhzani… - arxiv preprint arxiv …, 2024 - arxiv.org
Numerous capability and safety techniques of Large Language Models (LLMs), including
RLHF, automated red-teaming, prompt engineering, and infilling, can be cast as sampling …

The iterated auxiliary particle filter

P Guarniero, AM Johansen, A Lee - Journal of the American …, 2017 - Taylor & Francis
We present an offline, iterated particle filter to facilitate statistical inference in general state
space hidden Markov models. Given a model and a sequence of observations, the …

Sixo: Smoothing inference with twisted objectives

D Lawson, A Raventós, A Warrington… - Advances in Neural …, 2022 - proceedings.neurips.cc
Abstract Sequential Monte Carlo (SMC) is an inference algorithm for state space models that
approximates the posterior by sampling from a sequence of target distributions. The target …

Nonlinear system identification: learning while respecting physical models using a sequential Monte Carlo method

A Wigren, J Wågberg, F Lindsten… - IEEE Control …, 2022 - ieeexplore.ieee.org
The identification of nonlinear systems is a challenging problem. Physical knowledge of a
system can be used in the identification process to significantly improve the predictive …

On the role of interaction in sequential Monte Carlo algorithms

N Whiteley, A Lee, K Heine - 2016 - projecteuclid.org
We introduce a general form of sequential Monte Carlo algorithm defined in terms of a
parameterized resampling mechanism. We find that a suitably generalized notion of the …

Reinforcement learning: An overview

K Murphy - arxiv preprint arxiv:2412.05265, 2024 - arxiv.org
This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement
learning and sequential decision making, covering value-based RL, policy-gradient …

Conditioning diffusion models by explicit forward-backward bridging

A Corenflos, Z Zhao, S Särkkä, J Sjölund… - arxiv preprint arxiv …, 2024 - arxiv.org
Given an unconditional diffusion model $\pi (x, y) $, using it to perform conditional simulation
$\pi (x\mid y) $ is still largely an open question and is typically achieved by learning …