TY  - JOUR
A1  - Thapa, Samudrajit
A1  - Park, Seongyu
A1  - Kim, Yeongjin
A1  - Jeon, Jae-Hyung
A1  - Metzler, Ralf
A1  - Lomholt, Michael A.
T1  - Bayesian inference of scaled versus fractional Brownian motion
JF  - Journal of physics : A, mathematical and theoretical
N2  - We present a Bayesian inference scheme for scaled Brownian motion, and investigate its performance on synthetic data for parameter estimation and model selection in a combined inference with fractional Brownian motion. We include the possibility of measurement noise in both models. We find that for trajectories of a few hundred time points the procedure is able to resolve well the true model and parameters. Using the prior of the synthetic data generation process also for the inference, the approach is optimal based on decision theory. We include a comparison with inference using a prior different from the data generating one.
KW  - Bayesian inference
KW  - scaled Brownian motion
KW  - single particle tracking
Y1  - 2022
U6  - https://doi.org/10.1088/1751-8121/ac60e7
SN  - 1751-8113
SN  - 1751-8121
VL  - 55
IS  - 19
PB  - IOP Publ. Ltd.
CY  - Bristol
ER  - 
TY  - THES
A1  - Thapa, Samudrajit
T1  - Deciphering anomalous diffusion in complex systems using Bayesian inference and large deviation theory
N2  - The development of methods such as super-resolution microscopy (Nobel prize in Chemistry, 2014) and multi-scale computer modelling (Nobel prize in Chemistry, 2013) have provided scientists with powerful tools to study microscopic systems. Sub-micron particles or even fluorescently labelled single molecules can now be tracked for long times in a variety of systems such as living cells, biological membranes, colloidal solutions etc. at spatial and temporal resolutions previously inaccessible. Parallel to such single-particle tracking experiments, super-computing techniques enable simulations of large atomistic or coarse-grained systems such as biologically relevant membranes or proteins from picoseconds to seconds, generating large volume of data. These have led to an unprecedented rise in the number of reported cases of anomalous diffusion wherein the characteristic features of Brownian motion—namely linear growth of the mean squared displacement with time and the Gaussian form of the probability density function (PDF) to find a particle at a given position at some fixed time—are routinely violated. This presents a big challenge in identifying the underlying stochastic process and also estimating the corresponding parameters of the process to completely describe the observed behaviour. Finding the correct physical mechanism which leads to the observed dynamics is of paramount importance, for example, to understand the first-arrival time of transcription factors which govern gene regulation, or the survival probability of a pathogen in a biological cell post drug administration. Statistical Physics provides useful methods that can be applied to extract such vital information. This cumulative dissertation, based on five publications, focuses on the development, implementation and application of such tools with special emphasis on Bayesian inference and large deviation theory. Together with the implementation of Bayesian model comparison and parameter estimation methods for models of diffusion, complementary tools are developed based on different observables and large deviation theory to classify stochastic processes and gather pivotal information. Bayesian analysis of the data of micron-sized particles traced in mucin hydrogels at different pH conditions unveiled several interesting features and we gained insights into, for example, how in going from basic to acidic pH, the hydrogel becomes more heterogeneous and phase separation can set in, leading to observed non-ergodicity (non-equivalence of time and ensemble averages) and non-Gaussian PDF. With large deviation theory based analysis we could detect, for instance, non-Gaussianity in seeming Brownian diffusion of beads in aqueous solution, anisotropic motion of the beads in mucin at neutral pH conditions, and short-time correlations in climate data. Thus through the application of the developed methods to biological and meteorological datasets crucial information is garnered about the underlying stochastic processes and significant insights are obtained in understanding the physical nature of these systems.
N2  - Die Entwicklung von Methoden wie der superauflösenden Mikroskopie (Nobelpreis für Chemie, 2014) und der Multiskalen-Computermodellierung (Nobelpreis für Chemie, 2013) hat Wis- senschaftlern mächtige Werkzeuge zur Untersuchung mikroskopischer Systeme zur Verfügung gestellt. Submikrometer Partikel und sogar einzelne fluoreszent markierte Moleküle können heute über lange Beobachtungszeiten in einer Vielzahl von Systemen, wie z.B. lebenden Zellen, biologischen Membranen und kolloidalen Suspensionen, mit bisher unerreichter räumlicher und zeitlicher Auflösung verfolgt werden. Neben solchen Einzelpartikelverfolgungsexperi- menten ermöglichen Supercomputer die Simulation großer atomistischer oder coarse-grained Systeme, wie z..B. biologisch relevante Membranen oder Proteine, über wenige Picosekunden bis hin zu einigen Sekunden, wobei große Datenmengen produziert werden. Diese haben zu einem beispiellosen Anstieg in der Zahl berichteter Fälle von anomaler Diffusion geführt, bei welcher die charakteristischen Eigenschaften der Brownschen Diffusion—nämlich das lineare Wachstum der mittleren quadratischen Verschiebung mit der Zeit und die Gaußsche Form der Wahrscheinlichkeitsdichtefunktion ein Partikel an einem gegebenen Ort und zu gegebener Zeit zu finden—verletzt sind. Dies stellt eine große Herausforderung bei der Identifizierung des zugrundeliegenden stochastischen Prozesses und der Schätzung der zugehörigen Prozess- parameter dar, was zur vollständigen Beschreibung des beobachteten Verhaltens nötig ist. Das Auffinden des korrekten physikalischen Mechanismus, welcher zum beobachteten Verhal- ten führt, ist von überragender Bedeutung, z.B. beim Verständnis der, die Genregulation steuernden, first-arrival time von Transkriptionsfaktoren oder der Überlebensfunktion eines Pathogens in einer biologischen Zelle nach Medikamentengabe. Die statistische Physik stellt nützliche Methoden bereit, die angewendet werden können, um solch wichtige Informationen zu erhalten. Der Schwerpunkt der vorliegenden, auf fünf Publikationen basierenden, kumulativen Dissertation liegt auf der Entwicklung, Implementierung und Anwendung solcher Methoden, mit einem besonderen Schwerpunkt auf der Bayesschen Inferenz und der Theorie der großen Abweichungen. Zusammen mit der Implementierung eines Bayesschen Modellvergleichs und Methoden zur Parameterschätzung für Diffusionsmodelle werden ergänzende Methoden en- twickelt, welche auf unterschiedliche Observablen und der Theorie der großen Abweichungen basieren, um stochastische Prozesse zu klassifizieren und wichtige Informationen zu erhalten. Die Bayessche Analyse der Bewegungsdaten von Mikrometerpartikeln, welche in Mucin- hydrogelen mit verschiedenen pH-Werten verfolgt wurden, enthüllte mehrere interessante Eigenschaften und wir haben z.B. Einsichten darüber gewonnen, wie der Übergang von basischen zu sauren pH-Werten die Heterogenität des Hydrogels erhöht und Phasentrennung einsetzen kann, was zur beobachteten Nicht-Ergodizität (Inäquivalenz von Zeit- und En- semblemittelwert) und nicht-Gaußscher Wahrscheinlichkeitsdichte führt. Mit einer auf der Theorie der großen Abweichungen basierenden Analyse konnten wir, z.B., nicht-Gaußsches Verhalten bei der scheinbaren Brownschen Diffusion von Partikeln in wässriger Lösung, anisotrope Bewegung von Partikeln in Mucin bei neutralem pH-Wert und Kurzzeitkorrela- tionen in Klimadaten detektieren. Folglich werden durch die Anwendung der entwickelten Methoden auf biologische und meteorologische Daten entscheidende Informationen über die zugrundeliegenden stochastischen Prozesse gesammelt und bedeutende Erkenntnisse für das Verständnis der Eigenschaften dieser Systeme erhalten.
KW  - anomalous diffusion
KW  - Bayesian inference
KW  - large deviation theory
KW  - statistical physics
Y1  - 2020
ER  - 
TY  - JOUR
A1  - Seelig, Stefan A.
A1  - Rabe, Maximilian Michael
A1  - Malem-Shinitski, Noa
A1  - Risse, Sarah
A1  - Reich, Sebastian
A1  - Engbert, Ralf
T1  - Bayesian parameter estimation for the SWIFT model of eye-movement control during reading
JF  - Journal of mathematical psychology
N2  - Process-oriented theories of cognition must be evaluated against time-ordered observations. Here we present a representative example for data assimilation of the SWIFT model, a dynamical model of the control of fixation positions and fixation durations during natural reading of single sentences. First, we develop and test an approximate likelihood function of the model, which is a combination of a spatial, pseudo-marginal likelihood and a temporal likelihood obtained by probability density approximation Second, we implement a Bayesian approach to parameter inference using an adaptive Markov chain Monte Carlo procedure. Our results indicate that model parameters can be estimated reliably for individual subjects. We conclude that approximative Bayesian inference represents a considerable step forward for computational models of eye-movement control, where modeling of individual data on the basis of process-based dynamic models has not been possible so far.
KW  - dynamical models
KW  - reading
KW  - eye movements
KW  - saccades
KW  - likelihood function
KW  - Bayesian inference
KW  - MCMC
KW  - interindividual differences
Y1  - 2020
U6  - https://doi.org/10.1016/j.jmp.2019.102313
SN  - 0022-2496
SN  - 1096-0880
VL  - 95
PB  - Elsevier
CY  - San Diego
ER  - 
TY  - JOUR
A1  - Schütt, Heiko Herbert
A1  - Harmeling, Stefan
A1  - Macke, Jakob H.
A1  - Wichmann, Felix A.
T1  - Painfree and accurate Bayesian estimation of psychometric functions for (potentially) overdispersed data
JF  - Vision research : an international journal for functional aspects of vision.
N2  - The psychometric function describes how an experimental variable, such as stimulus strength, influences the behaviour of an observer. Estimation of psychometric functions from experimental data plays a central role in fields such as psychophysics, experimental psychology and in the behavioural neurosciences. Experimental data may exhibit substantial overdispersion, which may result from non-stationarity in the behaviour of observers. Here we extend the standard binomial model which is typically used for psychometric function estimation to a beta-binomial model. We show that the use of the beta-binomial model makes it possible to determine accurate credible intervals even in data which exhibit substantial overdispersion. This goes beyond classical measures for overdispersion goodness-of-fit which can detect overdispersion but provide no method to do correct inference for overdispersed data. We use Bayesian inference methods for estimating the posterior distribution of the parameters of the psychometric function. Unlike previous Bayesian psychometric inference methods our software implementation-psignifit 4 performs numerical integration of the posterior within automatically determined bounds. This avoids the use of Markov chain Monte Carlo (MCMC) methods typically requiring expert knowledge. Extensive numerical tests show the validity of the approach and we discuss implications of overdispersion for experimental design. A comprehensive MATLAB toolbox implementing the method is freely available; a python implementation providing the basic capabilities is also available. (C) 2016 The Authors. Published by Elsevier Ltd.
KW  - Psychometric function
KW  - Bayesian inference
KW  - Beta-binomial model
KW  - Overdispersion
KW  - Non-stationarity
KW  - Confidence intervals
KW  - Credible intervals
KW  - Psychophysical methods
Y1  - 2016
U6  - https://doi.org/10.1016/j.visres.2016.02.002
SN  - 0042-6989
SN  - 1878-5646
VL  - 122
SP  - 105
EP  - 123
PB  - Elsevier
CY  - Oxford
ER  - 
TY  - JOUR
A1  - Schad, Daniel
A1  - Vasishth, Shravan
T1  - The posterior probability of a null hypothesis given a statistically significant result
JF  - The quantitative methods for psychology
N2  - When researchers carry out a null hypothesis significance test, it is tempting to assume that a statistically significant result lowers Prob(H0), the probability of the null hypothesis being true. Technically, such a statement is meaningless for various reasons: e.g., the null hypothesis does not have a probability associated with it. However, it is possible to relax certain assumptions to compute the posterior probability Prob(H0) under repeated sampling. We show in a step-by-step guide that the intuitively appealing belief, that Prob(H0) is low when significant results have been obtained under repeated sampling, is in general incorrect and depends greatly on: (a) the prior probability of the null being true; (b) type-I error rate, (c) type-II error rate, and (d) replication of a result. Through step-by-step simulations using open-source code in the R System of Statistical Computing, we show that uncertainty about the null hypothesis being true often remains high despite a significant result. To help the reader develop intuitions about this common misconception, we provide a Shiny app (https://danielschad.shinyapps.io/probnull/). We expect that this tutorial will help researchers better understand and judge results from null hypothesis significance tests.
KW  - Null hypothesis significance testing
KW  - Bayesian inference
KW  - statistical
KW  - power
Y1  - 2022
U6  - https://doi.org/10.20982/tqmp.18.2.p011
SN  - 1913-4126
SN  - 2292-1354
VL  - 18
IS  - 2
SP  - 130
EP  - 141
PB  - University of Montreal, Department of Psychology
CY  - Montreal
ER  - 
TY  - JOUR
A1  - Savoy, Heather
A1  - Heße, Falk
T1  - Dimension reduction for integrating data series in Bayesian inversion of geostatistical models
JF  - Stochastic environmental research and risk assessment
N2  - This study explores methods with which multidimensional data, e.g. time series, can be effectively incorporated into a Bayesian framework for inferring geostatistical parameters. Such series are difficult to use directly in the likelihood estimation procedure due to their high dimensionality; thus, a dimension reduction approach is taken to utilize these measurements in the inference. Two synthetic scenarios from hydrology are explored in which pumping drawdown and concentration breakthrough curves are used to infer the global mean of a log-normally distributed hydraulic conductivity field. Both cases pursue the use of a parametric model to represent the shape of the observed time series with physically-interpretable parameters (e.g. the time and magnitude of a concentration peak), which is compared to subsets of the observations with similar dimensionality. The results from both scenarios highlight the effectiveness for the shape-matching models to reduce dimensionality from 100+ dimensions down to less than five. The models outperform the alternative subset method, especially when the observations are noisy. This approach to incorporating time series observations in the Bayesian framework for inferring geostatistical parameters allows for high-dimensional observations to be faithfully represented in lower-dimensional space for the non-parametric likelihood estimation procedure, which increases the applicability of the framework to more observation types. Although the scenarios are both from hydrogeology, the methodology is general in that no assumptions are made about the subject domain. Any application that requires the inference of geostatistical parameters using series in either time of space can use the approach described in this paper.
KW  - Geostatistics
KW  - Stochastic hydrogeology
KW  - Dimension reduction
KW  - Bayesian inference
Y1  - 2019
U6  - https://doi.org/10.1007/s00477-019-01697-9
SN  - 1436-3240
SN  - 1436-3259
VL  - 33
IS  - 7
SP  - 1327
EP  - 1344
PB  - Springer
CY  - New York
ER  - 
TY  - JOUR
A1  - Rosenbaum, Benjamin
A1  - Raatz, Michael
A1  - Weithoff, Guntram
A1  - Fussmann, Gregor F.
A1  - Gaedke, Ursula
T1  - Estimating parameters from multiple time series of population dynamics using bayesian inference
JF  - Frontiers in ecology and evolution
N2  - Empirical time series of interacting entities, e.g., species abundances, are highly useful to study ecological mechanisms. Mathematical models are valuable tools to further elucidate those mechanisms and underlying processes. However, obtaining an agreement between model predictions and experimental observations remains a demanding task. As models always abstract from reality one parameter often summarizes several properties. Parameter measurements are performed in additional experiments independent of the ones delivering the time series. Transferring these parameter values to different settings may result in incorrect parametrizations. On top of that, the properties of organisms and thus the respective parameter values may vary considerably. These issues limit the use of a priori model parametrizations. In this study, we present a method suited for a direct estimation of model parameters and their variability from experimental time series data. We combine numerical simulations of a continuous-time dynamical population model with Bayesian inference, using a hierarchical framework that allows for variability of individual parameters. The method is applied to a comprehensive set of time series from a laboratory predator-prey system that features both steady states and cyclic population dynamics. Our model predictions are able to reproduce both steady states and cyclic dynamics of the data. Additionally to the direct estimates of the parameter values, the Bayesian approach also provides their uncertainties. We found that fitting cyclic population dynamics, which contain more information on the process rates than steady states, yields more precise parameter estimates. We detected significant variability among parameters of different time series and identified the variation in the maximum growth rate of the prey as a source for the transition from steady states to cyclic dynamics. By lending more flexibility to the model, our approach facilitates parametrizations and shows more easily which patterns in time series can be explained also by simple models. Applying Bayesian inference and dynamical population models in conjunction may help to quantify the profound variability in organismal properties in nature.
KW  - Bayesian inference
KW  - chemostat experiments
KW  - ordinary differential equation
KW  - parameter estimation
KW  - population dynamics
KW  - predator prey
KW  - time series analysis
KW  - trait variability
Y1  - 2019
U6  - https://doi.org/10.3389/fevo.2018.00234
SN  - 2296-701X
VL  - 6
PB  - Frontiers Research Foundation
CY  - Lausanne
ER  - 
TY  - JOUR
A1  - Reich, Sebastian
T1  - A nonparametric ensemble transform method for bayesian inference
JF  - SIAM journal on scientific computing
N2  - Many applications, such as intermittent data assimilation, lead to a recursive application of Bayesian inference within a Monte Carlo context. Popular data assimilation algorithms include sequential Monte Carlo methods and ensemble Kalman filters (EnKFs). These methods differ in the way Bayesian inference is implemented. Sequential Monte Carlo methods rely on importance sampling combined with a resampling step, while EnKFs utilize a linear transformation of Monte Carlo samples based on the classic Kalman filter. While EnKFs have proven to be quite robust even for small ensemble sizes, they are not consistent since their derivation relies on a linear regression ansatz. In this paper, we propose another transform method, which does not rely on any a priori assumptions on the underlying prior and posterior distributions. The new method is based on solving an optimal transportation problem for discrete random variables.
KW  - Bayesian inference
KW  - Monte Carlo method
KW  - sequential data assimilation
KW  - linear programming
KW  - resampling
Y1  - 2013
U6  - https://doi.org/10.1137/130907367
SN  - 1064-8275
VL  - 35
IS  - 4
SP  - A2013
EP  - A2024
PB  - Society for Industrial and Applied Mathematics
CY  - Philadelphia
ER  - 
TY  - JOUR
A1  - Rabe, Maximilian Michael
A1  - Chandra, Johan
A1  - Krügel, André
A1  - Seelig, Stefan A.
A1  - Vasishth, Shravan
A1  - Engbert, Ralf
T1  - A bayesian approach to dynamical modeling of eye-movement control in reading of normal, mirrored, and scrambled texts
JF  - Psychological Review
N2  - In eye-movement control during reading, advanced process-oriented models have been developed to reproduce behavioral data. So far, model complexity and large numbers of model parameters prevented rigorous statistical inference and modeling of interindividual differences. Here we propose a Bayesian approach to both problems for one representative computational model of sentence reading (SWIFT; Engbert et al., Psychological Review, 112, 2005, pp. 777-813). We used experimental data from 36 subjects who read the text in a normal and one of four manipulated text layouts (e.g., mirrored and scrambled letters). The SWIFT model was fitted to subjects and experimental conditions individually to investigate between- subject variability. Based on posterior distributions of model parameters, fixation probabilities and durations are reliably recovered from simulated data and reproduced for withheld empirical data, at both the experimental condition and subject levels. A subsequent statistical analysis of model parameters across reading conditions generates model-driven explanations for observable effects between conditions.
KW  - reading eye movements
KW  - dynamical models
KW  - Bayesian inference
KW  - oculomotor
KW  - control
KW  - individual differences
Y1  - 2021
U6  - https://doi.org/10.1037/rev0000268
SN  - 0033-295X
SN  - 1939-1471
VL  - 128
IS  - 5
SP  - 803
EP  - 823
PB  - American Psychological Association
CY  - Washington
ER  - 
TY  - JOUR
A1  - Molkenthin, Christian
A1  - Donner, Christian
A1  - Reich, Sebastian
A1  - Zöller, Gert
A1  - Hainzl, Sebastian
A1  - Holschneider, Matthias
A1  - Opper, Manfred
T1  - GP-ETAS: semiparametric Bayesian inference for the spatio-temporal epidemic type aftershock sequence model
JF  - Statistics and Computing
N2  - The spatio-temporal epidemic type aftershock sequence (ETAS) model is widely used to describe the self-exciting nature of earthquake occurrences. While traditional inference methods provide only point estimates of the model parameters, we aim at a fully Bayesian treatment of model inference, allowing naturally to incorporate prior knowledge and uncertainty quantification of the resulting estimates. Therefore, we introduce a highly flexible, non-parametric representation for the spatially varying ETAS background intensity through a Gaussian process (GP) prior. Combined with classical triggering functions this results in a new model formulation, namely the GP-ETAS model. We enable tractable and efficient Gibbs sampling by deriving an augmented form of the GP-ETAS inference problem. This novel sampling approach allows us to assess the posterior model variables conditioned on observed earthquake catalogues, i.e., the spatial background intensity and the parameters of the triggering function. Empirical results on two synthetic data sets indicate that GP-ETAS outperforms standard models and thus demonstrate the predictive power for observed earthquake catalogues including uncertainty quantification for the estimated parameters. Finally, a case study for the l'Aquila region, Italy, with the devastating event on 6 April 2009, is presented.
KW  - Self-exciting point process
KW  - Hawkes process
KW  - Spatio-temporal ETAS model
KW  - Bayesian inference
KW  - Sampling
KW  - Earthquake modeling
KW  - Gaussian process
KW  - Data augmentation
Y1  - 2022
U6  - https://doi.org/10.1007/s11222-022-10085-3
SN  - 0960-3174
SN  - 1573-1375
VL  - 32
IS  - 2
PB  - Springer
CY  - Dordrecht
ER  -