Show simple item record

FieldValueLanguage
dc.contributor.authorQuiroz, Matias
dc.contributor.authorVillani, Mattias
dc.contributor.authorKohn, Robert
dc.contributor.authorTran, Minh-Ngoc
dc.date.accessioned2017-01-19
dc.date.available2017-01-19
dc.date.issued2016-01-01
dc.identifier.urihttp://hdl.handle.net/2123/16205
dc.description.abstractWe propose Subsampling MCMC, a Markov Chain Monte Carlo (MCMC) framework where the likelihood function for n observations is estimated from a random subset of m observations. We introduce a general and highly efficient unbiased estimator of the log-likelihood based on control variates obtained from clustering the data. The cost of computing the log-likelihood estimator is much smaller than that of the full log-likelihood used by standard MCMC. The likelihood estimate is bias-corrected and used in two correlated pseudo-marginal algorithms to sample from a perturbed posterior, for which we derive the asymptotic error with respect to n and m, respectively. A practical estimator of the error is proposed and we show that the error is negligible even for a very small m in our applications. We demonstrate that Subsampling MCMC is substantially more efficient than standard MCMC in terms of sampling efficiency for a given computational budget, and that it outperforms other subsampling methods for MCMC proposed in the literature.en
dc.relation.ispartofseriesBAWP-2016-07en
dc.rightsOtheren
dc.subjectBayesian inferenceen
dc.subjectCorrelated pseudo-marginalen
dc.subjectEstimated likelihooden
dc.subjectBlock pseudo-marginalen
dc.subjectBig Dataen
dc.subjectSurvey samplingen
dc.titleSpeeding up MCMC by Efficient Data Subsamplingen
dc.typeWorking paperen
dc.type.pubtypePre-printen
usyd.facultyThe University of Sydney Business School, Discipline of Business Analyticsen


Show simple item record

Associated file/s

Associated collections

Show simple item record

There are no previous versions of the item available.