Time-efficient Bayesian Inference for a (Skewed) Von Mises Distribution on the Torus in a Deep Probabilistic Programming Language _ IEEE Conference Publication _ IEEE Xplore.pdf
Automatic inference; Bayesian inference; Efficient implementation; Inference algorithm; Learning frameworks; Optimisations; Probabilistic programming language; Statistic modeling; Time-efficient; Von Mises distribution; Control and Systems Engineering; Software; Computer Science Applications
Abstract :
[en] Probabilistic programming languages (PPLs) are at the interface between statistics and the theory of programming languages. PPLs formulate statistical models as stochastic programs that enable automatic inference algorithms and optimization. Pyro [1] and its sibling NumPyro [2] are PPLs built on top of the deep learning frameworks PyTorch [3] and Jax [4], respectively. Both PPLs provide simple, highly similar interfaces for inference using efficient implementations of Hamiltonian Monte Carlo (HMC), the No-U-Turn Sampler (NUTS), and Stochastic Variational Inference (SVI). They automatically generate variational distributions from a model, automatically enumerate discrete variables, and support formulating deep probabilistic models such as variational autoencoders and deep Markov models. The Sine von Mises distribution and its skewed variant are toroidal distributions relevant to protein bioinformatics. They provide a natural way to model the dihedral angles of protein structures, which is important in protein structure prediction, simulation and analysis. We present efficient implementations of the Sine von Mises distribution and its skewing in Pyro and NumPyro, and devise a simulation method that increases efficiency with several orders of magnitude when using parallel hardware (i.e., modern CPUs, GPUs, and TPUs). We demonstrate the use of the skewed Sine von Mises distribution by modeling dihedral angles of proteins using a Bayesian mixture model inferred using NUTS, exploiting NumPyro's facilities for automatic enumeration [5].
Disciplines :
Mathematics
Author, co-author :
Ronning, Ola; University of Copenhagen, Department of Computer Science, Department of Biology, Denmark
LEY, Christophe ; University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Mathematics (DMATH) ; Computer Science and Statistics, Ghent University, Department of Applied Mathematics, Belgium
Mardia, Kanti V.; School of Mathematics, University of Leeds, United Kingdom
Hamelryck, Thomas; University of Copenhagen, Department of Computer Science, Department of Biology, Denmark
External co-authors :
yes
Language :
English
Title :
Time-efficient Bayesian Inference for a (Skewed) von Mises Distribution on the Torus in a Deep Probabilistic Programming Language
Publication date :
25 September 2021
Event name :
2021 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)
Event place :
Karlsruhe, Deu
Event date :
23-09-2021 => 25-09-2021
By request :
Yes
Journal title :
IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems
Publisher :
Institute of Electrical and Electronics Engineers Inc.
We would like to thank the Pyro PPL developers Fritz Obermeyer and Du Phat for valuable feedback and discussion about our implementations. We acknowledge support from the Independent Research Fund Denmark (DFF) under the grant ”Deep Probabilistic Programming for Protein Structure Prediction”. K. V. Mardia acknowledges the Leverhulme Trust for the Emeritus Fellowship. Christophe Ley’s research is supported by the FWO Krediet aan Navorsers grant with reference number 1510391N.
Eli Bingham, Jonathan p Chen, Martin Jankowiak, Fritz Obermeyer, Neeraj Pradhan, Theofanis Karaletsos, Rohit Singh, Paul Szerlip, Paul Horsfall, and Noah d Goodman. Pyro: Deep universal probabilistic programming. The Journal of Machine Learning Research, 20(1):973-978, 2019.
Du Phan, Neeraj Pradhan, and Martin Jankowiak. Composable effects for flexible and accelerated probabilistic programming in numpyro. arXiv preprint arXiv:1912.11554, 2019.
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Köpf, Edward Yang, Zach DeVito, Martin Raison, Alykhan Tejani, Sasank Chil-amkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 32:8026-8037, 2019.
James Bradbury, Roy Frostig, Peter Hawkins, Matthew James Johnson, Chris Leary, Dougal Maclaurin, George Necula, Adam Paszke, Jake VanderPlas, Skye Wanderman-Milne, and Qiao Zhang. jax: Composable transformations of Python+NumPy programs. http://github.com/google/jax, 2018.
Fritz Obermeyer, Eli Bingham, Martin Jankowiak, Du Phan, and Jonathan P. Chen. Functional tensors for probabilistic programming. arXiv preprint arXiv:1910.10775, 2020.
John Moult, Jan t Pedersen, Richard Judson, and Krzysztof Fidelis. a large-scale experiment to assess protein structure prediction methods. Proteins: Structure, Function, and Bioinformatics, 23(3):ii-iv, 1995.
Mohammed AlQuraishi. Machine learning in protein structure prediction. Current Opinion in Chemical Biology, 65:1-8, 2021.
Andrew W Senior, Richard Evans, John Jumper, James Kirkpatrick, Laurent Sifre, Tim Green, Chongli Qin, Augustin Z?ídek, Alexander WR Nelson, Alex Bridgland, et al. Improved protein structure prediction using potentials from deep learning. Nature, 577(7792):706-710, 2020.
Joan-Emma Shea, Robert B Best, and Jeetain Mittal. Physics-based computational and theoretical approaches to intrinsically disordered proteins. Current Opinion in Structural Biology, 67:219-225, 2021.
Thomas Hamelryck, Kanti V Mardia, and Jesper Ferkinghoff-Borg. Bayesian Methods in Structural Bioinformatics. Springer, 2012.
Kanti V Mardia. Statistics of directional data. Journal of the Royal Statistical Society: Series B (Methodological), 37(3):349-371, 1975.
Harshinder Singh, Vladimir Hnizdo, and Eugene Demchuk. Probabilistic model for two dependent circular variables. Biometrika, 89(3):719-723, 2002.
Jose Ameijeiras-Alonso and Christophe Ley. Sine-skewed toroidal distributions and their application in protein bioinformatics. Biostatistics, 2021 (to appear).
Diederik P. Kingma and Max Welling. An introduction to varia-tional autoencoders. Foundations and Trends® in Machine Learning, 12(4):307-392, 2019.
Matthew D Hoffman and Andrew Gelman. The no-u-turn sampler: Adaptively setting path lengths in hamiltonian monte carlo. Journal of Machine Learning Research, 15(1):1593-1623, 2014.
John T Kent, Asaad M Ganeiber, and Kanti V Mardia. A new unified approach for the simulation of a wide class of directional distributions. Journal of Computational and Graphical Statistics, 27(2):291-301, 2018.
Louis-Paul Rivest. A distribution for dependent unit vectors. Communications in Statistics-Theory and Methods, 17(2):461-483, 1988.
Kanti V Mardia, Charles C Taylor, and Ganesh K Subramaniam. Protein bioinformatics and mixtures of bivariate von mises distributions for angular data. Biometrics, 63(2):505-512, 2007.
Christopher Bystroff, Vesteinn Thorsson, and David Baker. Hmmstr: a hidden markov model for local sequence-structure correlations in proteins. Journal of molecular biology, 301(1):173-190, 2000.
Christian B. Thygesen, Ahmad S. Al-Sibahi, Christian S. Steenmanns, Lys S. Moreta, Anders B. Sørensen, and Thomas Hamelryck. Efficient generative modelling of protein structure fragments using a deep markov model. In International Conference on Machine Learning. PMLR, 2021 (to appear).
Carol A Rohl, Charlie EM Strauss, Kira MS Misura, and David Baker. Protein structure prediction using rosetta. Methods in Enzymology, 383:66-93, 2004.
Gopalasamudram N Ramachandran, Chandrasekharan Ramakrishnan, and Viswanathan Sasisekharan. Stereochemistry of polypeptide chain configurations. Journal of Molecular Biology, 7:95-99, 1963.
Dorothee Liebschner, Pavel V Afonine, Matthew L Baker, Gábor Bunkóczi, Vincent B Chen, Tristan I Croll, Bradley Hintze, L-W Hung, Swati Jain, Airlie J McCoy, et al. Macromolecular structure determination using X-rays, neutrons and electrons: Recent developments in phenix. Acta Crystallographica Section D: Structural Biology, 75(10):861-877, 2019.
Kanti V Mardia. Statistical approaches to three key challenges in protein structural bioinformatics. Journal of the Royal Statistical Society: SERIES C: Applied Statistics, pages 487-514, 2013.
Jayaram Sethuraman. A constructive definition of dirichlet priors. Statistica sinica, pages 639-650, 1994.
Diederik P Kingma, Tim Salimans, and Max Welling. Variational dropout and the local reparameterization trick. arXiv preprint arXiv:1506.02557, 2015.
Jiacheng Xu and Greg Durrett. Spherical latent spaces for stable variational autoencoders. arXiv preprint arXiv:1808.10805, 2018.
Andrew Gelman, Donald B Rubin, et al. Inference from iterative simulation using multiple sequences. Statistical science, 7(4):457-472, 1992.