Predictive Coding Approximates Backprop Along Arbitrary Computation Graphs

Beren Millidge; Alexander Tschantz; Christopher L Buckley

doi:10.1162/neco_a_01497

Predictive Coding Approximates Backprop Along Arbitrary Computation Graphs

Neural Comput. 2022 May 19;34(6):1329-1368. doi: 10.1162/neco_a_01497.

Authors

Beren Millidge¹, Alexander Tschantz², Christopher L Buckley³

Affiliations

¹ School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, U.K. s1686853@sms.ed.ac.uk.
² Sackler Center for Consciousness Science, School of Engineering and Informatics, University of Sussex, Brighton BN1 9QJ, U.K. tschantz.alec@gmail.com.
³ Evolutionary and Adaptive Systems Research Group, School of Engineering and Informatics, University of Sussex, Brighton BN1 9QJ, U.K. C.L.Buckley@sussex.ac.uk.

PMID: 35534010
DOI: 10.1162/neco_a_01497

Abstract

Backpropagation of error (backprop) is a powerful algorithm for training machine learning architectures through end-to-end differentiation. Recently it has been shown that backprop in multilayer perceptrons (MLPs) can be approximated using predictive coding, a biologically plausible process theory of cortical computation that relies solely on local and Hebbian updates. The power of backprop, however, lies not in its instantiation in MLPs but in the concept of automatic differentiation, which allows for the optimization of any differentiable program expressed as a computation graph. Here, we demonstrate that predictive coding converges asymptotically (and in practice, rapidly) to exact backprop gradients on arbitrary computation graphs using only local learning rules. We apply this result to develop a straightforward strategy to translate core machine learning architectures into their predictive coding equivalents. We construct predictive coding convolutional neural networks, recurrent neural networks, and the more complex long short-term memory, which include a nonlayer-like branching internal graph structure and multiplicative interactions. Our models perform equivalently to backprop on challenging machine learning benchmarks while using only local and (mostly) Hebbian plasticity. Our method raises the potential that standard machine learning algorithms could in principle be directly implemented in neural circuitry and may also contribute to the development of completely distributed neuromorphic architectures.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Machine Learning
Neural Networks, Computer*
Neurons*

Grants and funding

BB/P022197/1/BB_/Biotechnology and Biological Sciences Research Council/United Kingdom