arxiv:2006.06687

On the asymptotics of wide networks with polynomial activations

Published on Jun 11, 2020

Authors:

Abstract

The conjecture on neural network behavior in the large width limit is proven for deep networks with polynomial activations, and differences in asymptotic behavior between analytic and piecewise-linear activations are highlighted.

AI-generated summary

We consider an existing conjecture addressing the asymptotic behavior of neural networks in the large width limit. The results that follow from this conjecture include tight bounds on the behavior of wide networks during stochastic gradient descent, and a derivation of their finite-width dynamics. We prove the conjecture for deep networks with polynomial activation functions, greatly extending the validity of these results. Finally, we point out a difference in the asymptotic behavior of networks with analytic (and non-linear) activation functions and those with piecewise-linear activations such as ReLU.