TY - JOUR
T1 - Selection dynamics for deep neural networks
AU - Liu, Hailiang
AU - Markowich, Peter A.
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: We are grateful to Michael Herty (RWTH) for his interest, which motivated us to investigate this problem and eventually led to this paper. Liu was partially supported by The National Science Foundation under Grant DMS1812666 and by NSF Grant RNMS (Ki-Net)1107291.
PY - 2020/9/21
Y1 - 2020/9/21
N2 - This paper presents a partial differential equation framework for deep residual neural networks and for the associated learning problem. This is done by carrying out the continuum limits of neural networks with respect to width and depth. We study the wellposedness, the large time solution behavior, and the characterization of the steady states of the forward problem. Several useful time-uniform estimates and stability/instability conditions are presented. We state and prove optimality conditions for the inverse deep learning problem, using standard variational calculus, the Hamilton-Jacobi-Bellmann equation and the Pontryagin maximum principle. This serves to establish a mathematical foundation for investigating the algorithmic and theoretical connections between neural networks, PDE theory, variational analysis, optimal control, and deep learning.
AB - This paper presents a partial differential equation framework for deep residual neural networks and for the associated learning problem. This is done by carrying out the continuum limits of neural networks with respect to width and depth. We study the wellposedness, the large time solution behavior, and the characterization of the steady states of the forward problem. Several useful time-uniform estimates and stability/instability conditions are presented. We state and prove optimality conditions for the inverse deep learning problem, using standard variational calculus, the Hamilton-Jacobi-Bellmann equation and the Pontryagin maximum principle. This serves to establish a mathematical foundation for investigating the algorithmic and theoretical connections between neural networks, PDE theory, variational analysis, optimal control, and deep learning.
UR - http://hdl.handle.net/10754/660838
UR - https://linkinghub.elsevier.com/retrieve/pii/S002203962030485X
UR - http://www.scopus.com/inward/record.url?scp=85091076010&partnerID=8YFLogxK
U2 - 10.1016/j.jde.2020.08.041
DO - 10.1016/j.jde.2020.08.041
M3 - Article
SN - 1090-2732
VL - 269
SP - 11540
EP - 11574
JO - Journal of Differential Equations
JF - Journal of Differential Equations
IS - 12
ER -