TY - JOUR
T1 - Smart Gradient - An Adaptive Technique for Improving Gradient Estimation
AU - Fattah, Esmail Abdul
AU - Niekerk, Janet Van
AU - Rue, Haavard
N1 - KAUST Repository Item: Exported on 2021-12-16
PY - 2021
Y1 - 2021
N2 - Computing the gradient of a function provides fundamental information about its behavior. This information is essential for several applications and algorithms across various fields. One common application that require gradients are optimization techniques such as stochastic gradient descent, Newton's method and trust region methods. However, these methods usually requires a numerical computation of the gradient at every iteration of the method which is prone to numerical errors. We propose a simple limited-memory technique for improving the accuracy of a numerically computed gradient in this gradient-based optimization framework by exploiting (1) a coordinate transformation of the gradient and (2) the history of previously taken descent directions. The method is verified empirically by extensive experimentation on both test functions and on real data applications. The proposed method is implemented in the R package smartGrad and in C++.
AB - Computing the gradient of a function provides fundamental information about its behavior. This information is essential for several applications and algorithms across various fields. One common application that require gradients are optimization techniques such as stochastic gradient descent, Newton's method and trust region methods. However, these methods usually requires a numerical computation of the gradient at every iteration of the method which is prone to numerical errors. We propose a simple limited-memory technique for improving the accuracy of a numerically computed gradient in this gradient-based optimization framework by exploiting (1) a coordinate transformation of the gradient and (2) the history of previously taken descent directions. The method is verified empirically by extensive experimentation on both test functions and on real data applications. The proposed method is implemented in the R package smartGrad and in C++.
UR - http://hdl.handle.net/10754/669609
UR - https://arxiv.org/pdf/2106.07313.pdf
M3 - Article
JO - Accepted in Foundations of Data Science
JF - Accepted in Foundations of Data Science
ER -