Learning to learn by gradient descent by gradient descent

less than 1 minute read

Published: October 30, 2019

hand designed optimization to learned ones?

no free lunch: optimize in a specified field
rethinking generalization

optimizer and optimizee

L(phi) = E_f[f(theta(f, phi))]
- a good phi should max. theta’s performance w.r.t. target function f
- sample f to update phi
refer to code

Share on

Twitter Facebook LinkedIn

You May Also Enjoy

Human Object Interaction

2 minute read

Published: June 18, 2022

Learning-Based Image Synthesis

3 minute read

Published: September 24, 2021

Advances in Neural Rendering

5 minute read

Published: September 22, 2021

Self Attention for Computer Vision

2 minute read

Published: September 21, 2021