Do you want BuboFlash to help you learning these things? Or do you want to add or correct something? Click here to log in or create user.



Trajectory Analysis: Implicit Regularization Towards Low Rank

We are interested in understanding what end-to-end matrix W W emerges when we run GD on an LNN to minimize a general convex loss L ( W ) L(W) , and in particular the matrix completion loss given above. Note that L ( W ) L(W) is convex, but the objective obtained by over-parameterizing with an LNN is not. We analyze the trajectories of W W , and specifically the dynamics of its singular value decomposition. Denote the singular values by { σ r } r \{ \sigma_r \}_r , and the corresponding left and right singular vectors by { u r } r \{ \mathbf{u}_r \}_r and { v r } r \{ \mathbf{v}_r \}_r respectively.

If you want to change selection, open document below and click on "Move attachment"


Summary

statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

Details



Discussion

Do you want to join discussion? Click here to log in or create user.