Adagrad Preconditioning. First, we develop a unified convergence analysis of SGD with adap