Goto

Collaborating Authors


eda9523faa5e7191aee1c2eaff669716-Supplemental-Conference.pdf

Neural Information Processing Systems

Though promising results have been reported on some RL application domains, policies learned with such representations usually fail to generalize well in a complex environment because minimizing a reconstruction loss may potentially introduce local (visual) features with task-irrelevant information.


eda9523faa5e7191aee1c2eaff669716-Paper-Conference.pdf

Neural Information Processing Systems

Though promising results have been reported on some RL application domains, policies learned with such representations usually fail to generalize well in a complex environment because minimizing a reconstruction loss may potentially introduce local (visual) features with task-irrelevant information.


Direct Runge-Kutta Discretization Achieves Acceleration

Jingzhao Zhang, Aryan Mokhtari, Suvrit Sra, Ali Jadbabaie

Neural Information Processing Systems

We study gradient-based optimization methods obtained by directly discretizing a second-order ordinary differential equation (ODE) related to the continuous limit of Nesterov's accelerated gradient method.



7eacb532570ff6858afd2723755ff790-AuthorFeedback.pdf

Neural Information Processing Systems

We also calculate the optimal solution to verify the approximation ratio. See Table 1 in our submission for definitions. It is indeed an interesting problem to generalize our techniques to other problems, e.g., classification problems and40 non-linearregressionproblems.




7e9e346dc5fd268b49bf418523af8679-AuthorFeedback.pdf

Neural Information Processing Systems

Commentsonpresentation: Thank you forthehelpful suggestions. We focus on Mercer kernel with formkθ(zt, z) = qθ(z|t z) = h|t h. Even so, we obtain SOTA results for recurrent models on all document classification tasks, with the19 exceptionofAGNews,forwhichwe'recompetitive.