Goto

Collaborating Authors

 Government







Variational Policy Gradient Method for Reinforcement Learning with General Utilities

Neural Information Processing Systems

In recent years, reinforcement learning (RL) systems with general goals beyond a cumulative sum of rewards have gained traction, such as in constrained problems, exploration, and acting upon prior experiences. In this paper, we consider policy optimization in Markov Decision Problems, where the objective is a general concave utility function of the state-action occupancy measure, which subsumes several of the aforementioned examples as special cases.


Washington Post CEO steps down amid onslaught of backlash following mass layoffs

FOX News

Washington Post CEO and publisher Will Lewis has stepped down amid growing backlash over his handling of the paper's mass layoffs as chief financial officer Jeff Dโ€™Onofrio is been tapped to take over.


Washington Post CEO steps downs amid onslaught of backlash following mass layoffs

FOX News

Washington Post CEO and publisher Will Lewis has stepped down amid growing backlash over his handling of the paper's mass layoffs as chief financial officer Jeff Dโ€™Onofrio is been tapped to take over.


Kraken: InherentlyParallelTransformersFor EfficientMulti-DeviceInference

Neural Information Processing Systems

Large Transformer networks are increasingly used in settings where low inference latency is necessary to enable new applications and improve the end-user experience.