A Class of Parallel Doubly Stochastic Algorithms for Large-Scale Learning