SGD: The Role of Implicit Regularization, Batch-size and Multiple-epochs