ConstrainedOptimizationtoTrainNeuralNetworks onCriticaland Under-RepresentedClasses

Neural Information Processing Systems 

Asaconsequence, removing theerror P would reduce theloss more than removing the error N. Moreover, it is clear that this difference in error weighing increases withthelevelofimbalance between theclasses.