A two-head loss function for deep Average-K classification