The power of absolute discounting: all-dimensional distribution estimation
Falahatgar, Moein, Ohannessian, Mesrob I., Orlitsky, Alon, Pichapati, Venkatadheeraj
–Neural Information Processing Systems
Categorical models are a natural fit for many problems. When learning the distribution of categories from samples, high-dimensionality may dilute the data. Minimax optimality is too pessimistic to remedy this issue. A serendipitously discovered estimator, absolute discounting, corrects empirical frequencies by subtracting a constant from observed categories, which it then redistributes among the unobserved. It outperforms classical estimators empirically, and has been used extensively in natural language modeling.
Neural Information Processing Systems
Feb-14-2020, 19:12:58 GMT
- Technology: