multi-field categorical data
- Europe > Switzerland (0.04)
- Asia > Thailand (0.04)
- Oceania > Australia (0.04)
- (2 more...)
Field-wise Learning for Multi-field Categorical Data
We propose a new method for learning with multi-field categorical data. Multi-field categorical data are usually collected over many heterogeneous groups. These groups can reflect in the categories under a field. The existing methods try to learn a universal model that fits all data, which is challenging and inevitably results in learning a complex model. In contrast, we propose a field-wise learning method leveraging the natural structure of data to learn simple yet efficient one-to-one field-focused models with appropriate constraints.
- Europe > Switzerland (0.04)
- Asia > Thailand (0.04)
- Oceania > Australia (0.04)
- (2 more...)
Review for NeurIPS paper: Field-wise Learning for Multi-field Categorical Data
Summary and Contributions: The authors present an approach for modelling categorical variables. Each categorical column in a table is termed'field' by the authors. The main idea appears to be based on splitting the regularisation term for each'field'. The authors present a thorough derivation of their method. A linear and a nonlinear model are developed.
Field-wise Learning for Multi-field Categorical Data
We propose a new method for learning with multi-field categorical data. Multi-field categorical data are usually collected over many heterogeneous groups. These groups can reflect in the categories under a field. The existing methods try to learn a universal model that fits all data, which is challenging and inevitably results in learning a complex model. In contrast, we propose a field-wise learning method leveraging the natural structure of data to learn simple yet efficient one-to-one field-focused models with appropriate constraints.
Field-weighted Factorization Machines for Click-Through Rate Prediction in Display Advertising
Pan, Junwei, Xu, Jian, Ruiz, Alfonso Lobos, Zhao, Wenliang, Pan, Shengjun, Sun, Yu, Lu, Quan
Click-through rate (CTR) prediction is a critical task in online display advertising. The data involved in CTR prediction are typically multi-field categorical data, i.e., every feature is categorical and belongs to one and only one field. One of the interesting characteristics of such data is that features from one field often interact differently with features from different other fields. Recently, Field-aware Factorization Machines (FFMs) have been among the best performing models for CTR prediction by explicitly modeling such difference. However, the number of parameters in FFMs is in the order of feature number times field number, which is unacceptable in the real-world production systems. In this paper, we propose Field-weighted Factorization Machines (FwFMs) to model the different feature interactions between different fields in a much more memory-efficient way. Our experimental evaluations show that FwFMs can achieve competitive prediction performance with only as few as 4% parameters of FFMs. When using the same number of parameters, FwFMs can bring 0.92% and 0.47% AUC lift over FFMs on two real CTR prediction data sets.
- Europe > France > Auvergne-Rhône-Alpes > Lyon > Lyon (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- Marketing (1.00)
- Information Technology > Services (1.00)