ACE: A Cardinality Estimator for Set-Valued Queries
Sheng, Yufan, Cao, Xin, Zhao, Kaiqi, Fang, Yixiang, Qi, Jianzhong, Zhang, Wenjie, Jensen, Christian S.
–arXiv.org Artificial Intelligence
Cardinality estimation is a fundamental functionality in database systems. Most existing cardinality estimators focus on handling predicates over numeric or categorical data. They have largely omitted an important data type, set-valued data, which frequently occur in contemporary applications such as information retrieval and recommender systems. The few existing estimators for such data either favor high-frequency elements or rely on a partial independence assumption, which limits their practical applicability. We propose ACE, an Attention-based Cardinality Estimator for estimating the cardinality of queries over set-valued data. We first design a distillation-based data encoder to condense the dataset into a compact matrix. We then design an attention-based query analyzer to capture correlations among query elements. To handle variable-sized queries, a pooling module is introduced, followed by a regression model (MLP) to generate final cardinality estimates. We evaluate ACE on three datasets with varying query element distributions, demonstrating that ACE outperforms the state-of-the-art competitors in terms of both accuracy and efficiency.
arXiv.org Artificial Intelligence
Mar-19-2025
- Country:
- Asia > China (0.28)
- Europe (0.46)
- North America > United States (0.28)
- Oceania > Australia (0.28)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Information Technology (0.46)
- Technology: