CAMAL: Optimizing LSM-trees via Active Learning
Yu, Weiping, Luo, Siqiang, Yu, Zihao, Cong, Gao
–arXiv.org Artificial Intelligence
We use machine learning to optimize LSM-tree structure, aiming to reduce the cost of processing various read/write operations. We introduce a new approach Camal, which boasts the following features: (1) ML-Aided: Camal is the first attempt to apply active learning to tune LSM-tree based key-value stores. The learning process is coupled with traditional cost models to improve the training process; (2) Decoupled Active Learning: backed by rigorous analysis, Camal adopts active learning paradigm based on a decoupled tuning of each parameter, which further accelerates the learning process; (3) Easy Extrapolation: Camal adopts an effective mechanism to incrementally update the model with the growth of the data size; (4) Dynamic Mode: Camal is able to tune LSM-tree online under dynamically changing workloads; (5) Significant System Improvement: By integrating Camal into a full system RocksDB, the system performance improves by 28% on average and up to 8x compared to a state-of-the-art RocksDB design.
arXiv.org Artificial Intelligence
Sep-23-2024
- Country:
- Africa
- Mali (0.04)
- South Sudan > Equatoria
- Central Equatoria > Juba (0.04)
- Asia
- Middle East > Jordan (0.04)
- Singapore (0.05)
- Europe > Austria
- Vienna (0.14)
- North America > United States
- Massachusetts > Hampshire County > Amherst (0.04)
- Africa
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology (0.67)
- Technology: