A Communication-Efficient Parallel Algorithm for Decision Tree

Mar-12-2024, 07:47:10 GMT–Neural Information Processing Systems

Decision tree (and its extensions such as Gradient Boosting Decision Trees and Random Forest) is a widely used machine learning algorithm, due to its practical effectiveness and model interpretability. With the emergence of big data, there is an increasing need to parallelize the training process of decision tree. However, most existing attempts along this line suffer from high communication costs. In this paper, we propose a new algorithm, called Parallel Voting Decision Tree (PV-Tree), to tackle this challenge. After partitioning the training data onto a number of (e.g., M) machines, this algorithm performs both local voting and global voting in each iteration.

algorithm, pv-tree, training data, (17 more...)

Neural Information Processing Systems

Mar-12-2024, 07:47:10 GMT

Conferences PDF

Add feedback

Country:
- Asia (0.04)
- Oceania > Australia
  - New South Wales (0.04)
- North America > Canada
  - Ontario > Kingston (0.04)
- Europe > Spain
  - Catalonia > Barcelona Province > Barcelona (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)