ActiveDP: Bridging Active Learning and Data Programming
–arXiv.org Artificial Intelligence
Modern machine learning models require large labelled datasets to achieve good performance, but manually labelling large datasets is expensive and time-consuming. The data programming paradigm enables users to label large datasets efficiently but produces noisy labels, which deteriorates the downstream model's performance. The active learning paradigm, on the other hand, can acquire accurate labels but only for a small fraction of instances. In this paper, we propose ActiveDP, an interactive framework bridging active learning and data programming together to generate labels with both high accuracy and coverage, combining the strengths of both paradigms. Experiments show that ActiveDP outperforms previous weak supervision and active learning approaches and consistently performs well under different labelling budgets.
arXiv.org Artificial Intelligence
Feb-8-2024
- Country:
- Asia
- China > Jiangsu Province
- Nanjing (0.04)
- Middle East > Palestine
- Gaza Strip > Rafah Governorate > Rafah (0.04)
- China > Jiangsu Province
- Europe > Slovenia
- Drava > Municipality of Benedikt > Benedikt (0.04)
- North America
- Canada > Ontario
- Toronto (0.47)
- United States > New York
- New York County > New York City (0.04)
- Canada > Ontario
- Asia
- Genre:
- Research Report (0.82)
- Technology: