Data Collection and Labeling Techniques for Machine Learning

Jun-19-2024–arXiv.org Artificial Intelligence

This remarkable advancement can be attributed to two key factors: the exponential rise in computational power and the ever-increasing availability of vast datasets [1-3]. However, the very foundation upon which this progress rests-data collection and labeling-presents significant challenges that can hinder the efficacy and ethical implementation of ML models[4-8]. This review paper delves into the intricate world of data collection and labeling for machine learning, drawing upon insights from both the data management and machine learning communities. The transformative potential of machine learning is evident across a multitude of domains. From revolutionizing healthcare with disease diagnosis and personalized medicine[9] to powering selfdriving cars[10] and optimizing logistics in supply chains[11], ML algorithms are rapidly reshaping our world. At the heart of these advancements lies the ability of ML models to learn from data, identify patterns, and make predictions based on the information they have been exposed to. The quality and quantity of data used to train these models are paramount to their success. High-quality, diverse, and well-labeled data are essential for building robust and generalizable ML models that can perform effectively in real-world scenarios [12, 13].

dataset, learning, proceedings, (15 more...)

arXiv.org Artificial Intelligence

Jun-19-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > New York County > New York City (0.05)
- Asia > Taiwan
  - Taiwan Province > Taipei (0.04)

Genre:
- Overview (1.00)
- Research Report (0.82)

Industry:
- Health & Medicine (1.00)
- Information Technology > Security & Privacy (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Information Retrieval (0.93)
  - Representation & Reasoning > Information Fusion (0.69)
  - Machine Learning > Neural Networks (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found