Evaluating Blocking Biases in Entity Matching

Moslemi, Mohammad Hossein, Balamurugan, Harini, Milani, Mostafa

Sep-24-2024–arXiv.org Artificial Intelligence

Entity Matching (EM) is crucial for identifying equivalent data entities across different sources, a task that becomes increasingly challenging with the growth and heterogeneity of data. Blocking techniques, which reduce the computational complexity of EM, play a vital role in making this process scalable. Despite advancements in blocking methods, the issue of fairness; where blocking may inadvertently favor certain demographic groups; has been largely overlooked. This study extends traditional blocking metrics to incorporate fairness, providing a framework for assessing bias in blocking techniques. Through experimental analysis, we evaluate the effectiveness and fairness of various blocking methods, offering insights into their potential biases. Our findings highlight the importance of considering fairness in EM, particularly in the blocking phase, to ensure equitable outcomes in data integration tasks.

data mining, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Sep-24-2024

arXiv.org PDF

Add feedback

Country:
- North America > Canada > Ontario > Middlesex County > London (0.14)

Genre:
- Research Report > New Finding (0.48)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning
      - Neural Networks > Deep Learning (0.69)
      - Performance Analysis > Accuracy (0.46)
      - Statistical Learning (0.93)
    - Natural Language (0.94)
    - Representation & Reasoning (1.00)
  - Data Science > Data Mining (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found