Goto

Collaborating Authors

 Puri, Colin A.


Accelerating the Discovery of Data Quality Rules: A Case Study

AAAI Conferences

Poor quality data is a growing and costly problem that affects many enterprises across all aspects of their business ranging from operational efficiency to revenue protection. In this paper, we present an application -- Data Quality Rules Accelerator (DQRA) -- that accelerates Data Quality (DQ) efforts (e.g. data profiling and cleansing) by automatically discovering DQ rules for detecting inconsistencies in data. We then present two evaluations. The first evaluation compares DQRA to existing solutions; and shows that DQRA either outperformed or achieved performance comparable with these solutions on metrics such as precision, recall, and runtime. The second evaluation is a case study where DQRA was piloted at a large utilities company to improve data quality as part of a legacy migration effort. DQRA was able to discover rules that detected data inconsistencies directly impacting revenue and operational efficiency. Moreover, DQRA was able to significantly reduce the amount of effort required to develop these rules compared to the state of the practice. Finally, we describe ongoing efforts to deploy DQRA.


A Tool for Measuring the Reality of Technology Trends of Interest

AAAI Conferences

In this paper, we present a prototype application — the Technology Trend Tracker — to measure the reality of technology trends of interest using information on the Web to inform decisions such as when to develop training, when to invest in expertise, and more. This prototype performs this task by integrating several artificial intelligence technologies in an innovative way. These technologies include rich semantic representations, a natural language understanding module, and a flexible semantic matcher. We use our system to augment Accenture's annual technology vision survey and show how our system performs well on measuring the reality of technology trends from this survey. We also show why our system performs well through an ablation study.