Causal Analysis in Theory and Practice » Data versus Science: Contesting the Soul of Data-Science

#artificialintelligence 

Summary The post below is written for the upcoming Spanish translation of The Book of Why, which was announced today. It expresses my firm belief that the current data-fitting direction taken by "Data Science" is temporary (read my lips!), that the future of "Data Science" lies in causal data interpretation and that we should prepare ourselves for the backlash swing. Data versus Science: Contesting the Soul of Data-Science Much has been said about how ill-prepared our health-care system was in coping with catastrophic outbreaks like COVID-19. Yet viewed from the corner of my expertise, the ill-preparedness can also be seen as a failure of information technology to keep track of and interpret the outpour of data that have arrived from multiple and conflicting sources, corrupted by noise and omission, some by sloppy collection and some by deliberate misreporting, AI could and should have equipped society with intelligent data-fusion technology, to interpret such conflicting pieces of information and reason its way out of the confusion. Speaking from the perspective of causal inference research, I have been part of a team that has developed a complete theoretical underpinning for such "data-fusion" problems; a development that is briefly described in Chapter 10 of The Book of Why.