In this tutorial, you will learn how to automatically detect COVID-19 in a hand-created X-ray image dataset using Keras, TensorFlow, and Deep Learning. Like most people in the world right now, I'm genuinely concerned about COVID-19. I find myself constantly analyzing my personal health and wondering if/when I will contract it. At first, I didn't think much of it -- I have pollen allergies and due to the warm weather on the eastern coast of the United States, spring has come early this year. My allergies were likely just acting up. But my symptoms didn't improve throughout the day. I'm actually sitting here, writing the this tutorial, with a thermometer in my mouth; and glancing down I see that it reads 99.4 Fahrenheit.
Yeah, so the entire project is a coordinated effort by the White House Office of Science and Technology Policy. I think some time in early March a group at Georgetown, the Center for Security in Emerging Technology (CSET) reached out to us at Allen AI to help coordinate the release of this dataset, along with a couple of different organizations. You mentioned MSR (Microsoft Research), Chan Zuckerberg, Kaggle was also involved, and the National Library of Medicine, which is part of the NIH. So all these groups - we're going to come together to essentially create this dataset to help create text mining and information retrieval tools that could assist medical experts in understanding more of what was going on with the epidemic. For Allen AI, the way that we got involved is we had recently created a new pipeline to revamp our open research corpus.
There seems to be a tendency to hastily use imperfect and questionable data to train an AI solution for COVID-19, a dangerous trend that not only does not help any patient or physician but also damages the reputation of the AI community. Dealing with a pandemic -- as significant as it is -- does not suspend basic scientific principles. Data has to be curated by medical experts, full and rigorous validations have to be performed, and results have to be reviewed by peers before we deploy any solution or even proposal into the world, particularly when society is dealing with many uncertainties. It is safe to say we are all deeply concerned about the COVID-19 pandemic. This coronavirus has drastically changed our reality: We're experiencing stress, restrictions, quarantines; we're witnessing heroic sacrifices of caregivers including staff, nurses, and physicians; we're losing loved ones; and we're facing economic hardships and massive uncertainties about what is in store in the coming months.
Since the corona erupted into our world, research institutes and governments have released many databases publicly to allow research groups (and independent individuals) to analyze the data around the corona's spread. These databases are scattered under numerous initiatives and sources. The purpose of this blog is to organize all the major open databases and data initiatives around the world. Feel free to add it in the comments or through this form. In response to the COVID-19 pandemic, the White House and a coalition of leading research groups have prepared the COVID-19 Open Research Dataset (CORD-19).