Distributed Record Linkage in Healthcare Data with Apache Spark