Goto

Collaborating Authors

 tdm researcher


Text and Data Mining of In-Copyright Works

Communications of the ACM

Text and data mining (TDM) uses statistical analysis tools to extract new knowledge from large quantities of text or data for purposes by finding patterns, discovering relationships, and analyzing semantics. It is used in a wide variety of fields from biomedical research to digital humanities. Japan has enacted laws to allow TDM research copying. These lawsuits grew out of the Google Book Search Project (GBS). GBS is a corpus of millions of digital books to improve its search technologies that Google developed after making a deal with the University of Michigan in 2004 to scan all eight million books in its library's collections. In return, Michigan got back from Google digital copies of the books it scanned.