Lessons from Libor: How to Apply Machine Learning for Document Digitization