Microsoft's Project Alexandria parses documents using unsupervised learning