Constructing Reference Sets from Unstructured, Ungrammatical Text