Mining Word Boundaries in Speech as Naturally Annotated Word Segmentation Data