Distributed representation of multi-sense words: A loss-driven approach