Learning Representations for Hierarchies with Minimal Support

Neural Information Processing Systems 

For very large digraphs, however, this means many (most) entries may be unobserved during training.