Data-driven Methods of Extracting Text Structure and Information Transfer