Chronotome: Real-Time Topic Modeling for Streaming Embedding Spaces

Lim, Matte, Yeh, Catherine, Wattenberg, Martin, Viégas, Fernanda, Michalatos, Panagiotis

arXiv.org Artificial Intelligence 

Harvard University Figure 1: T o visualize how topics evolve in real time, we create a rotatable embedding space where time is encoded along the Z-axis. We provide three preset views to help users explore topic clusters from different perspectives: (A) Front View (overall clusters), (B) Iso View (clusters over time), and (C) Side View (clusters over time). Here, each point represents an image from a dataset of Picasso's paintings, batched into 5-year intervals. Many real-world datasets - from an artist's body of work to a person's social media history - exhibit meaningful semantic changes over time that are difficult to capture with existing dimensionality reduction methods. To address this gap, we introduce a visualization technique that combines force-based projection and streaming clustering methods to build a spatial-temporal map of embeddings. We demonstrate the utility of our approach through use cases on text and image data, showing how it offers a new lens for understanding the aesthetics and semantics of temporal datasets.