Retrieving Time-Series Differences Using Natural Language Queries

Dohi, Kota, Nishida, Tomoya, Purohit, Harsh, Endo, Takashi, Kawaguchi, Yohei

arXiv.org Artificial Intelligence 

--Effectively searching time-series data is essential for system analysis; however, traditional methods often require domain expertise to define search criteria. Recent advancements have enabled natural language-based search, but these methods struggle to handle differences between time-series data. T o address this limitation, we propose a natural language query-based approach for retrieving pairs of time-series data based on differences specified in the query. Specifically, we define six key characteristics of differences, construct a corresponding dataset, and develop a contrastive learning-based model to align differences between time-series data with query texts. Experimental results demonstrate that our model achieves an overall mAP score of 0.994 in retrieving time-series pairs. The state of any system can be represented as time-series data consisting of one or multiple channels.