Pattern Recognition
- Africa > Cameroon > Far North Region > Maroua (0.04)
- Asia > Japan (0.04)
- Asia > China > Guangdong Province > Guangzhou (0.04)
EV-Eye: Rethinking High-frequency Eye Tracking through the Lenses of Event Cameras
In this paper, we present EV-Eye, a first-of-its-kind large-scale multimodal eye tracking dataset aimed at inspiring research on high-frequency eye/gaze tracking. EV -Eye utilizes the emerging bio-inspired event camera to capture independent pixel-level intensity changes induced by eye movements, achieving sub-microsecond latency.
- North America > United States (0.15)
- Asia > China (0.04)
- Europe > Netherlands > South Holland > Delft (0.04)
- North America > United States (0.14)
- Europe > Greece (0.04)
- Asia > Middle East > Jordan (0.04)
- Information Technology (0.46)
- Banking & Finance (0.45)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.68)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
- (2 more...)
- North America > United States > Virginia (0.04)
- Europe > France (0.04)
- Oceania > Australia (0.04)
- (3 more...)
- Health & Medicine > Therapeutic Area > Neurology (0.46)
- Education > Curriculum > Subject-Specific Education (0.41)
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
- Asia > China > Beijing > Beijing (0.04)
- North America > United States (0.04)
- North America > United States > North Carolina (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- Asia > Middle East > Iran (0.04)
- (6 more...)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.83)
- North America > United States > New York (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
- Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (0.42)
- Asia > South Korea > Daejeon > Daejeon (0.05)
- North America > Canada > Quebec > Montreal (0.04)
- Information Technology > Data Science > Data Mining (0.94)
- Information Technology > Artificial Intelligence > Natural Language (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.69)
- (4 more...)
SoftMatcha 2: A Fast and Soft Pattern Matcher for Trillion-Scale Corpora
Yoneda, Masataka, Matsushita, Yusuke, Kamoda, Go, Suenaga, Kohei, Akiba, Takuya, Waga, Masaki, Yokoi, Sho
We present an ultra-fast and flexible search algorithm that enables search over trillion-scale natural language corpora in under 0.3 seconds while handling semantic variations (substitution, insertion, and deletion). Our approach employs string matching based on suffix arrays that scales well with corpus size. To mitigate the combinatorial explosion induced by the semantic relaxation of queries, our method is built on two key algorithmic ideas: fast exact lookup enabled by a disk-aware design, and dynamic corpus-aware pruning. We theoretically show that the proposed method suppresses exponential growth in the search space with respect to query length by leveraging statistical properties of natural language. In experiments on FineWeb-Edu (Lozhkov et al., 2024) (1.4T tokens), we show that our method achieves significantly lower search latency than existing methods: infini-gram (Liu et al., 2024), infini-gram mini (Xu et al., 2025), and SoftMatcha (Deguchi et al., 2025). As a practical application, we demonstrate that our method identifies benchmark contamination in training corpora, unidentified by existing approaches. We also provide an online demo of fast, soft search across corpora in seven languages.
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
- Leisure & Entertainment > Sports > Olympic Games (0.95)
- Health & Medicine > Therapeutic Area > Immunology (0.92)