AITopics | locator

Collaborating Authors

locator

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SpecAgent: A Speculative Retrieval and Forecasting Agent for Code Completion

Ma, George, Koul, Anurag, Chen, Qi, Wu, Yawen, Kuhar, Sachit, Yu, Yu, Sengupta, Aritra, Kumar, Varun, Ramanathan, Murali Krishna

arXiv.org Artificial IntelligenceOct-22-2025

Large Language Models (LLMs) excel at code-related tasks but often struggle in realistic software repositories, where project-specific APIs and cross-file dependencies are crucial. Retrieval-augmented methods mitigate this by injecting repository context at inference time. The low inference-time latency budget affects either retrieval quality or the added latency adversely impacts user experience. We address this limitation with SpecAgent, an agent that improves both latency and code-generation quality by proactively exploring repository files during indexing and constructing speculative context that anticipates future edits in each file. This indexing-time asynchrony allows thorough context computation, masking latency, and the speculative nature of the context improves code-generation quality. Additionally, we identify the problem of future context leakage in existing benchmarks, which can inflate reported performance. To address this, we construct a synthetic, leakage-free benchmark that enables a more realistic evaluation of our agent against baselines. Experiments show that SpecAgent consistently achieves absolute gains of 9-11% (48-58% relative) compared to the best-performing baselines, while significantly reducing inference latency.

large language model, machine learning, target function, (18 more...)

arXiv.org Artificial Intelligence

2510.17925

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)

Add feedback

Generalist Scanner Meets Specialist Locator: A Synergistic Coarse-to-Fine Framework for Robust GUI Grounding

Li, Zhecheng, Song, Guoxian, Wang, Yiwei, Xiong, Zhen, Yuan, Junsong, Cai, Yujun

arXiv.org Artificial IntelligenceSep-30-2025

Grounding natural language queries in graphical user interfaces (GUIs) presents a challenging task that requires models to comprehend diverse UI elements across various applications and systems, while also accurately predicting the spatial coordinates for the intended operation. To tackle this problem, we propose GMS: Generalist Scanner Meets Specialist Locator, a synergistic coarse-to-fine framework that effectively improves GUI grounding performance. GMS leverages the complementary strengths of general vision-language models (VLMs) and small, task-specific GUI grounding models by assigning them distinct roles within the framework. Specifically, the general VLM acts as a 'Scanner' to identify potential regions of interest, while the fine-tuned grounding model serves as a 'Locator' that outputs precise coordinates within these regions. This design is inspired by how humans perform GUI grounding, where the eyes scan the interface and the brain focuses on interpretation and localization. Our whole framework consists of five stages and incorporates hierarchical search with cross-modal communication to achieve promising prediction results. Experimental results on the ScreenSpot-Pro dataset show that while the 'Scanner' and 'Locator' models achieve only $2.0\%$ and $3.7\%$ accuracy respectively when used independently, their integration within GMS framework yields an overall accuracy of $35.7\%$, representing a $10 \times$ improvement. Additionally, GMS significantly outperforms other strong baselines under various settings, demonstrating its robustness and potential for general-purpose GUI grounding.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2509.24133

Country: North America > United States > California (0.46)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Motion-Aware Optical Camera Communication with Event Cameras

Su, Hang, Gao, Ling, Liu, Tao, Kneip, Laurent

arXiv.org Artificial IntelligenceDec-1-2024

As the ubiquity of smart mobile devices continues to rise, Optical Camera Communication systems have gained more attention as a solution for efficient and private data streaming. This system utilizes optical cameras to receive data from digital screens via visible light. Despite their promise, most of them are hindered by dynamic factors such as screen refreshing and rapid camera motion. CMOS cameras, often serving as the receivers, suffer from limited frame rates and motion-induced image blur, which degrade overall performance. To address these challenges, this paper unveils a novel system that utilizes event cameras. We introduce a dynamic visual marker and design event-based tracking algorithms to achieve fast localization and data streaming. Remarkably, the event camera's unique capabilities mitigate issues related to screen refresh rates and camera motion, enabling a high throughput of up to 114 Kbps in static conditions, and a 1 cm localization accuracy with 1% bit error rate under various camera motions.

artificial intelligence, event camera, machine learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/LRA.2024.3517292

2412.00816

Country:

Europe > Spain > Galicia > Madrid (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report > Promising Solution (0.34)

Industry:

Media > Photography (0.89)
Media > Television (0.75)
Media > Film (0.75)
Information Technology > Security & Privacy (0.66)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Data Science (0.94)
Information Technology > Communications (0.88)
(2 more...)

Add feedback

EEGUnity: Open-Source Tool in Facilitating Unified EEG Datasets Towards Large-Scale EEG Model

Qin, Chengxuan, Yang, Rui, You, Wenlong, Chen, Zhige, Zhu, Longsheng, Huang, Mengjie, Wang, Zidong

arXiv.org Artificial IntelligenceSep-24-2024

The increasing number of dispersed EEG dataset publications and the advancement of large-scale Electroencephalogram (EEG) models have increased the demand for practical tools to manage diverse EEG datasets. However, the inherent complexity of EEG data, characterized by variability in content data, metadata, and data formats, poses challenges for integrating multiple datasets and conducting large-scale EEG model research. To tackle the challenges, this paper introduces EEGUnity, an open-source tool that incorporates modules of 'EEG Parser', 'Correction', 'Batch Processing', and 'Large Language Model Boost'. Leveraging the functionality of such modules, EEGUnity facilitates the efficient management of multiple EEG datasets, such as intelligent data structure inference, data cleaning, and data unification. In addition, the capabilities of EEGUnity ensure high data quality and consistency, providing a reliable foundation for large-scale EEG data research. EEGUnity is evaluated across 25 EEG datasets from different sources, offering several typical batch processing workflows. The results demonstrate the high performance and flexibility of EEGUnity in parsing and data processing. The project code is publicly available at github.com/Baizhige/EEGUnity.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2410.07196

Country:

Europe > United Kingdom (0.14)
Asia > China > Shaanxi Province > Xi'an (0.05)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.34)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Health Care Technology (0.68)

Technology:

Information Technology > Software (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Data Science > Data Quality > Data Cleaning (0.36)

Add feedback

Investigating Consistency in Query-Based Meeting Summarization: A Comparative Study of Different Embedding Methods

Jia-Chen, Chen, Senabre, Guillem, Caron, Allane

arXiv.org Artificial IntelligenceFeb-10-2024

With more and more advanced data analysis techniques emerging, people will expect these techniques to be applied in more complex tasks and solve problems in our daily lives. Text Summarization is one of famous applications in Natural Language Processing (NLP) field. It aims to automatically generate summary with important information based on a given context, which is important when you have to deal with piles of documents. Summarization techniques can help capture key points in a short time and bring convenience in works. One of applicable situation is meeting summarization, especially for important meeting that tend to be long, complicated, multi-topic and multi-person. Therefore, when people want to review specific content from a meeting, it will be hard and time-consuming to find the related spans in the meeting transcript. However, most of previous works focus on doing summarization for newsletters, scientific articles...etc, which have a clear document structure and an official format. For the documents with complex structure like transcripts, we think those works are not quite suitable for meeting summarization. Besides, the consistency of summary is another issue common to be discussed in NLP field. To conquer challenges of meeting summarization, we are inspired by "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization" proposed by Microsoft and we also propose our Locater model designed to extract relevant spans based on given transcript and query, which are then summarized by Summarizer model. Furthermore, we perform a comparative study by applying different word embedding techniques to improve summary consistency.

accessed, arxiv, summarization, (13 more...)

arXiv.org Artificial Intelligence

2402.06907

Country:

Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(3 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Benchmarking Automated Clinical Language Simplification: Dataset, Algorithm, and Evaluation

Luo, Junyu, Zheng, Zifei, Ye, Hanzhong, Ye, Muchao, Wang, Yaqing, You, Quanzeng, Xiao, Cao, Ma, Fenglong

arXiv.org Artificial IntelligenceSep-21-2023

Patients with low health literacy usually have difficulty understanding medical jargon and the complex structure of professional medical language. Although some studies are proposed to automatically translate expert language into layperson-understandable language, only a few of them focus on both accuracy and readability aspects simultaneously in the clinical domain. Thus, simplification of the clinical language is still a challenging task, but unfortunately, it is not yet fully addressed in previous work. To benchmark this task, we construct a new dataset named MedLane to support the development and evaluation of automated clinical language simplification approaches. Besides, we propose a new model called DECLARE that follows the human annotation procedure and achieves state-of-the-art performance compared with eight strong baselines. To fairly evaluate the performance, we also propose three specific evaluation metrics. Experimental results demonstrate the utility of the annotated MedLane dataset and the effectiveness of the proposed model DECLARE.

abbreviation, history, simplification, (17 more...)

arXiv.org Artificial Intelligence

2012.0242

Country:

North America > United States > Pennsylvania (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)
Asia > China > Liaoning Province > Dalian (0.04)

Genre: Research Report > New Finding (0.34)

Industry:

Information Technology > Security & Privacy (0.67)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.47)
Health & Medicine > Health Care Technology > Medical Record (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Communications (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Nexus sine qua non: Essentially Connected Networks for Traffic Forecasting

Nie, Tong, Qin, Guoyang, Sun, Lijun, Wang, Yunpeng, Sun, Jian

arXiv.org Artificial IntelligenceAug-13-2023

Spatiotemporal graph neural networks (STGNNs) have emerged as a leading approach for learning representations and forecasting on traffic datasets with underlying topological and correlational structures. However, current STGNNs use intricate techniques with high complexities to capture these structures, making them difficult to understand and scale. The existence of simple yet efficient architectures remains an open question. Upon closer examination, we find what lies at the core of STGNN's representations are certain forms of spatiotemporal contextualization. In light of this, we design Nexus sine qua non (NexuSQN), an essentially connected network built on an efficient message-passing backbone. NexuSQN simply uses learnable "where" and "when" locators for the aforementioned contextualization and omits any intricate components such as RNNs, Transformers, and diffusion convolutions. Results show that NexuSQN outperforms intricately designed benchmarks in terms of size, computational efficiency, and accuracy. This suggests a promising future for developing simple yet efficient neural predictors.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2307.01482

Country:

North America > Canada > Quebec > Montreal (0.14)
North America > United States > California > San Francisco County > San Francisco (0.04)
Asia > China > Shanghai > Shanghai (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry: Transportation > Ground > Road (0.67)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Modeling & Simulation (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Deep Learning for Reference-Free Geolocation for Poplar Trees

John, Cai W., Queen, Owen, Muchero, Wellington, Emrich, Scott J.

arXiv.org Artificial IntelligenceJan-30-2023

A core task in precision agriculture is the identification of climatic and ecological conditions that are advantageous for a given crop. The most succinct approach is geolocation, which is concerned with locating the native region of a given sample based on its genetic makeup. Here, we investigate genomic geolocation of Populus trichocarpa, or poplar, which has been identified by the US Department of Energy as a fast-rotation biofuel crop to be harvested nationwide. In particular, we approach geolocation from a reference-free perspective, circumventing the need for compute-intensive processes such as variant calling and alignment. Our model, MashNet, predicts latitude and longitude for poplar trees from randomly-sampled, unaligned sequence fragments. We show that our model performs comparably to Locator, a state-of-the-art method based on aligned whole-genome sequence data. MashNet achieves an error of 34.0 km^2 compared to Locator's 22.1 km^2. MashNet allows growers to quickly and efficiently identify natural varieties that will be most productive in their growth environment based on genotype. This paper explores geolocation for precision agriculture while providing a framework and data source for further development by the machine learning community.

artificial intelligence, bioinformatics, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2301.13387

Country:

North America > United States > Tennessee > Knox County > Knoxville (0.15)
North America > United States > California (0.14)
North America > United States > Tennessee > Anderson County > Oak Ridge (0.04)
(2 more...)

Genre: Research Report > Promising Solution (0.34)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Energy (1.00)
Food & Agriculture > Agriculture (0.97)
Government > Regional Government > North America Government > United States Government (0.68)

Technology:

Information Technology > Biomedical Informatics > Translational Bioinformatics (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

MAGIC: Microlensing Analysis Guided by Intelligent Computation

Zhao, Haimeng, Zhu, Wei

arXiv.org Artificial IntelligenceOct-14-2022

For a microlensing event with multiple lenses, the interpretation of the light curve can be challenging. First When a distant star (called the source) gets sufficiently of all, the computation of the multiple-lens microlensing aligned with a massive foreground object (called light curve can be time-consuming due to the finitesource the lens), the gravitational field of the lens focuses the effect (e.g., Dong et al. 2006; Bozza 2010). This light out of the distant star, thus making the distant star is especially true when the microlens system consists of appear brighter (Einstein 1936; Paczynski 1986). For a three or more objects (e.g., Gaudi et al. 2008; Kuang typical source star inside the Milky Way, one can observe et al. 2021). Additionally, the likelihood landscape of the time evolution of their brightness (i.e., light curves) the high-dimensional parameter space can be so pathological and infer the existence and properties of companion objects that traditional sampling-based methods may to the lens by monitoring the deviations in the light have a hard time searching for the correct solution (or curve from the single lens scenario (e.g., Mao & Paczynski solutions). This remains to be true even when the brute 1991; Gould & Loeb 1992). This so-called gravitational force search on a fine grid that is defined by a subset microlensing technique has been frequently used of model parameters is conducted. As a result, the current to detect exoplanets and stellar binaries and are complementary analysis of multiple-lens microlensing events is still to other techniques (see reviews by Gaudi case-by-case, with each event requiring hundreds of (or 2012 and Zhu & Dong 2021).

artificial intelligence, light curve, machine learning, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.3847/1538-3881/ac9230

2206.08199

Country:

Asia > China > Beijing > Beijing (0.04)
Africa > South Africa (0.04)
North America > United States (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Sensing and Signal Processing > Image Processing (0.67)

Add feedback

Benchmarking Learnt Radio Localisation under Distribution Shift

Arnold, Maximilian, Alloulah, Mohammed

arXiv.org Artificial IntelligenceOct-4-2022

Deploying radio frequency (RF) localisation systems invariably entails non-trivial effort, particularly for the latest learning-based breeds. There has been little prior work on characterising and comparing how learnt localiser networks can be deployed in the field under real-world RF distribution shifts. In this paper, we present RadioBench: a suite of 8 learnt localiser nets from the state-of-the-art to study and benchmark their real-world deployability, utilising five novel industry-grade datasets. We train 10k models to analyse the inner workings of these learnt localiser nets and uncover their differing behaviours across three performance axes: (i) learning, (ii) proneness to distribution shift, and (iii) localisation. We use insights gained from this analysis to recommend best practices for the deployability of learning-based RF localisation under practical constraints. Decades of of radio frequency (RF) localisation research have given us a variety of classic methods (Patwari et al., 2005; Gezici et al., 2005). Newer machine learning incarnations can enhance location estimation considerably (Zanjani et al., 2022; Karmanov et al., 2021), albeit at the expense of proneness to distributional shift in wireless signals. For example, models trained on signals from a warehouse environment may not work well in another different environment (Arnold et al., 2018). If learnt localiser networks are to be productised and deployed, it is imperative that we robustify them.

arena 1, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2210.0193

Country:

North America > United States > Missouri > Jefferson County > Arnold (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback