AITopics | inference length

Collaborating Authors

inference length

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Continuous Parametric Optical Flow-Supplementary Material-Jianqin Luo

Neural Information Processing SystemsOct-8-2025, 15:15:35 GMT

In order to provide more information and explanation about our training data and evaluation benchmark, we introduce our synthetic samples with respect to the length and density of point trajectory.

artificial intelligence, machine learning, point trajectory, (15 more...)

Neural Information Processing Systems

Country: Asia > China > Shaanxi Province > Xi'an (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.49)

Add feedback

METHOD: Modular Efficient Transformer for Health Outcome Discovery

Qian, Linglong, Ibrahim, Zina

arXiv.org Artificial IntelligenceMay-26-2025

Recent advances in transformer architectures have revolutionised natural language processing, but their application to healthcare domains presents unique challenges. Patient timelines are characterised by irregular sampling, variable temporal dependencies, and complex contextual relationships that differ substantially from traditional language tasks. This paper introduces \METHOD~(Modular Efficient Transformer for Health Outcome Discovery), a novel transformer architecture specifically designed to address the challenges of clinical sequence modelling in electronic health records. \METHOD~integrates three key innovations: (1) a patient-aware attention mechanism that prevents information leakage whilst enabling efficient batch processing; (2) an adaptive sliding window attention scheme that captures multi-scale temporal dependencies; and (3) a U-Net inspired architecture with dynamic skip connections for effective long sequence processing. Evaluations on the MIMIC-IV database demonstrate that \METHOD~consistently outperforms the state-of-the-art \ETHOS~model, particularly in predicting high-severity cases that require urgent clinical intervention. \METHOD~exhibits stable performance across varying inference lengths, a crucial feature for clinical deployment where patient histories vary significantly in length. Analysis of learned embeddings reveals that \METHOD~better preserves clinical hierarchies and relationships between medical concepts. These results suggest that \METHOD~represents a significant advancement in transformer architectures optimised for healthcare applications, providing more accurate and clinically relevant predictions whilst maintaining computational efficiency.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2505.17054

Genre:

Research Report > Experimental Study (0.47)
Research Report > New Finding (0.34)

Industry:

Health & Medicine > Consumer Health (0.85)
Health & Medicine > Diagnostic Medicine (0.69)
Health & Medicine > Health Care Technology > Medical Record (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.77)

Add feedback

Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective

Zhong, Meizhi, Zhang, Chen, Lei, Yikun, Liu, Xikai, Gao, Yan, Hu, Yao, Chen, Kehai, Zhang, Min

arXiv.org Artificial IntelligenceJun-19-2024

Enabling LLMs to handle lengthy context is currently a research hotspot. Most LLMs are built upon rotary position embedding (RoPE), a popular position encoding method. Therefore, a prominent path is to extrapolate the RoPE trained on comparably short texts to far longer texts. A heavy bunch of efforts have been dedicated to boosting the extrapolation via extending the formulations of the RoPE, however, few of them have attempted to showcase their inner workings comprehensively. In this paper, we are driven to offer a straightforward yet in-depth understanding of RoPE extensions from an attention perspective and on two benchmarking tasks. A broad array of experiments reveals several valuable findings: 1) Maintaining attention patterns to those at the pretrained length improves extrapolation; 2) Large attention uncertainty leads to retrieval errors; 3) Using longer continual pretraining lengths for RoPE extensions could reduce attention uncertainty and significantly enhance extrapolation.

arxiv preprint, rope extension, sequence, (13 more...)

arXiv.org Artificial Intelligence

2406.13282

Country: