AITopics | embed

Collaborating Authors

embed

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

081b08068e4733ae3e7ad019fe8d172f-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 12:11:01 GMT

artificial intelligence, convolutional modulation, embed, (12 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.94)

Add feedback

LearningtoExecuteProgramswith InstructionPointerAttentionGraphNeuralNetworks

Neural Information Processing SystemsFeb-8-2026, 15:45:09 GMT

Graph neural networks (GNNs) have emerged as a powerful tool for learning softwareengineering tasksincluding codecompletion, bugfinding,andprogram repair. The IPA-GNN can be seen either as a continuous relaxation of the RNN model or as a GNN variant more tailored to execution.

artificial intelligence, branch decision, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

A Appendix

Neural Information Processing SystemsFeb-7-2026, 13:54:41 GMT

In order to build our latency prediction model, We test three types of hardware devices, NVIDIA V100, NVIDIA GTX 2080, and NVIDIA GTX 1080. Their respective properties are presented in Table 6. It shows that the server GPU V100 is the most powerful hardware device with the most processing engines (#PE). We map the operations to hardware. These split tiles are assigned to multiple PEs.

artificial intelligence, embed, machine learning, (12 more...)

Neural Information Processing Systems

Technology:

Information Technology > Graphics (0.88)
Information Technology > Hardware (0.74)
Information Technology > Sensing and Signal Processing > Image Processing (0.46)
(2 more...)

Add feedback

Segment, Embed, and Align: A Universal Recipe for Aligning Subtitles to Signing

Jiang, Zifan, Jang, Youngjoon, Momeni, Liliane, Varol, Gül, Ebling, Sarah, Zisserman, Andrew

arXiv.org Artificial IntelligenceDec-10-2025

The goal of this work is to develop a universal approach for aligning subtitles (i.e., spoken language text with corresponding timestamps) to continuous sign language videos. Prior approaches typically rely on end-to-end training tied to a specific language or dataset, which limits their generality. In contrast, our method Segment, Embed, and Align (SEA) provides a single framework that works across multiple languages and domains. SEA leverages two pretrained models: the first to segment a video frame sequence into individual signs and the second to embed the video clip of each sign into a shared latent space with text. Alignment is subsequently performed with a lightweight dynamic programming procedure that runs efficiently on CPUs within a minute, even for hour-long episodes. SEA is flexible and can adapt to a wide range of scenarios, utilizing resources from small lexicons to large continuous corpora. Experiments on four sign language datasets demonstrate state-of-the-art alignment performance, highlighting the potential of SEA to generate high-quality parallel data for advancing sign language processing. SEA's code and models are openly available.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2512.08094

Country: Europe (0.46)

Genre: Research Report (0.64)

Industry: Education (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.67)

Add feedback

A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation

Lawton, Neal Gregory, Samuel, Alfy, Kumar, Anoop, Liu, Daben

arXiv.org Artificial IntelligenceOct-21-2025

A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation Download PDF Neal Gregory Lawton, Alfy Samuel, Anoop Kumar, Daben Liu Published: 20 Aug 2025, Retrieval augmented generation (RAG) is a popular framework for question answering that is powered by two large language models (LLMs): an embedding model that retrieves context documents from a database that are relevant to a given question, and a generator model that uses the retrieved context to generate an answer to the question. Both the embedding and generator models can be fine-tuned to increase performance of a RAG pipeline on a new task, but multiple fine-tuning strategies exist with different costs and benefits. In this paper, we evaluate and compare several RAG fine-tuning strategies, including independent, joint, and two-phase fine-tuning. In our experiments, we observe that all of these strategies achieve about equal improvement in EM and F1 generation quality metrics, although they have significantly different computational costs. We conclude the optimal fine-tuning strategy to use depends on whether the training dataset includes context labels and whether a grid search over the learning rates for the embedding and generator models is required.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2510.016

Genre: Research Report (0.71)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images Manuel Watter

Neural Information Processing SystemsOct-2-2025, 11:48:35 GMT

We introduce Embed to Control (E2C), a method for model learning and control of non-linear dynamical systems from raw pixel images. E2C consists of a deep generative model, belonging to the family of variational autoencoders, that learns to generate image trajectories from a latent space in which the dynamics is constrained to be locally linear. Our model is derived directly from an optimal control formulation in latent space, supports long-term prediction of image sequences and exhibits strong performance on a variety of complex control problems.

artificial intelligence, latent space, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Baden-Württemberg > Freiburg (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.68)

Add feedback

T-SYNTH: A Knowledge-Based Dataset of Synthetic Breast Images

Wiedeman, Christopher, Sarmakeeva, Anastasiia, Sizikova, Elena, Filienko, Daniil, Lago, Miguel, Delfino, Jana G., Badano, Aldo

arXiv.org Artificial IntelligenceSep-19-2025

Responsible for approximately two million new cases and over six hundred thousand deaths in 2022 alone (Sung et al., 2021), breast cancer remains a prominent global health concern, and is expected to account nearly one-third of all newly diagnosed cancers among women in the United States (DeSantis et al., 2016). According to the most recent report from International Agency for Research on Cancer (Bray et al., 2024), it is one of the most widespread cancers diagnosed worldwide, both in the number of cases and associated deaths. Consequently, medical imaging techniques are indispensable for screening, diagnosis, and further research into the disease. Historically, the most common imaging technique for breast cancer screening is digital mammography (DM), in which a 2D x-ray projection of a compressed breast is taken. Digital breast tomosynthesis (DBT), a pseudo-3D imaging technique, has been increasingly adopted, demonstrating improved screening performance (Asbeutah et al., 2019; Sprague et al., 2023).

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2507.04038

Country: North America > United States (1.00)

Genre: Research Report (0.82)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Health & Medicine > Therapeutic Area > Oncology > Breast Cancer (0.92)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Symbols List of symbols used in the paper with their brief description

Neural Information Processing SystemsAug-17-2025, 03:56:14 GMT

This 2SP has a set of continuous first-stage decisions which yield an immediate revenue. In the second stage, after a set of random variables are realized, a set of binary decisions can be made to receive further profit. In this work, we specifically consider the instance described in the example 7.3. of [Schultz et al., 1998].

artificial intelligence, configuration, machine learning, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Appendix A Latency Driven Slimming Algorithm

Neural Information Processing SystemsAug-14-2025, 21:39:24 GMT

We provide the details of the proposed latency-driven fast slimming in Alg. 1. Formulations of the Our major conclusions and speed analysis can be found in Sec. 3 and Figure 1. Compared to non-overlap large-kernel patch embedding (V5 in Tab. MHSA with the global receptive field is an essential contribution to model performance. By comparing V1 and V2 in Tab. 3, we can observe that the GN We explore ReLU and HardSwish (V3 and V4 in Tab. 3) in addition to GeLU We draw a conclusion that the activation function can be selected on a case-by-case basis depending on the specific hardware and compiler. In this work, we use GeLU to provide better performance than ReLU while executing faster.

efficientformer, efficientformer-l1, embed, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.35)

Add feedback

Filters

Collaborating Authors

embed

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

081b08068e4733ae3e7ad019fe8d172f-Supplemental-Conference.pdf

9793671e4be9858a69a32545204d59d1-Supplemental-Conference.pdf

LearningtoExecuteProgramswith InstructionPointerAttentionGraphNeuralNetworks

A Appendix

Segment, Embed, and Align: A Universal Recipe for Aligning Subtitles to Signing

A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation

Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images Manuel Watter

T-SYNTH: A Knowledge-Based Dataset of Synthetic Breast Images

A Symbols List of symbols used in the paper with their brief description

Appendix A Latency Driven Slimming Algorithm