AITopics

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

91edff07232fb1b55a505a9e9f6c0ff3-Paper-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 12:39:15 GMT

large language model, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.28)
North America > United States > California (0.27)

Genre:

Research Report > New Finding (1.00)
Overview (0.68)

Industry:

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(5 more...)

Add feedback

Appendix: Pairwise Causality Guided Transformers for Event Sequences

Neural Information Processing SystemsMar-27-2025, 12:39:06 GMT

The 5 real-world applications cover various domains.

artificial intelligence, dataset, machine learning, (12 more...)

Neural Information Processing Systems

Genre: Research Report (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

Pairwise Causality Guided Transformers for Event Sequences

Neural Information Processing SystemsMar-27-2025, 12:39:03 GMT

Although pairwise causal relations have been extensively studied in observational longitudinal analyses across many disciplines, incorporating knowledge of causal pairs into deep learning models for temporal event sequences remains largely unexplored. In this paper, we propose a novel approach for enhancing the performance of transformer-based models in multivariate event sequences by injecting pairwise qualitative causal knowledge such as'event Z amplifies future occurrences of event Y'. We establish a new framework for causal inference in temporal event sequences using a transformer architecture, providing a theoretical justification for our approach, and show how to obtain unbiased estimates of the proposed measure. Experimental results demonstrate that our approach outperforms several state-of-the-art models in terms of prediction accuracy by effectively leveraging knowledge about causal pairs. We also consider a unique application where we extract knowledge around sequences of societal events by generating them from a large language model, and demonstrate how a causal knowledge graph can help with event prediction in such sequences. Overall, our framework offers a practical means of improving the performance of transformer-based models in multivariate event sequences by explicitly exploiting pairwise causal information.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Genre:

Research Report > Promising Solution (0.68)
Overview > Innovation (0.48)

Industry:

Law Enforcement & Public Safety (0.68)
Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)

Add feedback

91a5742235f70ae846436d9780e9f1d4-Supplemental-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 12:38:56 GMT

artificial intelligence, machine learning, metric, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

TaskMet: Task-Driven Metric Learning for Model Learning

Neural Information Processing SystemsMar-27-2025, 12:38:52 GMT

Deep learning models are often used with some downstream task. Models solely trained to achieve accurate predictions may struggle to perform well on the desired downstream tasks. We propose using the task loss to learn a metric which parameterizes a loss to train the model. This approach does not alter the optimal prediction model itself, but rather changes the model learning to emphasize the information important for the downstream task. This enables us to achieve the best of both worlds: a prediction model trained in the original prediction space while also being valuable for the desired downstream task. We validate our approach through experiments conducted in two main settings: 1) decision-focused model learning scenarios involving portfolio optimization and budget allocation, and 2) reinforcement learning in noisy environments with distracting states. The source code to reproduce our experiments is available here.

artificial intelligence, deep learning, machine learning, (14 more...)

Neural Information Processing Systems

Industry: Energy > Oil & Gas > Upstream (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Quantifying the Gain in Weak-to-Strong Generalization

Neural Information Processing SystemsMar-27-2025, 12:38:42 GMT

Recent advances in large language models have shown capabilities that are extraordinary and near-superhuman. These models operate with such complexity that reliably evaluating and aligning them proves challenging for humans. This leads to the natural question: can guidance from weak models (like humans) adequately direct the capabilities of strong models?

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

North America (0.14)
Europe > France (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry: Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)

Add feedback

A Computation and Implementation Details We propose several optimizations in the P

Neural Information Processing SystemsMar-27-2025, 12:38:34 GMT

Explaining Website For the website dataset, we explain product and user preferences in Figure 15. We generally found that, for the period considered, cosmetic products and the "Jersey Basic category" drove clicks.

artificial intelligence, preferential value function, ref -shap, (14 more...)

Neural Information Processing Systems

Industry: Health & Medicine (0.54)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.34)

Add feedback

Explaining Preferences with Shapley Values Robert Hu

Neural Information Processing SystemsMar-27-2025, 12:38:31 GMT

Work mainly done while the authors were with the Department of Statistics, University of Oxford 36th Conference on Neural Information Processing Systems (NeurIPS 2022).

artificial intelligence, machine learning, value function, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.67)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.24)

Genre: Research Report (0.68)

Industry: Leisure & Entertainment > Sports > Tennis (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science (0.93)

Add feedback

Toward Dynamic Non-Line-of-Sight Imaging with Mamba Enforced Temporal Consistency

Neural Information Processing SystemsMar-27-2025, 12:38:20 GMT

Dynamic reconstruction in confocal non-line-of-sight imaging encounters great challenges since the dense raster-scanning manner limits the practical frame rate. A fewer pioneer works reconstruct high-resolution volumes from the under-scanning transient measurements but overlook temporal consistency among transient frames. To fully exploit multi-frame information, we propose the first spatial-temporal Mamba (ST-Mamba) based method tailored for dynamic reconstruction of transient videos. Our method capitalizes on neighbouring transient frames to aggregate the target 3D hidden volume. Specifically, the interleaved features extracted from the input transient frames are fed to the proposed ST-Mamba blocks, which leverage the time-resolving causality in transient measurement.

artificial intelligence, machine learning, reconstruction, (20 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition (Supplementary Material)

Neural Information Processing SystemsMar-27-2025, 12:38:16 GMT

In Figure 1, we compare our LMC framework with the baseline Softmax, and present qualitative results on the TinyImageNet dataset. Note that for the baseline Softmax, we do not simulate any virtual open-set classes. As shown, via simulating additional virtual open-set classes that share the spurious-discriminative features, our framework can prevent the closed-set score S of the open-set testing image from being easily overestimated by approaching the image to both a certain closed-set class and certain virtual open-set classes. This demonstrates the effectiveness of our framework in reducing the reliance on spurious-discriminative features. In our experiments, following [1, 11], we use the following two metrics: AUROC and OSCR [3].

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Genre: Research Report (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.74)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.52)

Add feedback