AITopics

Collaborating Authors

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Apple's Big OS Rebrand, OnePlus Embraces AI, and Samsung's Next Folds--Your Gear News of the Week

WIREDMay-31-2025, 10:30:00 GMT

Bloomberg reports that this year at WWDC, Apple plans to announce a broad overhaul of all of its operating systems. That includes renaming them to be more consistent. Starting this year, Apple will reportedly begin denoting each OS version for each product by year, instead of by version. Confusingly, it will start with the next year, rather than this year (just like cars). So the versions we'll see at this year's WWDC will not be iOS 25, but rather iOS 26, watchOS 26, and so on, in place of iOS 19 and watchOS 12. Here's more you may have missed this week: The move is reportedly part of a larger push toward a cohesive user experience across platforms.

apple, artificial intelligence, human computer interaction, (12 more...)

WIRED

Industry: Semiconductors & Electronics (0.44)

Technology:

Information Technology > Human Computer Interaction (0.37)
Information Technology > Artificial Intelligence (0.33)

Add feedback

Mixture of In-Context Experts Enhance LLMs' Long Context Awareness Hongzhan Lin 1 Ang Lv1 Yuhan Chen 2 Chen Zhu 3

Neural Information Processing SystemsMay-31-2025, 10:29:26 GMT

Many studies have revealed that large language models (LLMs) exhibit uneven awareness of different contextual positions. Their limited context awareness can lead to overlooking critical information and subsequent task failures. While several approaches have been proposed to enhance LLMs' context awareness, achieving both effectiveness and efficiency remains challenging. In this paper, for LLMs utilizing RoPE as position embeddings, we introduce a novel method called "Mixture of In-Context Experts" (MoICE) to address this challenge. MoICE comprises two key components: a router integrated into each attention head within LLMs and a lightweight router-only training optimization strategy: (1) MoICE views each RoPE angle as an'in-context' expert, demonstrated to be capable of directing the attention of a head to specific contextual positions. Consequently, each attention head flexibly processes tokens using multiple RoPE angles dynamically selected by the router to attend to the needed positions.

large language model, machine learning, moice, (19 more...)

Neural Information Processing Systems

Country:

Asia (0.68)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)
Research Report > Promising Solution (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

appreciate the technical novelty in our approach and its theoretical guarantees, and find our work respond in more detail below, and took all comments into account in our revised version

Neural Information Processing SystemsMay-31-2025, 10:28:23 GMT

We sincerely thank the reviewers for their time, feedback, and thoughtful suggestions. We would like to first clarify the claims and evaluation of our work. In the context of HC, we focus on Dasgupta's cost (DC), Approximation Ratio (R3) R3's main concerns are two clarifications about our approximation The first asks if the approximation result (Thm 4.1) only holds for the optimal embedding. HC Baselines (R2) We thank R2 for the suggestions to improve our experiments. K-Means, a top-down method which is the direct analog of HKM in a similarity-based context [33].

artificial intelligence, machine learning, theoretical guarantee, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.36)

Add feedback

Credal Deep Ensembles for Uncertainty Quantification Kaizheng Wang

Neural Information Processing SystemsMay-31-2025, 10:27:42 GMT

This paper introduces an innovative approach to classification called Credal Deep Ensembles (CreDEs), namely, ensembles of novel Credal-Set Neural Networks (CreNets). CreNets are trained to predict a lower and an upper probability bound for each class, which, in turn, determine a convex set of probabilities (credal set) on the class set. The training employs a loss inspired by distributionally robust optimization which simulates the potential divergence of the test distribution from the training distribution, in such a way that the width of the predicted probability interval reflects the'epistemic' uncertainty about the future data distribution. Ensembles can be constructed by training multiple CreNets, each associated with a different random seed, and averaging the outputted intervals. Extensive experiments are conducted on various out-of-distributions (OOD) detection benchmarks (CIFAR10/100 vs SVHN/Tiny-ImageNet, CIFAR10 vs CIFAR10-C, ImageNet vs ImageNet-O) and using different network architectures (ResNet50, VGG16, and ViT Base). Compared to Deep Ensemble baselines, CreDEs demonstrate higher test accuracy, lower expected calibration error, and significantly improved epistemic uncertainty estimation.

artificial intelligence, credal, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Europe > Belgium > Flanders (0.14)
North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(3 more...)

Add feedback

Supplementary Materials for S-PIFu: Integrating Parametric Human Models with PIFu for Single-view Clothed Human Reconstruction

Neural Information Processing SystemsMay-31-2025, 10:27:36 GMT

In Figure 1, we show S-PIFu's results when given images of test subjects who wear large clothings (e.g. Images of these test subjects have pixels that belong to human subject but not to the SMPL-X body, and yet S-PIFu is able reconstruct the human subjects accurately. Pixels that belong to human subject but not to the SMPL-X body act as a natural regularizer that prevents S-PIFu from being overly reliant on estimated SMPL-X meshes to reconstruct clothed human meshes. This happens because these pixels only have valid values for the RGB channels and not the channels of our 2D feature maps (i.e. C, B, and N. Recall that C refers to coordinate information, B refers to blendweights-based labels, and N refers to body part orientation information). In Figure 1, we observe what would happen if we feed a noisy SMPL-X mesh (i.e. a SMPL-X mesh with inaccurate pose parameters) to our S-PIFu (Note that S-PIFu has not been trained with any noisy SMPL-X meshes). It is not uncommon for an estimated SMPL-X mesh to have an inaccurate pose, as observed by PaMIR (9), ARCH++ (3) and ICON (8).

artificial intelligence, machine learning, s-pifu, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

High-Quality Self-Supervised Deep Image Denoising

Samuli Laine, Tero Karras, Jaakko Lehtinen, Timo Aila

Neural Information Processing SystemsMay-31-2025, 10:27:02 GMT

We describe a novel method for training high-quality image denoising models based on unorganized collections of corrupted images. The training does not need access to clean reference images, or explicit pairs of corrupted images, and can thus be applied in situations where such data is unacceptably expensive or impossible to acquire. We build on a recent technique that removes the need for reference data by employing networks with a "blind spot" in the receptive field, and significantly improve two key aspects: image quality and training efficiency. Our result quality is on par with state-of-the-art neural network denoisers in the case of i.i.d.

artificial intelligence, bayesian inference, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > Canada (0.14)

Genre: Research Report (0.54)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback

clarifying the paper. always performs roughly on par with the baseline supervised training, and with very small training sets appears to

Neural Information Processing SystemsMay-31-2025, 10:26:47 GMT

We would like to thank the reviewers for their comments and remarks. Reviewers #1 and #4 inquired about the quality of our method with smaller training sets. Training images Method all 10 000 1000 500 300 200 100 (10 runs) Baseline, N2C 31.60 31.59 Reviewer #1 remarked that our experiments are performed on synthetic data only. As the non-learned CBM3D method is also designed for natural images, we feel that our comparisons are fair.

architecture, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Supplementary Material for " Adaptive Experimental Design with Temporal Interference: A Maximum Likelihood Approach "

Neural Information Processing SystemsMay-31-2025, 10:26:44 GMT

Throughout this section, we refer to the two Markov chains depicted in Figure 1. The transition probabilities are as depicted in the figure. We assume each chain only earns a reward in state x =1. Thus the treatment effect is (q(2) q(1))/s. First, suppose that for ` =1,2we wanted to estimate only (`) by running chain `, i.e., A Then note that in every S steps, only one observation is received of the reward in state 1. Figure 1: The two Markov chains described in Appendix A. Chain 1 is red, and chain 2 is blue.

artificial intelligence, machine learning, variance, (16 more...)

Neural Information Processing Systems

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.41)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.41)

Add feedback

Compact Proofs of Model Performance via Mechanistic Interpretability

Jason Gross,Rajashree Agrawal,Thomas Kwa,Euan Ong,Chun Hei Yip,Alex Gibson,Soufiane Noubir,Lawrence Chan

Neural Information Processing SystemsMay-31-2025, 10:23:23 GMT

We propose using mechanistic interpretability - techniques for reverse engineering model weights into human-interpretable algorithms - to derive and compactly prove formal guarantees on model performance. We prototype this approach by formally proving accuracy lower bounds for a small transformer trained on Max-of-K, validating proof transferability across 151 random seeds and four values of K. We create 102 different computer-assisted proof strategies and assess their length and tightness of bound on each of our models. Using quantitative metrics, we find that shorter proofs seem to require and provide more mechanistic understanding. Moreover, we find that more faithful mechanistic understanding leads to tighter performance bounds. We confirm these connections by qualitatively examining a subset of our proofs. Finally, we identify compounding structureless errors as a key challenge for using mechanistic interpretability to generate compact proofs on model performance.

logic & formal reasoning, machine learning, natural language, (22 more...)

Neural Information Processing Systems

Country:

Europe (0.45)
Asia (0.27)
North America > United States > Massachusetts (0.14)

Genre: Research Report > Experimental Study (0.45)

Technology:

Information Technology > Software (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making Yubin Kim 1 Chanwoo Park

Neural Information Processing SystemsMay-31-2025, 10:23:05 GMT

Foundation models are becoming valuable tools in medicine. Yet despite their promise, the best way to leverage Large Language Models (LLMs) in complex medical tasks remains an open question. We introduce a novel multi-agent framework, named Medical Decision-making Agents (MDAgents) that helps to address this gap by automatically assigning a collaboration structure to a team of LLMs. The assigned solo or group collaboration structure is tailored to the medical task at hand, a simple emulation inspired by the way real-world medical decision-making processes are adapted to tasks of different complexities. We evaluate our framework and baseline methods using state-of-the-art LLMs across a suite of real-world medical knowledge and medical diagnosis benchmarks, including a comparison of LLMs' medical complexity classification against human physicians

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country: