AITopics

Collaborating Authors

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models Pan Lu1, Michel Galley 2

Neural Information Processing SystemsMay-25-2025, 03:52:23 GMT

Large language models (LLMs) have achieved remarkable progress in solving various natural language processing tasks due to emergent reasoning abilities. However, LLMs have inherent limitations as they are incapable of accessing up-to-date information (stored on the Web or in task-specific knowledge bases), using external tools, and performing precise mathematical and logical reasoning.

generator, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.14)

Genre: Research Report (1.00)

Industry:

Transportation (0.46)
Information Technology > Security & Privacy (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.53)

Add feedback

86c1fd74fa25bd6be0072937803e0bd1-Paper-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 03:49:20 GMT

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: Asia (0.14)

Genre: Research Report > New Finding (0.67)

Industry:

Energy > Oil & Gas (0.45)
Education > Educational Setting > Online (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Robots (0.93)
(2 more...)

Add feedback

One Fits All: Power General Time Series Analysis by Pretrained LM Tian Zhou Xue Wang

Neural Information Processing SystemsMay-25-2025, 03:48:59 GMT

Although we have witnessed great success of pre-trained models in natural language processing (NLP) and computer vision (CV), limited progress has been made for general time series analysis. Unlike NLP and CV where a unified model can be used to perform different tasks, specially designed approach still dominates in each time series analysis task such as classification, anomaly detection, forecasting, and few-shot learning. The main challenge that blocks the development of pre-trained model for time series analysis is the lack of a large amount of data for training. In this work, we address this challenge by leveraging language or CV models, pre-trained from billions of tokens, for time series analysis. Specifically, we refrain from altering the self-attention and feedforward layers of the residual blocks in the pre-trained language or image model. This model, known as the Frozen Pretrained Transformer (FPT), is evaluated through fine-tuning on all major types of tasks involving time series. Our results demonstrate that pre-trained models on natural language or images can lead to a comparable or state-of-the-art performance in all main time series analysis tasks, as illustrated in Figure 1. We also found both theoretically and empirically that the self-attention module behaviors similarly to principle component analysis (PCA), an observation that helps explains how transformer bridges the domain gap and a crucial step towards understanding the universality of a pre-trained transformer.

forecasting, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Time Series Analysis (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

868f2f9a9950f7b0538b3ce7eb4c8eb8-Supplemental-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 03:47:17 GMT

artificial intelligence, machine learning, probability, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.34)

Add feedback

Structured Federated Learning through Clustered Additive Modeling

Neural Information Processing SystemsMay-25-2025, 03:44:26 GMT

Heterogeneous federated learning without assuming any structure is challenging due to the conflicts among non-identical data distributions of clients. In practice, clients often comprise near-homogeneous clusters so training a server-side model per cluster mitigates the conflicts. However, FL with client clustering often suffers from "clustering collapse", i.e., one cluster's model excels on increasing clients, and reduces to single-model FL. Moreover, cluster-wise models hinder knowledge sharing between clusters and each model depends on fewer clients. Furthermore, the static clustering assumption on data may not hold for dynamically changing models, which are sensitive to cluster imbalance/initialization or outliers. To address these challenges, we propose "Clustered Additive Modeling (CAM)", which applies a globally shared model Θ

artificial intelligence, federated learning, machine learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback

Supplementary Materials for the Paper " L2T-DLN: Learning to Teach with Dynamic Loss Network "

Neural Information Processing SystemsMay-25-2025, 03:44:06 GMT

BIT Special Zone Beijing Institute Of Technology Beijing, China, 100081 {haizhaoyang, liyuan.pan, In this supplementary material, we provide the proofs of convergence analysis in Section 1, 1-vs-1 transformation employed in the classification and semantic segmentation tasks in Section 2, the coordinate-wise and the preprocessing method of the LSTM teacher in Section 3, the loss functions of YOLO-v3 in Section 4, more experiments of image classification in Section 5, and the inferences of semantic segmentation in Section 6. A differentiable function e() is L-smooth with gradient Lipschitz constant C (uniformly Lipschitz continuous), if e(x) e(y) C x y, x, y. If e(x) ϵ, then x is an ϵ-first-order stationary point. For a differentiable function e(), if x is a SS1, and there exists ϵ > 0 so that for any y in the ϵ-neighborhood of x, we have e(x) e(y), then x is a local minimum. A saddle point x is an SS1 that is not a local minimum.

artificial intelligence, machine learning, student model, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.44)
Europe > Switzerland > Zürich (0.14)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild

Neural Information Processing SystemsMay-25-2025, 03:42:30 GMT

Achieving machine autonomy and human control often represent divergent objectives in the design of interactive AI systems. Visual generative foundation models such as Stable Diffusion show promise in navigating these goals, especially when prompted with arbitrary languages. However, they often fall short in generating images with spatial, structural, or geometric controls. The integration of such controls, which can accommodate various visual conditions in a single unified model, remains an unaddressed challenge. In response, we introduce UniControl, a new generative foundation model that consolidates a wide array of controllable condition-to-image (C2I) tasks within a singular framework, while still allowing for arbitrary language prompts. UniControl enables pixel-level-precise image generation, where visual conditions primarily influence the generated structures and language prompts guide the style and context. To equip UniControl with the capacity to handle diverse visual conditions, we augment pretrained text-to-image diffusion models and introduce a task-aware HyperNet to modulate the diffusion models, enabling the adaptation to different C2I tasks simultaneously. Trained on nine unique C2I tasks, UniControl demonstrates impressive zero-shot generation abilities with unseen visual conditions. Experimental results show that UniControl often surpasses the performance of single-task-controlled methods of comparable model sizes.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Asia (0.28)
Africa (0.28)
North America > United States (0.28)
Europe > Switzerland > Zürich > Zürich (0.14)

Genre: Research Report > New Finding (0.48)

Industry:

Information Technology (0.46)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.92)

Add feedback

Neural Injective Functions

Anonymous

Neural Information Processing SystemsMay-25-2025, 03:33:16 GMT

artificial intelligence, dimension, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel (0.14)
Europe (0.14)

Genre: Research Report > New Finding (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Appendix A Illustrative Examples infinitesimal generator of the 2-dimensional rotation group, SO(2): v

Neural Information Processing SystemsMay-25-2025, 03:32:18 GMT

By the prolongation formula, Eq. (4), the first prolongation in t is given by: ϕ As a final illustrative example of the symmetry criterion, we will follow Olver's example below: This choice was based on the previous research using PINN and DeepONets for solving Burgers' The output of the embedding vectors from both networks is 100 dimensional. For both the Heat equation and Burgers' equation experiments, we perform hyper-parameter tuning We also note that for Burgers' equation, we found that cosine similarity for L The results reported in Section 4 use cosine-similarity. We will make the data and the code available on GitHub. The corresponding mean-squared errors are reported in Table 2.

artificial intelligence, generator, infinitesimal generator, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.57)

Add feedback

Filters

Collaborating Authors

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models Pan Lu1, Michel Galley 2

86c1fd74fa25bd6be0072937803e0bd1-Paper-Conference.pdf

One Fits All: Power General Time Series Analysis by Pretrained LM Tian Zhou Xue Wang

868f2f9a9950f7b0538b3ce7eb4c8eb8-Supplemental-Conference.pdf

Structured Federated Learning through Clustered Additive Modeling

Supplementary Materials for the Paper " L2T-DLN: Learning to Teach with Dynamic Loss Network "

8644353f7d307baaf29bc1e56fe8e0ec-Paper-Datasets_and_Benchmarks.pdf

UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild

Neural Injective Functions

Appendix A Illustrative Examples infinitesimal generator of the 2-dimensional rotation group, SO(2): v