AITopics

88c3c482430a62d35e03926a22e4b67e-Supplemental-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 04:03:00 GMT

CoLA and discuss modifications to improve lower precision performance. In Appendix D we expand on the details of the experiments in the main text. We now present the linear algebra identities that we use to exploit structure in CoLA. Finally, for sum we have the Woodbury identity and its variants. Besides the compositional operators, we have some rules for some special operators.

artificial intelligence, iteration, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.35)

Add feedback

88a129e44f25a571ae8b838057c46855-Paper-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 04:02:15 GMT

div class, large language model, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Asia > Middle East > Israel (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

8833c8aa10542d24d693bbaf6a4598f5-Supplemental-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 03:58:25 GMT

artificial intelligence, equation, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

87dbbdc3a685a97ad28489a1d57c45c1-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsMay-25-2025, 03:54:30 GMT

artificial intelligence, machine learning, survey article, (16 more...)

Neural Information Processing Systems

Country: North America > Canada (0.68)

Genre: Overview (0.87)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.68)

Add feedback

Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models Pan Lu1, Michel Galley 2

Neural Information Processing SystemsMay-25-2025, 03:52:23 GMT

Large language models (LLMs) have achieved remarkable progress in solving various natural language processing tasks due to emergent reasoning abilities. However, LLMs have inherent limitations as they are incapable of accessing up-to-date information (stored on the Web or in task-specific knowledge bases), using external tools, and performing precise mathematical and logical reasoning.

generator, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.14)

Genre: Research Report (1.00)

Industry:

Transportation (0.46)
Information Technology > Security & Privacy (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.53)

Add feedback

86c1fd74fa25bd6be0072937803e0bd1-Paper-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 03:49:20 GMT

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: Asia (0.14)

Genre: Research Report > New Finding (0.67)

Industry:

Energy > Oil & Gas (0.45)
Education > Educational Setting > Online (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Robots (0.93)
(2 more...)

Add feedback

One Fits All: Power General Time Series Analysis by Pretrained LM Tian Zhou Xue Wang

Neural Information Processing SystemsMay-25-2025, 03:48:59 GMT

Although we have witnessed great success of pre-trained models in natural language processing (NLP) and computer vision (CV), limited progress has been made for general time series analysis. Unlike NLP and CV where a unified model can be used to perform different tasks, specially designed approach still dominates in each time series analysis task such as classification, anomaly detection, forecasting, and few-shot learning. The main challenge that blocks the development of pre-trained model for time series analysis is the lack of a large amount of data for training. In this work, we address this challenge by leveraging language or CV models, pre-trained from billions of tokens, for time series analysis. Specifically, we refrain from altering the self-attention and feedforward layers of the residual blocks in the pre-trained language or image model. This model, known as the Frozen Pretrained Transformer (FPT), is evaluated through fine-tuning on all major types of tasks involving time series. Our results demonstrate that pre-trained models on natural language or images can lead to a comparable or state-of-the-art performance in all main time series analysis tasks, as illustrated in Figure 1. We also found both theoretically and empirically that the self-attention module behaviors similarly to principle component analysis (PCA), an observation that helps explains how transformer bridges the domain gap and a crucial step towards understanding the universality of a pre-trained transformer.

forecasting, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Time Series Analysis (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

868f2f9a9950f7b0538b3ce7eb4c8eb8-Supplemental-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 03:47:17 GMT

artificial intelligence, machine learning, probability, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.34)

Add feedback

Structured Federated Learning through Clustered Additive Modeling

Neural Information Processing SystemsMay-25-2025, 03:44:26 GMT

Heterogeneous federated learning without assuming any structure is challenging due to the conflicts among non-identical data distributions of clients. In practice, clients often comprise near-homogeneous clusters so training a server-side model per cluster mitigates the conflicts. However, FL with client clustering often suffers from "clustering collapse", i.e., one cluster's model excels on increasing clients, and reduces to single-model FL. Moreover, cluster-wise models hinder knowledge sharing between clusters and each model depends on fewer clients. Furthermore, the static clustering assumption on data may not hold for dynamically changing models, which are sensitive to cluster imbalance/initialization or outliers. To address these challenges, we propose "Clustered Additive Modeling (CAM)", which applies a globally shared model Θ

artificial intelligence, federated learning, machine learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback

Supplementary Materials for the Paper " L2T-DLN: Learning to Teach with Dynamic Loss Network "

Neural Information Processing SystemsMay-25-2025, 03:44:06 GMT

BIT Special Zone Beijing Institute Of Technology Beijing, China, 100081 {haizhaoyang, liyuan.pan, In this supplementary material, we provide the proofs of convergence analysis in Section 1, 1-vs-1 transformation employed in the classification and semantic segmentation tasks in Section 2, the coordinate-wise and the preprocessing method of the LSTM teacher in Section 3, the loss functions of YOLO-v3 in Section 4, more experiments of image classification in Section 5, and the inferences of semantic segmentation in Section 6. A differentiable function e() is L-smooth with gradient Lipschitz constant C (uniformly Lipschitz continuous), if e(x) e(y) C x y, x, y. If e(x) ϵ, then x is an ϵ-first-order stationary point. For a differentiable function e(), if x is a SS1, and there exists ϵ > 0 so that for any y in the ϵ-neighborhood of x, we have e(x) e(y), then x is a local minimum. A saddle point x is an SS1 that is not a local minimum.

artificial intelligence, machine learning, student model, (18 more...)

Neural Information Processing Systems

Country: