AITopics

c404a5adbf90e09631678b13b05d9d7a-Paper.pdf

Neural Information Processing SystemsMar-21-2025, 16:43:00 GMT

artificial intelligence, arxiv preprint arxiv, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Sensing and Signal Processing > Image Processing (0.70)

Add feedback

Why Normalizing Flows Fail to Detect Out-of-Distribution Data

Neural Information Processing SystemsMar-21-2025, 16:42:49 GMT

Detecting out-of-distribution (OOD) data is crucial for robust machine learning systems. Normalizing flows are flexible deep generative models that often surprisingly fail to distinguish between in- and out-of-distribution data: a flow trained on pictures of clothing assigns higher likelihood to handwritten digits. We investigate why normalizing flows perform poorly for OOD detection. We demonstrate that flows learn local pixel correlations and generic image-to-latent-space transformations which are not specific to the target image datasets, focusing on flows based on coupling layers. We show that by modifying the architecture of flow coupling layers we can bias the flow towards learning the semantic structure of the target data, improving OOD detection.

artificial intelligence, detect out-of-distribution data, machine learning, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.30)

Add feedback

Learning threshold neurons via the "edge of stability " Anonymous Author(s) Affiliation Address email

Neural Information Processing SystemsMar-21-2025, 16:42:47 GMT

Large step sizes are necessary to learn the "threshold neuron" of a ReLU network (2) for a simple binary classification task (1). We choose d = 200, n = 300, λ = 3, and run gradient descent with the logistic loss.

artificial intelligence, machine learning, regime, (17 more...)

Neural Information Processing Systems

Country: Europe > France (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.34)

Add feedback

3e592c571de69a43d7a870ea89c7e33a-Paper-Conference.pdf

Neural Information Processing SystemsMar-21-2025, 16:42:44 GMT

artificial intelligence, machine learning, regime, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Evolutionary Neural Architecture Search for Transformer in Knowledge Tracing

Neural Information Processing SystemsMar-21-2025, 16:42:32 GMT

Transformer has achieved excellent performance in the knowledge tracing (KT) task, but they are criticized for the manually selected input features for fusion and the defect of single global context modelling to directly capture students' forgetting behavior in KT, when the related records are distant from the current record in terms of time. To address the issues, this paper first considers adding convolution operations to the Transformer to enhance its local context modelling ability used for students' forgetting behavior, then proposes an evolutionary neural architecture search approach to automate the input feature selection and automatically determine where to apply which operation for achieving the balancing of the local/global context modelling. In the search space design, the original global path containing the attention module in Transformer is replaced with the sum of a global path and a local path that could contain different convolutions, and the selection of input features is also considered. To search the best architecture, we employ an effective evolutionary algorithm to explore the search space and also suggest a search space reduction strategy to accelerate the convergence of the algorithm. Experimental results on the two largest and most challenging education datasets demonstrate the effectiveness of the architecture found by the proposed approach.

artificial intelligence, deep learning, machine learning, (19 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

3e32af2df2cd13dfbcbe6e8d38111068-Paper-Conference.pdf

Neural Information Processing SystemsMar-21-2025, 16:42:22 GMT

adversary, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe (1.00)
North America > Canada (0.67)
North America > United States > California (0.28)

Industry: Education > Educational Setting (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.47)

Add feedback

Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems 2 Zhichao Huang

Neural Information Processing SystemsMar-21-2025, 16:42:14 GMT

We focus on the stochastic setting, where we can only access an unbiased stochastic gradient estimate of f at each iteration. This formulation includes many machine learning applications as special cases such as robust optimization and adversary training.

artificial intelligence, inequality, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > China (0.14)
North America > Canada (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.73)

Add feedback

Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems 2 Zhichao Huang

Neural Information Processing SystemsMar-21-2025, 16:42:07 GMT

We focus on the stochastic setting, where we can only access an unbiased stochastic gradient estimate of f at each iteration. This formulation includes many machine learning applications as special cases such as robust optimization and adversary training.

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Asia > China (0.14)
North America > Canada (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.76)

Add feedback

Reply to Reviewer 2, 3 and 4 Novelty of the analysis: Besides keeping the estimator of f(x k,y

Neural Information Processing SystemsMar-21-2025, 16:41:57 GMT

We are happy to cite this paper and compare it with SREDA. A more reasonable way is to solve it by accessing the (stochastic) gradient of f like SREDA. We are happy to follow the reviewer's suggestion and include it in the main text if the paper is accepted.

artificial intelligence, minimax problem, reviewer 2, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.81)

Add feedback

Alien Amidar Assault Asterix Asteroids Atlantis

Neural Information Processing SystemsMar-21-2025, 16:41:54 GMT

For all authors... (a) Do the main claims made in the abstract and introduction accurately reflect the paper's contributions and scope? If you ran experiments... (a) Did you include the code, data, and instructions needed to reproduce the main experimental results (either in the supplemental material or as a URL)? [Yes] Code provided as supplemental. If you used crowdsourcing or conducted research with human subjects... (a) Did you include the full text of instructions given to participants and screenshots, if applicable? [N/A] (b) Did you describe any potential participant risks, with links to Institutional Review Board (IRB) approvals, if applicable? [N/A] (c) Did you include the estimated hourly wage paid to participants and the total amount spent on participant compensation? A.1 Implementation, Hyperparameters and Evaluation Details The implementation of our main agent, Tandem DQN, is based on the Double-DQN [van Hasselt et al., 2016] agent provided in the DQN Zoo open-source agent collection [Quan and Ostrovski, 2020]. Figure 12: Tandem DQN: Active vs. passive performance on four selected Classic Control domains.

artificial intelligence, environment frame, machine learning, (16 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.48)

Industry: Leisure & Entertainment > Sports (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)

Add feedback

Filters

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

c404a5adbf90e09631678b13b05d9d7a-Paper.pdf

Why Normalizing Flows Fail to Detect Out-of-Distribution Data

Learning threshold neurons via the "edge of stability " Anonymous Author(s) Affiliation Address email

3e592c571de69a43d7a870ea89c7e33a-Paper-Conference.pdf

Evolutionary Neural Architecture Search for Transformer in Knowledge Tracing

3e32af2df2cd13dfbcbe6e8d38111068-Paper-Conference.pdf

Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems 2 Zhichao Huang

Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems 2 Zhichao Huang

Reply to Reviewer 2, 3 and 4 Novelty of the analysis: Besides keeping the estimator of f(x k,y

Alien Amidar Assault Asterix Asteroids Atlantis