AITopics | rate

Collaborating Authors

rate

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

d072677d210ac4c03ba046120f0802ec-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-10-2026, 11:46:44 GMT

Why distilling prioritized paths improves architecture rating? Considering21 complexity-accuracy tradeoff, we choose the first option. The search spaces of most current one-shot methods are inconsistent, including the25 publicationsfromGoogle. The architectures in Tab. 5 are searched by multiple36 runs.

architecture, artificial intelligence, toreviewer, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.62)

Add feedback

A Experimental Setup

Neural Information Processing SystemsFeb-8-2026, 10:36:52 GMT

ASIN (id): ASIN stands for Amazon Standard Identification Number. Title: The Title attribute represents the name or title given to a product, book, or creative work. Size: Size indicates the dimensions or physical size of the product. Model: The Model attribute refers to a specific model or version of a product. It provides information about the product's primary material, such as metal, plastic, Color Text: Color Text describes the color or color variation of the product.

artificial intelligence, machine learning, phase-2 test, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.72)

Add feedback

The Curse of Unrolling: Rate of Differentiating Through Optimization

Neural Information Processing SystemsDec-24-2025, 10:12:22 GMT

Computing the Jacobian of the solution of an optimization problem is a central problem in machine learning, with applications in hyperparameter optimization, meta-learning, optimization as a layer, and dataset distillation, to name a few. Unrolled differentiation is a popular heuristic that approximates the solution using an iterative solver and differentiates it through the computational path. This work provides a non-asymptotic convergence-rate analysis of this approach on quadratic objectives for gradient descent and the Chebyshev method. We show that to ensure convergence of the Jacobian, we can either 1) choose a large learning rate leading to a fast asymptotic convergence but accept that the algorithm may have an arbitrarily long burn-in phase or 2) choose a smaller learning rate leading to an immediate but slower convergence. We refer to this phenomenon as the curse of unrolling.Finally, we discuss open problems relative to this approach, such as deriving a practical update rule for the optimal unrolling strategy and making novel connections with the field of Sobolev orthogonal polynomials.

differentiating, name change, unrolling, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Rates of Convergence for Nearest Neighbor Classification

Kamalika Chaudhuri, Sanjoy Dasgupta

Neural Information Processing SystemsOct-2-2025, 19:21:01 GMT

Neural Information Processing Systems http://nips.cc/

convergence, nearest neighbor classification, rate

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Case-Based Reasoning (0.40)

Add feedback

Training a Generally Curious Agent

Tajwar, Fahim, Jiang, Yiding, Thankaraj, Abitha, Rahman, Sumaita Sadia, Kolter, J Zico, Schneider, Jeff, Salakhutdinov, Ruslan

arXiv.org Artificial IntelligenceMar-5-2025

Efficient exploration is essential for intelligent systems interacting with their environment, but existing language models often fall short in scenarios that require strategic information gathering. In this paper, we present PAPRIKA, a fine-tuning approach that enables language models to develop general decision-making capabilities that are not confined to particular environments. By training on synthetic interaction data from different tasks that require diverse strategies, PAPRIKA teaches models to explore and adapt their behavior on a new task based on environment feedback in-context without more gradient updates. Experimental results show that models fine-tuned with PAPRIKA can effectively transfer their learned decision-making capabilities to entirely unseen tasks without additional training. Unlike traditional training, our approach's primary bottleneck lies in sampling useful interaction data instead of model updates. To improve sample efficiency, we propose a curriculum learning strategy that prioritizes sampling trajectories from tasks with high learning potential. These results suggest a promising path towards AI systems that can autonomously solve novel sequential decision-making problems that require interactions with the external world.

large language model, machine learning, reinforcement learning, (23 more...)

arXiv.org Artificial Intelligence

2502.17543

Country:

North America > United States (0.92)
North America > Cuba (0.28)
Europe > Ukraine > Kyiv Oblast > Kyiv (0.14)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Education (0.67)
Government > Military > Navy (0.47)
Leisure & Entertainment > Games > Computer Games (0.46)
Energy > Oil & Gas > Upstream (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
(3 more...)

Add feedback

Challenging the Performance-Interpretability Trade-off: An Evaluation of Interpretable Machine Learning Models

Kruschel, Sven, Hambauer, Nico, Weinzierl, Sven, Zilker, Sandra, Kraus, Mathias, Zschech, Patrick

arXiv.org Artificial IntelligenceSep-22-2024

Machine learning is permeating every conceivable domain to promote data-driven decision support. The focus is often on advanced black-box models due to their assumed performance advantages, whereas interpretable models are often associated with inferior predictive qualities. More recently, however, a new generation of generalized additive models (GAMs) has been proposed that offer promising properties for capturing complex, non-linear patterns while remaining fully interpretable. To uncover the merits and limitations of these models, this study examines the predictive performance of seven different GAMs in comparison to seven commonly used machine learning models based on a collection of twenty tabular benchmark datasets. To ensure a fair and robust model comparison, an extensive hyperparameter search combined with cross-validation was performed, resulting in 68,500 model runs. In addition, this study qualitatively examines the visual output of the models to assess their level of interpretability. Based on these results, the paper dispels the misconception that only black-box models can achieve high accuracy by demonstrating that there is no strict trade-off between predictive performance and model interpretability for tabular data. Furthermore, the paper discusses the importance of GAMs as powerful interpretable models for the field of information systems and derives implications for future work from a socio-technical perspective.

dataset, estimator, interpretability, (17 more...)

arXiv.org Artificial Intelligence

2409.14429

Country:

Europe > Germany > Bavaria > Middle Franconia > Nuremberg (0.14)
Oceania > Australia (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
(9 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Transportation (1.00)
Information Technology (1.00)
Health & Medicine (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.68)
(2 more...)

Add feedback

Analisis Eksploratif Dan Augmentasi Data NSL-KDD Menggunakan Deep Generative Adversarial Networks Untuk Meningkatkan Performa Algoritma Extreme Gradient Boosting Dalam Klasifikasi Jenis Serangan Siber

Santoso, K. P., Madany, F. A., Suryotrisongko, H.

arXiv.org Artificial IntelligenceDec-17-2023

This study proposes the implementation of Deep Generative Adversarial Networks (GANs) for augmenting the NSL-KDD dataset. The primary objective is to enhance the efficacy of eXtreme Gradient Boosting (XGBoost) in the classification of cyber-attacks on the NSL-KDD dataset. As a result, the method proposed in this research achieved an accuracy of 99.53% using the XGBoost model without data augmentation with GAN, and 99.78% with data augmentation using GAN.

adversarial network, serangan, siber, (11 more...)

arXiv.org Artificial Intelligence

2312.10669

Country:

Asia > Indonesia > Java > East Java > Surabaya (0.05)
Europe > Italy > Tuscany > Pisa Province > Pisa (0.04)

Genre: Research Report (0.50)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)

Add feedback

Gradient Boosting Hyperparameters Tuning : Classifier Example

#artificialintelligenceJul-5-2020, 23:50:49 GMT

There are various machine learning algorithms that at the last make a weak model. You think to apply other algorithms and still, you get the weak model. If I say there is a method to make all the weak models to a strong model, then do you believe it. At first, you will not believe it, but After reading the entire post you will definitely learn the method to convert the weak model to a strong model using boosting. You will know to tune the Gradient Boosting Hyperparameters. Boosting is an ensemble method to aggregate all the weak models to make them better and the strong model.

artificial intelligence, machine learning, weak model, (12 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.86)

Add feedback

Survival regression with accelerated failure time model in XGBoost

Barnwal, Avinash, Cho, Hyunsu, Hocking, Toby Dylan

arXiv.org Machine LearningJun-11-2020

Survival regression is used to estimate the relation between time-to-event and feature variables, and is important in application domains such as medicine, marketing, risk management and sales management. Nonlinear tree based machine learning algorithms as implemented in libraries such as XGBoost, scikit-learn, LightGBM, and CatBoost are often more accurate in practice than linear models. However, existing implementations of tree-based models have offered limited support for survival regression. In this work, we propose and implement loss functions for learning accelerated failure time (AFT) models in XGBoost, to increase the support for survival modeling for different kinds of label censoring. The AFT model assumes effects that directly accelerate or decelerate the survival time for different kinds of censored data sets. We demonstrate with real and simulated experiments the effectiveness of AFT in XGBoost with respect to a number of baselines, in two respects: generalization performance and training speed. Furthermore, we take advantage of the support for NVIDIA GPUs in XGBoost to achieve substantial speedup over multi-coreCPUs. To our knowledge, our work is the first implementation of AFT that utilizes the processing power of NVIDIA GPUs.

artificial intelligence, grid, machine learning, (19 more...)

arXiv.org Machine Learning

2006.0492

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > Suffolk County > Stony Brook (0.04)
North America > United States > District of Columbia > Washington (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry:

Information Technology (1.00)
Health & Medicine > Therapeutic Area > Oncology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

Add feedback

A Visual Explanation of Gradient Descent Methods (Momentum, AdaGrad, RMSProp, Adam)

#artificialintelligenceJun-10-2020, 16:01:32 GMT

With a myriad of resources out there explaining gradient descents, in this post, I'd like to visually walk you through how each of these methods works. With the aid of a gradient descent visualization tool I built, hopefully I can present you with some unique insights, or minimally, many GIFs. I assume basic familiarity of why and how gradient descent is used in machine learning (if not, I recommend this video by 3Blue1Brown) . My focus here is to compare and contrast these methods. If you are already familiar with all the methods, you can scroll to the bottom to watch a few fun "horse races".

artificial intelligence, gradient descent, machine learning, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)

Add feedback