AITopics | machine learning

Collaborating Authors

machine learning

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Hierarchical Optimal Transport for Document Representation

Mikhail Yurochkin, Sebastian Claici, Edward Chien, Farzaneh Mirzazadeh, Justin M. Solomon

Neural Information Processing SystemsJun-1-2025, 02:46:58 GMT

The ability to measure similarity between documents enables intelligent summarization and analysis of large corpora. Past distances between documents suffer from either an inability to incorporate semantic similarities between words or from scalability issues. As an alternative, we introduce hierarchical optimal transport as a meta-distance between documents, where documents are modeled as distributions over topics, which themselves are modeled as distributions over words. We then solve an optimal transport problem on the smaller topic space to compute a similarity score. We give conditions on the topics under which this construction defines a distance, and we relate it to the word mover's distance. We evaluate our technique for k-NN classification and show better interpretability and scalability with comparable performance to current methods at a fraction of the cost.

machine learning, natural language, wmd, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
North America > Canada (0.14)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Discriminator optimal transport

Akinori Tanaka

Neural Information Processing SystemsJun-1-2025, 02:44:26 GMT

We show that it improves inception score and FID calculated by unconditional GAN trained by CIFAR-10, STL-10 and a public pre-trained model of conditional GAN trained by ImageNet.

artificial intelligence, machine learning, optimal transport, (15 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō (0.28)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.15)
North America > United States > California > Los Angeles County > Long Beach (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

Reviewer # 1 > It is better to add a section: comparison with related works, to highlight the main contributions

Neural Information Processing SystemsJun-1-2025, 02:44:11 GMT

We wish to express our appreciation to the reviewers for their insightful comments on our paper. All responses are reflected in our camera-ready version. Thank you for the proposal. We are sorry for that our writing makes itself hard to follow. Thank you for the important comment.

artificial intelligence, assumption, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

Learning-Augmented Algorithms with Explicit Predictors

Neural Information Processing SystemsJun-1-2025, 02:44:00 GMT

Recent advances in algorithmic design show how to utilize predictions obtained by machine learning models from past and present data. These approaches have demonstrated an enhancement in performance when the predictions are accurate, while also ensuring robustness by providing worst-case guarantees when predictions fail. In this paper we focus on online problems; prior research in this context was focused on a paradigm where the algorithms are oblivious of the predictors' design, treating them as a black box. In contrast, in this work, we unpack the predictor and integrate the learning problem it gives rise for within the algorithmic challenge. In particular we allow the predictor to learn as it receives larger parts of the input, with the ultimate goal of designing online learning algorithms specifically tailored for the algorithmic task at hand. Adopting this perspective, we focus on a number of fundamental problems, including caching and scheduling, which have been well-studied in the black-box setting. For each of the problems, we introduce new algorithms that take advantage of explicit and carefully designed learning rules.

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe (0.28)
Asia > Middle East > Israel (0.14)

Genre: Research Report > Experimental Study (0.92)

Industry:

Transportation > Air (0.54)
Education > Educational Setting > Online (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.34)

Add feedback

Extracting Training Data from Molecular Pre-trained Models

Neural Information Processing SystemsJun-1-2025, 02:43:40 GMT

Graph Neural Networks (GNNs) have significantly advanced the field of drug discovery, enhancing the speed and efficiency of molecular identification. However, training these GNNs demands vast amounts of molecular data, which has spurred the emergence of collaborative model-sharing initiatives. These initiatives facilitate the sharing of molecular pre-trained models among organizations without exposing proprietary training data. Despite the benefits, these molecular pre-trained models may still pose privacy risks. For example, malicious adversaries could perform data extraction attack to recover private training data, thereby threatening commercial secrets and collaborative trust.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (0.93)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Benchmarking the Attribution Quality of Vision Models Robin Hesse 1 Simone Schaub-Meyer 1,2 Stefan Roth Department of Computer Science, Technical University of Darmstadt

Neural Information Processing SystemsJun-1-2025, 02:42:02 GMT

Attribution maps are one of the most established tools to explain the functioning of computer vision models. They assign importance scores to input features, indicating how relevant each feature is for the prediction of a deep neural network. While much research has gone into proposing new attribution methods, their proper evaluation remains a difficult challenge. In this work, we propose a novel evaluation protocol that overcomes two fundamental limitations of the widely used incremental-deletion protocol, i.e., the out-of-domain issue and lacking inter-model comparisons. This allows us to evaluate 23 attribution methods and how different design choices of popular vision backbones affect their attribution quality. We find that intrinsically explainable models outperform standard models and that raw attribution values exhibit a higher attribution quality than what is known from previous work. Further, we show consistent changes in the attribution quality when varying the network design, indicating that some standard design choices promote attribution quality.

artificial intelligence, attribution method, machine learning, (18 more...)

Neural Information Processing Systems

Country: Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.40)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Audio-Driven Co-Speech Gesture Video Generation (Supplemental Document)

Neural Information Processing SystemsJun-1-2025, 02:41:54 GMT

In the supplemental document, we will introduce below contents: 1) proof of Theorem 1 (unique cholesky decomposition theorem) (Sec. L); 13) the licenses of existing assets involved in this paper (Sec. In the main paper, to ease the constraint in the quantization process, we use the unique cholesky decomposition theorem [13] to transform the covariance matrix C to factorial covariance L by theorem: Theorem 1. Because C has the positive determinants, the diagonal entries of L should be non-zero values. The output of the GPT [10] model at the t-th time step is the probability of choosing each codebook entry, where the entry with the largest probability serves as the predicted motion code of the next time step.

information, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country: Asia > China (0.14)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Learning from Bad Data via Generation

Tianyu Guo, Chang Xu, Boxin Shi, Chao Xu, Dacheng Tao

Neural Information Processing SystemsJun-1-2025, 02:39:25 GMT

Bad training data would challenge the learning model from understanding the underlying data-generating scheme, which then increases the difficulty in achieving satisfactory performance on unseen test data. We suppose the real data distribution lies in a distribution set supported by the empirical distribution of bad data. A worst-case formulation can be developed over this distribution set, and then be interpreted as a generation task in an adversarial manner. The connections and differences between GANs and our framework have been thoroughly discussed. We further theoretically show the influence of this generation task on learning from bad data and reveal its connection with a data-dependent regularization. Given different distance measures (e.g., Wasserstein distance or JS divergence) of distributions, we can derive different objective functions for the problem. Experimental results on different kinds of bad training data demonstrate the necessity and effectiveness of the proposed method.

artificial intelligence, classifier, machine learning, (17 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)

Add feedback

presentation and fix all minor issues in the final version. distributions within the ball of an appropriate radius ɛ (see Eq. (1)), which could also include the unknown real distribution P

Neural Information Processing SystemsJun-1-2025, 02:39:10 GMT

We thank reviewers for the constructive comments. First, generators in existing methods tend to fit the empirical distribution. Given a bad training set, their generated data could be worse. Second, these generators often produce "easy" samples Since ɛ is unknown, it is common to take λ as a hyper parameter to be tuned in experiments (e.g. Moreover, the generator could conduct "data augmentation" for the We may thus receive a slightly better result, e.g.

artificial intelligence, classifier, machine learning, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.52)

Add feedback

L_DMI: A Novel Information-theoretic Loss Function for Training Deep Nets Robust to Label Noise

Yilun Xu, Peng Cao, Yuqing Kong, Yizhou Wang

Neural Information Processing SystemsJun-1-2025, 02:38:35 GMT

Accurately annotating large scale dataset is notoriously expensive both in time and in money. Although acquiring low-quality-annotated dataset can be much cheaper, it often badly damages the performance of trained models when using such dataset without particular treatment. Various methods have been proposed for learning with noisy labels. However, most methods only handle limited kinds of noise patterns, require auxiliary information or steps (e.g., knowing or estimating the noise transition matrix), or lack theoretical justification.

artificial intelligence, dmi, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > Canada (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback