AITopics | ml 2

Collaborating Authors

ml 2

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Implementation of PS CD Algorithm

Neural Information Processing SystemsFeb-10-2026, 22:36:12 GMT

In this section, we provide two different ways to prove Theorem 2. The first one is more straightforward and directly differentiates through the term To solve this issue, we introduce the following variational representation: Lemma 1. With Jensen's inequality, we have: log null null As introduced in Equation (9) in Section 2.3, the divergence corresponding to the This is a direct consequence of Lemma 2. It can also be verified by checking the PS-CD Lemma 3. When 1 γ < 0, we have: S We first make the following assumption, which is similar to the one used in [4, 47]: Assumption 1. The assumption is typically easy to enforce in practice. In this section, we analyze the convergence property of the PS-CD algorithm presented in Algorithm 1. We have the following theorem that characterizes the convergence property of Algorithm 2: Theorem 5. Monte Carlo estimation will incur additional approximation error.

artificial intelligence, equation, machine learning, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Semi-supervised Multi-label Learning with Balanced Binary Angular Margin Loss Ximing Li

Neural Information Processing SystemsNov-20-2025, 02:16:56 GMT

The current prevalent ideas are estimating pseudo-labels of unlabeled samples with SSL techniques and inducing MLL classifiers with both labeled and pseudo-labeled samples in a self-training manner.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: Asia > China (0.05)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Semi-supervised Multi-label Learning with Balanced Binary Angular Margin Loss Ximing Li

Neural Information Processing SystemsOct-10-2025, 13:37:50 GMT

The current prevalent ideas are estimating pseudo-labels of unlabeled samples with SSL techniques and inducing MLL classifiers with both labeled and pseudo-labeled samples in a self-training manner.

ml 2, negative sample, ssl, (17 more...)

Neural Information Processing Systems

Country: Asia > China (0.05)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Decentralized Parallel Algorithm for Training Generative Adversarial Nets

Neural Information Processing SystemsOct-3-2025, 08:45:52 GMT

Generative Adversarial Networks (GANs) are a powerful class of generative models in the deep learning community.

algorithm, arxiv preprint arxiv, decentralized algorithm, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Iowa > Johnson County > Iowa City (0.14)
Asia > Middle East > Jordan (0.04)
North America > United States > Massachusetts (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

A Stochastic Linearized Augmented Lagrangian Method for Decentralized Bilevel Optimization Songtao Lu

Neural Information Processing SystemsAug-18-2025, 19:36:00 GMT

Classical bilevel optimization is referred to as the case where there is no consensus constraint but with only two levels of the minimization subproblems, i.e.,

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.27)
North America > United States > Ohio > Franklin County > Columbus (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(3 more...)

Add feedback

Semi-supervised Multi-label Learning with Balanced Binary Angular Margin Loss

Neural Information Processing SystemsMay-27-2025, 13:07:12 GMT

Semi-supervised multi-label learning (SSMLL) refers to inducing classifiers using a small number of samples with multiple labels and many unlabeled samples. The prevalent solution of SSMLL involves forming pseudo-labels for unlabeled samples and inducing classifiers using both labeled and pseudo-labeled samples in a self-training manner. Unfortunately, with the commonly used binary type of loss and negative sampling, we have empirically found that learning with labeled and pseudo-labeled samples can result in the variance bias problem between the feature distributions of positive and negative samples for each label. To alleviate this problem, we aim to balance the variance bias between positive and negative samples from the perspective of the feature angle distribution for each label. Specifically, we extend the traditional binary angular margin loss to a balanced extension with feature angle distribution transformations under the Gaussian assumption, where the distributions are iteratively updated during classifier training. We also suggest an efficient prototype-based negative sampling method to maintain high-quality negative samples for each label.

artificial intelligence, balanced binary angular margin loss, machine learning, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.41)

Add feedback

ML$^2$Tuner: Efficient Code Tuning via Multi-Level Machine Learning Models

Cha, JooHyoung, Lee, Munyoung, Kwon, Jinse, Lee, Jubin, Lee, Jemin, Kwon, Yongin

arXiv.org Artificial IntelligenceNov-16-2024

The increasing complexity of deep learning models necessitates specialized hardware and software optimizations, particularly for deep learning accelerators. Existing autotuning methods often suffer from prolonged tuning times due to profiling invalid configurations, which can cause runtime errors. We introduce ML$^2$Tuner, a multi-level machine learning tuning technique that enhances autotuning efficiency by incorporating a validity prediction model to filter out invalid configurations and an advanced performance prediction model utilizing hidden features from the compilation process. Experimental results on an extended VTA accelerator demonstrate that ML$^2$Tuner achieves equivalent performance improvements using only 12.3% of the samples required with a similar approach as TVM and reduces invalid profiling attempts by an average of 60.8%, Highlighting its potential to enhance autotuning performance by filtering out invalid configurations

artificial intelligence, configuration, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2411.10764

Country: North America > United States > New York > New York County > New York City (0.05)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits

Jang, Kyoungseok, Zhang, Chicheng, Jun, Kwang-Sung

arXiv.org Machine LearningFeb-16-2024

We study low-rank matrix trace regression and the related problem of low-rank matrix bandits. Assuming access to the distribution of the covariates, we propose a novel low-rank matrix estimation method called LowPopArt and provide its recovery guarantee that depends on a novel quantity denoted by B(Q) that characterizes the hardness of the problem, where Q is the covariance matrix of the measurement distribution. We show that our method can provide tighter recovery guarantees than classical nuclear norm penalized least squares (Koltchinskii et al., 2011) in several problems. To perform efficient estimation with a limited number of measurements from an arbitrarily given measurement set A, we also propose a novel experimental design criterion that minimizes B(Q) with computational efficiency. We leverage our novel estimator and design of experiments to derive two low-rank linear bandit algorithms for general arm sets that enjoy improved regret upper bounds. This improves over previous works on low-rank bandits, which make somewhat restrictive assumptions that the arm set is the unit ball or that an efficient exploration distribution is given. To our knowledge, our experimental design criterion is the first one tailored to low-rank matrix estimation beyond the naive reduction to linear regression, which can be of independent interest.

algorithm, efficient low-rank matrix estimation, vec, (10 more...)

arXiv.org Machine Learning

2402.11156

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Arizona > Pima County > Tucson (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.35)

Add feedback

Leveraging a Probabilistic PCA Model to Understand the Multivariate Statistical Network Monitoring Framework for Network Security Anomaly Detection

Pérez-Bueno, Fernando, García, Luz, Maciá-Fernández, Gabriel, Molina, Rafael

arXiv.org Artificial IntelligenceFeb-2-2023

Network anomaly detection is a very relevant research area nowadays, especially due to its multiple applications in the field of network security. The boost of new models based on variational autoencoders and generative adversarial networks has motivated a reevaluation of traditional techniques for anomaly detection. It is, however, essential to be able to understand these new models from the perspective of the experience attained from years of evaluating network security data for anomaly detection. In this paper, we revisit anomaly detection techniques based on PCA from a probabilistic generative model point of view, and contribute a mathematical model that relates them. Specifically, we start with the probabilistic PCA model and explain its connection to the Multivariate Statistical Network Monitoring (MSNM) framework. MSNM was recently successfully proposed as a means of incorporating industrial process anomaly detection experience into the field of networking. We have evaluated the mathematical model using two different datasets. The first, a synthetic dataset created to better understand the analysis proposed, and the second, UGR'16, is a specifically designed real-traffic dataset for network security anomaly detection. We have drawn conclusions that we consider to be useful when applying generative models to network security detection.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TNET.2021.3138536.

2302.01759

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
Europe > Spain > Andalusia > Granada Province > Granada (0.04)
Asia > Middle East > Iran (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.64)

Industry:

Information Technology > Security & Privacy (1.00)
Information Technology > Networks (1.00)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Filters

Collaborating Authors

ml 2

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

bc5fcb0018cecacba559dc512740091b-Supplemental.pdf

A Implementation of PS CD Algorithm

Semi-supervised Multi-label Learning with Balanced Binary Angular Margin Loss Ximing Li

Semi-supervised Multi-label Learning with Balanced Binary Angular Margin Loss Ximing Li

A Decentralized Parallel Algorithm for Training Generative Adversarial Nets

A Stochastic Linearized Augmented Lagrangian Method for Decentralized Bilevel Optimization Songtao Lu

Semi-supervised Multi-label Learning with Balanced Binary Angular Margin Loss

ML$^2$Tuner: Efficient Code Tuning via Multi-Level Machine Learning Models

Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits

Leveraging a Probabilistic PCA Model to Understand the Multivariate Statistical Network Monitoring Framework for Network Security Anomaly Detection