AITopics | Yeats, Eric

Collaborating Authors

Yeats, Eric

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models

Zhang, Jingyang, Sun, Jingwei, Yeats, Eric, Ouyang, Yang, Kuo, Martin, Zhang, Jianyi, Yang, Hao Frank, Li, Hai

arXiv.org Artificial IntelligenceMay-23-2024

Despite improved performance, existing methods (including the state-of-the-art, Min-K%) are mostly developed upon simple heuristics and lack solid, reasonable foundations. In this work, we propose a novel and theoretically motivated methodology for pre-training data detection, named Min-K%++. Specifically, we present a key insight that training samples tend to be local maxima of the modeled distribution along each input dimension through maximum likelihood training, which in turn allow us to insightfully translate the problem into identification of local maxima. Then, we design our method accordingly that works under the discrete distribution modeled by LLMs, whose core idea is to determine whether the input forms a mode or has relatively high probability under the conditional categorical distribution. Empirically, the proposed method achieves new SOTA performance across multiple settings. On the WikiMIA benchmark, Min-K%++ outperforms the runner-up by 6.2% to 10.5% in detection AUROC averaged over five models. On the more challenging MIMIR benchmark, it consistently improves upon reference-free methods while performing on par with reference-based method that requires an extra reference model.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2404.02936

Country: Asia > Middle East > UAE (0.14)

Genre: Research Report (0.64)

Industry:

Information Technology > Security & Privacy (0.46)
Law (0.46)

Add feedback

Do Counterfactual Examples Complicate Adversarial Training?

Yeats, Eric, Darwin, Cameron, Ortega, Eduardo, Liu, Frank, Li, Hai

arXiv.org Artificial IntelligenceApr-17-2024

We leverage diffusion models to study the robustness-performance tradeoff of robust classifiers. Our approach introduces a simple, pretrained diffusion method to generate low-norm counterfactual examples (CEs): semantically altered data which results in different true class membership. We report that the confidence and accuracy of robust models on their clean training data are associated with the proximity of the data to their CEs. Moreover, robust models perform very poorly when evaluated on the CEs directly, as they become increasingly invariant to the low-norm, semantic changes brought by CEs. The results indicate a significant overlap between non-robust and semantic features, countering the common assumption that non-robust features are not interpretable.

artificial intelligence, classifier, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2404.10588

Country: North America > United States (0.47)

Genre: Research Report (1.00)

Industry: Government > Regional Government (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Adversarial Estimation of Topological Dimension with Harmonic Score Maps

Yeats, Eric, Darwin, Cameron, Liu, Frank, Li, Hai

arXiv.org Artificial IntelligenceDec-11-2023

Quantification of the number of variables needed to locally explain complex data is often the first step to better understanding it. Existing techniques from intrinsic dimension estimation leverage statistical models to glean this information from samples within a neighborhood. However, existing methods often rely on well-picked hyperparameters and ample data as manifold dimension and curvature increases. Leveraging insight into the fixed point of the score matching objective as the score map is regularized by its Dirichlet energy, we show that it is possible to retrieve the topological dimension of the manifold learned by the score map. We then introduce a novel method to measure the learned manifold's topological dimension (i.e., local intrinsic dimension) using adversarial attacks, thereby generating useful interpretations of the learned manifold.

artificial intelligence, dimension, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2312.06869

Country:

North America > United States (0.68)
Asia > Middle East > Israel (0.14)

Genre: Research Report (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (0.68)
Government > Military (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.68)

Add feedback

Disentangling Learning Representations with Density Estimation

Yeats, Eric, Liu, Frank, Li, Hai

arXiv.org Artificial IntelligenceFeb-8-2023

Disentangled learning representations have promising utility in many applications, but they currently suffer from serious reliability issues. We present Gaussian Channel Autoencoder (GCAE), a method which achieves reliable disentanglement via flexible density estimation of the latent space. GCAE avoids the curse of dimensionality of density estimation by disentangling subsets of its latent space with the Dual Total Correlation (DTC) metric, thereby representing its high-dimensional latent joint distribution as a collection of many low-dimensional conditional distributions. In our experiments, GCAE achieves highly competitive and reliable disentanglement scores compared with state-of-the-art baselines.

artificial intelligence, machine learning, representation, (18 more...)

arXiv.org Artificial Intelligence

2302.04362

Country: North America > United States (1.00)

Genre: Research Report > New Finding (0.34)

Industry: Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.91)

Add feedback

NashAE: Disentangling Representations through Adversarial Covariance Minimization

Yeats, Eric, Liu, Frank, Womble, David, Li, Hai

arXiv.org Artificial IntelligenceSep-21-2022

We present a self-supervised method to disentangle factors of variation in high-dimensional data that does not rely on prior knowledge of the underlying variation profile (e.g., no assumptions on the number or distribution of the individual latent variables to be extracted). In this method which we call NashAE, high-dimensional feature disentanglement is accomplished in the low-dimensional latent space of a standard autoencoder (AE) by promoting the discrepancy between each encoding element and information of the element recovered from all other encoding elements. Disentanglement is promoted efficiently by framing this as a minmax game between the AE and an ensemble of regression networks which each provide an estimate of an element conditioned on an observation of all other elements. We quantitatively compare our approach with leading disentanglement methods using existing disentanglement metrics. Furthermore, we show that NashAE has increased reliability and increased capacity to capture salient data characteristics in the learned latent representation.

artificial intelligence, machine learning, representation, (17 more...)

arXiv.org Artificial Intelligence

2209.10677

Country: North America > United States (1.00)

Genre: Research Report (0.50)

Industry: Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback