AITopics | Perrault-Joncas, Dominique

Collaborating Authors

Perrault-Joncas, Dominique

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

C-3DPO: Constrained Controlled Classification for Direct Preference Optimization

Asadi, Kavosh, Han, Julien, Xu, Xingzi, Perrault-Joncas, Dominique, Sabach, Shoham, Bouyarmane, Karim, Ghavamzadeh, Mohammad

arXiv.org Artificial IntelligenceFeb-21-2025

Direct preference optimization (DPO)-style algorithms have emerged as a promising approach for solving the alignment problem in AI. We present a novel perspective that formulates these algorithms as implicit classification algorithms. This classification framework enables us to recover many variants of DPO-style algorithms by choosing appropriate classification labels and loss functions. We then leverage this classification framework to demonstrate that the underlying problem solved in these algorithms is under-specified, making them susceptible to probability collapse of the winner-loser responses. We address this by proposing a set of constraints designed to control the movement of probability mass between the winner and loser in the reference and target policies. Our resulting algorithm, which we call Constrained Controlled Classification DPO (\texttt{C-3DPO}), has a meaningful RLHF interpretation. By hedging against probability collapse, \texttt{C-3DPO} provides practical improvements over vanilla \texttt{DPO} when aligning several large language models using standard preference datasets.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2502.17507

Country: North America > United States (0.14)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

LLMForecaster: Improving Seasonal Event Forecasts with Unstructured Textual Data

Zhang, Hanyu, Arvin, Chuck, Efimov, Dmitry, Mahoney, Michael W., Perrault-Joncas, Dominique, Ramasubramanian, Shankar, Wilson, Andrew Gordon, Wolff, Malcolm

arXiv.org Artificial IntelligenceDec-3-2024

Modern time-series forecasting models often fail to make full use of rich unstructured information about the time series themselves. This lack of proper conditioning can lead to obvious model failures; for example, models may be unaware of the details of a particular product, and hence fail to anticipate seasonal surges in customer demand in the lead up to major exogenous events like holidays for clearly relevant products. To address this shortcoming, this paper introduces a novel forecast post-processor -- which we call LLMForecaster -- that fine-tunes large language models (LLMs) to incorporate unstructured semantic and contextual information and historical data to improve the forecasts from an existing demand forecasting pipeline. In an industry-scale retail application, we demonstrate that our technique yields statistically significantly forecast improvements across several sets of products subject to holiday-driven demand surges.

forecasting, large language model, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2412.02525

Country: Asia > China (0.14)

Genre: Research Report > Experimental Study (0.70)

Industry: Energy > Power Industry (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Assessment of Treatment Effect Estimators for Heavy-Tailed Data

Tripuraneni, Nilesh, Madeka, Dhruv, Foster, Dean, Perrault-Joncas, Dominique, Jordan, Michael I.

arXiv.org Machine LearningDec-19-2021

A central obstacle in the objective assessment of treatment effect (TE) estimators in randomized control trials (RCTs) is the lack of ground truth (or validation set) to test their performance. In this paper, we provide a novel cross-validation-like methodology to address this challenge. The key insight of our procedure is that the noisy (but unbiased) difference-of-means estimate can be used as a ground truth "label" on a portion of the RCT, to test the performance of an estimator trained on the other portion. We combine this insight with an aggregation scheme, which borrows statistical strength across a large collection of RCTs, to present an end-to-end methodology for judging an estimator's ability to recover the underlying treatment effect. We evaluate our methodology across 709 RCTs implemented in the Amazon supply chain. In the corpus of AB tests at Amazon, we highlight the unique difficulties associated with recovering the treatment effect due to the heavy-tailed nature of the response variables. In this heavy-tailed setting, our methodology suggests that procedures that aggressively downweight or truncate large values, while introducing bias, lower the variance enough to ensure that the treatment effect is more accurately estimated.

artificial intelligence, estimator, machine learning, (15 more...)

arXiv.org Machine Learning

2112.07602

Country: North America > United States > California (0.14)

Genre: Research Report > Experimental Study (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Improved graph Laplacian via geometric self-consistency

Perrault-Joncas, Dominique, Meila, Marina

arXiv.org Machine LearningMay-31-2014

We address the problem of setting the kernel bandwidth used by Manifold Learning algorithms to construct the graph Laplacian. Exploiting the connection between manifold geometry, represented by the Riemannian metric, and the Laplace-Beltrami operator, we set the bandwidth by optimizing the Laplacian's ability to preserve the geometry of the data. Experiments show that this principled approach is effective and robust.

artificial intelligence, dimension, laplacian, (18 more...)

arXiv.org Machine Learning

1406.0118

Country: North America > United States (0.68)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

Estimating Vector Fields on Manifolds and the Embedding of Directed Graphs

Perrault-Joncas, Dominique, Meila, Marina

arXiv.org Machine LearningMay-30-2014

This paper considers the problem of embedding directed graphs in Euclidean space while retaining directional information. We model a directed graph as a finite set of observations from a diffusion on a manifold endowed with a vector field. This is the first generative model of its kind for directed graphs. We introduce a graph embedding algorithm that estimates all three features of this model: the low-dimensional embedding of the manifold, the data density and the vector field. In the process, we also obtain new theoretical results on the limits of "Laplacian type" matrices derived from directed graphs. The application of our method to both artificially constructed and real data highlights its strengths.

artificial intelligence, graph, machine learning, (17 more...)

arXiv.org Machine Learning

1406.0013

Country: North America > United States > California (0.14)

Genre: Research Report (0.50)

Industry: Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Building a Job Lanscape from Directional Transition Data

Perrault-Joncas, Dominique (University of Washington) | Meila, Marina (University of Washington) | Scott, Marc (New York University)

AAAI ConferencesNov-5-2010

The analysis of career paths suffers from a lack of exploratory tools and dynamic models, due in part to the inherent high dimensionality of the problem. Paths may be understood as directed traversals through a graph whose nodes consist of "job types," which we define as industry and occupation pairs. We want to develop tools to understand and detect high-level features of both the labor market and the workers moving through it — career dynamics. To do this, we map the discrete space of jobs into a d-dimensional continuous space; proximity between jobs will mean that they are "close" to each other in a non-negligible subset of career paths. This embedding allows one to visualize the job landscape. Moreover, we can map individual or groups of career paths to this space, extract features of their collective structure, and construct statistical tests comparing groups by means of this mapping.

artificial intelligence, machine learning, transition, (19 more...)

AAAI Conferences

2010 AAAI Fall Symposium Series

Country: North America > United States > California (0.14)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback