Goto

Collaborating Authors

 ppd



Eliciting Categorical Data for Optimal Aggregation

Chien-Ju Ho, Rafael Frongillo, Yiling Chen

Neural Information Processing Systems

Models for collecting and aggregating categorical data on crowdsourcing platforms typically fall into two broad categories: those assuming agents honest and consistent but with heterogeneous error rates, and those assuming agents strategic and seek to maximize their expected reward. The former often leads to tractable aggregation of elicited data, while the latter usually focuses on optimal elicitation and does not consider aggregation. In this paper, we develop a Bayesian model, wherein agents have differing quality of information, but also respond to incentives. Our model generalizes both categories and enables the joint exploration of optimal elicitation and aggregation. This model enables our exploration, both analytically and experimentally, of optimal aggregation of categorical data and optimal multiple-choice interface design.


Your eyes can only handle so much HDTV

Popular Science

More pixels doesn't always mean a better screen. Breakthroughs, discoveries, and DIY tips sent every weekday. Every year, tech and television companies boast their products' latest and greatest, highest-resolution displays. The 4K display--a screen with a horizontal display of approximately 4,000 pixels-- first became widely available around 2014. Barely a decade later, you can purchase a TV with double the resolution .


Detecting and explaining postpartum depression in real-time with generative artificial intelligence

García-Méndez, Silvia, de Arriba-Pérez, Francisco

arXiv.org Artificial Intelligence

Among the many challenges mothers undergo after childbirth, postpartum depression ( ppd) is a severe condition that significantly impacts their mental and physical well-being. Consequently, the rapid detection of ppd and their associated risk factors is critical for in-time assessment and intervention through specialized prevention procedures. Accordingly, this work addresses the need to help practitioners make decisions with the latest technological advancements to enable real-time screening and treatment recommendations. Mainly, our work contributes to an intelligent ppd screening system that combines Natural Language Processing, Machine Learning ( ml), and Large Language Models ( llm s) towards an affordable, real-time, and non-invasive free speech analysis. Moreover, it addresses the black box problem since the predictions are described to the end users thanks to the combination of llm s with interpretable ml models ( i.e., tree-based algorithms) using feature importance and natural language. The results obtained are 90 % on ppd detection for all evaluation metrics, outperforming the competing solutions in the literature. Ultimately, our solution contributes to the rapid detection of ppd and their associated risk factors, critical for in-time and proper assessment and intervention. Introduction Depression is a global public health concern that affects more than 150 million people, being more prevalent in women (Labaka et al., 2018; Moreira et al., 2019). Among the many challenges mothers undergo after childbirth, postpartum depression ( ppd) is a severe condition that usually requires medical intervention (Falana & Carrington, 2019). Mainly, ppd is a common non-psychotic mental disorder during the first year after childbirth that can lead to severe complications in the women's health (Abadiga, 2019). Current data indicates that between 10 % to 15 % of mothers worldwide are affected with ppd yearly (Fatima et al., 2019; Liu et al., 2023). Moreover, only 20% of the target population is diagnosed or even treated promptly (Mazumder & Baruah, 2021).


Topological Social Choice: Designing a Noise-Robust Polar Distance for Persistence Diagrams

Andrikopoulos, Athanasios, Sampanis, Nikolaos

arXiv.org Artificial Intelligence

Topological Data Analysis (TDA) has emerged as a powerful framework for extracting robust and interpretable features from noisy high-dimensional data. In the context of Social Choice Theory, where preference profiles and collective decisions are geometrically rich yet sensitive to perturbations, TDA remains largely unexplored. This work introduces a novel conceptual bridge between these domains by proposing a new metric framework for persistence diagrams tailored to noisy preference data.We define a polar coordinate-based distance that captures both the magnitude and orientation of topological features in a smooth and differentiable manner. Our metric addresses key limitations of classical distances, such as bottleneck and Wasserstein, including instability under perturbation, lack of continuity, and incompatibility with gradient-based learning. The resulting formulation offers improved behavior in both theoretical and applied settings.To the best of our knowledge, this is the first study to systematically apply persistent homology to social choice systems, providing a mathematically grounded method for comparing topological summaries of voting structures and preference dynamics. We demonstrate the superiority of our approach through extensive experiments, including robustness tests and supervised learning tasks, and we propose a modular pipeline for building predictive models from online preference data. This work contributes a conceptually novel and computationally effective tool to the emerging interface of topology and decision theory, opening new directions in interpretable machine learning for political and economic systems.


Evasion Attacks Against Bayesian Predictive Models

Arce, Pablo G., Naveiro, Roi, Insua, David Ríos

arXiv.org Machine Learning

There is an increasing interest in analyzing the behavior of machine learning systems against adversarial attacks. However, most of the research in adversarial machine learning has focused on studying weaknesses against evasion or poisoning attacks to predictive models in classical setups, with the susceptibility of Bayesian predictive models to attacks remaining underexplored. This paper introduces a general methodology for designing optimal evasion attacks against such models. We investigate two adversarial objectives: perturbing specific point predictions and altering the entire posterior predictive distribution. For both scenarios, we propose novel gradient-based attacks and study their implementation and properties in various computational setups.


Uncertainty Quantification for Prior-Data Fitted Networks using Martingale Posteriors

Nagler, Thomas, Rügamer, David

arXiv.org Machine Learning

Prior-data fitted networks (PFNs) have emerged as promising foundation models for prediction from tabular data sets, achieving state-of-the-art performance on small to moderate data sizes without tuning. While PFNs are motivated by Bayesian ideas, they do not provide any uncertainty quantification for predictive means, quantiles, or similar quantities. We propose a principled and efficient sampling procedure to construct Bayesian posteriors for such estimates based on Martingale posteriors, and prove its convergence. Several simulated and real-world data examples showcase the uncertainty quantification of our method in inference applications.


Temperature Optimization for Bayesian Deep Learning

Ng, Kenyon, van der Heide, Chris, Hodgkinson, Liam, Wei, Susan

arXiv.org Machine Learning

The Cold Posterior Effect (CPE) is a phenomenon in Bayesian Deep Learning (BDL), where tempering the posterior to a cold temperature often improves the predictive performance of the posterior predictive distribution (PPD). Although the term `CPE' suggests colder temperatures are inherently better, the BDL community increasingly recognizes that this is not always the case. Despite this, there remains no systematic method for finding the optimal temperature beyond grid search. In this work, we propose a data-driven approach to select the temperature that maximizes test log-predictive density, treating the temperature as a model parameter and estimating it directly from the data. We empirically demonstrate that our method performs comparably to grid search, at a fraction of the cost, across both regression and classification tasks. Finally, we highlight the differing perspectives on CPE between the BDL and Generalized Bayes communities: while the former primarily focuses on predictive performance of the PPD, the latter emphasizes calibrated uncertainty and robustness to model misspecification; these distinct objectives lead to different temperature preferences.


Proximal Policy Distillation

Spigler, Giacomo

arXiv.org Artificial Intelligence

We introduce Proximal Policy Distillation (PPD), a novel policy distillation method that integrates student-driven distillation and Proximal Policy Optimization (PPO) to increase sample efficiency and to leverage the additional rewards that the student policy collects during distillation. To assess the efficacy of our method, we compare PPD with two common alternatives, student-distill and teacher-distill, over a wide range of reinforcement learning environments that include discrete actions and continuous control (ATARI, Mujoco, and Procgen). For each environment and method, we perform distillation to a set of target student neural networks that are smaller, identical (self-distillation), or larger than the teacher network. Our findings indicate that PPD improves sample efficiency and produces better student policies compared to typical policy distillation approaches. Moreover, PPD demonstrates greater robustness than alternative methods when distilling policies from imperfect demonstrations. The code for the paper is released as part of a new Python library built on top of stable-baselines3 to facilitate policy distillation: 'sb3-distill'.


Generative vs. Discriminative modeling under the lens of uncertainty quantification

Argouarc'h, Elouan, Desbouvries, François, Barat, Eric, Kawasaki, Eiji

arXiv.org Machine Learning

Learning a parametric model from a given dataset indeed enables to capture intrinsic dependencies between random variables via a parametric conditional probability distribution and in turn predict the value of a label variable given observed variables. In this paper, we undertake a comparative analysis of generative and discriminative approaches which differ in their construction and the structure of the underlying inference problem. Our objective is to compare the ability of both approaches to leverage information from various sources in an epistemic uncertainty aware inference via the posterior predictive distribution. We assess the role of a prior distribution, explicit in the generative case and implicit in the discriminative case, leading to a discussion about discriminative models suffering from imbalanced dataset. We next examine the double role played by the observed variables in the generative case, and discuss the compatibility of both approaches with semi-supervised learning. We also provide with practical insights and we examine how the modeling choice impacts the sampling from the posterior predictive distribution. With regard to this, we propose a general sampling scheme enabling supervised learning for both approaches, as well as semi-supervised learning when compatible with the considered modeling approach. Throughout this paper, we illustrate our arguments and conclusions using the example of affine regression, and validate our comparative analysis through classification simulations using neural network based models.