Goto

Collaborating Authors

 Industry


MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction

Neural Information Processing Systems

Unlike prior approaches constrained to fixed instrument classes, MGE-LDM learns a joint distribution over full mixtures, submixtures, and individual stems within a single compact latent diffusion model. At inference, MGE-LDM enables (1) complete mixture generation, (2) partial generation (i.e., source imputation), and (3) textconditioned extraction of arbitrary sources. By formulating both separation and imputation as conditional inpainting tasks in the latent space, our approach supports flexible, class-agnostic manipulation of arbitrary instrument sources. Notably, MGE-LDM can be trained jointly across heterogeneous multi-track datasets (e.g., Slakh2100, MUSDB18, MoisesDB) without relying on predefined instrument categories. Audio samples are available at our project page .


T-norm Selection for Object Detection in Autonomous Driving with Logical Constraints

Neural Information Processing Systems

Integrating logical constraints into object detection models for autonomous driving (AD) is a promising way to enhance their compliance to rules and thus increase the safety of the system. In this, t-norms have been utilized to calculate the constrained loss, i.e., the violations of logical constraints as losses. While prior works have statically selected few t-norms, we conduct an extensive experimental study to identify the most effective choices, as suboptimal t-norms can lead to undesired model behavior. For this, we present MOD-ECL, a neurosymbolic framework that implements a wide range of t-norms and can use them in an adaptive manner, with an algorithm that selects well-performing t-norms during training and a scheduler that regulates the impact of the constrained loss. We evaluate its effectiveness on the ROAD-R and ROAD-Waymo-R datasets for object detection in AD with attached common-sense constraints. Our results show that careful selection of parameters is crucial for good behavior of the constrained loss and that our framework allows us to obtain not only lower constraint violation but in some cases also an increase in detection performance. Furthermore, our methods allow fine control over the tradeoff between accuracy and violation.1


Put CASH on Bandits: AMax K-Armed Problem for Automated Machine Learning

Neural Information Processing Systems

The Combined Algorithm Selection and Hyperparameter optimization (CASH) is a challenging resource allocation problem in the field of AutoML. We propose MaxUCB, a max k-armed bandit method to trade off exploring different model classes and conducting hyperparameter optimization. MaxUCB is specifically designed for the light-tailed and bounded reward distributions arising in this setting and, thus, provides an efficient alternative compared to classic max k-armed bandit methods assuming heavy-tailed reward distributions. We theoretically and empirically evaluate our method on four standard AutoML benchmarks demonstrating superior performance over prior approaches.


How the Peter Thiel-Linked Dialog Club Secretly Ranks Its Members

WIRED

Leaked files show the invite-only network grades members by their money and fame, shaping who's in, who's out, and who pays. Dialog, the private network cofounded by Peter Thiel, grades its event attendees on a hidden scale, ranking them by wealth and fame, tracking their relationships, and using algorithms to help decide who they should meet, who they should sit with, and who no longer belongs, WIRED has learned. The records are part of a trove of internal data received by WIRED from a confidential source, containing the personal information of nearly 200 prominent people scheduled to attend the group's annual retreat this summer. The data includes home addresses, private phone numbers and email accounts, dates of birth, photos, and emergency contacts, as well as food allergies and the political leanings volunteered by some members. The records are distinct from a list of people affiliated with Dialog that was left exposed on the organization's website and has been circulating online since earlier this week--a looser directory that appears to include nonmembers, such as Maryland governor Wes Moore, a former event speaker, and other outside guests who passed through Dialog's orbit, in some cases years ago.


Bridging Brains and Concepts: Interpretable Visual Decoding from fMRI with Semantic Bottlenecks

Neural Information Processing Systems

Decoding of visual stimuli from noninvasive neuroimaging techniques such as functional magnetic resonance (fMRI) has advanced rapidly in the last years; yet, most high-performing brain decoding models rely on complicated, non-interpretable latent spaces. In this study we present an interpretable brain decoding framework that inserts a semantic bottleneck into BrainDiffuser, a well established, simple and linear decoding pipeline. We firstly produce a 214 dimensional binary interpretable space L for images, in which each dimension answers to a specific question about the image (e.g., "Is there a person?",


Combining Discrete Adversarial Training for LLMs

Neural Information Processing Systems

Despite recent efforts in Large Language Model (LLM) safety and alignment, current adversarial attacks on frontier LLMs can still consistently force harmful generations. Although adversarial training has been widely studied and shown to significantly improve the robustness of traditional machine learning models, its strengths and weaknesses in the context of LLMs are less understood. Specifically, while existing discrete adversarial attacks are effective at producing harmful content, training LLMs with concrete adversarial prompts is often computationally expensive, leading to reliance on continuous relaxations. At the same time, despite their effectiveness and generalization capabilities, training with continuous perturbations does not always capture the full spectrum of vulnerabilities exploited by discrete attacks. In this work, we aim to bridge this gap by introducing MIXAT, a novel method that combines stronger discrete and faster continuous attacks during training. We rigorously evaluate MIXAT across a wide spectrum of state-of-theart attacks, proposing the At Least One Attack Success Rate (ALO-ASR) metric to capture the worst-case vulnerability of models. We show MIXAT achieves substantially better robustness (ALO-ASR < 20%) compared to prior defenses (ALO-ASR > 50%), while maintaining a runtime comparable to methods based on continuous relaxations. We further analyze MIXAT in realistic deployment settings, exploring how chat templates, quantization, low-rank adapters, and temperature affect both adversarial training and evaluation, revealing additional blind spots in current methodologies. Our results demonstrate that MIXAT's discrete-continuous defense offers a principled and superior robustness-accuracy tradeoff with minimal computational overhead, highlighting its promise for building safer LLMs.


Dems raked in millions from employees at firms newly identified as 'Chinese military companies'

FOX News

This material may not be published, broadcast, rewritten, or redistributed. Quotes displayed in real-time or delayed by at least 15 minutes. Market data provided by Factset . Powered and implemented by FactSet Digital Solutions . Mutual Fund and ETF data provided by LSEG .


Is Putin Finally Feeling Pressure?

The New Yorker

Is Vladimir Putin Finally Feeling Pressure? The Russian President is facing growing domestic discontent after a series of successful attacks by the Ukrainian Army, including a major attack on Moscow. The war in Ukraine, which not long ago seemed to be turning in favor of Vladimir Putin's invading Russian Army, appears to have undergone another reversal. Thanks in part to its drone campaign, the Ukrainians have, according to some analysts, " turned the tide," putting pressure on Putin to potentially accept a ceasefire in the coming months. At the same time, there have been bubbles of discontent forming within Russia, over the cost of the war and government crackdowns on internet access. To understand what might be happening in Russia, and how the Putin regime might respond, I recently e-mailed several rounds of questions to Tatiana Stanovaya, a senior fellow at the Carnegie Russia Eurasia Center, and the founder of the political analysis organization R.Politik. Our conversation, edited for length and clarity, is below.


Stochastic Gradients under Nuisances

Neural Information Processing Systems

Stochastic gradient optimization is the dominant learning paradigm for a variety of scenarios, from classical supervised learning to modern self-supervised learning. We consider stochastic gradient algorithms for learning problems whose objectives rely on unknown nuisance parameters, and establish non-asymptotic convergence guarantees. Our results show that, while the presence of a nuisance can alter the optimum and upset the optimization trajectory, the classical stochastic gradient algorithm may still converge under appropriate conditions, such as Neyman orthogonality. Moreover, even when Neyman orthogonality is not satisfied, we show that an algorithm variant with approximately orthogonalized updates (with an approximately orthogonalized gradient oracle) may achieve similar convergence rates. Examples from orthogonal statistical learning/double machine learning and causal inference are discussed.


Distributive Fairness in Large Language Models: Evaluating Alignment with Human Values

Neural Information Processing Systems

The growing interest in employing large language models (LLMs) for decision-making in social and economic contexts has raised questions about their potential to function as agents in these domains. A significant number of societal problems involve the distribution of resources, where fairness, along with economic efficiency, play a critical role in the desirability of outcomes. In this paper, we examine whether LLM responses adhere to fundamental fairness concepts such as equitability, envy-freeness, and Rawlsian maximin, and investigate their alignment with human preferences. We evaluate the performance of several LLMs, providing a comparative benchmark of their ability to reflect these measures. Our results demonstrate a lack of alignment between current LLM responses and human distributional preferences. Moreover, LLMs are unable to utilize money as a transferable resource to mitigate inequality. Nonetheless, we demonstrate a stark contrast when (some) LLMs are tasked with selecting from a predefined menu of options rather than generating one. In addition, we analyze the robustness of LLM responses to variations in semantic factors (e.g., intentions or personas) or non-semantic prompting changes (e.g., templates or orderings). Finally, we highlight potential strategies aimed at enhancing the alignment of LLM behavior with well-established fairness concepts.