Goto

Collaborating Authors

 filtering


De-AnonymizingTextby FingerprintingLanguageGeneration

Neural Information Processing Systems

Components of machine learning systems are not (yet) perceived as security hotspots. Secure coding practices, such as ensuring that no execution paths depend on confidential inputs, have not yet been adopted by ML developers. We initiate the study of code security of ML systems by investigating how nucleus sampling--a popular approach forgeneratingtext,used forapplications such as auto-completion--unwittingly leakstextstypedbyusers.


Assumed Density Filtering and Smoothing with Neural Network Surrogate Models

arXiv.org Artificial Intelligence

The Kalman filter and Rauch-Tung-Striebel (RTS) smoother are optimal for state estimation in linear dynamic systems. With nonlinear systems, the challenge consists in how to propagate uncertainty through the state transitions and output function. For the case of a neural network model, we enable accurate uncertainty propagation using a recent state-of-the-art analytic formula for computing the mean and covariance of a deep neural network with Gaussian input. We argue that cross entropy is a more appropriate performance metric than RMSE for evaluating the accuracy of filters and smoothers. We demonstrate the superiority of our method for state estimation on a stochastic Lorenz system and a Wiener system, and find that our method enables more optimal linear quadratic regulation when the state estimate is used for feedback.



An Outcome-Based Educational Recommender System

arXiv.org Artificial Intelligence

Abstract--Most educational recommender systems are tuned and judged on click-or rating-based relevance, leaving their true pedagogical impact unclear . We introduce OBER--an Outcome-Based Educational Recommender that embeds learning outcomes and assessment items directly into the data schema, so any algorithm can be evaluated on the mastery it fosters. OBER uses a minimalist entity-relation model, a log-driven mastery formula, and a plug-in architecture. Integrated into an e-learning system in non-formal domain, it was evaluated trough a two-week A/B/C test with over 5 700 learners across three methods: fixed expert trajectory, collaborative filtering (CF), and knowledge-based (KB) filtering. CF maximized retention, but the fixed path achieved the highest mastery. Because OBER derives business, relevance, and learning metrics from the same logs, it lets practitioners weigh relevance and engagement against outcome mastery with no extra testing overhead. The framework is method-agnostic and readily extensible to future adaptive or context-aware recommenders. Index T erms--recommendation systems, e-learning, evaluation, assessment, intended learning outcomes, constructive alingment, empirical software engineering.


Limited Reference, Reliable Generation: A Two-Component Framework for Tabular Data Generation in Low-Data Regimes

arXiv.org Artificial Intelligence

Synthetic tabular data generation is increasingly essential in data management, supporting downstream applications when real-world and high-quality tabular data is insufficient. Existing tabular generation approaches, such as generative adversarial networks (GANs), diffusion models, and fine-tuned Large Language Models (LLMs), typically require sufficient reference data, limiting their effectiveness in domain-specific databases with scarce records. While prompt-based LLMs offer flexibility without parameter tuning, they often fail to capture dataset-specific feature-label dependencies and generate redundant data, leading to degradation in downstream task performance. To overcome these issues, we propose ReFine, a framework that (i) derives symbolic "if-then" rules from interpretable models and embeds them into prompts to explicitly guide generation toward domain-specific feature distribution, and (ii) applies a dual-granularity filtering strategy that suppresses over-sampling patterns and selectively refines rare but informative samples to reduce distributional imbalance. Extensive experiments on various regression and classification benchmarks demonstrate that ReFine consistently outperforms state-of-the-art methods, achieving up to 0.44 absolute improvement in R-squared for regression and 10.0 percent relative improvement in F1 score for classification tasks.


Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset Peter Henderson

Neural Information Processing Systems

Emerging ethical approaches have attempted to filter pretraining material, but such approaches have been ad hoc and failed to take context into account. We offer an approach to filtering grounded in law, which has directly addressed the tradeoffs in filtering material.


Behavior-Specific Filtering for Enhanced Pig Behavior Classification in Precision Livestock Farming

arXiv.org Artificial Intelligence

Precision Livestock Farming (PLF) has emerged as a critical field for monitoring and improving animal health and behavior[1]. Accurate and continuous tracking of livestock behavior is essential for identifying early signs of health issues an d enabling timely intervention. Traditional methods for monitoring pig behavior, such as manual observation, are labor - intensive, limited in scalability, and prone to inaccuracies [2]. Recent advancements in PLF have introduced automated systems that lev erage biosensors to track behavior in real time. These sensors, often attached to animals, collect data that is both costeffective and reliable, making them indispensable for modern livestock management [3,4].


A Computational Approach to Modeling Conversational Systems: Analyzing Large-Scale Quasi-Patterned Dialogue Flows

arXiv.org Artificial Intelligence

--The analysis of conversational dynamics has gained increasing importance with the rise of large language model-based systems, which interact with users across diverse contexts. In this work, we propose a novel computational framework for constructing conversational graphs that capture the flow and structure of loosely organized dialogues, referred to as quasi-patterned conversations. We introduce the Filter & Reconnect method, a novel graph simplification technique that minimizes noise while preserving semantic coherence and structural integrity of conversational graphs. Through comparative analysis, we demonstrate that the use of large language models combined with our graph simplification technique has resulted in semantic metric S increasing by a factor of 2.06 compared to previous approaches while simultaneously enforcing a tree-like structure with 0 ฮด -hyperbolicity, ensuring optimal clarity in conversation modeling. This work provides a computational method for analyzing large-scale dialogue datasets, with practical applications related to monitoring automated systems such as chatbots, dialogue management tools, and user behavior analytics.


Real-Time ESFP: Estimating, Smoothing, Filtering, and Pose-Mapping

arXiv.org Artificial Intelligence

A. SMPL: A Skinned Multi-Person Linear Model The SMPL model ( Skinned Multi-Person Linear model) is a widely adopted statistical representation of the human body that combines a low-dimensional parameter space with linear blend skinning (LBS) to generate realistic, fully differentiable 3-D meshes. It underpins many state-of-the-art pipelines for monocular pose estimation, motion capture, and animation because it offers three essential properties: a compact pose-shape space learned from thousands of laser scans, an articulated skeletal structure compatible with traditional skinning, and analytic gradients with respect to both pose and shape parameters [6].


OpenThoughts: Data Recipes for Reasoning Models

arXiv.org Artificial Intelligence

Reasoning models have made rapid progress on many benchmarks involving math, code, and science. Yet, there are still many open questions about the best training recipes for reasoning since state-of-the-art models often rely on proprietary datasets with little to no public information available. To address this, the goal of the OpenThoughts project is to create open-source datasets for training reasoning models. After initial explorations, our OpenThoughts2-1M dataset led to OpenThinker2-32B, the first model trained on public reasoning data to match DeepSeek-R1-Distill-32B on standard reasoning benchmarks such as AIME and LiveCodeBench. We then improve our dataset further by systematically investigating each step of our data generation pipeline with 1,000+ controlled experiments, which led to OpenThoughts3. Scaling the pipeline to 1.2M examples and using QwQ-32B as teacher yields our OpenThoughts3-7B model, which achieves state-of-the-art results: 53% on AIME 2025, 51% on LiveCodeBench 06/24-01/25, and 54% on GPQA Diamond - improvements of 15.3, 17.2, and 20.5 percentage points compared to the DeepSeek-R1-Distill-Qwen-7B. All of our datasets and models are available on https://openthoughts.ai.