AITopics

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.67)

Neural Information Processing SystemsFeb-11-2026, 06:36:39 GMT

De-AnonymizingTextby FingerprintingLanguageGeneration

Components of machine learning systems are not (yet) perceived as security hotspots. Secure coding practices, such as ensuring that no execution paths depend on confidential inputs, have not yet been adopted by ML developers. We initiate the study of code security of ML systems by investigating how nucleus sampling--a popular approach forgeneratingtext,used forapplications such as auto-completion--unwittingly leakstextstypedbyusers.

artificial intelligence, logit, machine learning, (19 more...)

Country:

Asia > India (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry: Information Technology (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Kuang, Simon, Lin, Xinfan

Assumed Density Filtering and Smoothing with Neural Network Surrogate Models

arXiv.org Artificial IntelligenceNov-13-2025

The Kalman filter and Rauch-Tung-Striebel (RTS) smoother are optimal for state estimation in linear dynamic systems. With nonlinear systems, the challenge consists in how to propagate uncertainty through the state transitions and output function. For the case of a neural network model, we enable accurate uncertainty propagation using a recent state-of-the-art analytic formula for computing the mean and covariance of a deep neural network with Gaussian input. We argue that cross entropy is a more appropriate performance metric than RMSE for evaluating the accuracy of filters and smoothers. We demonstrate the superiority of our method for state estimation on a stochastic Lorenz system and a Wiener system, and find that our method enables more optimal linear quadratic regulation when the state estimate is used for feedback.

artificial intelligence, machine learning, state estimation problem, (16 more...)

2511.09016

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New Jersey > Hudson County > Hoboken (0.04)
North America > United States > California > Yolo County > Davis (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry: Health & Medicine (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Neural Information Processing SystemsOct-2-2025, 00:14:19 GMT

A Tractable Approximation to Optimal Point Process Filtering: Application to Neural Encoding

Yuval Harel, Ron Meir, Manfred Opper

Neural Information Processing Systems http://nips.cc/

posterior variance, spike, variance, (15 more...)

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)

Askarbekuly, Nursultan, Fayzrakhmanov, Timur, Babarogić, Sladjan, Luković, Ivan

An Outcome-Based Educational Recommender System

arXiv.org Artificial IntelligenceSep-24-2025

Abstract--Most educational recommender systems are tuned and judged on click-or rating-based relevance, leaving their true pedagogical impact unclear . We introduce OBER--an Outcome-Based Educational Recommender that embeds learning outcomes and assessment items directly into the data schema, so any algorithm can be evaluated on the mastery it fosters. OBER uses a minimalist entity-relation model, a log-driven mastery formula, and a plug-in architecture. Integrated into an e-learning system in non-formal domain, it was evaluated trough a two-week A/B/C test with over 5 700 learners across three methods: fixed expert trajectory, collaborative filtering (CF), and knowledge-based (KB) filtering. CF maximized retention, but the fixed path achieved the highest mastery. Because OBER derives business, relevance, and learning metrics from the same logs, it lets practitioners weigh relevance and engagement against outcome mastery with no extra testing overhead. The framework is method-agnostic and readily extensible to future adaptive or context-aware recommenders. Index T erms--recommendation systems, e-learning, evaluation, assessment, intended learning outcomes, constructive alingment, empirical software engineering.

artificial intelligence, machine learning, recommender system, (17 more...)

2509.18186

Country: Europe > Serbia (0.15)

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (0.66)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.57)
Education > Educational Setting > Online (0.57)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Artificial IntelligenceSep-15-2025

Limited Reference, Reliable Generation: A Two-Component Framework for Tabular Data Generation in Low-Data Regimes

Jiang, Mingxuan, Wang, Yongxin, Dai, Ziyue, Liu, Yicun, Nie, Hongyi, Liu, Sen, Chai, Hongfeng

Synthetic tabular data generation is increasingly essential in data management, supporting downstream applications when real-world and high-quality tabular data is insufficient. Existing tabular generation approaches, such as generative adversarial networks (GANs), diffusion models, and fine-tuned Large Language Models (LLMs), typically require sufficient reference data, limiting their effectiveness in domain-specific databases with scarce records. While prompt-based LLMs offer flexibility without parameter tuning, they often fail to capture dataset-specific feature-label dependencies and generate redundant data, leading to degradation in downstream task performance. To overcome these issues, we propose ReFine, a framework that (i) derives symbolic "if-then" rules from interpretable models and embeds them into prompts to explicitly guide generation toward domain-specific feature distribution, and (ii) applies a dual-granularity filtering strategy that suppresses over-sampling patterns and selectively refines rare but informative samples to reduce distributional imbalance. Extensive experiments on various regression and classification benchmarks demonstrate that ReFine consistently outperforms state-of-the-art methods, achieving up to 0.44 absolute improvement in R-squared for regression and 10.0 percent relative improvement in F1 score for classification tasks.

large language model, machine learning, natural language, (19 more...)

2509.0996

Country: Asia > China (0.28)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Neural Information Processing SystemsAug-18-2025, 08:08:52 GMT

Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset Peter Henderson

Emerging ethical approaches have attempted to filter pretraining material, but such approaches have been ad hoc and failed to take context into account. We offer an approach to filtering grounded in law, which has directly addressed the tradeoffs in filtering material.

data mining, machine learning, natural language, (16 more...)

Country:

Europe > Germany (0.28)
North America > Canada > British Columbia (0.04)
North America > United States > Virginia (0.04)
(15 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Law > Statutes (1.00)
Law > Litigation (1.00)
Law > Criminal Law (1.00)
(8 more...)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(4 more...)

arXiv.org Artificial IntelligenceJul-29-2025

Behavior-Specific Filtering for Enhanced Pig Behavior Classification in Precision Livestock Farming

Zhang, Zhen, Ha, Dong Sam, Morota, Gota, Shin, Sook

Precision Livestock Farming (PLF) has emerged as a critical field for monitoring and improving animal health and behavior[1]. Accurate and continuous tracking of livestock behavior is essential for identifying early signs of health issues an d enabling timely intervention. Traditional methods for monitoring pig behavior, such as manual observation, are labor - intensive, limited in scalability, and prone to inaccuracies [2]. Recent advancements in PLF have introduced automated systems that lev erage biosensors to track behavior in real time. These sensors, often attached to animals, collect data that is both costeffective and reliable, making them indispensable for modern livestock management [3,4].

accuracy, data mining, machine learning, (14 more...)

2507.21021

Country:

North America > United States > Virginia (0.16)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (1.00)
Food & Agriculture > Agriculture (0.86)
Information Technology (0.68)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

Ammar, Mohamed Achref Ben, Bennani, Mohamed Taha

A Computational Approach to Modeling Conversational Systems: Analyzing Large-Scale Quasi-Patterned Dialogue Flows

arXiv.org Artificial IntelligenceJul-21-2025

--The analysis of conversational dynamics has gained increasing importance with the rise of large language model-based systems, which interact with users across diverse contexts. In this work, we propose a novel computational framework for constructing conversational graphs that capture the flow and structure of loosely organized dialogues, referred to as quasi-patterned conversations. We introduce the Filter & Reconnect method, a novel graph simplification technique that minimizes noise while preserving semantic coherence and structural integrity of conversational graphs. Through comparative analysis, we demonstrate that the use of large language models combined with our graph simplification technique has resulted in semantic metric S increasing by a factor of 2.06 compared to previous approaches while simultaneously enforcing a tree-like structure with 0 δ -hyperbolicity, ensuring optimal clarity in conversation modeling. This work provides a computational method for analyzing large-scale dialogue datasets, with practical applications related to monitoring automated systems such as chatbots, dialogue management tools, and user behavior analytics.

large language model, machine learning, natural language, (20 more...)

doi: 10.1109/EUROCON64445.2025.11073224

2507.13544

Country:

Asia (0.28)
Africa > Middle East > Tunisia (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceJun-27-2025

Real-Time ESFP: Estimating, Smoothing, Filtering, and Pose-Mapping

Cui, Qifei, Zhou, Yuang, Deng, Ruichen

A. SMPL: A Skinned Multi-Person Linear Model The SMPL model ( Skinned Multi-Person Linear model) is a widely adopted statistical representation of the human body that combines a low-dimensional parameter space with linear blend skinning (LBS) to generate realistic, fully differentiable 3-D meshes. It underpins many state-of-the-art pipelines for monocular pose estimation, motion capture, and animation because it offers three essential properties: a compact pose-shape space learned from thousands of laser scans, an articulated skeletal structure compatible with traditional skinning, and analytic gradients with respect to both pose and shape parameters [6].

artificial intelligence, estimation, machine learning, (17 more...)

2506.21234

Country: North America > United States (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Video Understanding (0.56)