AITopics

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Neural Information Processing SystemsFeb-10-2026, 11:14:35 GMT

SupplementaryMaterialforLearningoutsidethe Black-Box: Thepursuitofinterpretablemodels

This lemma is a trivial consequence of the definition of Meijer G-functions. The only nontrivial step in the abovereasoning is going from the second to the third line. To speed up the process, the experiments are done by using a restriction ofGH excluding the inverse trigonometric functions as well as some Bessel functions. Also note that, as suggested by LIME,X8, X9 also have an important weight in this polynomial. Wefinishonalast remark on the benefits offered by our projection pursuit approach. Wesee that both the symbolic model and its local approximation take a very concise form when we consider the new variables zk, k =1,...,K.

artificial intelligence, black-box, thepursuitofinterpretablemodel, (18 more...)

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry: Transportation > Air (0.42)

Technology: Information Technology > Artificial Intelligence (0.49)

Neural Information Processing SystemsFeb-10-2026, 03:22:43 GMT

7c080cab957edab671ac49ae11e51337-Supplemental-Conference.pdf

different da realization, standard deviation correspond, transformation, (10 more...)

Technology: Information Technology > Artificial Intelligence (0.94)

Toba, Hayate, Yano, Atsushi, Azumi, Takuya

Generalized Inequality-based Approach for Probabilistic WCET Estimation

arXiv.org Machine LearningNov-18-2025

Estimating the probabilistic Worst-Case Execution Time (pWCET) is essential for ensuring the timing correctness of real-time applications, such as in robot IoT systems and autonomous driving systems. While methods based on Extreme Value Theory (EVT) can provide tight bounds, they suffer from model uncertainty due to the need to decide where the upper tail of the distribution begins. Conversely, inequality-based approaches avoid this issue but can yield pessimistic results for heavy-tailed distributions. This paper proposes a method to reduce such pessimism by incorporating saturating functions (arctangent and hyperbolic tangent) into Chebyshev's inequality, which mitigates the influence of large outliers while preserving mathematical soundness. Evaluations on synthetic and real-world data from the Autoware autonomous driving stack demonstrate that the proposed method achieves safe and tighter bounds for such distributions.

artificial intelligence, inequality, probability, (16 more...)

arXiv.org Machine Learning

2511.11682

Country: Asia > Japan (0.14)

Genre: Research Report (0.82)

Industry:

Information Technology (0.75)
Transportation > Ground > Road (0.55)
Automobiles & Trucks (0.55)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.55)

Morales, Giorgio, Sheppard, John W.

Decomposable Neuro Symbolic Regression

arXiv.org Artificial IntelligenceNov-7-2025

Symbolic regression (SR) models complex systems by discovering mathematical expressions that capture underlying relationships in observed data. However, most SR methods prioritize minimizing prediction error over identifying the governing equations, often producing overly complex or inaccurate expressions. To address this, we present a decomposable SR method that generates interpretable multivariate expressions leveraging transformer models, genetic algorithms (GAs), and genetic programming (GP). In particular, our explainable SR method distills a trained ``opaque'' regression model into mathematical expressions that serve as explanations of its computed function. Our method employs a Multi-Set Transformer to generate multiple univariate symbolic skeletons that characterize how each variable influences the opaque model's response. We then evaluate the generated skeletons' performance using a GA-based approach to select a subset of high-quality candidates before incrementally merging them via a GP-based cascade procedure that preserves their original skeleton structure. The final multivariate skeletons undergo coefficient optimization via a GA. We evaluated our method on problems with controlled and varying degrees of noise, demonstrating lower or comparable interpolation and extrapolation errors compared to two GP-based methods, three neural SR methods, and a hybrid approach. Unlike them, our approach consistently learned expressions that matched the original mathematical structure.

evolutionary algorithm, machine learning, tanh, (18 more...)

2511.04124

Country: North America > United States (0.45)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)

Chen, Chi-Sheng, Chen, Samuel Yen-Chi

Q-DPTS: Quantum Differentially Private Time Series Forecasting via Variational Quantum Circuits

arXiv.org Artificial IntelligenceSep-24-2025

Time series forecasting is vital in domains where data sensitivity is paramount, such as finance and energy systems. While Differential Privacy (DP) provides theoretical guarantees to protect individual data contributions, its integration especially via DP-SGD often impairs model performance due to injected noise. In this paper, we propose Q-DPTS, a hybrid quantum-classical framework for Quantum Differentially Private Time Series Forecasting. Q-DPTS combines Variational Quantum Circuits (VQCs) with per-sample gradient clipping and Gaussian noise injection, ensuring rigorous $(ε, δ)$-differential privacy. The expressiveness of quantum models enables improved robustness against the utility loss induced by DP mechanisms. We evaluate Q-DPTS on the ETT (Electricity Transformer Temperature) dataset, a standard benchmark for long-term time series forecasting. Our approach is compared against both classical and quantum baselines, including LSTM, QASA, QRWKV, and QLSTM. Results demonstrate that Q-DPTS consistently achieves lower prediction error under the same privacy budget, indicating a favorable privacy-utility trade-off. This work presents one of the first explorations into quantum-enhanced differentially private forecasting, offering promising directions for secure and accurate time series modeling in privacy-critical scenarios.

artificial intelligence, data mining, machine learning, (18 more...)

2508.05036

Country: North America > United States (0.14)

Genre: Research Report > New Finding (0.48)

Industry:

Energy (0.48)
Information Technology > Security & Privacy (0.47)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Ta, Hoang-Thang, Thai, Duy-Quy, Tran-Thi, Phuong-Linh

Combinations of Fast Activation and Trigonometric Functions in Kolmogorov-Arnold Networks

arXiv.org Artificial IntelligenceAug-19-2025

For years, many neural networks have been developed based on the Kolmogorov-Arnold Representation Theorem (KART), which was created to address Hilbert's 13th problem. Recently, relying on KART, Kolmogorov-Arnold Networks (KANs) have attracted attention from the research community, stimulating the use of polynomial functions such as B-splines and RBFs. However, these functions are not fully supported by GPU devices and are still considered less popular. In this paper, we propose the use of fast computational functions, such as ReLU and trigonometric functions (e.g., ReLU, sin, cos, arctan), as basis components in Kolmogorov-Arnold Networks (KANs). By integrating these function combinations into the network structure, we aim to enhance computational efficiency. Experimental results show that these combinations maintain competitive performance while offering potential improvements in training time and generalization.

artificial intelligence, arxiv preprint arxiv, machine learning, (14 more...)

2508.11876

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsAug-16-2025, 07:26:59 GMT

Appendix: A Data-Augmentation Is Worth A Thousand Samples

We thus propose to derive them, following the same recipe one will be able to obtain the analytical form of the first two moments for any desired transformation.

artificial intelligence, standard deviation correspond, transformation, (11 more...)

Technology: Information Technology > Artificial Intelligence (0.94)

arXiv.org Artificial IntelligenceMay-29-2025

Don't Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models

An, Sohyun, Wang, Ruochen, Zhou, Tianyi, Hsieh, Cho-Jui

While recent success of large reasoning models (LRMs) significantly advanced LLMs' reasoning capability by optimizing the final answer accuracy using reinforcement learning, they may also drastically increase the output length due to overthinking, characterized by unnecessarily complex reasoning paths that waste computation and potentially degrade the performance. We hypothesize that such inefficiencies stem from LRMs' limited capability to dynamically select the proper modular reasoning strategies, termed thinking patterns at the right position. To investigate this hypothesis, we propose a dynamic optimization framework that segments model-generated reasoning paths into distinct thinking patterns, systematically identifying and promoting beneficial patterns that improve the answer while removing detrimental ones. Empirical analysis confirms that our optimized thinking paths yield more concise yet sufficiently informative trajectories, enhancing reasoning efficiency by reducing attention FLOPs by up to 47% while maintaining accuracy for originally correct responses. Moreover, a non-trivial portion of originally incorrect responses are transformed into correct ones, achieving a 15.6% accuracy improvement with reduced length. Motivated by the improvement brought by the optimized thinking paths, we apply a preference optimization technique supported by a pairwise dataset contrasting suboptimal and optimal reasoning paths. Experimental evaluations across multiple mathematical reasoning benchmarks reveal that our method notably reduces computational overhead while simultaneously improving reasoning accuracy, achieving up to a 12% accuracy improvement and reducing token usage from approximately 5,000 to 3,000 tokens.

arxiv preprint arxiv, large language model, machine learning, (16 more...)

2505.21765

Country: North America > United States > California (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.87)

Li, Sixu, Kumar, Deepak Prakash, Darbha, Swaroop, Zhou, Yang

Time-optimal Convexified Reeds-Shepp Paths on a Sphere

arXiv.org Artificial IntelligenceApr-1-2025

This article addresses time-optimal path planning for a vehicle capable of moving both forward and backward on a unit sphere with a unit maximum speed, and constrained by a maximum absolute turning rate $U_{max}$. The proposed formulation can be utilized for optimal attitude control of underactuated satellites, optimal motion planning for spherical rolling robots, and optimal path planning for mobile robots on spherical surfaces or uneven terrains. By utilizing Pontryagin's Maximum Principle and analyzing phase portraits, it is shown that for $U_{max}\geq1$, the optimal path connecting a given initial configuration to a desired terminal configuration falls within a sufficient list of 23 path types, each comprising at most 6 segments. These segments belong to the set $\{C,G,T\}$, where $C$ represents a tight turn with radius $r=\frac{1}{\sqrt{1+U_{max}^2}}$, $G$ represents a great circular arc, and $T$ represents a turn-in-place motion. Closed-form expressions for the angles of each path in the sufficient list are derived. The source code for solving the time-optimal path problem and visualization is publicly available at https://github.com/sixuli97/Optimal-Spherical-Convexified-Reeds-Shepp-Paths.

artificial intelligence, optimal path, planning & scheduling, (16 more...)

2504.00966

Country:

North America > United States > Texas > Brazos County > College Station (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Italy > Calabria > Crotone Province (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.68)