AITopics

Crucinio, Francesca R., Pathiraja, Sahani

Sequential Monte Carlo approximations of Wasserstein--Fisher--Rao gradient flows

arXiv.org Machine LearningJun-9-2025

We consider the problem of sampling from a probability distribution $π$. It is well known that this can be written as an optimisation problem over the space of probability distribution in which we aim to minimise the Kullback--Leibler divergence from $π$. We consider several partial differential equations (PDEs) whose solution is a minimiser of the Kullback--Leibler divergence from $π$ and connect them to well-known Monte Carlo algorithms. We focus in particular on PDEs obtained by considering the Wasserstein--Fisher--Rao geometry over the space of probabilities and show that these lead to a natural implementation using importance sampling and sequential Monte Carlo. We propose a novel algorithm to approximate the Wasserstein--Fisher--Rao flow of the Kullback--Leibler divergence which empirically outperforms the current state-of-the-art. We study tempered versions of these PDEs obtained by replacing the target distribution with a geometric mixture of initial and target distribution and show that these do not lead to a convergence speed up.

approximation, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

2506.05905

Country:

Europe > Italy > Piedmont > Turin Province > Turin (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > New York (0.04)
(2 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.45)

The GuardianJun-8-2025, 15:00:37 GMT

US attacks on science and research a 'great gift' to China on artificial intelligence, former OpenAI board member says

The US administration's targeting of academic research and international students is a "great gift" to China in the race to compete on artificial intelligence, former OpenAI board member Helen Toner has said. The director of strategy at Georgetown's Center for Security and Emerging Technology (CSET) joined the board of OpenAI in 2021 after a career studying AI and the relationship between the United States and China. Toner, a 33-year-old University of Melbourne graduate, was on the board for two years until a falling out with founder Sam Altman in 2023. Altman was fired by the board over claims that he was not "consistently candid" in his communications and the board did not have confidence in Altman's ability to lead. The chaotic months that followed saw Altman fired and then re-hired with three members of the board, including Toner, ousted instead.

large language model, machine learning, natural language, (16 more...)

The Guardian

Country:

North America > United States (1.00)
Asia > China (0.88)
Oceania > Australia (0.06)

Industry: Government > Regional Government > North America Government > United States Government (0.85)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.92)

BBC NewsJun-7-2025, 04:48:05 GMT

Biggest drone strike hits Ukraine's second city

Biggest drone strike yet on Ukraine's second city 24 minutes agoShareSaveJaroslav LukivBBC NewsShareSaveKharkiv Regional AdministrationUkraine's emergencies workers search for survivors at one of Kharkiv's damaged residential buildings At least two people have been killed and another 17 injured in the biggest Russian drone attack on Ukraine's second-largest city of Kharkiv, the mayor says. Ihor Terekhov says that overnight Russia launched 48 drones, as well as two missiles and four gliding bombs. "We have a lot of damage," he says, adding that three high-rising residential buildings were hit. Footage has emerged showing several storeys of one such building on fire. Six people were killed and 80 injured across Ukraine the previous night, when Russia attacked the country with more than 400 drones and nearly 40 missiles.

artificial intelligence, biggest drone strike, ukraine, (8 more...)

BBC News

Country:

Asia > Russia (0.76)
Europe > Russia (0.50)
Europe > Ukraine > Kharkiv Oblast > Kharkiv (0.49)
(17 more...)

Industry:

Government > Military (1.00)
Government > Regional Government > Europe Government (0.51)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)

Hartisch, Michael, Chew, Leroy

An Expansion-Based Approach for Quantified Integer Programming

Quantified Integer Programming (QIP) bridges multiple domains by extending Quantified Boolean Formulas (QBF) to incorporate general integer variables and linear constraints while also generalizing Integer Programming through variable quantification. As a special case of Quantified Constraint Satisfaction Problems (QCSP), QIP provides a versatile framework for addressing complex decision-making scenarios. Additionally, the inclusion of a linear objective function enables QIP to effectively model multistage robust discrete linear optimization problems, making it a powerful tool for tackling uncertainty in optimization. While two primary solution paradigms exist for QBF -- search-based and expansion-based approaches -- only search-based methods have been explored for QIP and QCSP. We introduce an expansion-based approach for QIP using Counterexample-Guided Abstraction Refinement (CEGAR), adapting techniques from QBF. We extend this methodology to tackle multistage robust discrete optimization problems with linear constraints and further embed it in an optimization framework, enhancing its applicability. Our experimental results highlight the advantages of this approach, demonstrating superior performance over existing search-based solvers for QIP in specific instances. Furthermore, the ability to model problems using linear constraints enables notable performance gains over state-of-the-art expansion-based solvers for QBF.

artificial intelligence, constraint, optimization problem, (16 more...)

2506.04452

Country:

Europe > Germany > Bavaria > Middle Franconia > Nuremberg (0.40)
Europe > Netherlands > South Holland > Leiden (0.04)
South America > Brazil > Rio Grande do Norte > Natal (0.04)
(9 more...)

Genre: Research Report (0.81)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)

Gurung, Dev, Pokhrel, Shiva Raj

Communication Efficient Adaptive Model-Driven Quantum Federated Learning

--Training with huge datasets and a large number of participating devices leads to bottlenecks in federated learning (FL). Furthermore, the challenges of heterogeneity between multiple FL clients affect the overall performance of the system. In a quantum federated learning (QFL) context, we address these three main challenges: i) training bottlenecks from massive datasets, ii) the involvement of a substantial number of devices, and iii) non-IID data distributions. We introduce a model-driven quantum federated learning algorithm (mdQFL) to tackle these challenges. Our proposed approach is efficient and adaptable to various factors, including different numbers of devices. T o the best of our knowledge, it is the first to explore training and update personalization, as well as test generalization within a QFL setting, which can be applied to other FL scenarios. We evaluated the efficiency of the proposed mdQFL framework through extensive experiments under diverse non-IID data heterogeneity conditions using various datasets within the Qiskit environment. Our results demonstrate a nearly 50% decrease in total communication costs while maintaining or, in some cases, exceeding the accuracy of the final model and consistently improving local model training compared to the standard QFL baseline. Moreover, our experimental evaluation thoroughly explores the QFL and mdQFL algorithms, along with several influencing factors. In addition, we present a theoretical analysis to clarify the complexities of the proposed algorithm. Federated Learning (FL) has emerged as a pivotal technique to address the challenges of privacy and security in distributed machine learning [1], [2].

artificial intelligence, generalization, machine learning, (18 more...)

2506.04548

Country:

Oceania > Australia (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > Netherlands (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Daryakenari, Farshad Haghgoo, Farizeh, Tara

A Novel Transformer-Based Method for Full Lower-Limb Joint Angles and Moments Prediction in Gait Using sEMG and IMU data

--This study presents a transformer-based deep learning framework for the long-horizon prediction of full lower-limb joint angles and joint moments using surface electromyography (sEMG) and inertial measurement unit (IMU) signals. Two separate Transformer Neural Networks (TNNs) were designed: one for kinematic prediction and one for kinetic prediction. The model was developed with real-time application in mind, using only wearable sensors suitable for outside-laboratory use. Two prediction horizons were considered to evaluate short-and long-term performance. The network achieved high accuracy in both tasks, with Spearman correlation coefficients exceeding ρ = 0.96 and R Notably, the model consistently outperformed a recent benchmark method in joint angle prediction, reducing RMSE errors by an order of magnitude. The results confirmed the complementary role of sEMG and IMU signals in capturing both kinematic and kinetic information. This work demonstrates the potential of transformer-based models for real-time, full-limb biomechanical prediction in wearable and robotic applications, with future directions including input minimization and modality-specific weighting strategies to enhance model efficiency and accuracy. CRUCIAL requirement in developing real-world systems--especially those that involve repetitive tasks--is optimization. Without an optimized system, we risk excessive energy consumption, increased physical or computational effort, and ultimately higher operational costs, all of which are undesirable. However, achieving such optimization requires a foundational step: analyzing the system's dynamics throughout task execution.

machine learning, natural language, prediction, (17 more...)

2506.04577

Country:

Europe > Switzerland (0.04)
Asia > Middle East > Iran > Tehran Province > Tehran (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
Europe > Austria > Vienna (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Stable Vision Concept Transformers for Medical Diagnosis

Hu, Lijie, Lai, Songning, Hua, Yuan, Yang, Shu, Zhang, Jingfeng, Wang, Di

Transparency is a paramount concern in the medical field, prompting researchers to delve into the realm of explainable AI (XAI). Among these XAI methods, Concept Bottleneck Models (CBMs) aim to restrict the model's latent space to human-understandable high-level concepts by generating a conceptual layer for extracting conceptual features, which has drawn much attention recently. However, existing methods rely solely on concept features to determine the model's predictions, which overlook the intrinsic feature embeddings within medical images. To address this utility gap between the original models and concept-based models, we propose Vision Concept Transformer (VCT). Furthermore, despite their benefits, CBMs have been found to negatively impact model performance and fail to provide stable explanations when faced with input perturbations, which limits their application in the medical field. To address this faithfulness issue, this paper further proposes the Stable Vision Concept Transformer (SVCT) based on VCT, which leverages the vision transformer (ViT) as its backbone and incorporates a conceptual layer. SVCT employs conceptual features to enhance decision-making capabilities by fusing them with image features and ensures model faithfulness through the integration of Denoised Diffusion Smoothing. Comprehensive experiments on four medical datasets demonstrate that our VCT and SVCT maintain accuracy while remaining interpretable compared to baselines. Furthermore, even when subjected to perturbations, our SVCT model consistently provides faithful explanations, thus meeting the needs of the medical field.

artificial intelligence, machine learning, natural language, (18 more...)

2506.05286

Country: Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)

Genre: Research Report > Promising Solution (0.48)

Industry:

Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Diagnostic Medicine (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(4 more...)

Kandala, Ratna, Mondal, Prakash

Towards a Unified System of Representation for Continuity and Discontinuity in Natural Language

Syntactic discontinuity is a grammatical phenomenon in which a constituent is split into more than one part because of the insertion of an element which is not part of the constituent. This is observed in many languages across the world such as Turkish, Russian, Japanese, Warlpiri, Navajo, Hopi, Dyirbal, Yidiny etc. Different formalisms/frameworks in current linguistic theory approach the problem of discontinuous structures in different ways. Each framework/formalism has widely been viewed as an independent and non-converging system of analysis. In this paper, we propose a unified system of representation for both continuity and discontinuity in structures of natural languages by taking into account three formalisms, in particular, Phrase Structure Grammar (PSG) for its widely used notion of constituency, Dependency Grammar (DG) for its head-dependent relations, and Categorial Grammar (CG) for its focus on functor-argument relations. We attempt to show that discontinuous expressions as well as continuous structures can be analysed through a unified mathematical derivation incorporating the representations of linguistic structure in these three grammar formalisms.

artificial intelligence, natural language, relation, (18 more...)

2506.05235

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(13 more...)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Pielok, Tobias, Bischl, Bernd, Rügamer, David

Semi-Implicit Variational Inference via Kernelized Path Gradient Descent

Semi-implicit variational inference (SIVI) is a powerful framework for approximating complex posterior distributions, but training with the Kullback-Leibler (KL) divergence can be challenging due to high variance and bias in high-dimensional settings. While current state-of-the-art semi-implicit variational inference methods, particularly Kernel Semi-Implicit Variational Inference (KSIVI), have been shown to work in high dimensions, training remains moderately expensive. In this work, we propose a kernelized KL divergence estimator that stabilizes training through nonparametric smoothing. To further reduce the bias, we introduce an importance sampling correction. We provide a theoretical connection to the amortized version of the Stein variational gradient descent, which estimates the score gradient via Stein's identity, showing that both methods minimize the same objective, but our semi-implicit approach achieves lower gradient variance. In addition, our method's bias in function space is benign, leading to more stable and efficient optimization. Empirical results demonstrate that our method outperforms or matches state-of-the-art SIVI methods in both performance and training efficiency.

artificial intelligence, machine learning, variational inference, (17 more...)

2506.05088

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States (0.04)
(2 more...)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.71)