AITopics | rdt

Collaborating Authors

rdt

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

bd5c3c51db72a6614bb71ce5318a78d0-Paper-Conference.pdf

Neural Information Processing SystemsJun-22-2026, 12:06:56 GMT

We study online decision making problems under resource constraints, where both reward and cost functions are drawn from distributions that may change adversarially over time. We focus on two canonical settings: (i) online resource allocation where rewards and costs are observed before action selection, and (ii)online learning with resource constraints where they are observed after action selection, under full feedback or bandit feedback. It is well known that achieving sublinear regret in these settings is impossible when reward and cost distributions may change arbitrarily over time. To address this challenge, we analyze a framework in which the learner is guided by a spending plan--a sequence prescribing expected resource usage across rounds. We design general (primal-)dual methods that achieve sublinear regret with respect to baselines that follow the spending plan. Crucially, the performance of our algorithms improves when the spending plan ensures a well-balanced distribution of the budget across rounds. We additionally provide a robust variant of our methods to handle worst-case scenarios where the spending plan is highly imbalanced. To conclude, we study the regret of our algorithms when competing against benchmarks that deviate from the prescribed spending plan.

artificial intelligence, machine learning, regret minimizer, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.27)

Genre: Research Report > Experimental Study (1.00)

Industry:

Information Technology (0.67)
Education > Educational Setting > Online (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Partially Performative Prediction

Lee, Jaewook, Zrnic, Tijana

arXiv.org Machine LearningJun-9-2026

Performative prediction studies feedback loops that arise when predictive models are deployed in consequential domains. In these settings, deploying a model can change the population whose patterns the model aims to predict, inducing a distribution shift that is endogenous to the learning system. This perspective departs from classical treatments of distribution shift, where shifts are typically modeled as exogenous changes in the data-generating process. Yet, in practice, distribution shift is rarely one or the other. Predictive models may influence future data through the decisions they support, while the world itself continues to drift for reasons beyond the learner's control. We study partially performative prediction, a framework that captures both endogenous and exogenous sources of distribution shift. The framework generalizes performative prediction by allowing the data distribution to evolve both in response to the deployed model and according to an external, time-varying process. We extend the central notions of performative stability and performative optimality to this setting by defining their online analogues that track the evolving partially performative environment. We analyze practical learning heuristics, including repeated retraining, and characterize when they successfully adapt to partially performative environments.

artificial intelligence, machine learning, modeling & simulation, (18 more...)

arXiv.org Machine Learning

2606.0789

Genre: Research Report (0.64)

Industry: Banking & Finance > Credit (0.46)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.47)

Add feedback

DivideandContrast: Source-freeDomainAdaptation viaAdaptiveContrastiveLearning (SupplementaryMaterial)

Neural Information Processing SystemsFeb-7-2026, 21:13:36 GMT

Consideringa C-wayclassification task, our model consists of source classifier and feature extractor h = gs ϕ, which maps input spaceRI topredictionvector spaceRC,andh(x) = argmaxc h(x)[c]. Following in[25,26,27,28],wedenoteDTc astheconditional distribution (probability measure) ofDT given the ground truthy = c, and also assume that the supports ofDTi andDTj aredisjointforalli = j. Following [25, 27, 26], we study target domain relies on theexpansion property, which implies the continuity of data distributions in each class-wise subpopulations. Thus, x DS,x B(x) DS, the network predictions are consistent, i.e.RDS(h)=0. Theorem A.2. Suppose the condition of Claim 3.1 holds andDT,DS satisfies (q,γ)-constant expansion.

artificial intelligence, hpl, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.05)
Asia > China > Zhejiang Province > Hangzhou (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Hazard-Responsive Digital Twin for Climate-Driven Urban Resilience and Equity

Shen, Zhenglai, Zhou, Hongyu

arXiv.org Artificial IntelligenceOct-28-2025

Complex events such as wildfires, floods, and heatwaves are no longer isolated phenomena but interlinked hazards that propagate through interconnected infrastructure networks. When one system fails, others that depend on it often cascade toward collapse, producing widespread disruption and social inequity. Recent crises including the 2023 Vermont flooding, the 2024 Texas winter freeze, and the 2025 Southern California wildfire illustrate how climate - amplified events can simultaneously strain energy, water, communication, and transportation systems. Traditional risk assessments, which often treat hazards as discrete and static events, are insufficient to capture the evolving and compounding nature of modern disasters. Digital Twin (DT) technology offers a promising avenue for improving situational awareness and decision - making under such conditions. Originally introduced for aerospace engineering and later adopted across industrial sectors, DTs create real - time virtual counterparts of physical systems using sensor data, predictive modeling, and feedback control (Grieves & Vickers, 2018; Tao et al., 2019) . Within the built environment, DTs have been applied to asset monitoring, predictive maintenance, and urban system management (Errandonea et al., 2020; Fogli, 2019; Fuller et al., 2020) . However, most conventional DTs rely on stable connectivity, complete datasets, and deterministic control assumptions that are not held during crises characterized by cascading failures and data disruption. To address these challenges, the concept of the Risk - Informed Digital Twin (RDT) integrates probabilistic modeling, uncertainty quantification, and decision support within the DT architecture (Pignatta & Alibrandi, 2022; Zio & Miqueles, 2024) .

artificial intelligence, machine learning, real time system, (19 more...)

arXiv.org Artificial Intelligence

2510.22941

Country:

North America > United States > California (0.54)
North America > United States > Tennessee (0.46)

Genre: Research Report > New Finding (0.46)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Energy > Power Industry (1.00)
Information Technology (0.88)
Transportation (0.86)

Technology:

Information Technology > Sensing and Signal Processing (1.00)
Information Technology > Data Science (1.00)
Information Technology > Communications > Networks (1.00)
(3 more...)

Add feedback

Phase retrieval with rank $d$ measurements -- \emph{descending} algorithms phase transitions

Stojnic, Mihailo

arXiv.org Machine LearningJun-24-2025

Companion paper [118] developed a powerful \emph{Random duality theory} (RDT) based analytical program to statistically characterize performance of \emph{descending} phase retrieval algorithms (dPR) (these include all variants of gradient descents and among them widely popular Wirtinger flows). We here generalize the program and show how it can be utilized to handle rank $d$ positive definite phase retrieval (PR) measurements (with special cases $d=1$ and $d=2$ serving as emulations of the real and complex phase retrievals, respectively). In particular, we observe that the minimal sample complexity ratio (number of measurements scaled by the dimension of the unknown signal) which ensures dPR's success exhibits a phase transition (PT) phenomenon. For both plain and lifted RDT we determine phase transitions locations. To complement theoretical results we implement a log barrier gradient descent variant and observe that, even in small dimensional scenarios (with problem sizes on the order of 100), the simulated phase transitions are in an excellent agreement with the theoretical predictions.

artificial intelligence, machine learning, optimization problem, (14 more...)

arXiv.org Machine Learning

2506.18282

Country:

Africa > Sudan (0.04)
North America > United States > Texas > Dallas County > Dallas (0.04)
Europe > Germany > North Rhine-Westphalia > Cologne Region > Aachen (0.04)
(11 more...)

Genre: Research Report (0.51)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)

Add feedback

Phase transition of \emph{descending} phase retrieval algorithms

Stojnic, Mihailo

arXiv.org Machine LearningJun-24-2025

We study theoretical limits of \emph{descending} phase retrieval algorithms. Utilizing \emph{Random duality theory} (RDT) we develop a generic program that allows statistical characterization of various algorithmic performance metrics. Through these we identify the concepts of \emph{parametric manifold} and its \emph{funneling points} as key mathematical objects that govern the underlying algorithms' behavior. An isomorphism between single funneling point manifolds and global convergence of descending algorithms is established. The structure and shape of the parametric manifold as well as its dependence on the sample complexity are studied through both plain and lifted RDT. Emergence of a phase transition is observed. Namely, as sample complexity increases, parametric manifold transitions from a multi to a single funneling point structure. This in return corresponds to a transition from the scenarios where descending algorithms generically fail to the scenarios where they succeed in solving phase retrieval. We also develop and implement a practical algorithmic variant that in a hybrid alternating fashion combines a barrier and a plain gradient descent. Even though the theoretical results are obtained for infinite dimensional scenarios (and consequently non-jittery parametric manifolds), we observe a strong agrement between theoretical and simulated phase transitions predictions for fairly small dimensions on the order of a few hundreds.

artificial intelligence, machine learning, manifold, (16 more...)

arXiv.org Machine Learning

2506.18275

Country:

North America > United States > Colorado > Denver County > Denver (0.04)
Africa > Sudan (0.04)
North America > United States > Texas > Dallas County > Dallas (0.04)
(14 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Researchers question reliability of Abbott's rapid malaria tests

ScienceMay-22-2025, 18:30:00 GMT

The World Health Organization (WHO) has sent an internal memo about potential problems with a major company's malaria tests after scientists reported issues with test sensitivity and warned it could delay patients' access to critical treatment. Abbott's Bioline rapid diagnostic tests (RDTs) for malaria are used by health workers around the world, particularly in remote areas where lab techniques such as microscopy and DNA detection aren't available. Investigations at several institutions in Southeast Asia suggest at least some of these RDTs fail to detect infections or show faint test lines for some positive cases. Daniel Ngamije Madandi, director of WHO's Global Malaria Programme (GMP), issued the memo to WHO's six regional offices on 30 April. It lists 11 "affected" lots from two Abbott RDTs--Pf/Pv and Pf/Pan--that were associated with "faint lines and false negative results" in reports from "multiple research groups." The memo follows a public notice by WHO in March that warned of reports of faint lines in malaria RDTs without mentioning particular brands or products.

abbott, abbott rdt, rdt, (12 more...)

Science

Country:

Asia > Southeast Asia (0.25)
Europe > France > Grand Est > Bas-Rhin > Strasbourg (0.05)
Asia > Myanmar (0.05)
(5 more...)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology: Information Technology > Artificial Intelligence (0.37)

Add feedback

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Liu, Songming, Wu, Lingxuan, Li, Bangguo, Tan, Hengkai, Chen, Huayu, Wang, Zhengyi, Xu, Ke, Su, Hang, Zhu, Jun

arXiv.org Artificial IntelligenceOct-10-2024

Bimanual manipulation is essential in robotics, yet developing foundation models is extremely challenging due to the inherent complexity of coordinating two robot arms (leading to multi-modal action distributions) and the scarcity of training data. In this paper, we present the Robotics Diffusion Transformer (RDT), a pioneering diffusion foundation model for bimanual manipulation. RDT builds on diffusion models to effectively represent multi-modality, with innovative designs of a scalable Transformer to deal with the heterogeneity of multi-modal inputs and to capture the nonlinearity and high frequency of robotic data. To address data scarcity, we further introduce a Physically Interpretable Unified Action Space, which can unify the action representations of various robots while preserving the physical meanings of original actions, facilitating learning transferrable physical knowledge. With these designs, we managed to pre-train RDT on the largest collection of multi-robot datasets to date and scaled it up to 1.2B parameters, which is the largest diffusion-based foundation model for robotic manipulation. We finally fine-tuned RDT on a self-created multi-task bimanual dataset with over 6K+ episodes to refine its manipulation capabilities. Experiments on real robots demonstrate that RDT significantly outperforms existing methods. It exhibits zeroshot generalization to unseen objects and scenes, understands and follows language instructions, learns new skills with just 1 5 demonstrations, and effectively handles complex, dexterous tasks. We refer to the project page for the code and videos. Bimanual manipulation is essential for robots to accomplish real-world tasks (Edsinger & Kemp, 2007). For practical applications, a useful manipulation policy should be able to generalize to unseen scenarios, such as unseen objects and scenes. Following the success in natural language processing (Achiam et al., 2023; Touvron et al., 2023) and computer vision (Radford et al., 2021; Kirillov et al., 2023), one promising direction to enable generalizable behaviors is to develop a foundation model through imitation learning on large-scale datasets. However, it is highly non-trivial to develop a bimanual manipulation foundation model. One main reason is that the accessible data for a specific dual-arm robot is significantly scarce (Sharma et al., 2018; Collaboration et al., 2023) due to high hardware costs, undermining the data-intensive requirements of training foundational models. Inspired by recent attempts in unimanual manipulation (Brohan et al., 2023; Kim et al., 2024), we seek to first pre-train on extensive multi-robot datasets and then fine-tune on the small dataset collected on the target dual-arm robot. This can help us to scale the data size up to three orders of magnitude, having the potential to learn transferrable physics knowledge from datasets of other robots. Nevertheless, there are two key technical challenges.

dataset, manipulation, robot, (13 more...)

arXiv.org Artificial Intelligence

2410.07864

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
Europe > Switzerland (0.04)

Genre: Research Report (1.00)

Industry: Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Robots > Manipulation (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.66)

Add feedback

Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling

Xu, Jiawei, Yang, Rui, Luo, Feng, Fang, Meng, Wang, Baoxiang, Han, Lei

arXiv.org Artificial IntelligenceJul-5-2024

Learning policies from offline datasets through offline reinforcement learning (RL) holds promise for scaling data-driven decision-making and avoiding unsafe and costly online interactions. However, real-world data collected from sensors or humans often contains noise and errors, posing a significant challenge for existing offline RL methods. Our study indicates that traditional offline RL methods based on temporal difference learning tend to underperform Decision Transformer (DT) under data corruption, especially when the amount of data is limited. This suggests the potential of sequential modeling for tackling data corruption in offline RL. To further unleash the potential of sequence modeling methods, we propose Robust Decision Transformer (RDT) by incorporating several robust techniques. Specifically, we introduce Gaussian weighted learning and iterative data correction to reduce the effect of corrupted data. Additionally, we leverage embedding dropout to enhance the model's resistance to erroneous inputs. Extensive experiments on MoJoCo, KitChen, and Adroit tasks demonstrate RDT's superior performance under diverse data corruption compared to previous methods. Moreover, RDT exhibits remarkable robustness in a challenging setting that combines training-time data corruption with testing-time observation perturbations. These results highlight the potential of robust sequence modeling for learning from noisy or corrupted offline datasets, thereby promoting the reliable application of offline RL in real-world tasks.

corruption, data corruption, dataset, (12 more...)

arXiv.org Artificial Intelligence

2407.04285

Country:

Asia > China > Hong Kong (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Harmonizing Program Induction with Rate-Distortion Theory

Zhou, Hanqi, Nagy, David G., Wu, Charley M.

arXiv.org Machine LearningMay-8-2024

Many aspects of human learning have been proposed as a process of constructing mental programs: from acquiring symbolic number representations to intuitive theories about the world. In parallel, there is a long-tradition of using information processing to model human cognition through Rate Distortion Theory (RDT). Yet, it is still poorly understood how to apply RDT when mental representations take the form of programs. In this work, we adapt RDT by proposing a three way trade-off among rate (description length), distortion (error), and computational costs (search budget). We use simulations on a melody task to study the implications of this trade-off, and show that constructing a shared program library across tasks provides global benefits. However, this comes at the cost of sensitivity to curricula, which is also characteristic of human learners. Finally, we use methods from partial information decomposition to generate training curricula that induce more effective libraries and better generalization.

curriculum, melody, representation, (15 more...)

arXiv.org Machine Learning

2405.05294

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Asia > Middle East > Jordan (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)

Genre:

Research Report (0.82)
Instructional Material > Course Syllabus & Notes (0.34)

Industry: Health & Medicine > Therapeutic Area (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.95)
Information Technology > Artificial Intelligence > Cognitive Science > Simulation of Human Behavior (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback