AITopics | mansour

Collaborating Authors

mansour

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Near-OptimalRegretforAdversarialMDPwith DelayedBanditFeedback

Neural Information Processing SystemsFeb-12-2026, 05:29:41 GMT

The standard assumption in reinforcement learning (RL) is that agents observe feedback for their actions immediately. However, in practice feedback is often observedindelay.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Nevada (0.04)
North America > Canada > Quebec > Montreal (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.34)

Add feedback

473803f0f2ebd77d83ee60daaa61f381-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-12-2026, 01:48:03 GMT

algorithm, asynchronous q-learning, reviewer, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Delay in Nonstochastic

Neural Information Processing SystemsFeb-8-2026, 01:06:21 GMT

InAdvancesin Neural Information Processing Systems, 2020, to appear.

artificial intelligence, machine learning, neural information processing system, (12 more...)

Neural Information Processing Systems

Country:

Asia > Japan (0.05)
North America > United States (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.52)

Add feedback

473803f0f2ebd77d83ee60daaa61f381-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 15:57:31 GMT

artificial intelligence, asynchronous q-learning, machine learning, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Reviews: Regret Minimization for Reinforcement Learning with Vectorial Feedback and Complex Objectives

Neural Information Processing SystemsJan-26-2025, 03:15:46 GMT

Two out of three reviewers appreciated the contributions of this paper, with one expert reviewer praising almost every aspect of the paper. On the negative side, one reviewer took issue with the proposed setting, highlighting that the utility of the proposed objective function is somewhat dubious in the general context of multi-objective decision making. I agree with this reviewer in that having "multi-objective" in the title of the paper may set the wrong expectations for some readers, and I suggest that the authors consider changing the title of the paper for its final version to avoid such misunderstandings. Furthermore, the final version should discuss the relationship between this paper and the very recent work of Rosenberg and Mansour (2019) that studies essentially the same problem in episodic MDPs. Other than these concerns, the paper is worthy of being published without major changes.

artificial intelligence, machine learning, reinforcement learning, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Add feedback

Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization

Tiapkin, Daniil, Chzhen, Evgenii, Stoltz, Gilles

arXiv.org Artificial IntelligenceJul-8-2024

In this paper, we consider the problem of learning in adversarial Markov decision processes [MDPs] with an oblivious adversary in a full-information setting. The agent interacts with an environment during $T$ episodes, each of which consists of $H$ stages, and each episode is evaluated with respect to a reward function that will be revealed only at the end of the episode. We propose an algorithm, called APO-MVP, that achieves a regret bound of order $\tilde{\mathcal{O}}(\mathrm{poly}(H)\sqrt{SAT})$, where $S$ and $A$ are sizes of the state and action spaces, respectively. This result improves upon the best-known regret bound by a factor of $\sqrt{S}$, bridging the gap between adversarial and stochastic MDPs, and matching the minimax lower bound $\Omega(\sqrt{H^3SAT})$ as far as the dependencies in $S,A,T$ are concerned. The proposed algorithm and analysis completely avoid the typical tool given by occupancy measures; instead, it performs policy optimization based only on dynamic programming and on a black-box online linear optimization strategy run over estimated advantage functions, making it easy to implement. The analysis leverages two recent techniques: policy optimization based on online linear optimization strategies (Jonckheere et al., 2023) and a refined martingale analysis of the impact on values of estimating transitions kernels (Zhang et al., 2023).

algorithm, log 2, probability, (13 more...)

arXiv.org Artificial Intelligence

2407.05704

Country:

Europe > France (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.46)

Add feedback

Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using Online Function Approximation

Levy, Orin, Cohen, Alon, Cassel, Asaf, Mansour, Yishay

arXiv.org Artificial IntelligenceAug-14-2023

We present the OMG-CMDP! algorithm for regret minimization in adversarial Contextual MDPs. The algorithm operates under the minimal assumptions of realizable function class and access to online least squares and log loss regression oracles. Our algorithm is efficient (assuming efficient online regression oracles), simple and robust to approximation errors. It enjoys an $\widetilde{O}(H^{2.5} \sqrt{ T|S||A| ( \mathcal{R}(\mathcal{O}) + H \log(\delta^{-1}) )})$ regret guarantee, with $T$ being the number of episodes, $S$ the state space, $A$ the action space, $H$ the horizon and $\mathcal{R}(\mathcal{O}) = \mathcal{R}(\mathcal{O}_{\mathrm{sq}}^\mathcal{F}) + \mathcal{R}(\mathcal{O}_{\mathrm{log}}^\mathcal{P})$ is the sum of the regression oracles' regret, used to approximate the context-dependent rewards and dynamics, respectively. To the best of our knowledge, our algorithm is the first efficient rate optimal regret minimization algorithm for adversarial CMDPs that operates under the minimal standard assumption of online function approximation.

artificial intelligence, efficient rate-optimal regret, fuzzy logic, (17 more...)

arXiv.org Artificial Intelligence

2303.01464

Country:

Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.61)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)

Add feedback

TomoSAM: a 3D Slicer extension using SAM for tomography segmentation

Semeraro, Federico, Quintart, Alexandre, Izquierdo, Sergio Fraile, Ferguson, Joseph C.

arXiv.org Artificial IntelligenceJun-14-2023

For this reason, many deep learning techniques have been proposed to perform the segmentation task, but they usually require a large amount of labeled data to train, as well as considerable computational resources. Our work is motivated by the objective of modeling Thermal Protection Systems (TPS), in order to estimate their material properties and predict their response to the harsh environmental conditions experienced during atmospheric entry. The Porous Microstructure Analysis (PuMA) software [1] was developed to provide a robust and efficient framework for the digital characterization of 3D microstructures.

artificial intelligence, machine learning, segmentation, (17 more...)

arXiv.org Artificial Intelligence

2306.08609

Country:

Oceania > Fiji (0.05)
North America > United States > California > Santa Clara County > Stanford (0.04)

Genre: Research Report (0.40)

Industry:

Health & Medicine (0.70)
Government > Space Agency (0.49)
Government > Regional Government > North America Government > United States Government (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Refined Analysis of FPL for Adversarial Markov Decision Processes

Wang, Yuanhao, Dong, Kefan

arXiv.org Machine LearningAug-20-2020

We consider the adversarial Markov Decision Process (MDP) problem, where the rewards for the MDP can be adversarially chosen, and the transition function can be either known or unknown. In both settings, Follow-the-PerturbedLeader (FPL) based algorithms have been proposed in previous literature. However, the established regret bounds for FPL based algorithms are worse than algorithms based on mirrordescent. We improve the analysis of FPL based algorithms in both settings, matching the current best regret bounds using faster and simpler algorithms.

algorithm, artificial intelligence, machine learning, (14 more...)

arXiv.org Machine Learning

2008.09251

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.63)

Add feedback

Whistleblower: Google Partners with China on 'AI Manhattan Project' Breitbart

#artificialintelligenceOct-15-2019, 02:21:53 GMT

Mansour asked Vorhies about Google's business engagements in China. "Google has gotten in trouble in the past for doing business with the Communist government of China," Mansour said. "And I wanted to ask you were those efforts ongoing when you were with the company? Can you give us any insight into that? Because it was quite troubling."

ai manhattan project, artificial intelligence, china, (11 more...)

#artificialintelligence

Country:

Asia > China (1.00)
Europe > United Kingdom (0.07)

Genre: Personal > Interview (0.39)

Industry:

Government (0.75)
Law > Civil Rights & Constitutional Law (0.35)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback