AITopics | Agents

However, distributed algorithms for learning relatedness among tasks are not resilient in the presence of Byzantine agents. In this paper, we present an approach for Byzantine resilient distributed multi-task learning. We propose an efficient online weight assignment rule by measuring the accumulated loss using an agent's data and its neighbors' models. A small accumulated loss indicates a large similarity between the two tasks.

agent, neighbor, normal agent, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Long Beach (0.14)
North America > United States > Virginia (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
(14 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

d37eb50d868361ea729bb4147eb3c1d8-AuthorFeedback.pdf

Neural Information Processing SystemsAug-16-2025, 14:44:00 GMT

We thank all the reviewers for their valuable comments and appreciation of the ideas and results presented in the paper. We summarize the main questions from the reviewers and address them separately below. T o Reviewer #1 Q1: Network connectivity is presumably known . . . it seems all the graphs considered are com-3 We note that the network connectivity is not assumed to be known. T o Reviewer #3 Q1: Scope of the paper/Missing related work. " and "FedNAS" are about We can add an explanation to clarify the MTL scope of the paper.

agent, byz, experiment, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Communications > Networks (0.57)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.31)

Add feedback

CAESAR: An Embodied Simulator for Generating Multimodal Referring Expression Datasets

Neural Information Processing SystemsAug-16-2025, 14:12:19 GMT

However, existing synthetic data generation tools that provide referring expressions generally neglect nonverbal gestures.

machine learning, natural language, object-oriented architecture, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia > Albemarle County > Charlottesville (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
(2 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

Learning to Play No-Press Diplomacy with Best Response Policy Iteration Thomas Anthony

Neural Information Processing SystemsAug-16-2025, 14:00:58 GMT

We consider Diplomacy, a 7-player board game designed to accentuate dilemmas resulting from many-agent interactions.

agent, checkpoint, diplomacy, (12 more...)

Neural Information Processing Systems

Country:

Europe > France (0.04)
Europe > Germany (0.04)
Europe > Italy (0.04)
(5 more...)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

82ad13ec01f9fe44c01cb91814fd7b8c-Paper-Conference.pdf

Neural Information Processing SystemsAug-16-2025, 13:18:49 GMT

machine learning, natural language, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > Middle East > Malta (0.04)
Asia > Japan > Honshū > Chūbu > Toyama Prefecture > Toyama (0.04)

Industry: Information Technology > Services (0.47)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
(4 more...)

Add feedback

A Properties of the discrepancy of synergy patterns as a legitimate 1 pseudometric

Neural Information Processing SystemsAug-16-2025, 12:53:46 GMT

To avoid unnecessary confusion, we notate the joint distribution of the L.H.S as We show that the infimum of the R.H.S. is reached when Then we can update the joint distribution for the L.H.S. with The start steps for employing SPD to obtain pseudo-reward 5000 α The factor of the regularized term in Eq. (6) 0 B We prove the triangle inequality by contradictions similar to iii). Each agent has to resolve to select the action from its discrete action space to move around. Neural Network (RNN) is used in the policy to alleviate the partial observability. WW W of edge { i, j} depicts agents' relative relations. Synergy Pattern Function ζ A general function which could depict agents' relative relations.

artificial intelligence, machine learning, synergy pattern, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.51)

Add feedback

SPD: Synergy Pattern Diversifying Oriented Unsupervised Multi-agent Reinforcement Learning

Neural Information Processing SystemsAug-16-2025, 12:53:43 GMT

As for the single agent, unsupervised learning has been incorporated into RL to acquire diverse skills for the agent without extrinsic reward from the environment, and this scenario is known as unsupervised reinforcement learning (URL).

agent, discrepancy, synergy pattern, (13 more...)

Neural Information Processing Systems

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
Europe > France (0.04)
Asia > Middle East > Jordan (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)

Add feedback

Axioms for Learning from Pairwise Comparisons

Neural Information Processing SystemsAug-16-2025, 12:53:28 GMT

ML, preferences and rankings are commonly learned by fitting a probabilistic model to noisy preference data. The behavior of this learning process from the view of economic theory has previously been studied for the case where the data consists of rankings. In practice, it is more common to have only pairwise comparison data, and the formal properties of the associated learning problem are more challenging to analyze. We show that a large class of random utility models (including the Thurstone-Mosteller Model), when estimated using the MLE, satisfy a Pareto efficiency condition. These models also satisfy a strong monotonicity property, which implies that the learning process is responsive to input data. On the other hand, we show that these models fail certain other consistency conditions from social choice theory, and in particular do not always follow the majority opinion. Our results inform existing and future applications of random utility models for societal decision making.

dataset, pairwise comparison, random utility model, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Canada (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.68)

Add feedback