AITopics

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.70)

Neural Information Processing SystemsFeb-10-2026, 21:48:06 GMT

e7023ba77a45f7e84c5ee8a28dd63585-Paper.pdf

dialog, q-bot, task performance, (15 more...)

Country:

North America > United States > Oregon (0.04)
North America > United States > California (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Neural Information Processing SystemsFeb-10-2026, 21:47:54 GMT

e7023ba77a45f7e84c5ee8a28dd63585-AuthorFeedback.pdf

dialog, interpretability, q-bot, (8 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Neural Information Processing SystemsAug-17-2025, 01:41:42 GMT

9 Supplement Overview

This document contains supplementary material for "Dialog without Dialog Data: Learning Visual The main paper excludes some details which we provide here. Section 12 reports the ablations we use to evaluate the effects of different aspects of the proposed Q-bot. This section describes our architecture in more detail. There is a minor notation difference between this section and the main paper. Note that for the planner there is an additional residual connection at line 16 which augments the hidden state.

artificial intelligence, machine learning, natural language, (18 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.70)

Neural Information Processing SystemsAug-17-2025, 01:41:37 GMT

Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data Michael Cogswell 5 Jiasen Lu3 Rishabh Jain 1 Stefan Lee

Such agents could leverage a larger variety of existing data to generalize to new task, minimizing expensive data collection and annotation. In this work, we study a setting we call " Dialog

artificial intelligence, machine learning, natural language, (18 more...)

Country:

North America > United States > Oregon (0.04)
North America > United States > California (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.65)

Neural Information Processing SystemsAug-17-2025, 01:41:25 GMT

e7023ba77a45f7e84c5ee8a28dd63585-AuthorFeedback.pdf

artificial intelligence, interpretability, machine learning, (10 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

arXiv.org Artificial IntelligenceJul-24-2020

Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data

Cogswell, Michael, Lu, Jiasen, Jain, Rishabh, Lee, Stefan, Parikh, Devi, Batra, Dhruv

Can we develop visually grounded dialog agents that can efficiently adapt to new tasks without forgetting how to talk to people? Such agents could leverage a larger variety of existing data to generalize to new tasks, minimizing expensive data collection and annotation. In this work, we study a setting we call "Dialog without Dialog", which requires agents to develop visually grounded dialog models that can adapt to new tasks without language level supervision. By factorizing intention and language, our model minimizes linguistic drift after fine-tuning for new tasks. We present qualitative results, automated metrics, and human studies that all show our model can adapt to new tasks and maintain language quality. Baselines either fail to perform well at new tasks or experience language drift, becoming unintelligible to humans. Code has been made available at https://github.com/mcogswell/dialog_without_dialog

artificial intelligence, machine learning, natural language, (19 more...)

2007.1275

Country:

North America > United States > Oregon (0.04)
North America > United States > California (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.85)

Liang, Paul Pu, Chen, Jeffrey, Salakhutdinov, Ruslan, Morency, Louis-Philippe, Kottur, Satwik

On Emergent Communication in Competitive Multi-Agent Teams

arXiv.org Artificial IntelligenceMar-3-2020

Several recent works have found the emergence of grounded compositional language in the communication protocols developed by mostly cooperative multi-agent systems when learned end-to-end to maximize performance on a downstream task. However, human populations learn to solve complex tasks involving communicative behaviors not only in fully cooperative settings but also in scenarios where competition acts as an additional external pressure for improvement. In this work, we investigate whether competition for performance from an external, similar agent team could act as a social influence that encourages multi-agent populations to develop better communication protocols for improved performance, compositionality, and convergence speed. We start from Task & Talk, a previously proposed referential game between two cooperative agents as our testbed and extend it into Task, Talk & Compete, a game involving two competitive teams each consisting of two aforementioned cooperative agents. Using this new setting, we provide an empirical study demonstrating the impact of competitive influence on multi-agent teams. Our results show that an external competitive influence leads to improved accuracy and generalization, as well as faster emergence of communicative languages that are more informative and compositional.

a-bot, agent, latexit sha1, (17 more...)

2003.01848

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(8 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Government (0.46)
Education (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (1.00)

Murahari, Vishvak, Chattopadhyay, Prithvijit, Batra, Dhruv, Parikh, Devi, Das, Abhishek

Improving Generative Visual Dialog by Answering Diverse Questions

arXiv.org Artificial IntelligenceSep-23-2019

Prior work on training generative Visual Dialog models with reinforcement learning(Das et al.) has explored a Qbot-Abot image-guessing game and shown that this 'self-talk' approach can lead to improved performance at the downstream dialog-conditioned image-guessing task. However, this improvement saturates and starts degrading after a few rounds of interaction, and does not lead to a better Visual Dialog model. We find that this is due in part to repeated interactions between Qbot and Abot during self-talk, which are not informative with respect to the image. To improve this, we devise a simple auxiliary objective that incentivizes Qbot to ask diverse questions, thus reducing repetitions and in turn enabling Abot to explore a larger state space during RL ie. be exposed to more visual concepts to talk about, and varied questions to answer. We evaluate our approach via a host of automatic metrics and human studies, and demonstrate that it leads to better dialog, ie. dialog that is more diverse (ie. less repetitive), consistent (ie. has fewer conflicting exchanges), fluent (ie. more human-like),and detailed, while still being comparably image-relevant as prior work and ablations.

a-bot, dialog, q-bot, (17 more...)

1909.1047

Country: North America > United States (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)

arXiv.org Artificial IntelligenceApr-19-2019

Emergence of Compositional Language with Deep Generational Transmission

Cogswell, Michael, Lu, Jiasen, Lee, Stefan, Parikh, Devi, Batra, Dhruv

Consider a collaborative task that requires communication. Two agents are placed in an environment and must create a language from scratch in order to coordinate. Recent work has been interested in what kinds of languages emerge when deep reinforcement learning agents are put in such a situation, and in particular in the factors that cause language to be compositional-i.e. meaning is expressed by combining words which themselves have meaning. Evolutionary linguists have also studied the emergence of compositional language for decades, and they find that in addition to structural priors like those already studied in deep learning, the dynamics of transmitting language from generation to generation contribute significantly to the emergence of compositionality. In this paper, we introduce these cultural evolutionary dynamics into language emergence by periodically replacing agents in a population to create a knowledge gap, implicitly inducing cultural transmission of language. We show that this implicit cultural transmission encourages the resulting languages to exhibit better compositional generalization and suggest how elements of cultural dynamics can be further integrated into populations of deep agents.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

1904.09067

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)