AITopics | Country

Collaborating Authors

Country

Semi-Supervised Class Discovery

Nixon, Jeremy, Liu, Jeremiah, Berthelot, David

arXiv.org Machine LearningFeb-21-2020

One promising approach to dealing with datapoints that are outside of the initial training distribution (OOD) is to create new classes that capture similarities in the datapoints previously rejected as uncategorizable. Systems that generate labels can be deployed against an arbitrary amount of data, discovering classification schemes that through training create a higher quality representation of data. We introduce the Dataset Reconstruction Accuracy, a new and important measure of the effectiveness of a model's ability to create labels. We introduce benchmarks against this Dataset Reconstruction metric. We apply a new heuristic, class learnability, for deciding whether a class is worthy of addition to the training dataset. We show that our class discovery system can be successfully applied to vision and language, and we demonstrate the value of semi-supervised learning in automatically discovering novel classes.

accuracy, dataset, learning, (12 more...)

arXiv.org Machine Learning

2002.0348

Country:

North America > United States > California > Santa Clara County > Mountain View (0.04)
Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.94)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.68)

Add feedback

Soft Threshold Weight Reparameterization for Learnable Sparsity

Kusupati, Aditya, Ramanujan, Vivek, Somani, Raghav, Wortsman, Mitchell, Jain, Prateek, Kakade, Sham, Farhadi, Ali

arXiv.org Machine LearningFeb-21-2020

Sparsity in Deep Neural Networks (DNNs) is studied extensively with the focus of maximizing prediction accuracy given an overall parameter budget. Existing methods rely on uniform or heuristic non-uniform sparsity budgets which have sub-optimal layer-wise parameter allocation resulting in a) lower prediction accuracy or b) higher inference cost (FLOPs). This work proposes Soft Threshold Reparameterization (STR), a novel use of the soft-threshold operator on DNN weights. STR smoothly induces sparsity while learning pruning thresholds thereby obtaining a non-uniform sparsity budget. Our method achieves state-of-the-art accuracy for unstructured sparsity in CNNs (ResNet50 and MobileNetV1 on ImageNet-1K), and, additionally, learns non-uniform budgets that empirically reduce the FLOPs by up to 50%. Notably, STR boosts the accuracy over existing results by up to 10% in the ultra sparse (99%) regime and can also be used to induce low-rank (structured sparsity) in RNNs. In short, STR is a simple mechanism which learns effective sparsity budgets that contrast with popular heuristics.

budget, sparsity, str, (12 more...)

arXiv.org Machine Learning

2002.03231

Country:

North America > United States (0.14)
Asia > India (0.04)

Genre: Research Report (0.63)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Language as a Cognitive Tool to Imagine Goals in Curiosity-Driven Exploration

Colas, Cédric, Karch, Tristan, Lair, Nicolas, Dussoux, Jean-Michel, Moulin-Frier, Clément, Dominey, Peter Ford, Oudeyer, Pierre-Yves

arXiv.org Artificial IntelligenceFeb-21-2020

Autonomous reinforcement learning agents must be intrinsically motivated to explore their environment, discover potential goals, represent them and learn how to achieve them. As children do the same, they benefit from exposure to language, using it to formulate goals and imagine new ones as they learn their meaning. In our proposed learning architecture (IMAGINE), the agent freely explores its environment and turns natural language descriptions of interesting interactions from a social partner into potential goals. IMAGINE learns to represent goals by jointly learning a language model and a goal-conditioned reward function. Just like humans, our agent uses language compositionality to generate new goals by composing known ones. Leveraging modular model architectures based on Deep Sets and gated-attention mechanisms, IMAGINE autonomously builds a repertoire of behaviors and shows good zero-shot generalization properties for various types of generalization. When imagining its own goals, the agent leverages zero-shot generalization of the reward function to further train on imagined goals and refine its behavior. We present experiments in a simulated domain where the agent interacts with procedurally generated scenes containing objects of various types and colors, discovers goals, imagines others and learns to achieve them.

architecture, generalization, reward function, (12 more...)

arXiv.org Artificial Intelligence

2002.09253

Country: North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre:

Research Report (0.64)
Instructional Material > Course Syllabus & Notes (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

Online Learning in Contextual Bandits using Gated Linear Networks

Sezener, Eren, Hutter, Marcus, Budden, David, Wang, Jianan, Veness, Joel

arXiv.org Artificial IntelligenceFeb-21-2020

We introduce a new and completely online contextual bandit algorithm called Gated Linear Contextual Bandits (GLCB). This algorithm is based on Gated Linear Networks (GLNs), a recently introduced deep learning architecture with properties well-suited to the online setting. Leveraging data-dependent gating properties of the GLN we are able to estimate prediction uncertainty with effectively zero algorithmic overhead. We empirically evaluate GLCB compared to 9 state-of-the-art algorithms that leverage deep neural networks, on a standard benchmark suite of discrete and continuous contextual bandit problems. GLCB obtains median first-place despite being the only online method, and we further support these results with a theoretical study of its convergence properties.

contextual bandit, gln, neuron, (15 more...)

arXiv.org Artificial Intelligence

2002.11611

Country:

North America > United States > New York > New York County > New York City (0.14)
Asia > Afghanistan > Parwan Province > Charikar (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry: Education > Educational Setting > Online (0.51)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Notes on neighborhood semantics for logics of unknown truths and false beliefs

Fan, Jie

arXiv.org Artificial IntelligenceFeb-21-2020

This paper studies logics of unknown truths and false beliefs under neighborhood semantics. Intuitively, if p is true but you do not know that p, then you have an unknown truth that p; if p is false but you believe thatp, then you have a false belief thatp, or you are wrong aboutp. The notion of unknown truths is important in philosophy and formal epistemology. For instance, it is related to Verificationism, or'verification thesis' [31]. Verificationism says that all truths can be known. However, from the thesis, the unknown truth of p, formalized p Kp, gives us a consequence that all truths are actually known. In other words, the notion gives rise to a well-known counterexample to Verificationism. This is the so-called Fitch's'paradox of knowability' [13]. 1 To take another example: it gives rise to an important type of Moore sentences, which is essential to Moore's paradox, which says that one cannot claim the paradoxical sentence "p but I do not know it" [23, 18]. It is known that such a Moore sentence is unsuccessful and self-refuting (see, e.g.

false belief, logic, neighborhood frame, (14 more...)

arXiv.org Artificial Intelligence

2002.09622

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New York > Tompkins County > Ithaca (0.04)
North America > United States > Indiana > Marion County > Indianapolis (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.69)

Add feedback

Emergent Communication with World Models

Cowen-Rivers, Alexander I., Naradowsky, Jason

arXiv.org Artificial IntelligenceFeb-21-2020

We introduce Language World Models, a class of language-conditional generative model which interpret natural language messages by predicting latent codes of future observations. This provides a visual grounding of the message, similar to an enhanced observation of the world, which may include objects outside of the listening agent's field-of-view. We incorporate this "observation" into a persistent memory state, and allow the listening agent's policy to condition on it, akin to the relationship between memory and controller in a World Model. We show this improves effective communication and task success in 2D gridworld speaker-listener navigation tasks. In addition, we develop two losses framed specifically for our model-based formulation to promote positive signalling and positive listening. Finally, because messages are interpreted in a generative model, we can visualize the model beliefs to gain insight into how the communication channel is utilized.

agent, arxiv preprint arxiv, communication, (11 more...)

arXiv.org Artificial Intelligence

2002.09604

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.70)

Add feedback

Training Question Answering Models From Synthetic Data

Puri, Raul, Spring, Ryan, Patwary, Mostofa, Shoeybi, Mohammad, Catanzaro, Bryan

arXiv.org Artificial IntelligenceFeb-21-2020

Question and answer generation is a data augmentation method that aims to improve question answering (QA) models given the limited amount of human labeled data. However, a considerable gap remains between synthetic and human-generated question-answer pairs. This work aims to narrow this gap by taking advantage of large language models and explores several factors such as model size, quality of pretrained models, scale of data synthesized, and algorithmic choices. On the SQuAD1.1 question answering task, we achieve higher accuracy using solely synthetic questions and answers than when using the SQuAD1.1 training set questions alone. Removing access to real Wikipedia data, we synthesize questions and answers from a synthetic corpus generated by an 8.3 billion parameter GPT-2 model. With no access to human supervision and only access to other models, we are able to train state of the art question answering networks on entirely model-generated data that achieve 88.4 Exact Match (EM) and 93.9 F1 score on the SQuAD1.1 dev set. We further apply our methodology to SQuAD2.0 and show a 2.8 absolute gain on EM score compared to prior work using synthetic data.

filtration, question generation, squad1, (14 more...)

arXiv.org Artificial Intelligence

2002.09599

Country:

North America > United States > Texas > Culberson County > Van Horn (0.14)
Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.05)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.05)
(12 more...)

Genre: Research Report (0.82)

Industry:

Media > Music (0.93)
Health & Medicine (0.68)
Leisure & Entertainment > Sports > Soccer (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A characterization of proportionally representative committees

Aziz, Haris, Lee, Barton E.

arXiv.org Artificial IntelligenceFeb-21-2020

When voters elicit ranked preferences over candidates, one particular axiom for proportional representation is Proportionality of Solid Coalitions (PSC). This axiom was advocated by Dummett [4] and has been referred to as the most important requirement for proportional representation [15, 16, 18, 19]. PSC is the subject of many theoretical and empirical studies. Theoretical studies have focused on designing voting rules that satisfy PSC; these include single transferable vote (STV) [15], Quota Borda System (QBS) [4], Schulz-STV [14], and the Expanding Approvals Rule (EAR) [2].

proportional representation, solid coalition, voter, (14 more...)

arXiv.org Artificial Intelligence

2002.09598

Country:

Oceania > Australia (0.05)
Europe > Ireland (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(4 more...)

Genre: Research Report (0.40)

Industry: Government > Voting & Elections (0.70)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

The Pragmatic Turn in Explainable Artificial Intelligence (XAI)

Páez, Andrés

arXiv.org Artificial IntelligenceFeb-21-2020

In this paper I argue that the search for explainable models and interpretable decisions in AI must be reformulated in terms of the broader project of offering a pragmatic and naturalistic account of understanding in AI. Intuitively, the purpose of providing an explanation of a model or a decision is to make it understandable to its stakeholders. But without a previous grasp of what it means to say that an agent understands a model or a decision, the explanatory strategies will lack a well-defined goal. Aside from providing a clearer objective for XAI, focusing on understanding also allows us to relax the factivity condition on explanation, which is impossible to fulfill in many machine learning models, and to focus instead on the pragmatic conditions that determine the best fit between a model and the methods and devices deployed to understand it. After an examination of the different types of understanding discussed in the philosophical and psychological literature, I conclude that interpretative or approximation models not only provide the best way to achieve the objectual understanding of a machine learning model, but are also a necessary condition to achieve post-hoc interpretability. This conclusion is partly based on the shortcomings of the purely functionalist approach to post-hoc interpretability that seems to be predominant in most recent literature.

explanation, interpretative model, knowledge, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s11023-019-09502-w

2002.09595

Country:

North America > United States > New York (0.05)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)
Europe > Netherlands > South Holland > Dordrecht (0.04)
(5 more...)

Genre: Research Report (0.50)

Industry:

Media (0.67)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (0.82)

Add feedback

Estimating Q(s,s') with Deep Deterministic Dynamics Gradients

Edwards, Ashley D., Sahni, Himanshu, Liu, Rosanne, Hung, Jane, Jain, Ankit, Wang, Rui, Ecoffet, Adrien, Miconi, Thomas, Isbell, Charles, Yosinski, Jason

arXiv.org Artificial IntelligenceFeb-21-2020

In this paper, we introduce a novel form of value function, $Q(s, s')$, that expresses the utility of transitioning from a state $s$ to a neighboring state $s'$ and then acting optimally thereafter. In order to derive an optimal policy, we develop a forward dynamics model that learns to make next-state predictions that maximize this value. This formulation decouples actions from values while still learning off-policy. We highlight the benefits of this approach in terms of value function transfer, learning within redundant action spaces, and learning off-policy from state observations generated by sub-optimal or completely random policies. Code and videos are available at \url{sites.google.com/view/qss-paper}.

dynamic model, experiment, qss, (10 more...)

arXiv.org Artificial Intelligence

2002.09505

Country: Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback