AITopics

1910.11369

Country:

Asia > Middle East > Jordan (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
(2 more...)

arXiv.org Machine LearningOct-23-2019

TCT: A Cross-supervised Learning Method for Multimodal Sequence Representation

Li, Wubo, Zou, Wei, Li, Xiangang

Multimodalities provide promising performance than unimodality in most tasks. However, learning the semantic of the representations from multimodalities efficiently is extremely challenging. To tackle this, we propose the Transformer based Cross-modal Translator (TCT) to learn unimodal sequence representations by translating from other related multimodal sequences on a supervised learning method. Combined TCT with Multimodal Transformer Network (MTN), we evaluate MTN-TCT on the video-grounded dialogue which uses multimodality. The proposed method reports new state-of-the-art performance on video-grounded dialogue which indicates representations learned by TCT are more semantics compared to directly use unimodality.

mtn-tct, sequence, translator, (13 more...)

1911.05186

Country:

Europe > Italy (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

#artificialintelligenceOct-22-2019, 13:29:11 GMT

Papers With Code : Billion-scale semi-supervised learning for image classification

This paper presents a study of semi-supervised learning with large convolutional networks. We propose a pipeline, based on a teacher/student paradigm, that leverages a large collection of unlabelled images (up to 1 billion)... Our main goal is to improve the performance for a given target architecture, like ResNet-50 or ResNext. We provide an extensive analysis of the success factors of our approach, which leads us to formulate some recommendations to produce high-accuracy models for image classification with semi-supervised learning. As a result, our approach brings important gains to standard architectures for image, video and fine-grained classification. For instance, by leveraging one billion unlabelled images, our learned vanilla ResNet-50 achieves 81.2% top-1 accuracy on the ImageNet benchmark.

billion-scale semi-supervised learning, image classification, semi-supervised learning, (2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.96)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.68)

arXiv.org Machine LearningOct-21-2019

Fragment Graphical Variational AutoEncoding for Screening Molecules with Small Data

Armitage, John, Spalek, Leszek J., Nguyen, Malgorzata, Nikolka, Mark, Jacobs, Ian, Marañón, Lorena, Nasrallah, Iyad, Schweicher, Guillaume, Dimov, Ivan, Simatos, Dimitrios, McCulloch, Ian, Nelson, Christian B., Conduit, Gareth, Sirringhaus, Henning

In the majority of molecular optimization tasks, predictive machine learning (ML) models are limited due to the unavailability and cost of generating big experimental datasets on the specific task. To circumvent this limitation, ML models are trained on big theoretical datasets or experimental indicators of molecular suitability that are either publicly available or inexpensive to acquire. These approaches produce a set of candidate molecules which have to be ranked using limited experimental data or expert knowledge. Under the assumption that structure is related to functionality, here we use a molecular fragment-based graphical autoencoder to generate unique structural fingerprints to efficiently search through the candidate set. We demonstrate that fragment-based graphical autoencoding reduces the error in predicting physical characteristics such as the solubility and partition coefficient in the small data regime compared to other extended circular fingerprints and string based approaches. We further demonstrate that this approach is capable of providing insight into real world molecular optimization problems, such as searching for stabilization additives in organic semiconductors by accurately predicting 92% of test molecules given 69 training examples. This task is a model example of black box molecular optimization as there is minimal theoretical and experimental knowledge to accurately predict the suitability of the additives.

fragment, molecular structure, molecule, (11 more...)

1910.13325

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Saudi Arabia (0.04)

Genre: Research Report (0.82)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.47)

#artificialintelligenceOct-19-2019, 09:35:49 GMT

Billion-scale semi-supervised learning for state-of-the-art image and video classification

Accurate image and video classification is important for a wide range of computer vision applications, from identifying harmful content, to making products more accessible to the visually impaired, to helping people more easily buy and sell things on products like Marketplace. Facebook AI is developing alternative ways to train our AI systems so that we can do more with less labeled training data overall, and also deliver accurate results even when large, high-quality labeled data sets are simply not available. Today, we are sharing details on a versatile new model training technique that delivers state-of-the-art accuracy for image and video classification systems. This approach, which we call semi-weak supervision, is a new way to combine the merits of two different training methods: semi-supervised learning and weakly supervised learning. It opens the door the door to creating more accurate, efficient production classification models by using a teacher-student model training paradigm and billion-scale weakly supervised data sets.

accuracy, classification model, percent top-1 accuracy, (12 more...)

Industry: Education (0.38)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Video Understanding (0.83)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.76)

#artificialintelligenceOct-19-2019, 00:54:40 GMT

Lecture01 Supervised Learning Setup of [Cornell CS4780 Machine Learning for Decision Making SP17]

cs4780 machine learning, lecture01 supervised learning setup, supervised learning setup, (2 more...)

Technology:

Information Technology > Communications > Social Media (0.76)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.51)

arXiv.org Artificial IntelligenceOct-17-2019

Executing Instructions in Situated Collaborative Interactions

Suhr, Alane, Yan, Claudia, Schluger, Jacob, Yu, Stanley, Khader, Hadi, Mouallem, Marwa, Zhang, Iris, Artzi, Yoav

We study a collaborative scenario where a user not only instructs a system to complete tasks, but also acts alongside it. This allows the user to adapt to the system abilities by changing their language or deciding to simply accomplish some tasks themselves, and requires the system to effectively recover from errors as the user strategically assigns it new goals. We build a game environment to study this scenario, and learn to map user instructions to system actions. We introduce a learning approach focused on recovery from cascading errors between instructions, and modeling methods to explicitly reason about instructions with multiple goals. We evaluate with a new evaluation protocol using recorded interactions and online games with human users, and observe how users adapt to the system abilities.

follower, instruction, interaction, (15 more...)

arXiv.org Artificial Intelligence

1910.03655

Country: North America > United States > New York (0.04)

Genre: Research Report (0.81)

Industry: Leisure & Entertainment > Games > Computer Games (0.86)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

arXiv.org Machine LearningOct-15-2019

Learning Classifiers on Positive and Unlabeled Data with Policy Gradient

Li, Tianyu, Wang, Chien-Chih, Ma, Yukun, Ortal, Patricia, Zhao, Qifang, Stenger, Bjorn, Hirate, Yu

--Existing algorithms aiming to learn a binary classifier from positive (P) and unlabeled (U) data require estimating the class prior or label noise ahead of building a classification model. However, the estimation and classifier learning are normally conducted in a pipeline instead of being jointly optimized. In this paper, we propose to alternatively train the two steps using reinforcement learning. Our proposal adopts a policy network to adaptively make assumptions on the labels of unlabeled data, while a classifier is built upon the output of the policy network and provides rewards to learn a better policy. The dynamic and interactive training between the policy maker and the classifier can exploit the unlabeled data in a more effective manner and yield a significant improvement in terms of classification performance. Furthermore, we present two different approaches to represent the actions taken by the policy. The first approach considers continuous actions as soft labels, while the other uses discrete actions as hard assignment of labels for unlabeled examples. We validate the effectiveness of the proposed method on two public benchmark datasets as well as one e-commerce dataset. The results show that the proposed method is able to consistently outperform state-of-the-art methods in various settings. PU learning refers to the problem of learning from a dataset where only a subset of examples are positively labeled and the rest are not annotated at all. It is a critical task due to its prevalence in various real-world applications [1], [2], [3]. In many common situations only positive data are available, for instance, an e-commerce website may only record users who have clicked on advertisements or purchased items. Meanwhile, it is not possible to simply assume that unlabeled instances are negative.

classifier, dataset, policy network, (15 more...)

1910.06535

Country:

Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.14)
North America > United States > Nevada (0.04)
North America > United States > Florida > Brevard County > Melbourne (0.04)
(9 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Information Technology > Services > e-Commerce Services (0.74)
Education > Educational Setting > Online (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.69)

#artificialintelligenceOct-10-2019, 00:39:12 GMT

Only 1 in 5 enterprises have DMARC records set up with an enforcement policy

Security company Vailmail released the Summer 2019 Email Fraud Landscape report on Tuesday highlighting recent efforts by enterprises to protect email accounts from cyberthreats. The report mostly focuses on the adoption rate of Domain-based Message Authentication, Reporting and Conformance (DMARC), a system that allows email domain owners to protect their domain from unauthorized use or "spoofing." Vailmail's researchers found that most enterprises were taking a positive step forward and saw a huge spike in DMARC adoption worldwide. Yet despite widespread adoption, the study found more than 90% of enterprise domains remain vulnerable to email impersonation attacks. By using DMARC and other similar authentication systems, domain owners can publish text files in the Domain Name System (DNS) laying out specific policies for how mail receivers should deal with unauthenticated email that appears to come from their domains.

dmarc record, enforcement policy, vailmail, (6 more...)

Genre: Research Report (0.41)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.41)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.41)

McCreery, Clara, Katariya, Namit, Kannan, Anitha, Chablani, Manish, Amatriain, Xavier

Domain-Relevant Embeddings for Medical Question Similarity

arXiv.org Machine LearningOct-9-2019

The rate at which medical questions are asked online significantly exceeds the capacity of qualified people to answer them, leaving many questions unanswered or inadequately answered. Many of these questions are not unique, and reliable identification of similar questions would enable more efficient and effective question answering schema. While many research efforts have focused on the problem of general question similarity, these approaches do not generalize well to the medical domain, where medical expertise is often required to determine semantic similarity. In this paper, we show how a semi-supervised approach of pre-training a neural network on medical question-answer pairs is a particularly useful intermediate task for the ultimate goal of determining medical question similarity. While other pre-training tasks yield an accuracy below 78.7% on this task, our model achieves an accuracy of 82.6% with the same number of training examples, an accuracy of 80.0% with a much smaller training set, and an accuracy of 84.5% when the full corpus of medical question-answer data is used.

dataset, question pair, similarity, (17 more...)

1910.04192

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > Italy > Tuscany > Florence (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre:

Research Report > Experimental Study (0.68)
Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)