AITopics | Bouchard, Guillaume

Collaborating Authors

Bouchard, Guillaume

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Detecting Harmful Content On Online Platforms: What Platforms Need Vs. Where Research Efforts Go

Arora, Arnav, Nakov, Preslav, Hardalov, Momchil, Sarwar, Sheikh Muhammad, Nayak, Vibha, Dinkov, Yoan, Zlatkova, Dimitrina, Dent, Kyle, Bhatawdekar, Ameya, Bouchard, Guillaume, Augenstein, Isabelle

arXiv.org Artificial IntelligenceJun-6-2023

The proliferation of harmful content on online platforms is a major societal problem, which comes in many different forms including hate speech, offensive language, bullying and harassment, misinformation, spam, violence, graphic content, sexual abuse, self harm, and many other. Online platforms seek to moderate such content to limit societal harm, to comply with legislation, and to create a more inclusive environment for their users. Researchers have developed different methods for automatically detecting harmful content, often focusing on specific sub-problems or on narrow communities, as what is considered harmful often depends on the platform and on the context. We argue that there is currently a dichotomy between what types of harmful content online platforms seek to curb, and what research efforts there are to automatically detect such content. We thus survey existing methods as well as content moderation policies by online platforms in this light and we suggest directions for future work.

computational linguistic, data mining, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2103.00153

Country:

Europe (1.00)
North America > United States > Minnesota (0.28)
North America > United States > Massachusetts (0.28)
(3 more...)

Genre: Research Report (1.00)

Industry:

Media > News (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Services (1.00)
(4 more...)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science (1.00)
Information Technology > Communications > Social Media (1.00)
(2 more...)

Add feedback

On Inductive Abilities of Latent Factor Models for Relational Learning

Trouillon, Théo, Gaussier, Eric, Dance, Christopher R., Bouchard, Guillaume

Journal of Artificial Intelligence ResearchJan-15-2019

Latent factor models are increasingly popular for modeling multi-relational knowledge graphs. By their vectorial nature, it is not only hard to interpret why this class of models works so well, but also to understand where they fail and how they might be improved. We conduct an experimental survey of state-of-the-art models, not towards a purely comparative end, but as a means to get insight about their inductive abilities. To assess the strengths and weaknesses of each model, we create simple tasks that exhibit first, atomic properties of binary relations, and then, common inter-relational inference through synthetic genealogies. Based on these experimental results, we propose new research directions to improve on existing models.

knowledge management, machine learning, relation, (19 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.11305

AI Access Foundation

11305

Journal of Artificial Intelligence Research

Country:

Europe > France (0.14)
North America > United States (0.14)
Europe > United Kingdom (0.14)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.94)
Information Technology > Knowledge Management > Knowledge Engineering (0.94)

Add feedback

Interpretation of Natural Language Rules in Conversational Machine Reading

Saeidi, Marzieh, Bartolo, Max, Lewis, Patrick, Singh, Sameer, Rocktäschel, Tim, Sheldon, Mike, Bouchard, Guillaume, Riedel, Sebastian

arXiv.org Machine LearningAug-28-2018

Most work in machine reading focuses on question answering problems where the answer is directly expressed in the text to read. However, many real-world question answering problems require the reading of text not because it contains the literal answer, but because it contains a recipe to derive an answer together with the reader's background knowledge. One example is the task of interpreting regulations to answer "Can I...?" or "Do I have to...?" questions such as "I am working in Canada. Do I have to carry on paying UK National Insurance?" after reading a UK government website about this topic. This task requires both the interpretation of rules and the application of background knowledge. It is further complicated due to the fact that, in practice, most questions are underspecified, and a human assistant will regularly have to ask clarification questions such as "How long have you been working abroad?" when the answer cannot be directly derived from the question and text. In this paper, we formalise this task and develop a crowd-sourcing strategy to collect 32k task instances based on real-world rules and crowd-generated questions and scenarios. We analyse the challenges of this task and assess its difficulty by evaluating the performance of rule-based and machine-learning baselines. We observe promising results when no background knowledge is necessary, and substantial room for improvement whenever background knowledge is needed.

crowdsourcing, deep learning, followup question, (22 more...)

arXiv.org Machine Learning

1809.01494

Country: North America > United States (1.00)

Genre: Research Report > New Finding (0.93)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.67)
(2 more...)

Add feedback

Knowledge Graph Completion via Complex Tensor Factorization

Trouillon, Théo, Dance, Christopher R., Welbl, Johannes, Riedel, Sebastian, Gaussier, Éric, Bouchard, Guillaume

arXiv.org Artificial IntelligenceNov-26-2017

In statistical relational learning, knowledge graph completion deals with automatically understanding the structure of large knowledge graphs---labeled directed graphs---and predicting missing relationships---labeled edges. State-of-the-art embedding models propose different trade-offs between modeling expressiveness, and time and space complexity. We reconcile both expressiveness and complexity through the use of complex-valued embeddings and explore the link between such complex-valued embeddings and unitary diagonalization. We corroborate our approach theoretically and show that all real square matrices---thus all possible relation/adjacency matrices---are the real part of some unitarily diagonalizable matrix. This results opens the door to a lot of other applications of square matrices factorization. Our approach based on complex embeddings is arguably simple, as it only involves a Hermitian dot product, the complex counterpart of the standard dot product between real vectors, whereas other methods resort to more and more complicated composition functions to increase their expressiveness. The proposed complex embeddings are scalable to large data sets as it remains linear in both space and time, while consistently outperforming alternative approaches on standard link prediction benchmarks.

deep learning, neural network, relation, (16 more...)

arXiv.org Artificial Intelligence

1702.06879

Country: Europe > United Kingdom (0.28)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

On Inductive Abilities of Latent Factor Models for Relational Learning

Trouillon, Théo, Gaussier, Éric, Dance, Christopher R., Bouchard, Guillaume

arXiv.org Machine LearningSep-17-2017

logic programming, neural network, relation, (20 more...)

arXiv.org Machine Learning

1709.05666

Country:

Europe > France (0.14)
Europe > United Kingdom (0.14)
North America > United States (0.14)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.94)
Information Technology > Knowledge Management > Knowledge Engineering (0.93)

Add feedback

Complex Embeddings for Simple Link Prediction

Trouillon, Théo, Welbl, Johannes, Riedel, Sebastian, Gaussier, Éric, Bouchard, Guillaume

arXiv.org Machine LearningJun-20-2016

In statistical relational learning, the link prediction problem is key to automatically understand the structure of large knowledge bases. As in previous studies, we propose to solve this problem through latent factorization. However, here we make use of complex valued embeddings. The composition of complex embeddings can handle a large variety of binary relations, among them symmetric and antisymmetric relations. Compared to state-of-the-art models such as Neural Tensor Network and Holographic Embeddings, our approach based on complex embeddings is arguably simpler, as it only uses the Hermitian dot product, the complex counterpart of the standard dot product between real vectors. Our approach is scalable to large datasets as it remains linear in both space and time, while consistently outperforming alternative approaches on standard link prediction benchmarks.

artificial intelligence, data mining, relation, (18 more...)

arXiv.org Machine Learning

1606.06357

Country:

Europe > Greece (0.14)
Europe > France (0.14)
Europe > United Kingdom (0.14)

Genre: Research Report (0.70)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

A Factorization Machine Framework for Testing Bigram Embeddings in Knowledgebase Completion

Welbl, Johannes, Bouchard, Guillaume, Riedel, Sebastian

arXiv.org Machine LearningApr-20-2016

Embedding-based Knowledge Base Completion models have so far mostly combined distributed representations of individual entities or relations to compute truth scores of missing links. Facts can however also be represented using pairwise embeddings, i.e. embeddings for pairs of entities and relations. In this paper we explore such bigram embeddings with a flexible Factorization Machine model and several ablations from it. We investigate the relevance of various bigram types on the fb15k237 dataset and find relative improvements compared to a compositional model.

artificial intelligence, natural language, relation, (16 more...)

arXiv.org Machine Learning

1604.05878

Country:

North America > United States (0.15)
North America > Canada (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

Online Learning to Sample

Bouchard, Guillaume, Trouillon, Théo, Perez, Julien, Gaidon, Adrien

arXiv.org Machine LearningMar-15-2016

Stochastic Gradient Descent (SGD) is one of the most widely used techniques for online optimization in machine learning. In this work, we accelerate SGD by adaptively learning how to sample the most useful training examples at each time step. First, we show that SGD can be used to learn the best possible sampling distribution of an importance sampling estimator. Second, we show that the sampling distribution of an SGD algorithm can be estimated online by incrementally minimizing the variance of the gradient. The resulting algorithm - called Adaptive Weighted SGD (AW-SGD) - maintains a set of parameters to optimize, as well as a set of parameters to sample learning examples. We show that AWSGD yields faster convergence in three different applications: (i) image classification with deep features, where the sampling of images depends on their labels, (ii) matrix factorization, where rows and columns are not sampled uniformly, and (iii) reinforcement learning, where the optimized and exploration policies are estimated at the same time, where our approach corresponds to an off-policy gradient algorithm.

algorithm, computer based training, educational technology, (22 more...)

arXiv.org Machine Learning

1506.09016

Country: Europe (0.14)

Genre: Research Report (0.64)

Industry: Education > Educational Setting > Online (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.56)

Add feedback

Approximate Inference with the Variational Holder Bound

Bouchard, Guillaume, Lakshminarayanan, Balaji

arXiv.org Machine LearningJun-19-2015

We introduce the Variational Holder (VH) bound as an alternative to Variational Bayes (VB) for approximate Bayesian inference. Unlike VB which typically involves maximization of a non-convex lower bound with respect to the variational parameters, the VH bound involves minimization of a convex upper bound to the intractable integral with respect to the variational parameters. Minimization of the VH bound is a convex optimization problem; hence the VH method can be applied using off-the-shelf convex optimization algorithms and the approximation error of the VH bound can also be analyzed using tools from convex optimization literature. We present experiments on the task of integrating a truncated multivariate Gaussian distribution and compare our method to VB, EP and a state-of-the-art numerical integration method for this problem.

bayesian inference, inequality, optimization problem, (17 more...)

arXiv.org Machine Learning

1506.061

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.49)

Add feedback

On Approximate Reasoning Capabilities of Low-Rank Vector Spaces

Bouchard, Guillaume (Xerox Research Centre Europe) | Singh, Sameer (University of Washington) | Trouillon, Théo (Xerox Research Centre Europe)

AAAI ConferencesMar-16-2015

In relational databases, relations between objects, represented by binary matrices or tensors, may be arbitrarily complex. In practice however, there are recurring relational patterns such as transitive, permutation and sequential relationships, that seem to have a regular structure not captured by the classical notion of matrix rank or tensor rank. In this paper, we show that factorizing the relational tensor using a logistic or hinge loss instead of the more standard squared loss is more appropriate because it can accurately model many common relations with a fixed-size embedding that depends sub-linearly on the number of entities in the knowledge base. We illustrate this fact empirically by being able to efficiently predict missing links in several synthetic and real-world experiments. Further, we provide theoretical justification for logistic loss by studying its connection to a complexity measure from the field of information complexity called the sign rank. Sign rank is a more appropriate complexity measure as it has a low value for transitive, permutation, or sequential relationships, while being large for uniformly sampled binary matrices/tensors with a high probability.

approximate reasoning capability, low-rank vector space

AAAI Conferences

2015 AAAI Spring Symposium Series

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.40)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.40)

Add feedback