AITopics | Edmonton

Collaborating Authors

Edmonton

Aleatoric and Epistemic Uncertainty in Machine Learning: A Tutorial Introduction

arXiv.org Machine LearningOct-21-2019

The notion of uncertainty is of major importance in machine learning and constitutes a key element of machine learning methodology. In line with the statistical tradition, uncertainty has long been perceived as almost synonymous with standard probability and probabilistic predictions. Yet, due to the steadily increasing relevance of machine learning for practical applications and related issues such as safety requirements, new problems and challenges have recently been identified by machine learning scholars, and these problems may call for new methodological developments. In particular, this includes the importance of distinguishing between (at least) two different types of uncertainty, often refereed to as aleatoric and epistemic. In this paper, we provide an introduction to the topic of uncertainty in machine learning as well as an overview of hitherto attempts at handling uncertainty in general and formalizing this distinction in particular. 1 Introduction Machine learning is essentially concerned with extracting models from data and using these models to make predictions.

epistemic uncertainty, prediction, probability, (17 more...)

arXiv.org Machine Learning

1910.09457

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
Europe > Sweden > Stockholm > Stockholm (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(13 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Estimator Vectors: OOV Word Embeddings based on Subword and Context Clue Estimates

Patel, Raj, Domeniconi, Carlotta

arXiv.org Machine LearningOct-18-2019

Estimator Vectors: OOV Word Embeddings based on Subword and Context Clue Estimates Raj Patel Carlotta Domeniconi † Abstract Semantic representations of words have been successfully extracted from unlabeled corpuses using neural network models like word2vec. These representations are generally high quality and are computationally inexpensive to train, making them popular. However, these approaches generally fail to approximate out of vocabulary (OOV) words, a task humans can do quite easily, using word roots and context clues. This paper proposes a neural network model that learns high quality word representations, subword representations, and context clue representations jointly. Learning all three types of representations together enhances the learning of each, leading to enriched word vectors, along with strong estimates for OOV words, via the combination of the corresponding context clue and subword embeddings. Our model, called Estimator Vectors (EV), learns strong word embed-dings and is competitive with state of the art methods for OOV estimation. 1 Introduction Semantic representations of words are useful for many natural language processing (NLP) tasks. While there exists many ways to learn them, models like word2vec [11] and GloVe [15] have been shown to be very efficient at producing high quality word embeddings. These embeddings not only capture similarity between words, but also capture some algebraic relationships between words. These models, though, also have some downsides. One major drawback is that they can only learn embeddings for words in the vocabulary, determined by the corpus they were trained on.

context clue, representation, vector, (15 more...)

arXiv.org Machine Learning

1910.10491

Country:

North America > United States > Virginia > Fairfax County > Fairfax (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

An MDL-Based Classifier for Transactional Datasets with Application in Malware Detection

Asadi, Behzad, Varadharajan, Vijay

arXiv.org Machine LearningOct-8-2019

We design a classifier for transactional datasets with application in malware detection. We build the classifier based on the minimum description length (MDL) principle. This involves selecting a model that best compresses the training dataset for each class considering the MDL criterion. To select a model for a dataset, we first use clustering followed by closed frequent pattern mining to extract a subset of closed frequent patterns (CFPs). We show that this method acts as a pattern summarization method to avoid pattern explosion; this is done by giving priority to longer CFPs, and without requiring to extract all CFPs. We then use the MDL criterion to further summarize extracted patterns, and construct a code table of patterns. This code table is considered as the selected model for the compression of the dataset. We evaluate our classifier for the problem of static malware detection in portable executable (PE) files. We consider API calls of PE files as their distinguishing features. The presence-absence of API calls forms a transactional dataset. Using our proposed method, we construct two code tables, one for the benign training dataset, and one for the malware training dataset. Our dataset consists of 19696 benign, and 19696 malware samples, each a binary sequence of size 22761. We compare our classifier with deep neural networks providing us with the state-of-the-art performance. The comparison shows that our classifier performs very close to deep neural networks. We also discuss that our classifier is an interpretable classifier. This provides the motivation to use this type of classifiers where some degree of explanation is required as to why a sample is classified under one class rather than the other class.

algorithm, code table, dataset, (15 more...)

arXiv.org Machine Learning

1910.03751

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Oceania > Australia (0.04)
North America > United States > Missouri > Jackson County > Kansas City (0.04)
(4 more...)

Genre: Research Report (0.40)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory > Minimum Complexity Machines (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

Fine-Grained Analysis of Propaganda in News Articles

Martino, Giovanni Da San, Yu, Seunghak, Barrón-Cedeño, Alberto, Petrov, Rostislav, Nakov, Preslav

arXiv.org Artificial IntelligenceOct-6-2019

Propaganda aims at influencing people's mindset with the purpose of advancing a specific agenda. Previous work has addressed propaganda detection at the document level, typically labelling all articles from a propagandistic news outlet as propaganda. Such noisy gold labels inevitably affect the quality of any learning system trained on them. A further issue with most existing systems is the lack of explainability. To overcome these limitations, we propose a novel task: performing fine-grained analysis of texts by detecting all fragments that contain propaganda techniques as well as their type. In particular, we create a corpus of news articles manually annotated at the fragment level with eighteen propaganda techniques and we propose a suitable evaluation measure. We further design a novel multi-granularity neural network, and we show that it outperforms several strong BERT-based baselines.

annotator, corpus, propaganda technique, (16 more...)

arXiv.org Artificial Intelligence

1910.02517

Country:

Asia > Russia (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
(23 more...)

Genre: Research Report (0.40)

Industry:

Media > News (1.00)
Government (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.94)

Add feedback

Annotated Guidelines and Building Reference Corpus for Myanmar-English Word Alignment

Han, Nway Nway, Thida, Aye

arXiv.org Artificial IntelligenceSep-25-2019

Reference corpus for word alignment is an important resource for developing and evaluating word alignment methods. For Myanmar - English language pairs, there is no reference corpus to evaluate the word alignment tasks. Therefore, we created the guidelines f or Myanmar - English word alignment annotation between two languages over contrastive learning and built the Myanmar - English reference corpus consisting of verified alignments from Myanmar ALT of the Asian Language Treebank (ALT). This reference corpus conta ins confident labels sure (S) and possible (P) for word alignments which are used to test for the purpose of evaluation of the word alignments tasks. We discuss the most linking ambiguities to define consistent and systematic instructions to align manual w ords. We evaluated the results of annotators agreement using our reference corpus in terms of alignment error rate (AER) in word alignment tasks and discuss the words relationships in terms of BLEU scores. A bilingual corpus aligned at the level of sentences or words is a precious resource for developing machine translation systems. Word alignment is a fundamental step in extracting translation information from bilingual corpus and determines which words and phrases are translations of each other in the original and translated sentence. In most translation systems, translational correspondences are rather complex; for a language pair such as Myanmar and Eng lish that belong to the different word order languages.

alignment, myanmar, reference corpus, (14 more...)

arXiv.org Artificial Intelligence

1909.11288

Country:

Asia > Myanmar > Mandalay Region > Mandalay (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
(7 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

Add feedback

Leveraging Implicit Expert Knowledge for Non-Circular Machine Learning in Sepsis Prediction

Schamoni, Shigehiko, Lindner, Holger A., Schneider-Lindner, Verena, Thiel, Manfred, Riezler, Stefan

arXiv.org Machine LearningSep-20-2019

Sepsis is the leading cause of death in non-coronary intensive care units. Moreover, a delay of antibiotic treatment of patients with severe sepsis by only few hours is associated with increased mortality. This insight makes accurate models for early prediction of sepsis a key task in machine learning for healthcare. Previous approaches have achieved high AUROC by learning from electronic health records where sepsis labels were defined automatically following established clinical criteria. We argue that the practice of incorporating the clinical criteria that are used to automatically define ground truth sepsis labels as features of severity scoring models is inherently circular and compromises the validity of the proposed approaches. We propose to create an independent ground truth for sepsis research by exploiting implicit knowledge of clinical practitioners via an electronic questionnaire which records attending physicians' daily judgements of patients' sepsis status. We show that despite its small size, our dataset allows to achieve state-of-the-art AUROC scores. An inspection of learned weights for standardized features of the linear model lets us infer potentially surprising feature contributions and allows to interpret seemingly counterintuitive findings.

sepsis, sepsis patient, septic shock, (14 more...)

arXiv.org Machine Learning

doi: 10.1016/j.artmed.2019.101725

1909.09557

Country:

Europe > Germany (0.04)
South America > Uruguay > Artigas > Artigas (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
(6 more...)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

What is this Article about? Extreme Summarization with Topic-aware Convolutional Neural Networks

Narayan, Shashi, Cohen, Shay B., Lapata, Mirella

Journal of Artificial Intelligence ResearchSep-19-2019

We introduce "extreme summarization," a new single-document summarization task which aims at creating a short, one-sentence news summary answering the question "What is the article about?". We argue that extreme summarization, by nature, is not amenable to extractive strategies and requires an abstractive modeling approach. In the hope of driving research on this task further: (a) we collect a real-world, large scale dataset by harvesting online articles from the British Broadcasting Corporation (BBC); and (b) propose a novel abstractive model which is conditioned on the article's topics and based entirely on convolutional neural networks. We demonstrate experimentally that this architecture captures long-range dependencies in a document and recognizes pertinent content, outperforming an oracle extractive system and state-of-the-art abstractive approaches when evaluated automatically and by humans on the extreme summarization dataset.

dataset, proceedings, summarization, (11 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.11315

AI Access Foundation

11315

Journal of Artificial Intelligence Research

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
Asia > Afghanistan (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
(43 more...)

Genre:

Research Report > New Finding (1.00)
Overview (0.93)

Industry:

Media > News (1.00)
Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
(5 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Artificial Intelligence Masters The Game of Poker – What Does That Mean For Humans?

#artificialintelligenceSep-14-2019, 00:12:30 GMT

While AI had some success at beating humans at other games such as chess and Go (games that follow predefined rules and aren't random), winning at poker proved to be more challenging because it requires strategy, intuition, and reasoning based on hidden information. Despite the challenges, artificial intelligence can now play--and win--poker. Artificial intelligence systems including DeepStack and Libratus paved the way for Pluribus, the AI that beat five other players in six-player Texas Hold'em, the most popular version of poker. This feat goes beyond games. This achievement means that artificial intelligence can now expand to help solve some of the world's most challenging issues.

artificial intelligence master, deepstack, texas hold, (8 more...)

#artificialintelligence

Country:

North America > United States > Texas (0.37)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.06)

Industry: Leisure & Entertainment > Games > Poker (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (0.54)
Information Technology > Artificial Intelligence > Machine Learning (0.36)

Add feedback

Hierarchical Pointer Net Parsing

Liu, Linlin, Lin, Xiang, Joty, Shafiq, Han, Simeng, Bing, Lidong

arXiv.org Artificial IntelligenceAug-30-2019

Transition-based top-down parsing with pointer networks has achieved state-of-the-art results in multiple parsing tasks, while having a linear time complexity. However, the decoder of these parsers has a sequential structure, which does not yield the most appropriate inductive bias for deriving tree structures. In this paper, we propose hierarchical pointer network parsers, and apply them to dependency and sentence-level discourse parsing tasks. Our results on standard benchmark datasets demonstrate the effectiveness of our approach, outperforming existing methods and setting a new state-of-the-art.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/D19-1093

1908.11571

Country:

Asia > Singapore (0.05)
North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > United States > Maryland > Baltimore (0.04)
(8 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Future of distracted driving technology makes Edmonton pitch CBC News

#artificialintelligenceAug-25-2019, 15:16:28 GMT

An Australian-based technology firm that uses artificial intelligence to catch distracted drivers made a pitch to an Edmonton conference on Friday. Acusensus presented its automatic camera enforcement technology at the International Conference on Urban Traffic Safety. Founded in early 2018, the company made international headlines with a pilot program in Australia earlier this year. The Acusensus camera system is mounted on the side or above the road, like photo radar. But unlike photo radar, the system takes high-resolution pictures of every passing car.

artificial intelligence, data mining, jannink, (7 more...)

#artificialintelligence

Country: North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.40)

Industry:

Information Technology > Security & Privacy (1.00)
Transportation > Ground > Road (0.62)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence (0.92)
Information Technology > Data Science > Data Mining (0.42)
Information Technology > Communications > Web (0.42)

Add feedback