AITopics

1910.07629

Country:

North America > Canada > British Columbia > Vancouver (0.05)
North America > United States > Texas > Dallas County > Dallas (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
(8 more...)

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Data Science > Data Mining (0.93)
(4 more...)

arXiv.org Machine LearningOct-16-2019

Migration through Machine Learning Lens -- Predicting Sexual and Reproductive Health Vulnerability of Young Migrants

Nigam, Amber, Jaiswal, Pragati, Girkar, Uma, Arora, Teertha, Celi, Leo A.

In this paper, we have discussed initial findings and results of our experiment to predict sexual and reproductive health vulnerabilities of migrants in a data-constrained environment. Notwithstanding the limited research and data about migrants and migration cities, we propose a solution that simultaneously focuses on data gathering from migrants, augmenting awareness of the migrants to reduce mishaps, and setting up a mechanism to present insights to the key stakeholders in migration to act upon. We have designed a webapp for the stakeholders involved in migration: migrants, who would participate in data gathering process and can also use the app for getting to know safety and awareness tips based on analysis of the data received; public health workers, who would have an access to the database of migrants on the app; policy makers, who would have a greater understanding of the ground reality, and of the patterns of migration through machine-learned analysis. Finally, we have experimented with different machine learning models on an artificially curated dataset. We have shown, through experiments, how machine learning can assist in predicting the migrants at risk and can also help in identifying the critical factors that make migration dangerous for migrants. The results for identifying vulnerable migrants through machine learning algorithms are statistically significant at an alpha of 0.05.

dataset, migrant, vulnerability, (12 more...)

1910.0239

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.15)
Africa > Kenya > Nairobi City County > Nairobi (0.05)
North America > United States > Massachusetts > Suffolk County > Boston (0.05)
(3 more...)

Genre: Research Report (0.50)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Health & Medicine (1.00)
Government > Regional Government (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.31)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.31)

Zhao, Han, Coston, Amanda, Adel, Tameem, Gordon, Geoffrey J.

Conditional Learning of Fair Representations

arXiv.org Artificial IntelligenceOct-16-2019

We propose a novel algorithm for learning fair representations that can simultaneously mitigate two notions of disparity among different demographic subgroups. Two key components underpinning the design of our algorithm are balanced error rate and conditional alignment of representations. In settings that have historically had discrimination, we are interested in defining fairness with respect to a protected group, the group which has historically been disadvantaged. Among many recent attempts to achieve algorithmic fairness (Dwork et al., 2012; Hardt et al., 2016; Zemel et al., 2013; Zafar et al., 2015), learning fair representations has attracted increasing attention However, it has long been empirically observed (Calders et al., 2009) and recently been proved (Zhao Part of this work was done when Han Zhao was visiting the V ector Institute, Toronto. In this work, we provide an affirmative answer to the above question by proposing an algorithm to align the conditional distributions (on the target variable) of representations across different demographic subgroups.

equalized odds, parity, representation, (17 more...)

arXiv.org Artificial Intelligence

1910.07162

Country:

North America > Canada > Ontario > Toronto (0.24)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (1.00)

Industry:

Law (0.46)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

#artificialintelligenceOct-15-2019, 11:09:25 GMT

How I scored in the top 1% of Kaggle's Titanic Machine Learning Challenge

You don't need to reinvent the wheel, you need to know how to use the wheel to make your car better. The Titanic challenge hosted by Kaggle is a competition in which the goal is to predict the survival or the death of a given passenger based on a set of variables describing him such as his age, his sex, or his passenger class on the boat. I have been playing with the Titanic dataset for a while. As I'm writing this post, I am ranked 113th out of 11002 participants. You must be wondering how did I manage to achieve this.

dataset, engineering, titanic machine learning challenge, (13 more...)

Industry:

Transportation > Passenger (0.50)
Education (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.31)

#artificialintelligenceOct-15-2019, 11:09:25 GMT

How I scored in the top 1% of Kaggle's Titanic Machine Learning Challenge

dataset, engineering, titanic machine learning challenge, (13 more...)

Industry:

Transportation > Passenger (0.50)
Education (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.31)

#artificialintelligenceOct-14-2019, 21:15:18 GMT

Using AI, Genes and Game Theory on Antimicrobial Resistance

Antimicrobial resistance (AMR) is the ability of microorganisms like bacteria, viruses, fungi and certain parasites to resist drugs such as antibiotics, antifungals, and antivirals from destroying it. AMR is a worldwide public health threat that is projected to rise. Globally, by 2050, over 10 million deaths per year will be due to antimicrobial resistance according to projections from a report by Wellcome Trust and the UK government. For antibiotic resistance alone, each year over two million people in the U.S. are affected, and 23,000 die, according to figures from the U.S. Centers for Disease Control and Prevention (CDC). Researchers at Washington State University have combined game theory with artificial intelligence (AI) to create a tool that can identify genes that are antibiotic-resistant in bacteria, and published their study in Scientific Reports on October 9, 2019.

antimicrobial resistance, resistance, sequence, (10 more...)

Country:

North America > United States > Washington (0.28)
North America > United States > Virginia (0.05)
North America > United States > California > San Diego County > San Diego (0.05)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Public Health (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Government > Regional Government > North America Government > United States Government (0.56)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.31)

#artificialintelligenceOct-14-2019, 17:42:52 GMT

Proper Balancing for Cross Validation

Here we plot the precision results of balancing, with under-sampling, only the train set of each CV fold before fitting the model on it and making predictions on the CV fold's test set: Here we plot the precision results of balancing, with over-sampling, only the train set of each CV fold before fitting the model on it and making predictions on the CV fold's test set: It is clear, that balancing so far did not help in getting good test results. However, this is out of scope for this article (:-)) and the goal of this article is achieved: To make the model produce, on each CV fold's test set, evaluation metric scores similar to those that it would produce on an unknown one, for the case that the train data are balanced.

cv fold, precision result, proper balancing, (4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Cross Validation (0.40)

#artificialintelligenceOct-14-2019, 07:25:31 GMT

RPA: Strengthen and Simplify Your Cyber Security Operations

Robotic process automation (RPA) uses machine learning (ML) and artificial intelligence (AI) to create a virtual workforce, able to handle repeatable tasks that require a human worker to perform. By using an RPA, companies can perform repetitive tasks faster, longer and with a reduced error rate allowing the workforce to focus on essential duties and responsibilities. In other words, companies have employees working like robots, performing jobs without thinking, why not have robots behaving like people for these tasks. Cybersecurity personnel and cybercriminals are in a constant state of war, automation and specifically RPA can help protect against malicious cyber intruders. Identification and prevention of zero-day attacks (an attack on an exploit the same day of its discovery) and elimination of any system weaknesses is the end goal of internal security teams.

cyber security operation, security team, strengthen and simplify, (10 more...)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (0.62)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.39)

Lim, Jen Ning, Yamada, Makoto, Jitkrittum, Wittawat, Terada, Yoshikazu, Matsui, Shigeyuki, Shimodaira, Hidetoshi

More Powerful Selective Kernel Tests for Feature Selection

arXiv.org Machine LearningOct-14-2019

Refining one's hypotheses in the light of data is a commonplace scientific practice, however, this approach introduces selection bias and can lead to specious statistical analysis. One approach of addressing this phenomena is via conditioning on the selection procedure, i.e., how we have used the data to generate our hypotheses, and prevents information to be used again after selection. Many selective inference (a.k.a. post-selection inference) algorithms typically take this approach but will "over-condition" for sake of tractability. While this practice obtains well calibrated $p$-values, it can incur a major loss in power. In our work, we extend two recent proposals for selecting features using the Maximum Mean Discrepancy and Hilbert Schmidt Independence Criterion to condition on the minimal conditioning event. We show how recent advances in multiscale bootstrap makes conditioning on the minimal selection event possible and demonstrate our proposal over a range of synthetic and real world experiments. Our results show that our proposed test is indeed more powerful in most scenarios.

estimator, hsic inc, multiscale bootstrap, (15 more...)

1910.06134

Country:

North America > United States (0.28)
Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.04)
Europe > Spain > Canary Islands (0.04)
(2 more...)

Genre: Research Report > New Finding (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)

Stutz, David, Hein, Matthias, Schiele, Bernt

Confidence-Calibrated Adversarial Training: Towards Robust Models Generalizing Beyond the Attack Used During Training

arXiv.org Machine LearningOct-14-2019

Adversarial training is the standard to train models robust against adversarial examples. However, especially for complex datasets, adversarial training incurs a significant loss in accuracy and is known to generalize poorly to stronger attacks, e.g., larger perturbations or other threat models. In this paper, we introduce confidence-calibrated adversarial training (CCAT) where the key idea is to enforce that the confidence on adversarial examples decays with their distance to the attacked examples. We show that CCAT preserves better the accuracy of normal training while robustness against adversarial examples is achieved via confidence thresholding. Most importantly, in strong contrast to adversarial training, the robustness of CCAT generalizes to larger perturbations and other threat models, not encountered during training. We also discuss our extensive work to design strong adaptive attacks against CCAT and standard adversarial training which is of independent interest. We present experimental results on MNIST, SVHN and Cifar10.

adversarial example, adversarial training, cifar10, (15 more...)

1910.06259

Country:

Europe > Germany > Saarland (0.04)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)

Genre: Research Report (0.51)

Industry: Information Technology > Security & Privacy (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)