AITopics

Sparse convex clustering is to cluster observations and conduct variable selection simultaneously in the framework of convex clustering. Although the weighted $L_1$ norm as the regularization term is usually employed in the sparse convex clustering, this increases the dependence on the data and reduces the estimation accuracy if the sample size is not sufficient. To tackle these problems, this paper proposes a Bayesian sparse convex clustering via the idea of Bayesian lasso and global-local shrinkage priors. We introduce Gibbs sampling algorithms for our method using scale mixtures of normals. The effectiveness of the proposed methods is shown in simulation studies and a real data analysis.

convex, exp null 1 2, sparse convex, (14 more...)

1911.08703

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Oceania > New Zealand (0.04)
Asia > Japan > Honshū > Kantō > Kanagawa Prefecture (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Dasagi, Vibhavari, Lee, Robert, Bruce, Jake, Leitner, Jürgen

Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

Deep reinforcement learning has been shown to solve challenging tasks where large amounts of training experience is available, usually obtained online while learning the task. Robotics is a significant potential application domain for many of these algorithms, but generating robot experience in the real world is expensive, especially when each task requires a lengthy online training procedure. Off-policy algorithms can in principle learn arbitrary tasks from a diverse enough fixed dataset. In this work, we evaluate popular exploration methods by generating robotics datasets for the purpose of learning to solve tasks completely offline without any further interaction in the real world. We present results on three popular continuous control tasks in simulation, as well as continuous control of a high-dimensional real robot arm. Code documenting all algorithms, experiments, and hyper-parameters is available at https://github.com/qutrobotlearning/batchlearning.

algorithm, exploration, learning, (15 more...)

1911.08666

Country:

Oceania > Australia > Queensland > Brisbane (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.82)

Industry: Education > Educational Setting > Online (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Pang, Guansong, Shen, Chunhua, Hengel, Anton van den

Deep Anomaly Detection with Deviation Networks

Although deep learning has been applied to successfully address many data mining problems, relatively limited work has been done on deep learning for anomaly detection. Existing deep anomaly detection methods, which focus on learning new feature representations to enable downstream anomaly detection methods, perform indirect optimization of anomaly scores, leading to data-inefficient learning and suboptimal anomaly scoring. Also, they are typically designed as unsupervised learning due to the lack of large-scale labeled anomaly data. As a result, they are difficult to leverage prior knowledge (e.g., a few labeled anomalies) when such information is available as in many real-world anomaly detection applications. This paper introduces a novel anomaly detection framework and its instantiation to address these problems. Instead of representation learning, our method fulfills an end-to-end learning of anomaly scores by a neural deviation learning, in which we leverage a few (e.g., multiple to dozens) labeled anomalies and a prior probability to enforce statistically significant deviations of the anomaly scores of anomalies from that of normal data objects in the upper tail. Extensive results show that our method can be trained substantially more data-efficiently and achieves significantly better anomaly scoring than state-of-the-art competing methods.

anomaly, anomaly score, devnet, (15 more...)

1911.08623

Country:

Oceania > Australia > South Australia > Adelaide (0.04)
North America > United States > Alaska > Anchorage Municipality > Anchorage (0.04)
Oceania > Australia > Australian Capital Territory > Canberra (0.04)
(4 more...)

Genre: Research Report > New Finding (0.88)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Learning internal representations

Baxter, Jonathan

Probably the most important problem in machine learning is the preliminary biasing of a learner's hypothesis space so that it is small enough to ensure good generalisation from reasonable training sets, yet large enough that it contains a good solution to the problem being learnt. In this paper a mechanism for {\em automatically} learning or biasing the learner's hypothesis space is introduced. It works by first learning an appropriate {\em internal representation} for a learning environment and then using that representation to bias the learner's hypothesis space for the learning of future tasks drawn from the same environment. An internal representation must be learnt by sampling from {\em many similar tasks}, not just a single task as occurs in ordinary machine learning. It is proved that the number of examples $m$ {\em per task} required to ensure good generalisation from a representation learner obeys $m = O(a+b/n)$ where $n$ is the number of tasks being learnt and $a$ and $b$ are constants. If the tasks are learnt independently ({\em i.e.} without a common representation) then $m=O(a+b)$. It is argued that for learning environments such as speech and character recognition $b\gg a$ and hence representation learning in these environments can potentially yield a drastic reduction in the number of examples required per task. It is also proved that if $n = O(b)$ (with $m=O(a+b/n)$) then the representation learnt will be good for learning novel tasks from the same environment, and that the number of examples required to generalise well on a novel task will be reduced to $O(a)$ (as opposed to $O(a+b)$ if no representation is used). It is shown that gradient descent can be used to train neural network representations and experiment results are reported providing strong qualitative support for the theoretical results.

generalisation, learner, representation, (16 more...)

doi: 10.1145/225298.225336

1911.05781

Country:

Oceania > Australia > South Australia (0.14)
North America > United States > New York (0.04)

Genre: Research Report (0.50)

Industry: Education (0.86)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

#artificialintelligenceNov-18-2019, 16:38:35 GMT

Artificial Intelligence in Education System Market 2019: Popular Trends, Growth, Rising Demand & Progressive Technologies To Watch Out For Near Future - Sound On Sound Fest

The statistical study, the report outlines the Global Artificial Intelligence in Education System Industry including production, cost/profit, supply-demand, and import-export. The total market is further bifurcated into a company, by country, and by various segmentation for the competitive landscape study.

artificial intelligence, global artificial intelligence, intelligence, (11 more...)

Country:

North America > Central America (0.15)
North America > United States > New York > Richmond County > New York City (0.06)
North America > United States > New York > Queens County > New York City (0.06)
(27 more...)

Industry: Education (0.95)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.30)
Information Technology > Artificial Intelligence > Natural Language (0.30)

#artificialintelligenceNov-18-2019, 16:37:26 GMT

Global Military Artificial Intelligence (AI) and Cybernetics Market: Focus on Platform, Technology, Application and Services - Analysis and Forecast, 2019-2024

Key Questions Answered in this Report: • What are the trends in the global military artificial intelligence and cybernetics across different regions? Global Military Artificial Intelligence Market Forecast, 2019-2024 The Global Military Artificial Intelligence Market report projects the market to grow at a significant CAGR of 18.66% on the basis of value during the forecast period from 2019 to 2024. North America dominated the global military artificial intelligence market with a share of 48.23% in 2019. North America, including the major countries such as the U.S., is the most prominent region for the military artificial intelligence market. In North America, the U.S. acquired a major market share in 2019 due to the major deployment of counter measures in defense sector in the country.

artificial intelligence and cybernetic market, global military artificial intelligence market, military artificial intelligence market, (6 more...)

Country:

South America (0.05)
Oceania > Australia (0.05)
North America > United States > New York (0.05)
(12 more...)

Genre: Press Release (0.87)

Industry:

Aerospace & Defense (1.00)
Banking & Finance > Trading (0.92)
Government > Military (0.75)

Technology: Information Technology > Artificial Intelligence (1.00)

#artificialintelligenceNov-18-2019, 14:55:28 GMT

What happens when a bot writes your blog posts

What did you choose to do as a writer, then? I was very naive when it comes to writing a series. I had no idea what was going to happen. I wanted it to be a lighthearted, realistic tale and I also wanted it to have a sense of drama, emotion, and suspense. I had no idea what I should do with the main characters in the first place, but I knew I had to make it a lighthearted, realistic story that would have the main characters struggling to find their happiness and love.

bot write, jell-o, memoir, (7 more...)

Country: Oceania > Australia > New South Wales > Sydney (0.05)

Genre: Personal (0.36)

Technology:

Information Technology > Communications > Social Media (0.86)
Information Technology > Artificial Intelligence (0.79)

#artificialintelligenceNov-18-2019, 11:26:48 GMT

Is it right to use AI to identify children at risk of harm?

Technology has advanced enormously in the 30 years since the introduction of the first Children Act, which shaped the UK's system of child safeguarding. Today a computer-generated analysis – "machine learning" that produces predictive analytics – can help social workers assess the probability of a child coming on to the at-risk register. It can also help show how they might prevent that happening. But with technological advances come dilemmas unimaginable back in 1989. Is it right for social workers to use computers to help promote the welfare of children in need?

identify child, predictive analysis, social worker, (17 more...)

Country:

Oceania > New Zealand (0.05)
Europe > United Kingdom > England > Greater London > London > Barking and Dagenham (0.05)

Industry:

Government > Social Services (0.76)
Information Technology > Security & Privacy (0.50)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Data Science > Data Mining (0.73)

Helfrich, Kyle, Ye, Qiang

Eigenvalue Normalized Recurrent Neural Networks for Short Term Memory

arXiv.org Machine LearningNov-18-2019

The underlying dynamical system carries temporal information from one time step to another and captures potential dependencies among the terms of a sequence. Like other deep neural networks, the weights of an RNN are learned by gradient descent. For the input at a time step to affect the output at a later time step, the gradients must back-propagate through each step. Since a sequence can be quite long, RNNs are prone to suffer from vanishing or exploding gradients as described in (Bengio, Frasconi, and Simard 1993) and (Pas-canu, Mikolov, and Bengio 2013). One consequence of this well-known problem is the difficulty of the network to model input-output dependency over a large number of time steps. There have been many different architectures that are designed to mitigate this problem. The most popular RNN architectures such as LSTMs (Hochreiter and Schmidhu-ber 1997) and GRUs (Cho et al. 2014), incorporate a gating mechanism to explicitly retain or discard information.

eigenvalue, matrix, sequence, (16 more...)

1911.07964

Country:

North America > United States > Kentucky > Fayette County > Lexington (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
(4 more...)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Subramanian, Shivashankar, Baldini, Ioana, Ravichandran, Sushma, Katz-Rogozhnikov, Dmitriy A., Ramamurthy, Karthikeyan Natesan, Sattigeri, Prasanna, Varshney, Kush R., Wang, Annmarie, Mangalath, Pradeep, Kleiman, Laura B.

Drug Repurposing for Cancer: An NLP Approach to Identify Low-Cost Therapies

arXiv.org Machine LearningNov-18-2019

More than 200 generic drugs approved by the U.S. Food and Drug Administration for non-cancer indications have shown promise for treating cancer. Due to their long history of safe patient use, low cost, and widespread availability, repurposing of generic drugs represents a major opportunity to rapidly improve outcomes for cancer patients and reduce healthcare costs worldwide. Evidence on the efficacy of non-cancer generic drugs being tested for cancer exists in scientific publications, but trying to manually identify and extract such evidence is intractable. In this paper, we introduce a system to automate this evidence extraction from PubMed abstracts. Our primary contribution is to define the natural language processing pipeline required to obtain such evidence, comprising the following modules: querying, filtering, cancer type entity extraction, therapeutic association classification, and study type classification. Using the subject matter expertise on our team, we create our own datasets for these specialized domain-specific tasks. We obtain promising performance in each of the modules by utilizing modern language modeling techniques and plan to treat them as baseline approaches for future improvement of individual components.

cancer, classification, generic drug, (12 more...)

1911.07819

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > Canada (0.04)

Genre: Research Report > Experimental Study (0.72)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)