South America
Learned Weight Sharing for Deep Multi-Task Learning by Natural Evolution Strategy and Stochastic Gradient Descent
Prellberg, Jonas, Kramer, Oliver
In deep multi-task learning, weights of task-specific networks are shared between tasks to improve performance on each single one. Since the question, which weights to share between layers, is difficult to answer, human-designed architectures often share everything but a last task-specific layer. In many cases, this simplistic approach severely limits performance. Instead, we propose an algorithm to learn the assignment between a shared set of weights and task-specific layers. To optimize the non-differentiable assignment and at the same time train the differentiable weights, learning takes place via a combination of natural evolution strategy and stochastic gradient descent. The end result are task-specific networks that share weights but allow independent inference. They achieve lower test errors than baselines and methods from literature on three multi-task learning datasets.
What is Normal, What is Strange, and What is Missing in a Knowledge Graph: Unified Characterization via Inductive Summarization
Belth, Caleb, Zheng, Xinyi, Vreeken, Jilles, Koutra, Danai
Knowledge graphs (KGs) store highly heterogeneous information about the world in the structure of a graph, and are useful for tasks such as question answering and reasoning. However, they often contain errors and are missing information. Vibrant research in KG refinement has worked to resolve these issues, tailoring techniques to either detect specific types of errors or complete a KG. In this work, we introduce a unified solution to KG characterization by formulating the problem as unsupervised KG summarization with a set of inductive, soft rules, which describe what is normal in a KG, and thus can be used to identify what is abnormal, whether it be strange or missing. Unlike first-order logic rules, our rules are labeled, rooted graphs, i.e., patterns that describe the expected neighborhood around a (seen or unseen) node, based on its type, and information in the KG. Stepping away from the traditional support/confidence-based rule mining techniques, we propose KGist, Knowledge Graph Inductive SummarizaTion, which learns a summary of inductive rules that best compress the KG according to the Minimum Description Length principle---a formulation that we are the first to use in the context of KG rule mining. We apply our rules to three large KGs (NELL, DBpedia, and Yago), and tasks such as compression, various types of error detection, and identification of incomplete information. We show that KGist outperforms task-specific, supervised and unsupervised baselines in error detection and incompleteness identification, (identifying the location of up to 93% of missing entities---over 10% more than baselines), while also being efficient for large knowledge graphs.
Composite Monte Carlo Decision Making under High Uncertainty of Novel Coronavirus Epidemic Using Hybridized Deep Learning and Fuzzy Rule Induction
Fong, Simon James, Li, Gloria, Dey, Nilanjan, Crespo, Ruben Gonzalez, Herrera-Viedma, Enrique
In the advent of the novel coronavirus epidemic since December 2019, governments and authorities have been struggling to make critical decisions under high uncertainty at their best efforts. Composite Monte-Carlo (CMC) simulation is a forecasting method which extrapolates available data which are broken down from multiple correlated/casual micro-data sources into many possible future outcomes by drawing random samples from some probability distributions. For instance, the overall trend and propagation of the infested cases in China are influenced by the temporal-spatial data of the nearby cities around the Wuhan city (where the virus is originated from), in terms of the population density, travel mobility, medical resources such as hospital beds and the timeliness of quarantine control in each city etc. Hence a CMC is reliable only up to the closeness of the underlying statistical distribution of a CMC, that is supposed to represent the behaviour of the future events, and the correctness of the composite data relationships. In this paper, a case study of using CMC that is enhanced by deep learning network and fuzzy rule induction for gaining better stochastic insights about the epidemic development is experimented. Instead of applying simplistic and uniform assumptions for a MC which is a common practice, a deep learning-based CMC is used in conjunction of fuzzy rule induction techniques. As a result, decision makers are benefited from a better fitted MC outputs complemented by min-max rules that foretell about the extreme ranges of future possibilities with respect to the epidemic.
5G Commercialization and Trials in Korea
Since Korea has a limited ICT R&D fund compared to other IT global countries, its strategy was essential to achieve its global competence in each generation of mobile communication. Just after the rollout of the world's first 5G service, the government took the next step by announcing the 5G strategy to promote the 5G application to a wide-ranging industry and create a sustainable 5G ecosystem leading to new growth engines. In this article, we focus on the government-industry 5G collaborations, including the R&D roadmap and promotion to the 5G commercialization, the global collaboration, the first 5G experience, and 5G vertical trials to make the 5G-enabled industrial transformation take place in Korea. The development of an electronic digital switching system called TDX in the 1980s, the world's first CDMA mobile service in the 1990s, and the nationwide wired and mobile broad Internet networks in the 2000s are the key advances that made it possible for Korean consumers to easily adopt new technologies such as LTE and 5G. In 2018, the handset penetration rate of South Korea was similar to western Europe, where LTE adaption was 84% with 99.95% coverage and 65Mbps downlink capacity.4
The NII Shonan Meeting in Japan
Japan's National Institute of Informatics (NII) launched its inaugural NII Shonan Meeting in February 2011. It was the first international conference of informatics in Asia, following in the style of the Dagstuhl seminars in Germany, designed to bring together the world's leading researchers and engineers to discuss open problems and challenges. More than 140 meetings have been held since then, and the number of participants totaled approximately 3,500 by November 2019. NII supports all the administrative arrangements for organizers and covers approximately half the fee for every academic participant (including room, board, and meeting fees). Sometimes, we also have summer/winter schools.
Gartner Says Strongest Demand for AI Talent Comes from Non-IT Departments
"High demand and tight labor markets have made candidates with AI skills highly competitive, but hiring techniques and strategies have not kept up," said Peter Krensky, research director at Gartner. "In the recent Gartner AI and Machine Learning Development Strategies Study, respondents ranked "skills of staff" as the No. 1 challenge or barrier to the adoption of AI and machine learning (ML)." Departments recruiting AI talent in high volumes include marketing, sales, customer service, finance, and research and development. These business units are using AI talent for customer churn modeling, customer profitability analysis, customer segmentation, cross-sell and upsell recommendations, demand planning, and risk management. A significant portion of AI use cases are reported from asset-centric industries supporting projects such as predictive maintenance, workflow and production optimization, quality control and supply chain optimization.
Deep Synthetic Minority Over-Sampling Technique
Mansourifar, Hadi, Shi, Weidong
Synthetic Minority Over-sampling Technique (SMOTE) is the most popular over-sampling method. However, its random nature makes the synthesized data and even imbalanced classification results unstable. It means that in case of running SMOTE n different times, n different synthesized in-stances are obtained with n different classification results. To address this problem, we adapt the SMOTE idea in deep learning architecture. In this method, a deep neural network regression model is used to train the inputs and outputs of traditional SMOTE. Inputs of the proposed deep regression model are two randomly chosen data points which are concatenated to form a double size vector. The outputs of this model are corresponding randomly interpolated data points between two randomly chosen vectors with original dimension. The experimental results show that, Deep SMOTE can outperform traditional SMOTE in terms of precision, F1 score and Area Under Curve (AUC) in majority of test cases.
A semi-supervised sparse K-Means algorithm
Vouros, Avgoustinos, Vasilaki, Eleni
We consider the problem of data clustering with unidentified feature quality but the existence of small amount of label data. In the first case a sparse clustering method can be employed in order to detect the subgroup of features necessary for clustering and in the second case a semi-supervised method can use the labelled data to create constraints and enhance the clustering solution. In this paper we propose a K-Means inspired algorithm that employs these techniques. We show that the algorithm maintains the high performance of other similar semi-supervised algorthms as well as keeping the ability to identify informative from uninformative features. We examine the performance of the algorithm on real world data sets with unknown features quality as well as a real world data set with a known uninformative feature. We use a series of scenarios with different number and types of constraints.
Imagination-Augmented Deep Learning for Goal Recognition
Duhamel, Thibault, Maynard, Mariane, Kabanza, Froduald
Being able to infer the goal of people we observe, interact with, or read stories about is one of the hallmarks of human intelligence. A prominent idea in current goal-recognition research is to infer the likelihood of an agent's goal from the estimations of the costs of plans to the different goals the agent might have. Different approaches implement this idea by relying only on handcrafted symbolic representations. Their application to real-world settings is, however, quite limited, mainly because extracting rules for the factors that influence goal-oriented behaviors remains a complicated task. In this paper, we introduce a novel idea of using a symbolic planner to compute plan-cost insights, which augment a deep neural network with an imagination capability, leading to improved goal recognition accuracy in real and synthetic domains compared to a symbolic recognizer or a deep-learning goal recognizer alone.
Tactic Learning and Proving for the Coq Proof Assistant
Blaauwbroek, Lasse, Urban, Josef, Geuvers, Herman
We present a system that utilizes machine learning for tactic proof search in the Coq Proof Assistant. In a similar vein as the TacticToe project for HOL4, our system predicts appropriate tactics and finds proofs in the form of tactic scripts. To do this, it learns from previous tactic scripts and how they are applied to proof states. The performance of the system is evaluated on the Coq Standard Library. Currently, our predictor can identify the correct tactic to be applied to a proof state 23.4% of the time. Our proof searcher can fully automatically prove 39.3% of the lemmas. When combined with the CoqHammer system, the two systems together prove 56.7% of the library's lemmas.