AITopics | Overview

Collaborating Authors

Overview

A Selective Survey on Versatile Knowledge Distillation Paradigm for Neural Network Models

Ku, Jeong-Hoe, Oh, JiHun, Lee, YoungYoon, Pooniwala, Gaurav, Lee, SangJeong

arXiv.org Artificial IntelligenceNov-30-2020

This paper aims to provide a selective survey about knowledge distillation(KD) framework for researchers and practitioners to take advantage of it for developing new optimized models in the deep neural network field. To this end, we give a brief overview of knowledge distillation and some related works including learning using privileged information(LUPI) and generalized distillation(GD). Even though knowledge distillation based on the teacher-student architecture was initially devised as a model compression technique, it has found versatile applications over various frameworks. In this paper, we review the characteristics of knowledge distillation from the hypothesis that the three important ingredients of knowledge distillation are distilled knowledge and loss,teacher-student paradigm, and the distillation process. In addition, we survey the versatility of the knowledge distillation by studying its direct applications and its usage in combination with other deep learning paradigms. Finally we present some future works in knowledge distillation including explainable knowledge distillation where the analytical analysis of the performance gain is studied and the self-supervised learning which is a hot research topic in deep learning community.

distillation, knowledge distillation, student network, (10 more...)

arXiv.org Artificial Intelligence

2011.14554

Country:

North America > United States (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > South Korea > Seoul > Seoul (0.04)

Genre: Overview (1.00)

Industry: Education (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Persistent Reductions in Regularized Loss Minimization for Variable Selection

Jalali, Amin

arXiv.org Machine LearningNov-29-2020

In the context of regularized loss minimization with polyhedral gauges, we show that for a broad class of loss functions (possibly non-smooth and non-convex) and under a simple geometric condition on the input data it is possible to efficiently identify a subset of features which are guaranteed to have zero coefficients in all optimal solutions in all problems with loss functions from said class, before any iterative optimization has been performed for the original problem. This procedure is standalone, takes only the data as input, and does not require any calls to the loss function. Therefore, we term this procedure as a persistent reduction for the aforementioned class of regularized loss minimization problems. This reduction can be efficiently implemented via an extreme ray identification subroutine applied to a polyhedral cone formed from the datapoints. We employ an existing output-sensitive algorithm for extreme ray identification which makes our guarantee and algorithm applicable in ultra-high dimensional problems.

conv, extreme ray, setup 2, (14 more...)

arXiv.org Machine Learning

2011.14549

Country: Europe > Switzerland > Zürich > Zürich (0.04)

Genre:

Overview (0.67)
Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)

Add feedback

A Brief Introduction to Edge Computing and Deep Learning

#artificialintelligenceNov-28-2020, 22:24:47 GMT

Welcome to my first blog on topics in artificial intelligence! Here I will introduce the topic of edge computing, with context in deep learning applications. This blog is largely adapted from a survey paper written by Xiaofei Wang et al.: Convergence of Edge Computing and Deep Learning: A Comprehensive Survey. If you're interested in learning more about any topic covered here, there are plenty of examples, figures, and explanations in the full 35 page survery: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp & arnumber 8976180 Now, before we begin, I'd like to take a moment and motivate why edge computing and deep learning can be very powerful when combined: Deep learning is becoming an increasingly-capable practice in machine learning that allows computers to detect objects, recognize speech, translate languages, and make decisions. More problems in machine learning are solved with the advanced techniques that researchers discover by the day.

application, intelligence, learning, (11 more...)

#artificialintelligence

Country:

North America > United States > New York > Richmond County > New York City (0.04)
North America > United States > California (0.04)

Genre: Overview (0.34)

Industry: Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

AI Weekly: The state of machine learning in 2020

#artificialintelligenceNov-28-2020, 08:20:07 GMT

It's hard to believe, but a year in which the unprecedented seemed to happen every day is just weeks from being over. In AI circles, the end of the calendar year means the rollout of annual reports aimed at defining progress, impact, and areas for improvement. The AI Index is due out in the coming weeks, as is CB Insights' assessment of global AI startup activity, but two reports -- both called The State of AI -- have already been released. Last week, McKinsey released its global survey on the state of AI, a report now in its third year. Interviews with executives and a survey of business respondents found a potential widening of the gap between businesses that apply AI and those that do not.

deployment, mckinsey, respondent, (12 more...)

#artificialintelligence

Genre:

Questionnaire & Opinion Survey (0.72)
Overview (0.52)

Industry: Government (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.55)

Add feedback

MetaGater: Fast Learning of Conditional Channel Gated Networks via Federated Meta-Learning

Lin, Sen, Yang, Li, He, Zhezhi, Fan, Deliang, Zhang, Junshan

arXiv.org Artificial IntelligenceNov-28-2020

While deep learning has achieved phenomenal successes in many AI applications, its enormous model size and intensive computation requirements pose a formidable challenge to the deployment in resource-limited nodes. There has recently been an increasing interest in computationally-efficient learning methods, e.g., quantization, pruning and channel gating. However, most existing techniques cannot adapt to different tasks quickly. In this work, we advocate a holistic approach to jointly train the backbone network and the channel gating which enables dynamical selection of a subset of filters for more efficient local computation given the data input. Particularly, we develop a federated meta-learning approach to jointly learn good meta-initializations for both backbone networks and gating modules, by making use of the model similarity across learning tasks on different nodes. In this way, the learnt meta-gating module effectively captures the important filters of a good meta-backbone network, based on which a task-specific conditional channel gated network can be quickly adapted, i.e., through one-step gradient descent, from the meta-initializations in a two-stage procedure using new samples of that task. The convergence of the proposed federated meta-learning algorithm is established under mild conditions. Experimental results corroborate the effectiveness of our method in comparison to related work.

channel gated network, metagater, module, (12 more...)

arXiv.org Artificial Intelligence

2011.12511

Country:

North America > United States > Virginia (0.04)
North America > United States > Arizona (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre:

Research Report (0.64)
Overview (0.46)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

A Survey on Data Pricing: from Economics to Data Science

Pei, Jian

arXiv.org Artificial IntelligenceNov-27-2020

How can we assess the value of data objectively, systematically and quantitatively? Pricing data, or information goods in general, has been studied and practiced in dispersed areas and principles, such as economics, marketing, electronic commerce, data management, data mining and machine learning. In this article, we present a unified, interdisciplinary and comprehensive overview of this important direction. We examine various motivations behind data pricing, understand the economics of data pricing and review the development and evolution of pricing models according to a series of fundamental principles. We discuss both digital products and data products. We also consider a series of challenges and directions for future work.

auction, pricing, proceedings, (15 more...)

arXiv.org Artificial Intelligence

2009.04462

Country:

North America > United States > New Jersey > Mercer County > Princeton (0.14)
North America > United States > New York > New York County > New York City (0.05)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(15 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Media (1.00)
Marketing (1.00)
Information Technology > Security & Privacy (1.00)
(4 more...)

Technology:

Information Technology > e-Commerce (1.00)
Information Technology > Security & Privacy (1.00)
Information Technology > Information Management (1.00)
(6 more...)

Add feedback

When Machine Learning Meets Privacy: A Survey and Outlook

#artificialintelligenceNov-26-2020, 11:13:46 GMT

The newly emerged machine learning (e.g. Meanwhile, privacy has emerged as a big concern in this machine learning-based artificial intelligence era. It is important to note that the problem of privacy preservation in the context of machine learning is quite different from that in traditional data privacy protection, as machine learning can act as both friend and foe. Currently, the work on the preservation of privacy and machine learning (ML) is still in an infancy stage, as most existing solutions only focus on privacy problems during the machine learning process. Therefore, a comprehensive study on the privacy preservation problems and machine learning is required. This paper surveys the state of the art in privacy issues and solutions for machine learning.

machine learning meet privacy, privacy and machine, survey and outlook, (3 more...)

#artificialintelligence

Genre: Overview (0.51)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Meta-learning in natural and artificial intelligence

Wang, Jane X.

arXiv.org Artificial IntelligenceNov-26-2020

Humans are remarkable for continuously learning throughout the entirety of their lives, from acquiring physical reasoning and language skills at a young age [64, 43], to the ability to reason about the detailed complexities inherent in everyday adult life. One key quality of this learning is that it happens at multiple scales, both in terms of time and abstraction, in a process termed meta-learning or learning to learn. The fundamental principle of meta-learning is that learning proceeds faster with more experience, via the acquisition of inductive biases or knowledge that allows for more efficient learning in the future [66, 59, 57]. These favorable properties of meta-learning have recently gained it considerable renewed interest within the deep learning/artificial intelligence community. Despite their tremendous successes in recent years [46, 61], deep learning systems still require many orders of magnitude of data than humans [40, 12]. Although early work demonstrated the feasibility for neural networks to discover their own learning rules [10, 58], it was only recently that the field has experienced a resurgence of new research in meta-learning using deep neural networks. This has demonstrated the wide-ranging potential of neural networks to meta-learn all aspects of the learning process. Deep neural networks are typically trained via backpropagation, which adjusts the weights of the neural network so that given a set of input data, the network outputs match some desired target outputs (e.g., classification labels).

learning, neural network, neuroscience, (12 more...)

arXiv.org Artificial Intelligence

2011.13464

Country:

North America > United States > District of Columbia > Washington (0.04)
Europe > Germany > North Rhine-Westphalia > Upper Bavaria > Munich (0.04)

Genre:

Research Report (0.50)
Overview (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Achievements and Challenges in Explaining Deep Learning based Computer-Aided Diagnosis Systems

Lucieri, Adriano, Bajwa, Muhammad Naseer, Dengel, Andreas, Ahmed, Sheraz

arXiv.org Artificial IntelligenceNov-26-2020

Remarkable success of modern image-based AI methods and the resulting interest in their applications in critical decision-making processes has led to a surge in efforts to make such intelligent systems transparent and explainable. The need for explainable AI does not stem only from ethical and moral grounds but also from stricter legislation around the world mandating clear and justifiable explanations of any decision taken or assisted by AI. Especially in the medical context where Computer-Aided Diagnosis can have a direct influence on the treatment and well-being of patients, transparency is of utmost importance for safe transition from lab research to real world clinical practice. This paper provides a comprehensive overview of current state-of-the-art in explaining and interpreting Deep Learning based algorithms in applications of medical research and diagnosis of diseases. We discuss early achievements in development of explainable AI for validation of known disease criteria, exploration of new potential biomarkers, as well as methods for the subsequent correction of AI models. Various explanation methods like visual, textual, post-hoc, ante-hoc, local and global have been thoroughly and critically analyzed. Subsequently, we also highlight some of the remaining challenges that stand in the way of practical applications of AI as a clinical decision support tool and provide recommendations for the direction of future research.

algorithm, arxiv preprint arxiv, explanation, (14 more...)

arXiv.org Artificial Intelligence

2011.13169

Country:

Europe > Germany > Rhineland-Palatinate > Kaiserslautern (0.05)
Europe > Germany > Lower Saxony > Gottingen (0.04)
Europe > France (0.04)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Law (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Dermatology (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

Modular Structures and Atomic Decomposition in Ontologies

Journal of Artificial Intelligence ResearchNov-25-2020

With the growth of ontologies used in diverse application areas, the need for module extraction and modularisation techniques has risen. The notion of the modular structure of an ontology, which comprises a suitable set of base modules together with their logical dependencies, has the potential to help users and developers in comprehending, sharing, and maintaining an ontology. We have developed a new modular structure, called atomic decomposition (AD), which is based on modules that provide strong logical properties, such as locality-based modules. In this article, we present the theoretical foundations of AD, review its logical and computational properties, discuss its suitability as a modular structure, and report on an experimental evaluation of AD. In addition, we discuss the concept of a modular structure in ontology engineering and provide a survey of existing decomposition approaches.

axiom, module, ontology, (11 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.12151

AI Access Foundation

12151

Journal of Artificial Intelligence Research

Country:

Europe > Germany > Bremen > Bremen (0.27)
Oceania > Australia (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre:

Overview (0.67)
Research Report (0.45)

Industry: Health & Medicine > Therapeutic Area (0.67)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)

Add feedback