AITopics | Overview

Collaborating Authors

Overview

A Survey on Neural Speech Synthesis

#artificialintelligenceJul-1-2021, 00:18:36 GMT

Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural speech given text, is a hot research topic in speech, language, and machine learning communities and has broad applications in the industry. As the development of deep learning and artificial intelligence, neural network-based TTS has significantly improved the quality of synthesized speech in recent years. In this paper, we conduct a comprehensive survey on neural TTS, aiming to provide a good understanding of current research and future trends. We focus on the key components in neural TTS, including text analysis, acoustic models and vocoders, and several advanced topics, including fast TTS, low-resource TTS, robust TTS, expressive TTS, and adaptive TTS, etc. We further summarize resources related to TTS (e.g., datasets, opensource implementations) and discuss future research directions.

neural speech synthesis, neural tts, tts

#artificialintelligence

Genre: Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Synthesis (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

A Review on Edge Analytics: Issues, Challenges, Opportunities, Promises, Future Directions, and Applications

Nayak, Sabuzima, Patgiri, Ripon, Waikhom, Lilapati, Ahmed, Arif

arXiv.org Artificial IntelligenceJul-1-2021

Edge technology aims to bring Cloud resources (specifically, the compute, storage, and network) to the closed proximity of the Edge devices, i.e., smart devices where the data are produced and consumed. Embedding computing and application in Edge devices lead to emerging of two new concepts in Edge technology, namely, Edge computing and Edge analytics. Edge analytics uses some techniques or algorithms to analyze the data generated by the Edge devices. With the emerging of Edge analytics, the Edge devices have become a complete set. Currently, Edge analytics is unable to provide full support for the execution of the analytic techniques. The Edge devices cannot execute advanced and sophisticated analytic algorithms following various constraints such as limited power supply, small memory size, limited resources, etc. This article aims to provide a detailed discussion on Edge analytics. A clear explanation to distinguish between the three concepts of Edge technology, namely, Edge devices, Edge computing, and Edge analytics, along with their issues. Furthermore, the article discusses the implementation of Edge analytics to solve many problems in various areas such as retail, agriculture, industry, and healthcare. In addition, the research papers of the state-of-the-art edge analytics are rigorously reviewed in this article to explore the existing issues, emerging challenges, research opportunities and their directions, and applications.

arXiv.org Artificial Intelligence

2107.06835

Country:

Asia (0.28)
North America > United States (0.14)
Europe (0.14)

Genre:

Research Report (1.00)
Overview (0.67)

Industry:

Information Technology > Security & Privacy (1.00)
Energy > Oil & Gas > Upstream (1.00)
Information Technology > Services (0.68)
Health & Medicine > Consumer Health (0.67)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Internet of Things (1.00)
Information Technology > Information Management (1.00)
(12 more...)

Add feedback

Well-calibrated prediction intervals for regression problems

Dewolf, Nicolas, De Baets, Bernard, Waegeman, Willem

arXiv.org Machine LearningJul-1-2021

Over the last few decades, various methods have been proposed for estimating prediction intervals in regression settings, including Bayesian methods, ensemble methods, direct interval estimation methods and conformal prediction methods. An important issue is the calibration of these methods: the generated prediction intervals should have a predefined coverage level, without being overly conservative. In this work, we review the above four classes of methods from a conceptual and experimental point of view. Results on benchmark data sets from various domains highlight large fluctuations in performance from one data set to another. These observations can be attributed to the violation of certain assumptions that are inherent to some classes of methods. We illustrate how conformal prediction can be used as a general calibration procedure for methods that deliver poor results without a calibration step.

estimator, international conference, prediction interval, (15 more...)

arXiv.org Machine Learning

2107.00363

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
(10 more...)

Genre:

Overview (0.87)
Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Improve clinical outcomes with AI-enabled healthcare applications

#artificialintelligenceJun-30-2021, 08:05:39 GMT

Escalating demands in healthcare are rapidly changing the way organizations treat patients, support caregivers and staff, and share information. Global trends like population growth, increasing life expectancy, and widespread need for telehealth services have caused organizations to work harder and stretch their resources between numerous locations. The pressure to meet these challenges is mounting, as rising volumes of patients depend on immediate and personalized care. These patients often require more doctor's visits, sophisticated treatments, and medications, as well as the use of specialized equipment and personal devices which produce troves of medical data. A single patient can generate 80 megabytes of imaging and electronic health record (EHR) data per year.

ai-enabled healthcare application, clinical outcome, medical data, (11 more...)

#artificialintelligence

Genre:

Research Report > Experimental Study (0.45)
Overview (0.36)

Industry: Health & Medicine > Health Care Technology > Telehealth (1.00)

Technology: Information Technology > Artificial Intelligence > Vision (0.31)

Add feedback

The Use of Bandit Algorithms in Intelligent Interactive Recommender Systems

Wang, Qing

arXiv.org Artificial IntelligenceJun-30-2021

This can be naturally modeled constantly explore innovative ways to provide optimal online as contextual bandit problems (e.g., LinUCB [18] and Thompson user experiences for gaining competitive advantages. The great sampling [7]), where each arm corresponds to an item, pulling an needs of developing intelligent interactive recommendation systems item indicates recommending an item, and the reward is the instant are indicated, which could sequentially suggest users the most feedback from a user after the recommendation. Contextual proper items by accurately predicting their preferences, while receiving bandit algorithms have been widely applied in various interactive the up-to-date feedback to refine the recommendation results, recommender systems by achieving an optimal tradeoff between continuosly. Multi-armed bandit algorithms, which have been exploration and exploitation. Based on the preliminary studies [15, widely applied into various online systems, are quite capable of 18, 1], several practical challenges are identified in modern recommender delivering such efficient recommendation services.

artificial intelligence, data mining, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2107.00161

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report (1.00)
Overview (0.67)

Industry: Information Technology (0.93)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

Applications of the Free Energy Principle to Machine Learning and Neuroscience

Millidge, Beren

arXiv.org Artificial IntelligenceJun-30-2021

In this thesis, we explore and apply methods inspired by the free energy principle to two important areas in machine learning and neuroscience. The free energy principle is a general mathematical theory of the necessary information-theoretic behaviours of systems which maintain a separation from their environment. A core postulate of the theory is that complex systems can be seen as performing variational Bayesian inference and minimizing an information-theoretic quantity called the variational free energy. The free energy principle originated in, and has been extremely influential in theoretical neuroscience, having spawned a number of neurophysiologically realistic process theories, and maintaining close links with Bayesian Brain viewpoints. The thesis is split into three main parts where we apply methods and insights from the free energy principle to understand questions first in perception, then action, and finally learning. Specifically, in the first section, we focus on the theory of predictive coding, a neurobiologically plausible process theory derived from the free energy principle under certain assumptions, which argues that the primary function of the brain is to minimize prediction errors. We focus on scaling up predictive coding architectures and simulate large-scale predictive coding networks for perception on machine learning benchmarks; we investigate predictive coding's relationship to other classical filtering algorithms, and we demonstrate that many biologically implausible aspects of current models of predictive coding can be relaxed without unduly harming the performance of predictive coding models which allows for a potentially more literal translation of predictive coding theory into cortical microcircuits. In the second part of the thesis, we focus on the application of methods deriving from the free energy principle to action. We study the extension of methods of'active inference', a neurobiologically grounded account of action through variational message passing, to utilize deep artificial neural networks, allowing these methods to'scale up' to be competitive with state of the art deep reinforcement learning methods.

biologically implausible one-to-one connectivity, crucial approximate bayesian inference lemma, non-symmetric forward and backward connectivity, (17 more...)

arXiv.org Artificial Intelligence

2107.0014

Country:

Asia > Middle East > Jordan (0.04)
Africa > Mali (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(6 more...)

Genre:

Research Report > Promising Solution (1.00)
Research Report > New Finding (1.00)
Overview (1.00)
Instructional Material (0.92)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (1.00)
(4 more...)

Add feedback

Stochastic Gradient Descent-Ascent and Consensus Optimization for Smooth Games: Convergence Analysis under Expected Co-coercivity

Loizou, Nicolas, Berard, Hugo, Gidel, Gauthier, Mitliagkas, Ioannis, Lacoste-Julien, Simon

arXiv.org Machine LearningJun-30-2021

Two of the most prominent algorithms for solving unconstrained smooth games are the classical stochastic gradient descent-ascent (SGDA) and the recently introduced stochastic consensus optimization (SCO) (Mescheder et al., 2017). SGDA is known to converge to a stationary point for specific classes of games, but current convergence analyses require a bounded variance assumption. SCO is used successfully for solving large-scale adversarial problems, but its convergence guarantees are limited to its deterministic variant. In this work, we introduce the expected co-coercivity condition, explain its benefits, and provide the first last-iterate convergence guarantees of SGDA and SCO under this condition for solving a class of stochastic variational inequality problems that are potentially non-monotone. We prove linear convergence of both methods to a neighborhood of the solution when they use constant step-size, and we propose insightful stepsize-switching rules to guarantee convergence to the exact solution. In addition, our convergence guarantees hold under the arbitrary sampling paradigm, and as such, we give insights into the complexity of minibatching.

assumption, convergence, proposition 3, (14 more...)

arXiv.org Machine Learning

2107.00052

Country:

Asia > Middle East > Jordan (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)

Genre:

Research Report (0.64)
Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)

Add feedback

A Search Engine for Scientific Publications: a Cybersecurity Case Study

Oliveira, Nuno, Sousa, Norberto, Praça, Isabel

arXiv.org Artificial IntelligenceJun-30-2021

Cybersecurity is a very challenging topic of research nowadays, as digitalization increases the interaction of people, software and services on the Internet by means of technology devices and networks connected to it. The field is broad and has a lot of unexplored ground under numerous disciplines such as management, psychology, and data science. Its large disciplinary spectrum and many significant research topics generate a considerable amount of information, making it hard for us to find what we are looking for when researching a particular subject. This work proposes a new search engine for scientific publications which combines both information retrieval and reading comprehension algorithms to extract answers from a collection of domain-specific documents. The proposed solution although being applied to the context of cybersecurity exhibited great generalization capabilities and can be easily adapted to perform under other distinct knowledge domains.

application, corpus, search engine, (12 more...)

arXiv.org Artificial Intelligence

2107.00082

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Europe > Portugal > Porto > Porto (0.04)

Genre:

Overview (1.00)
Research Report (0.65)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (0.96)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Affective Image Content Analysis: Two Decades Review and New Perspectives

Zhao, Sicheng, Yao, Xingxu, Yang, Jufeng, Jia, Guoli, Ding, Guiguang, Chua, Tat-Seng, Schuller, Björn W., Keutzer, Kurt

arXiv.org Artificial IntelligenceJun-30-2021

Images can convey rich semantics and induce various emotions in viewers. Recently, with the rapid advancement of emotional intelligence and the explosive growth of visual data, extensive research efforts have been dedicated to affective image content analysis (AICA). In this survey, we will comprehensively review the development of AICA in the recent two decades, especially focusing on the state-of-the-art methods with respect to three main challenges -- the affective gap, perception subjectivity, and label noise and absence. We begin with an introduction to the key emotion representation models that have been widely employed in AICA and description of available datasets for performing evaluation with quantitative comparison of label noise and dataset bias. We then summarize and compare the representative approaches on (1) emotion feature extraction, including both handcrafted and deep features, (2) learning methods on dominant emotion recognition, personalized emotion prediction, emotion distribution learning, and learning from noisy data or few labels, and (3) AICA based applications. Finally, we discuss some challenges and promising research directions in the future, such as image content and context understanding, group emotion clustering, and viewer-image interaction.

dataset, emotion, recognition, (12 more...)

arXiv.org Artificial Intelligence

2106.16125

Country:

Asia > China > Tianjin Province > Tianjin (0.04)
Asia > Singapore > Central Region > Singapore (0.04)
Asia > China > Heilongjiang Province > Harbin (0.04)
(15 more...)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.34)

Industry:

Media (0.92)
Education > Educational Setting > Higher Education (0.67)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.66)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
(4 more...)

Add feedback

The Threat of Offensive AI to Organizations

Mirsky, Yisroel, Demontis, Ambra, Kotak, Jaidip, Shankar, Ram, Gelei, Deng, Yang, Liu, Zhang, Xiangyu, Lee, Wenke, Elovici, Yuval, Biggio, Battista

arXiv.org Artificial IntelligenceJun-29-2021

AI has provided us with the ability to automate tasks, extract information from vast amounts of data, and synthesize media that is nearly indistinguishable from the real thing. However, positive tools can also be used for negative purposes. In particular, cyber adversaries can use AI (such as machine learning) to enhance their attacks and expand their campaigns. Although offensive AI has been discussed in the past, there is a need to analyze and understand the threat in the context of organizations. For example, how does an AI-capable adversary impact the cyber kill chain? Does AI benefit the attacker more than the defender? What are the most significant AI threats facing organizations today and what will be their impact on the future? In this survey, we explore the threat of offensive AI on organizations. First, we present the background and discuss how AI changes the adversary's methods, strategies, goals, and overall attack model. Then, through a literature review, we identify 33 offensive AI capabilities which adversaries can use to enhance their attacks. Finally, through a user study spanning industry and academia, we rank the AI threats and provide insights on the adversaries.

adversary, international conference, publication date, (14 more...)

arXiv.org Artificial Intelligence

2106.15764

Country:

Europe > Italy > Sardinia > Cagliari (0.04)
North America > United States > New York > New York County > New York City (0.04)
Asia > Singapore (0.04)
(20 more...)

Genre:

Questionnaire & Opinion Survey (1.00)
Overview (1.00)
Research Report > New Finding (0.67)

Industry:

Law Enforcement & Public Safety (1.00)
Law (1.00)
Information Technology > Security & Privacy (1.00)
(2 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Communications > Networks (1.00)
(6 more...)

Add feedback