South America
Deep Networks from the Principle of Rate Reduction
Chan, Kwan Ho Ryan, Yu, Yaodong, You, Chong, Qi, Haozhi, Wright, John, Ma, Yi
This work attempts to interpret modern deep (convolutional) networks from the principles of rate reduction and (shift) invariant classification. We show that the basic iterative gradient ascent scheme for optimizing the rate reduction of learned features naturally leads to a multi-layer deep network, one iteration per layer. The layered architectures, linear and nonlinear operators, and even parameters of the network are all explicitly constructed layer-by-layer in a forward propagation fashion by emulating the gradient scheme. All components of this "white box" network have precise optimization, statistical, and geometric interpretation. This principled framework also reveals and justifies the role of multi-channel lifting and sparse coding in early stage of deep networks. Moreover, all linear operators of the so-derived network naturally become multi-channel convolutions when we enforce classification to be rigorously shift-invariant. The derivation also indicates that such a convolutional network is significantly more efficient to construct and learn in the spectral domain. Our preliminary simulations and experiments indicate that so constructed deep network can already learn a good discriminative representation even without any back propagation training. In recent years, various deep (convolution) network architectures such as AlexNet (Krizhevsky et al., 2012), VGG (Simonyan & Zisserman, 2015), ResNet (He et al., 2016), DenseNet (Huang et al., 2017), Recurrent CNN, LSTM (Hochreiter & Schmidhuber, 1997), Capsule Networks (Hinton et al., 2011), etc., have demonstrated very good performance in classification tasks of real-world datasets such as speeches or images. Nevertheless, almost all such networks are developed through years of empirical trial and error, including both their architectures/operators and the ways they are to be effectively trained. Some recent practices even take to the extreme by searching for effective network structures and training strategies through extensive random search techniques, such as Neural Architecture Search (Zoph & Le, 2017; Baker et al., 2017), AutoML (Hutter et al., 2019), and Learning to Learn (Andrychowicz et al., 2016).
Contrastive Representation Learning: A Framework and Review
Le-Khac, Phuc H., Healy, Graham, Smeaton, Alan F.
Contrastive Learning has recently received interest due to its success in self-supervised representation learning in the computer vision domain. However, the origins of Contrastive Learning date as far back as the 1990s and its development has spanned across many fields and domains including Metric Learning and natural language processing. In this paper we provide a comprehensive literature review and we propose a general Contrastive Representation Learning framework that simplifies and unifies many different contrastive learning methods. We also provide a taxonomy for each of the components of contrastive learning in order to summarise it and distinguish it from other forms of machine learning. We then discuss the inductive biases which are present in any contrastive learning system and we analyse our framework under different views from various sub-fields of Machine Learning. Examples of how contrastive learning has been applied in computer vision, natural language processing, audio processing, and others, as well as in Reinforcement Learning are also presented. Finally, we discuss the challenges and some of the most promising future research directions ahead.
Learning Contextualised Cross-lingual Word Embeddings for Extremely Low-Resource Languages Using Parallel Corpora
Wada, Takashi, Iwata, Tomoharu, Matsumoto, Yuji, Baldwin, Timothy, Lau, Jey Han
We propose a new approach for learning contextualised cross-lingual word embeddings based only on a small parallel corpus (e.g. a few hundred sentence pairs). Our method obtains word embeddings via an LSTM-based encoder-decoder model that performs bidirectional translation and reconstruction of the input sentence. Through sharing model parameters among different languages, our model jointly trains the word embeddings in a common multilingual space. We also propose a simple method to combine word and subword embeddings to make use of orthographic similarities across different languages. We base our experiments on real-world data from endangered languages, namely Yongning Na, Shipibo-Konibo and Griko. Our experiments on bilingual lexicon induction and word alignment tasks show that our model outperforms existing methods by a large margin for most language pairs. These results demonstrate that, contrary to common belief, an encoder-decoder translation model is beneficial for learning cross-lingual representations, even in extremely low-resource scenarios.
Conservative Safety Critics for Exploration
Bharadhwaj, Homanga, Kumar, Aviral, Rhinehart, Nicholas, Levine, Sergey, Shkurti, Florian, Garg, Animesh
Safe exploration presents a major challenge in reinforcement learning (RL): when active data collection requires deploying partially trained policies, we must ensure that these policies avoid catastrophically unsafe regions, while still enabling trial and error learning. In this paper, we target the problem of safe exploration in RL by learning a conservative safety estimate of environment states through a critic, and provably upper bound the likelihood of catastrophic failures at every training iteration. We theoretically characterize the tradeoff between safety and policy improvement, show that the safety constraints are likely to be satisfied with high probability during training, derive provable convergence guarantees for our approach, which is no worse asymptotically than standard RL, and demonstrate the efficacy of the proposed approach on a suite of challenging navigation, manipulation, and locomotion tasks. Empirically, we show that the proposed approach can achieve competitive task performance while incurring significantly lower catastrophic failure rates during training than prior methods. Videos are at this url https://sites.google.com/view/ Reinforcement learning (RL) is a powerful framework for learning-based control because it can enable agents to learn to make decisions automatically through trial and error. However, in the real world, the cost of those trials - and those errors - can be quite high: an aerial robot that attempts to fly at high speed might initially crash, and then be unable to attempt further trials due to extensive physical damage. However, learning complex skills without any failures at all is likely impossible.
Intel Powers First Satellite with AI on Board - Edge AI and Vision Alliance
As ubiquitous as artificial intelligence has become in modern life -- from boosting our understanding of the cosmos to surfacing entertaining videos on your phone -- AI hasn't yet found its way into orbit. That is until Sept. 2, when an experimental satellite about the size of a cereal box was ejected from a rocket's dispenser along with 45 other similarly small satellites. The satellite, named PhiSat-1, is now soaring at over 17,000 mph (27,500 kmh) in sun-synchronous orbit about 329 miles (530 km) overhead. PhiSat-1 contains a new hyperspectral-thermal camera and onboard AI processing thanks to an Intel Movidius Myriad 2 Vision Processing Unit (VPU) -- the same chip inside many smart cameras and even a $99 selfie drone here on Earth. PhiSat-1 is actually one of a pair of satellites on a mission to monitor polar ice and soil moisture, while also testing intersatellite communication systems in order to create a future network of federated satellites.
Bayesian Probabilistic Numerical Integration with Tree-Based Models
Zhu, Harrison, Liu, Xing, Kang, Ruya, Shen, Zhichao, Flaxman, Seth, Briol, François-Xavier
Bayesian quadrature (BQ) is a method for solving numerical integration problems in a Bayesian manner, which allows users to quantify their uncertainty about the solution. The standard approach to BQ is based on a Gaussian process (GP) approximation of the integrand. As a result, BQ is inherently limited to cases where GP approximations can be done in an efficient manner, thus often prohibiting very high-dimensional or non-smooth target functions. This paper proposes to tackle this issue with a new Bayesian numerical integration algorithm based on Bayesian Additive Regression Trees (BART) priors, which we call BART-Int. BART priors are easy to tune and well-suited for discontinuous functions. We demonstrate that they also lend themselves naturally to a sequential design setting and that explicit convergence rates can be obtained in a variety of settings. The advantages and disadvantages of this new methodology are highlighted on a set of benchmark tests including the Genz functions, and on a Bayesian survey design problem.
Active learning with RESSPECT: Resource allocation for extragalactic astronomical transients
Kennamer, Noble, Ishida, Emille E. O., Gonzalez-Gaitan, Santiago, de Souza, Rafael S., Ihler, Alexander, Ponder, Kara, Vilalta, Ricardo, Moller, Anais, Jones, David O., Dai, Mi, Krone-Martins, Alberto, Quint, Bruno, Sreejith, Sreevarsha, Malz, Alex I., Galbany, Lluis
The recent increase in volume and complexity of available astronomical data has led to a wide use of supervised machine learning techniques. Active learning strategies have been proposed as an alternative to optimize the distribution of scarce labeling resources. However, due to the specific conditions in which labels can be acquired, fundamental assumptions, such as sample representativeness and labeling cost stability cannot be fulfilled. The Recommendation System for Spectroscopic follow-up (RESSPECT) project aims to enable the construction of optimized training samples for the Rubin Observatory Legacy Survey of Space and Time (LSST), taking into account a realistic description of the astronomical data environment. In this work, we test the robustness of active learning techniques in a realistic simulated astronomical data scenario. Our experiment takes into account the evolution of training and pool samples, different costs per object, and two different sources of budget. Results show that traditional active learning strategies significantly outperform random sampling. Nevertheless, more complex batch strategies are not able to significantly overcome simple uncertainty sampling techniques. Our findings illustrate three important points: 1) active learning strategies are a powerful tool to optimize the label-acquisition task in astronomy, 2) for upcoming large surveys like LSST, such techniques allow us to tailor the construction of the training sample for the first day of the survey, and 3) the peculiar data environment related to the detection of astronomical transients is a fertile ground that calls for the development of tailored machine learning algorithms.
COVID-19 Analysis and Forecasting Using Deep Learning
As of October 2020, the COVID-19 pandemic has claimed over 1 million lives across the world and over 41 million people have been infected. Understanding the factors and policies that influence the spread of the virus can help governments make informed decisions in order to control infections and deaths until a vaccine becomes widely available. The data used for this project can be divided into four different parts, each represented as separate data frames/tables in the code: policy data, mobility data, demographic data, and COVID-19 time-series statistics. The policy data, extracted from the OxCGRT dataset, contains information about the policies implemented by the government in each country to control the spread of COVID-19. The policy data is available for each day after the start of the pandemic.
GPU for Deep Learning Market Study Offers In-depth Insights – TechnoWeekly
We fulfil all your research needs spanning across industry verticals with our huge collection of market research reports. We provide our services to all sizes of organisations and across all industry verticals and markets. Our Research Coordinators have in-depth knowledge of reports as well as publishers and will assist you in making an informed decision by giving you unbiased and deep insights on which reports will satisfy your needs at the best price.
Global Artificial Intelligence Platform Market 2020 Industry Development, Strategy, Survey, Geographical Segmentation And Recent Trends 2024 – PRnews Leader
A new research Titled "Global Artificial Intelligence Platform Market 2020 Research Report" provides the Professional and In-depth evaluation of scope of current and future market and review of Product Specification, market trend, product type and production analysis considering major factors such as Facts and figure, revenue generated from the sales of this Report, market share and growth rate for each type and application, Gross Margin, key factors driving to the market. The Artificial Intelligence Platform market will reach Volume Million USD in 2019 and CAGR xx% 2015-2019. The report Primarly enlists the basic details of industry based on the fundamental overview of Artificial Intelligence Platform market chain structure, and describes industry surroundings, the development of the market through upstream & downstream, industry overall, investment analysis, manufacturing cost structure, industry policies, plans and development, key players will drive key business decisions and makes a scientific prediction for the development industry prospects on the basis of past, present and forecast data related to the Artificial Intelligence Platform market from 2020-2024. The Scope of the global Artificial Intelligence Platform market mainly focuses on globally, it primarily covers the Artificial Intelligence Platform Market in USA, Canada and Mexico, Artificial Intelligence Platform Market in Germany, France, UK, Russia and Italy, global Artificial Intelligence Platform market in China, Japan, Korea, India and Southeast Asia, global Artificial Intelligence Platform market in Brazil, Argentina, Columbia, Global market in Saudi Arabia, UAE, Egypt, Nigeria and South Africa. The firstly global Artificial Intelligence Platform market describes the market overview, Upstream, Technology, Cost Structure.