Overview
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Zhang, Chao, Yang, Zichao, He, Xiaodong, Deng, Li
Deep learning has revolutionized speech recognition, image recognition, and natural language processing since 2010, each involving a single modality in the input signal. However, many applications in artificial intelligence involve more than one modality. It is therefore of broad interest to study the more difficult and complex problem of modeling and learning across multiple modalities. In this paper, a technical review of the models and learning methods for multimodal intelligence is provided. The main focus is the combination of vision and natural language, which has become an important area in both computer vision and natural language processing research communities. This review provides a comprehensive analysis of recent work on multimodal deep learning from three new angles - learning multimodal representations, the fusion of multimodal signals at various levels, and multimodal applications. On multimodal representation learning, we review the key concept of embedding, which unifies the multimodal signals into the same vector space and thus enables cross-modality signal processing. We also review the properties of the many types of embedding constructed and learned for general downstream tasks. On multimodal fusion, this review focuses on special architectures for the integration of the representation of unimodal signals for a particular task. On applications, selected areas of a broad interest in current literature are covered, including caption generation, text-to-image generation, and visual question answering. We believe this review can facilitate future studies in the emerging field of multimodal intelligence for the community.
AI Weekly: Introducing our 'Power in AI' special issue
At VentureBeat, there's a constant internal conversation about how we can keep finding better ways to meet our mission of covering transformative technology. "AI" is usually the key word in those conversations, which increasingly revolve around issues and topics that require deeper thought, greater attention, and accountability from the Fourth Estate. Each one is composed of features that explore a central topic from a variety of angles. Early next week, we'll be publishing our first -- an examination of power in AI. There's been much ink spilled about AI ethics, and for good reason.
Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview
Shah, Deven, Schwartz, H. Andrew, Hovy, Dirk
An increasing number of works in natural language processing have addressed the effect of bias on the predicted outcomes, introducing mitigation techniques that act on different parts of the standard NLP pipeline (data and models). However, these works have been conducted in isolation, without a unifying framework to organize efforts within the field. This leads to repetitive approaches, and puts an undue focus on the effects of bias, rather than on their origins. Research focused on bias symptoms rather than the underlying origins could limit the development of effective countermeasures. In this paper, we propose a unifying conceptualization: the predictive bias framework for NLP . We summarize the NLP literature and propose a general mathematical definition of predictive bias in NLP along with a conceptual framework, differentiating four main origins of biases: label bias, selection bias, model overamplification, and semantic bias . We discuss how past work has countered each bias origin. Our framework serves to guide an introductory overview of predictive bias in NLP, integrating existing work into a single structure and opening avenues for future research.
Meta Label Correction for Learning with Weak Supervision
Zheng, Guoqing, Awadallah, Ahmed Hassan, Dumais, Susan
Leveraging weak or noisy supervision for building effective machine learning models has long been an important research problem. The growing need for large-scale datasets to train deep learning models has increased its importance. Weak or noisy supervision could originate from multiple sources including non-expert annotators or automatic labeling based on heuristics or user interaction signals. Previous work on modeling and correcting weak labels have been focused on various aspects, including loss correction, training instance re-weighting, etc. In this paper, we approach this problem from a novel perspective based on meta-learning. We view the label correction procedure as a meta-process and propose a new meta-learning based framework termed MLC for learning with weak supervision. Experiments with different label noise levels on multiple datasets show that MLC can achieve large improvement over previous methods incorporating weak labels for learning.
Empirical validation of network learning with taxi GPS data from Wuhan, China
Xu, Susan Jia, Xie, Qian, Chow, Joseph Y. J., Liu, Xintao
Many studies have illustrated the import ance to accurately and precisely measure the attributes of an urban transport system. Due to the rise of Big Data and Internet of Things, there are numerous machine learning methods to measur e attributes of the transport system . Chow ( 1) provides an overview of these techniques including several appl ications like Allahviranloo and Recker ( 2) for activity pattern prediction; Cai et al. ( 3) for short - term traffic forecasting; Luque - Baena et al. ( 4) for vehicle detection; Lv et al. ( 5) for t raffic flow prediction; and Ma et al. ( 6) for network congestion prediction. However, generic machine learning techniques are not specifically designed to exploit the unique structure of urban transport networks. As a result, in recent years a theory of inverse problems (see 7) have emerged to capture network structure, dubbed " inverse transportation problems " by Xu et al. ( 8).
Protecting from Malware Obfuscation Attacks through Adversarial Risk Analysis
Redondo, Alberto, Insua, David Rios
Standard algorithms in detection systems perform insufficiently when dealing with malware passed through obfuscation tools. We illustrate this studying in detail an open source metamorphic software, making use of a hybrid framework to obtain the relevant features from binaries. We then provide an improved alternative solution based on adversarial risk analysis which we illustrate describe with an example. KEYWORDS: Adversarial Risk Analysis, Malware Obfuscation, Cybersecurity 1 INTRODUCTION The digital era is bringing along new global threats among which cybersecurity related ones emerge as truly worrisome, see for example the evolution of the Global Risks Map from the World Economic Forum (2017, 2018, 2019). Indeed, the operation of critical cyber infrastructures relies on components which could be cyber attacked, both incidentally and intentionally, suffering major performance degradation, Rao et al. (2016).
Global Big Data Conference
As the rise of e-commerce continues, companies around the globe have become increasingly sensitive to evolving consumer preferences. In a world where instant gratification has come to represent a generation, autonomous technologies are set to make a significant impact. When it comes to consumer shipping, McKinsey reports that 25 percent of all consumers would pay a premium for same-day or instant delivery made possible by autonomous tech. However, this figure is likely to grow, given that 30 percent of younger consumers are willing to pay more for the same shipping options. As industry use cases continue to expand, many have come to define the ecosystem as the autonomous "last-mile."
AI Stats News: Humans Plus AI 20X More Effective In Cybersecurity Defense Than Traditional Methods
Recent surveys, studies, forecasts and other quantitative assessments of the progress of AI highlighted the role of augmented intelligence, combining human intelligence with artificial intelligence to produce better results in cybersecurity defense and in getting more business value from the use of IoT data. Two thirds of UK financial institutions report they use machine learning (ML) in some form; ML is most commonly used in anti-money laundering (AML) and fraud detection as well as in customer-facing applications (e.g. Analysis of about 150 Federal departments and agencies identified 171 different uses of machine learning. Two of the leading agencies were the Securities and Exchange Commission and the Social Security Administration. The SEC uses machine learning to help identify scammers who may engage in insider trading.
FANN-on-MCU: An Open-Source Toolkit for Energy-Efficient Neural Network Inference at the Edge of the Internet of Things
Wang, Xiaying, Magno, Michele, Cavigelli, Lukas, Benini, Luca
The growing number of low-power smart devices in the Internet of Things is coupled with the concept of "Edge Computing", that is moving some of the intelligence, especially machine learning, towards the edge of the network. Enabling machine learning algorithms to run on resource-constrained hardware, typically on low-power smart devices, is challenging in terms of hardware (optimized and energy-efficient integrated circuits), algorithmic and firmware implementations. This paper presents FANN-on-MCU, an open-source toolkit built upon the Fast Artificial Neural Network (FANN) library to run lightweight and energy-efficient neural networks on microcontrollers based on both the ARM Cortex-M series and the novel RISC-V-based Parallel Ultra-Low-Power (PULP) platform. The toolkit takes multi-layer perceptrons trained with FANN and generates code targeted at execution on low-power microcontrollers either with a floating-point unit (i.e., ARM Cortex-M4F and M7F) or without (i.e., ARM Cortex M0-M3 or PULP-based processors). This paper also provides an architectural performance evaluation of neural networks on the most popular ARM Cortex-M family and the parallel RISC-V processor called Mr. Wolf. The evaluation includes experimental results for three different applications using a self-sustainable wearable multi-sensor bracelet. Experimental results show a measured latency in the order of only a few microseconds and a power consumption of few milliwatts while keeping the memory requirements below the limitations of the targeted microcontrollers. In particular, the parallel implementation on the octa-core RISC-V platform reaches a speedup of 22x and a 69% reduction in energy consumption with respect to a single-core implementation on Cortex-M4 for continuous real-time classification.
Advances in Machine Learning for the Behavioral Sciences
Kliegr, Tomáš, Bahník, Štěpán, Fürnkranz, Johannes
This is most apparent when auto-encoders are trained, where a network is trained to map the input data upon itself but is forced to project them into a lower-dimensional embedding space on the way (Vincent et al., 2010). In addition to the conventional fully connected layers, there are various special types of network connections. For example, in computer vision, convolu-tional layers are commonly used, which train multiple sliding windows that move over the image data and process just a part of the image at a time, thereby learning to recognize local features. These layers are subsequently abstracted into more and more complex visual patterns (Krizhevsky et al., 2017). For temporal data, one can use recurrent neural networks, which do not make predictions for individual input vectors, but for a sequence of input vectors. To do so, they allow feeding abstracted information from previous data points forward to the next layers.