Overview
Named Entity Recognition and Classification on Historical Documents: A Survey
Ehrmann, Maud, Hamdi, Ahmed, Pontes, Elvys Linhares, Romanello, Matteo, Doucet, Antoine
After decades of massive digitisation, an unprecedented amount of historical documents is available in digital format, along with their machine-readable texts. While this represents a major step forward with respect to preservation and accessibility, it also opens up new opportunities in terms of content mining and the next fundamental challenge is to develop appropriate technologies to efficiently search, retrieve and explore information from this 'big data of the past'. Among semantic indexing opportunities, the recognition and classification of named entities are in great demand among humanities scholars. Yet, named entity recognition (NER) systems are heavily challenged with diverse, historical and noisy inputs. In this survey, we present the array of challenges posed by historical documents to NER, inventory existing resources, describe the main approaches deployed so far, and identify key priorities for future developments.
Deep Learning for Ultrasound Beamforming
van Sloun, Ruud JG, Ye, Jong Chul, Eldar, Yonina C
Diagnostic imaging plays a critical role in healthcare, serving as a fundamental asset for timely diagnosis, disease staging and management as well as for treatment choice, planning, guidance, and follow-up. Among the diagnostic imaging options, ultrasound imaging is uniquely positioned, being a highly cost-effective modality that offers the clinician an unmatched and invaluable level of interaction, enabled by its real-time nature. Ultrasound probes are becoming increasingly compact and portable, with the market demand for low-cost pocket-sized and (in-body) miniaturized devices expanding. At the same time, there is a strong trend towards 3D imaging and the use of high-frame-rate imaging schemes; both accompanied by dramatically increasing data rates that pose a heavy burden on the probe-system communication and subsequent image reconstruction algorithms. With the demand for high-quality image reconstruction and signal extraction from less (e.g unfocused or parallel) transmissions that facilitate fast imaging, and a push towards compact probes, modern ultrasound imaging leans heavily on innovations in powerful digital receive channel processing. Beamforming, the process of mapping received ultrasound echoes to the spatial image domain, naturally lies at the heart of the ultrasound image formation chain. In this chapter on Deep Learning for Ultrasound Beamforming, we discuss why and when deep learning methods can play a compelling role in the digital beamforming pipeline, and then show how these data-driven systems can be leveraged for improved ultrasound image reconstruction.
A survey of Bayesian Network structure learning
Kitson, Neville K., Constantinou, Anthony C., Guo, Zhigao, Liu, Yang, Chobtham, Kiattikun
Bayesian Networks (BNs) have become increasingly popular over the last few decades as a tool for reasoning under uncertainty in fields as diverse as medicine, biology, epidemiology, economics and the social sciences. This is especially true in real-world areas where we seek to answer complex questions based on hypothetical evidence to determine actions for intervention. However, determining the graphical structure of a BN remains a major challenge, especially when modelling a problem under causal assumptions. Solutions to this problem include the automated discovery of BN graphs from data, constructing them based on expert knowledge, or a combination of the two. This paper provides a comprehensive review of combinatoric algorithms proposed for learning BN structure from data, describing 61 algorithms including prototypical, well-established and state-of-the-art approaches. The basic approach of each algorithm is described in consistent terms, and the similarities and differences between them highlighted. Methods of evaluating algorithms and their comparative performance are discussed including the consistency of claims made in the literature. Approaches for dealing with data noise in real-world datasets and incorporating expert knowledge into the learning process are also covered.
A Survey on Cost Types, Interaction Schemes, and Annotator Performance Models in Selection Algorithms for Active Learning in Classification
Herde, Marek, Huseljic, Denis, Sick, Bernhard, Calma, Adrian
Pool-based active learning (AL) aims to optimize the annotation process (i.e., labeling) as the acquisition of annotations is often time-consuming and therefore expensive. For this purpose, an AL strategy queries annotations intelligently from annotators to train a high-performance classification model at a low annotation cost. Traditional AL strategies operate in an idealized framework. They assume a single, omniscient annotator who never gets tired and charges uniformly regardless of query difficulty. However, in real-world applications, we often face human annotators, e.g., crowd or in-house workers, who make annotation mistakes and can be reluctant to respond if tired or faced with complex queries. Recently, a wide range of novel AL strategies has been proposed to address these issues. They differ in at least one of the following three central aspects from traditional AL: (1) They explicitly consider (multiple) human annotators whose performances can be affected by various factors, such as missing expertise. (2) They generalize the interaction with human annotators by considering different query and annotation types, such as asking an annotator for feedback on an inferred classification rule. (3) They take more complex cost schemes regarding annotations and misclassifications into account. This survey provides an overview of these AL strategies and refers to them as real-world AL. Therefore, we introduce a general real-world AL strategy as part of a learning cycle and use its elements, e.g., the query and annotator selection algorithm, to categorize about 60 real-world AL strategies. Finally, we outline possible directions for future research in the field of AL.
A review of deep learning methods for MRI reconstruction
Following the success of deep learning in a wide range of applications, neural network-based machine-learning techniques have received significant interest for accelerating magnetic resonance imaging (MRI) acquisition and reconstruction strategies. A number of ideas inspired by deep learning techniques for computer vision and image processing have been successfully applied to nonlinear image reconstruction in the spirit of compressed sensing for accelerated MRI. Given the rapidly growing nature of the field, it is imperative to consolidate and summarize the large number of deep learning methods that have been reported in the literature, to obtain a better understanding of the field in general. This article provides an overview of the recent developments in neural-network based approaches that have been proposed specifically for improving parallel imaging. A general background and introduction to parallel MRI is also given from a classical view of k-space based reconstruction methods.
Artificial intelligence (AI) at the edge: 3 key facts
Artificial Intelligence (AI) is moving from the realm of science fiction to widespread enterprise scalability. Even ten years ago, AI workloads were almost exclusively utilized by a small number of very profitable companies that had the resources to experiment and hire an extensive team of data scientists. Today, AI is used in a number of everyday tools, from language recognition to health care prediction and nearly every industry in between. AI is also now deployed at the edge, not just inside massive data processing facilities. That trend will continue in the coming years.
Reports of the Association for the Advancement of Artificial Intelligence's 2020 Fall Symposium Series
The Association for the Advancement of Artificial Intelligence's 2020 Fall Symposium Series was held virtually from November 11-14, 2020, and was collocated with three symposia postponed from March 2020 due to the COVID-19 Pandemic. There were five symposia in the fall program: AI for Social Good, Artificial Intelligence in Government and Public Sector, Conceptual Abstraction and Analogy in Natural and Artificial Intelligence, Physics-Guided AI to Accelerate Scientific Discovery, and Trust and Explainability in Artificial Intelligence for Human-Robot Interaction. Additionally, there were three symposia delayed from spring: AI Welcomes Systems Engineering: Towards the Science of Interdependence for Autonomous Human-Machine Teams, Deep Models and Artificial Intelligence for Defense Applications: Potentials, Theories, Practices, Tools, and Risks, and Towards Responsible AI in Surveillance, Media, and Security through Licensing. Recent developments in big data and computational power are revolutionizing several domains, opening up new opportunities and challenges. In this symposium, we highlighted two specific themes, namely humanitarian relief, and healthcare, where AI could be used for social good to achieve the United Nations (UN) sustainable development goals (SDGs) in those areas, which touch every aspect of human, social, and economic development. The talks at the symposium were focused on identifying the critical needs and pathways for responsible AI solutions to achieve SDGs, which demand holistic thinking on optimizing the trade-off between automation benefits and their potential side-effects, especially in a year that has upended societies globally due to the COVID-19 pandemic. Riding on the success of the AI for Social Good symposium that was held in Washington, DC, in November 2019, we organized the 2020 version of the symposium.
89% of tech execs see synthetic data as a key to staying ahead
The Transform Technology Summits start October 13th with Low-Code/No Code: Enabling Enterprise Agility. Nearly nine in ten (89%) technology decision makers who use vision data agree synthetic data is a new and innovative technology and believe that organizations that fail to adopt synthetic data are at risk of falling behind the curve, according to new research by Synthesis AI in conjunction with Vanson Bourne. Technology leaders agree that synthetic data will be an essential enabling technology and key to staying ahead. Above: Synthetic Data could be a solution to the time consuming and cost prohibitive nature of supervised learning. AI is driven by the speed, diversity, and quality of data.
Towards Energy-Efficient and Secure Edge AI: A Cross-Layer Framework
Shafique, Muhammad, Marchisio, Alberto, Putra, Rachmad Vidya Wicaksana, Hanif, Muhammad Abdullah
The security and privacy concerns along with the amount of data that is required to be processed on regular basis has pushed processing to the edge of the computing systems. Deploying advanced Neural Networks (NN), such as deep neural networks (DNNs) and spiking neural networks (SNNs), that offer state-of-the-art results on resource-constrained edge devices is challenging due to the stringent memory and power/energy constraints. Moreover, these systems are required to maintain correct functionality under diverse security and reliability threats. This paper first discusses existing approaches to address energy efficiency, reliability, and security issues at different system layers, i.e., hardware (HW) and software (SW). Afterward, we discuss how to further improve the performance (latency) and the energy efficiency of Edge AI systems through HW/SW-level optimizations, such as pruning, quantization, and approximation. To address reliability threats (like permanent and transient faults), we highlight cost-effective mitigation techniques, like fault-aware training and mapping. Moreover, we briefly discuss effective detection and protection techniques to address security threats (like model and data corruption). Towards the end, we discuss how these techniques can be combined in an integrated cross-layer framework for realizing robust and energy-efficient Edge AI systems.
A Survey of Text Games for Reinforcement Learning informed by Natural Language
Osborne, Philip, Nõmm, Heido, Freitas, Andre
Reinforcement Learning (RL) has shown human-level performance in solving complex, single setting virtual environments Mnih et al. [2013] & Silver et al. [2016]. However, applications and theory in RL problems have been far less developed and it has been posed that this is due to a wide divide between the empirical methodology associated with virtual environments in RL research and the challenges associated with reality Dulac-Arnold et al. [2019]. Simply put, Text Games provide a safe and data efficient way to learn from environments that mimic language found in real-world scenarios Shridhar et al. [2020]. Natural language (NL) has been introduced as a solution to many of the challenges in RL Luketina et al. [2019], as NL can facilitate the transfer of abstract knowledge to downstream tasks. However, RL approaches on these language driven environments are still limited in their development and therefore a call has been made for an improvement on the evaluation settings where language is a first-class component. Text Games gained wider acceptance as a testbed for NL research following work Figure 1: Sample gameplay from Narasimhan et al. [2015] who leveraged the Deep Q Network (DQN) framework from a fantasy Text Game as for policy learning on a set of synthetic textual games. Text Games are both partially given by Narasimhan et al. observable (as shown in Figure 1) and include outcomes that make reward signals [2015] where the player takes simple to define, making them a suitable problem for Reinforcement Learning to the action'Go East' to cross solve. However, research so far has been performed independently, with many authors the bridge.