Overview
Putting Humans in the Natural Language Processing Loop: A Survey
Wang, Zijie J., Choi, Dongjin, Xu, Shenyu, Yang, Diyi
How can we design Natural Language Processing (NLP) systems that learn from human feedback? There is a growing research body of Human-in-the-loop (HITL) NLP frameworks that continuously integrate human feedback to improve the model itself. HITL NLP research is nascent but multifarious -- solving various NLP problems, collecting diverse feedback from different people, and applying different methods to learn from collected feedback. We present a survey of HITL NLP work from both Machine Learning (ML) and Human-Computer Interaction (HCI) communities that highlights its short yet inspiring history, and thoroughly summarize recent frameworks focusing on their tasks, goals, human interactions, and feedback learning methods. Finally, we discuss future directions for integrating human feedback in the NLP development loop.
New 'AI Festival' unveils impressive line-up of inspiring speakers
Supported by a host of East Anglian based businesses and universities, this virtual event is exploring the possibilities for applied'Artificial Intelligence' Facebook, Google, and BT are just some of the leading technology companies set to share their knowledge and insights at the newly launched AI Festival, on 24 and 25 February 2021. Created by Suffolk based Orbital Global and BT, this virtual event is a unique initiative that will explore the implications for business, skills, and employment in relation to what could be the defining technology of the 21st Century. Taking place online at www.aiglobalfestival.com and accessible globally, the ticketed, two-day event brings together a range of sector specialists to share their experiences and forecasts for the future in a series of inspirational keynote talks, workshops, 'fireside chats', and technology demonstrations. This includes former NASA scientist, Peter Scott, who worked for the space agency's famed Jet Propulsion Laboratory, speaking about the future of AI and technology, Professor Paul Hunter from University of East Anglia will share what the pandemic tells us about future AI and digital based approaches to health, and Daniela Rus from MIT will provide an overview of AI robotic automation and the opportunities this offers the average business. The line-up includes many other world leading representatives from organisations such as PwC, Silicon Valley Bank, Alan Turing Institute, MIT, IQ Capital, Innovate UK, Orbital Global, VirtTuri, Wilkin and Sons Tiptree, University of Essex, and University of Suffolk.
The whole brain architecture approach: Accelerating the development of artificial general intelligence by referring to the brain
The vastness of the design space created by the combination of a large number of computational mechanisms, including machine learning, is an obstacle to creating an artificial general intelligence (AGI). Brain-inspired AGI development, in other words, cutting down the design space to look more like a biological brain, which is an existing model of a general intelligence, is a promising plan for solving this problem. However, it is difficult for an individual to design a software program that corresponds to the entire brain because the neuroscientific data required to understand the architecture of the brain are extensive and complicated. The whole-brain architecture approach divides the brain-inspired AGI development process into the task of designing the brain reference architecture (BRA) -- the flow of information and the diagram of corresponding components -- and the task of developing each component using the BRA. This is called BRA-driven development. Another difficulty lies in the extraction of the operating principles necessary for reproducing the cognitive-behavioral function of the brain from neuroscience data. Therefore, this study proposes the Structure-constrained Interface Decomposition (SCID) method, which is a hypothesis-building method for creating a hypothetical component diagram consistent with neuroscientific findings. The application of this approach has begun for building various regions of the brain. Moving forward, we will examine methods of evaluating the biological plausibility of brain-inspired software. This evaluation will also be used to prioritize different computational mechanisms, which should be merged, associated with the same regions of the brain.
Foundations of Population-Based SHM, Part IV: The Geometry of Spaces of Structures and their Feature Spaces
Tsialiamanis, George, Mylonas, Charilaos, Chatzi, Eleni, Dervilis, Nikolaos, Wagg, David J., Worden, Keith
One of the requirements of the population-based approach to Structural Health Monitoring (SHM) proposed in the earlier papers in this sequence, is that structures be represented by points in an abstract space. Furthermore, these spaces should be metric spaces in a loose sense; i.e. there should be some measure of distance applicable to pairs of points; similar structures should then be close in the metric. However, this geometrical construction is not enough for the framing of problems in data-based SHM, as it leaves undefined the notion of feature spaces. Interpreting the feature values on a structure-by-structure basis as a type of field over the space of structures, it seems sensible to borrow an idea from modern theoretical physics, and define feature assignments as sections in a vector bundle over the structure space. With this idea in place, one can interpret the effect of environmental and operational variations as gauge degrees of freedom, as in modern gauge field theories. This paper will discuss the various geometrical structures required for an abstract theory of feature spaces in SHM, and will draw analogies with how these structures have shown their power in modern physics. In the second part of the paper, the problem of determining the normal condition cross section of a feature bundle is addressed. The solution is provided by the application of Graph Neural Networks (GNN), a versatile non-Euclidean machine learning algorithm which is not restricted to inputs and outputs from vector spaces. In particular, the algorithm is well suited to operating directly on the sort of graph structures which are an important part of the proposed framework for PBSHM. The solution of the normal section problem is demonstrated for a heterogeneous population of truss structures for which the feature of interest is the first natural frequency.
An open-source machine learning framework to carry out systematic reviews
When scientists carry out research on a given topic, they often start by reviewing previous study findings. Conducting systematic literature reviews or meta-analyses can be very challenging and time consuming, as there are often huge amounts of research focusing on different topics, which may not always be relevant to a researcher's work. Researchers at Utrecht University have recently developed a machine learning framework that could significantly speed up this process, by automatically browsing through numerous past studies and compiling high quality literature reviews. This framework, called ASReview, could prove particularly useful for conducting research during the COVID-19 pandemic. "Researchers and experts face a major challenge to stay up-to-date with the latest developments in their field nowadays," Jonathan de Bruin, lead engineer involved in the study, told TechXplore.
Continuous Coordination As a Realistic Scenario for Lifelong Learning
Nekoei, Hadi, Badrinaaraayanan, Akilesh, Courville, Aaron, Chandar, Sarath
Current deep reinforcement learning (RL) algorithms are still highly task-specific and lack the ability to generalize to new environments. Lifelong learning (LLL), however, aims at solving multiple tasks sequentially by efficiently transferring and using knowledge between tasks. Despite a surge of interest in lifelong RL in recent years, the lack of a realistic testbed makes robust evaluation of LLL algorithms difficult. Multi-agent RL (MARL), on the other hand, can be seen as a natural scenario for lifelong RL due to its inherent non-stationarity, since the agents' policies change over time. In this work, we introduce a multi-agent lifelong learning testbed that supports both zero-shot and few-shot settings. Our setup is based on Hanabi -- a partially-observable, fully cooperative multi-agent game that has been shown to be challenging for zero-shot coordination. Its large strategy space makes it a desirable environment for lifelong RL tasks. We evaluate several recent MARL methods, and benchmark state-of-the-art LLL algorithms in limited memory and computation regimes to shed light on their strengths and weaknesses. This continual learning paradigm also provides us with a pragmatic way of going beyond centralized training which is the most commonly used training protocol in MARL. We empirically show that the agents trained in our setup are able to coordinate well with unseen agents, without any additional assumptions made by previous works.
Advances in Multi-turn Dialogue Comprehension: A Survey
Training machines to understand natural language and interact with humans is an elusive and essential task in the field of artificial intelligence. In recent years, a diversity of dialogue systems has been designed with the rapid development of deep learning researches, especially the recent pre-trained language models. Among these studies, the fundamental yet challenging part is dialogue comprehension whose role is to teach the machines to read and comprehend the dialogue context before responding. In this paper, we review the previous methods from the perspective of dialogue modeling. We summarize the characteristics and challenges of dialogue comprehension in contrast to plain-text reading comprehension. Then, we discuss three typical patterns of dialogue modeling that are widely-used in dialogue comprehension tasks such as response selection and conversation question-answering, as well as dialogue-related language modeling techniques to enhance PrLMs in dialogue scenarios. Finally, we highlight the technical advances in recent years and point out the lessons we can learn from the empirical analysis and the prospects towards a new frontier of researches.
AIhub monthly digest: February 2021
Welcome to the second of our monthly digests, designed to keep you up-to-date with the happenings in the AI world. You can catch up with any AIhub stories you may have missed, get the low-down on recent conferences, and generally immerse yourself in all things AI. You may be aware that we are running a focus series on the UN sustainable development goals (SDG). Each month we tackle a different SDG and cover some of the AI research linked to that particular goal. In February it was the turn of climate action.
How will Singapore ensure responsible AI use?
Since 2019, government-sponsored initiatives around AI have proliferated across Asia Pacific. Such initiatives include the setting up of cross-domain AI ethics councils, guidelines and frameworks for the responsible use of AI, and other initiatives such as financial and technology support. The majority of these initiatives builds on the country's respective data privacy and protection acts. This is a clear sign that governments see the need to expand existing regulations when it comes to leveraging AI as a key driver for digital economies. All initiatives to date are voluntary in nature, but there are indications already that existing data privacy and protection laws will be updated and expanded to include AI.
Deep Neural Networks for the Assessment of Surgical Skills: A Systematic Review
Yanik, Erim, Intes, Xavier, Kruger, Uwe, Yan, Pingkun, Miller, David, Van Voorst, Brian, Makled, Basiel, Norfleet, Jack, De, Suvranu
Surgical training in medical school residency programs has followed the apprenticeship model. The learning and assessment process is inherently subjective and time-consuming. Thus, there is a need for objective methods to assess surgical skills. Here, we use the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines to systematically survey the literature on the use of Deep Neural Networks for automated and objective surgical skill assessment, with a focus on kinematic data as putative markers of surgical competency. There is considerable recent interest in deep neural networks (DNN) due to the availability of powerful algorithms, multiple datasets, some of which are publicly available, as well as efficient computational hardware to train and host them. We have reviewed 530 papers, of which we selected 25 for this systematic review. Based on this review, we concluded that DNNs are powerful tools for automated, objective surgical skill assessment using both kinematic and video data. The field would benefit from large, publicly available, annotated datasets that are representative of the surgical trainee and expert demographics and multimodal data beyond kinematics and videos.