Overview
A Brief Overview of AI Governance for Responsible Machine Learning Systems
Gill, Navdeep, Mathur, Abhishek, Conde, Marcos V.
Organizations of all sizes, across all industries and domains are leveraging artificial intelligence (AI) technologies to solve some of their biggest challenges around operations, customer experience, and much more. However, due to the probabilistic nature of AI, the risks associated with it are far greater than traditional technologies. Research has shown that these risks can range anywhere from regulatory, compliance, reputational, and user trust, to financial and even societal risks. Depending on the nature and size of the organization, AI technologies can pose a significant risk, if not used in a responsible way. This position paper seeks to present a brief introduction to AI governance, which is a framework designed to oversee the responsible use of AI with the goal of preventing and mitigating risks. Having such a framework will not only manage risks but also gain maximum value out of AI projects and develop consistency for organization-wide adoption of AI.
AICOM-MP: an AI-based Monkeypox Detector for Resource-Constrained Environments
Yang, Tim Tianyi, Yang, Tom Tianze, Liu, Andrew, Tang, Jie, An, Na, Liu, Shaoshan, Liu, Xue
Under the Autonomous Mobile Clinics (AMCs) initiative, we are developing, open sourcing, and standardizing health AI technologies to enable healthcare access in least developed countries (LDCs). We deem AMCs as the next generation of health care delivery platforms, whereas health AI engines are applications on these platforms, similar to how various applications expand the usage scenarios of smart phones. Facing the recent global monkeypox outbreak, in this article, we introduce AICOM-MP, an AI-based monkeypox detector specially aiming for handling images taken from resource-constrained devices. Compared to existing AI-based monkeypox detectors, AICOM-MP has achieved state-of-the-art (SOTA) performance. We have hosted AICOM-MP as a web service to allow universal access to monkeypox screening technology. We have also open sourced both the source code and the dataset of AICOM-MP to allow health AI professionals to integrate AICOM-MP into their services. Also, through the AICOM-MP project, we have generalized a methodology of developing health AI technologies for AMCs to allow universal access even in resource-constrained environments.
ANALYSIS: Patents Forecast Widespread Reach of AI Tech in 2023
Artificial intelligence is driving important developments in technology, from controlling autonomous vehicles, to developing medical diagnoses, to combating climate change. The global AI market was valued at nearly $59.7 billion in 2021, and is estimated to reach $422.4 billion by 2028. In conjunction with the global AI market growth, the number of patents for AI technology are on an upswing, and a general survey of patents for AI technologies shows just how innovative these industries are becoming. The types and variety of patent filings for AI technologies in the fast-growing FinTech, biology and pharma, clean/green technology, and automotive industries further show that the expansion of AI advancements is inevitable, and next year should see a continuation of this trend in filings. There's also been significant cross-technology development, further driving AI's prevalence in a number of fields.
MEESO: A Multi-objective End-to-End Self-Optimized Approach for Automatically Building Deep Learning Models
Deep learning has been widely used in various applications from different fields such as computer vision, natural language processing, etc. However, the training models are often manually developed via many costly experiments. This manual work usually requires substantial computing resources, time, and experience. To simplify the use of deep learning and alleviate human effort, automated deep learning has emerged as a potential tool that releases the burden for both users and researchers. Generally, an automatic approach should support the diversity of model selection and the evaluation should allow users to decide upon their demands. To that end, we propose a multi-objective end-to-end self-optimized approach for constructing deep learning models automatically. Experimental results on well-known datasets such as MNIST, Fashion, and Cifar10 show that our algorithm can discover various competitive models compared with the state-of-the-art approach. In addition, our approach also introduces multi-objective trade-off solutions for both accuracy and uncertainty metrics for users to make better decisions.
A Survey on Differential Privacy with Machine Learning and Future Outlook
Baraheem, Samah, Yao, Zhongmei
Nowadays, machine learning models and applications have become increasingly pervasive. With this rapid increase in the development and employment of machine learning models, a concern regarding privacy has risen. Thus, there is a legitimate need to protect the data from leaking and from any attacks. One of the strongest and most prevalent privacy models that can be used to protect machine learning models from any attacks and vulnerabilities is differential privacy (DP). DP is strict and rigid definition of privacy, where it can guarantee that an adversary is not capable to reliably predict if a specific participant is included in the dataset or not. It works by injecting a noise to the data whether to the inputs, the outputs, the ground truth labels, the objective functions, or even to the gradients to alleviate the privacy issue and protect the data. To this end, this survey paper presents different differentially private machine learning algorithms categorized into two main categories (traditional machine learning models vs. deep learning models). Moreover, future research directions for differential privacy with machine learning algorithms are outlined.
Federated Learning for Healthcare Domain - Pipeline, Applications and Challenges
Joshi, Madhura, Pal, Ankit, Sankarasubbu, Malaikannan
Federated learning is the process of developing machine learning models over datasets distributed across data centers such as hospitals, clinical research labs, and mobile devices while preventing data leakage. This survey examines previous research and studies on federated learning in the healthcare sector across a range of use cases and applications. Our survey shows what challenges, methods, and applications a practitioner should be aware of in the topic of federated learning. This paper aims to lay out existing research and list the possibilities of federated learning for healthcare industries.
Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
Ren, Zhizhou, Liu, Anji, Liang, Yitao, Peng, Jian, Ma, Jianzhu
Learning new task-specific skills from a few trials is a fundamental challenge for artificial intelligence. Meta reinforcement learning (meta-RL) tackles this problem by learning transferable policies that support few-shot adaptation to unseen tasks. Despite recent advances in meta-RL, most existing methods require the access to the environmental reward function of new tasks to infer the task objective, which is not realistic in many practical applications. To bridge this gap, we study the problem of few-shot adaptation in the context of human-in-the-loop reinforcement learning. We develop a meta-RL algorithm that enables fast policy adaptation with preference-based feedback. The agent can adapt to new tasks by querying human's preference between behavior trajectories instead of using per-step numeric rewards. By extending techniques from information theory, our approach can design query sequences to maximize the information gain from human interactions while tolerating the inherent error of non-expert human oracle. In experiments, we extensively evaluate our method, Adaptation with Noisy OracLE (ANOLE), on a variety of meta-RL benchmark tasks and demonstrate substantial improvement over baseline algorithms in terms of both feedback efficiency and error tolerance.
Partial Differential Equations Meet Deep Neural Networks: A Survey
Huang, Shudong, Feng, Wentao, Tang, Chenwei, Lv, Jiancheng
Many problems in science and engineering can be represented by a set of partial differential equations (PDEs) through mathematical modeling. Mechanism-based computation following PDEs has long been an essential paradigm for studying topics such as computational fluid dynamics, multiphysics simulation, molecular dynamics, or even dynamical systems. It is a vibrant multi-disciplinary field of increasing importance and with extraordinary potential. At the same time, solving PDEs efficiently has been a long-standing challenge. Generally, except for a few differential equations for which analytical solutions are directly available, many more equations must rely on numerical approaches such as the finite difference method, finite element method, finite volume method, and boundary element method to be solved approximately. These numerical methods usually divide a continuous problem domain into discrete points and then concentrate on solving the system at each of those points. Though the effectiveness of these traditional numerical methods, the vast number of iterative operations accompanying each step forward significantly reduces the efficiency. Recently, another equally important paradigm, data-based computation represented by deep learning, has emerged as an effective means of solving PDEs. Surprisingly, a comprehensive review for this interesting subfield is still lacking. This survey aims to categorize and review the current progress on Deep Neural Networks (DNNs) for PDEs. We discuss the literature published in this subfield over the past decades and present them in a common taxonomy, followed by an overview and classification of applications of these related methods in scientific research and engineering scenarios. The origin, developing history, character, sort, as well as the future trends in each potential direction of this subfield are also introduced.
An overview on deep learning-based approximation methods for partial differential equations
Beck, Christian, Hutzenthaler, Martin, Jentzen, Arnulf, Kuckuck, Benno
Partial differential equations (PDEs) are ubiquitous in mathematics as tools for modelling processes in nature or in man-made complex systems. PDEs appear, e.g., as Hamilton-Jacobi-Bellman equations in optimal control problems for describing the value function associated to the control problem, as Zakai or Kushner equations in nonlinear filtering problems for describing the conditional probability distribution of the state of the signal process in the nonlinear filtering problem, in models for the approximative valuation of financial products such as financial derivative contracts, and in the approximative description of the distribution of species in ecosystems to model biodiversity under changing climate conditions. The PDEs which appear in the abovenamed applications are often nonlinear and high-dimensional where, e.g., in the case of optimal control problems, the PDE dimension d N = {1,2,3,...} corresponds to the number of agents, particles, or resources in the optimal control problem, where, e.g., in the case of the approximative valuation of financial products, the PDE dimension d N corresponds to the number of financial assets (such as stocks, commodities, exchange rates, and interest rates) in the involved hedging portfolio, and where, e.g., in the case of the approximative description of the distribution of species in ecosystems, the PDE dimension d N corresponds to the number of characteristic traits of the species in the ecosystem under consideration. High-dimensional nonlinear PDEs cannot be solved analytically in nearly all cases and it is one of the most challenging issues in applied mathematics to design and analyze approximation methods for high-dimensional nonlinear PDEs.
Image-text Retrieval: A Survey on Recent Research and Development
Cao, Min, Li, Shiping, Li, Juntao, Nie, Liqiang, Zhang, Min
In the past few years, cross-modal image-text retrieval (ITR) has experienced increased interest in the research community due to its excellent research value and broad real-world application. It is designed for the scenarios where the queries are from one modality and the retrieval galleries from another modality. This paper presents a comprehensive and up-to-date survey on the ITR approaches from four perspectives. By dissecting an ITR system into two processes: feature extraction and feature alignment, we summarize the recent advance of the ITR approaches from these two perspectives. On top of this, the efficiency-focused study on the ITR system is introduced as the third perspective. To keep pace with the times, we also provide a pioneering overview of the cross-modal pre-training ITR approaches as the fourth perspective. Finally, we outline the common benchmark datasets and valuation metric for ITR, and conduct the accuracy comparison among the representative ITR approaches. Some critical yet less studied issues are discussed at the end of the paper.