Overview
Interpretability and accessibility of machine learning in selected food processing, agriculture and health applications
Ranasinghe, N., Ramanan, A., Fernando, S., Hameed, P. N., Herath, D., Malepathirana, T., Suganthan, P., Niranjan, M., Halgamuge, S.
Figure 1 - Contributions of the Paper and Fair, Accessible, Interpretable and Reproducible (FAIR) AI adapted from (Halgamuge S., 2021) Artificial intelligence (AI) has seen an explosive growth over the last 20 years, largely through recent advances in machine learning (ML) - the data-centric branch of AI. A data-centric AI system consists of an AI model (a structure or architecture) and a method or learning algorithm that enables that model to derive usable information from data. Sometimes the data are exploratory, like the genomic data arriving from different parts of the world about constantly mutating viruses. To discover the presence of new variants or labels, we can feed an AI model with such uninterpreted data, so that researchers will be able to use this AI model to assign labels. Such AI models need unsupervised learning (UL) algorithms to extract information from unlabeled and uninterpreted data. We could also ask those researchers themselves to label data with appropriate variant labels, and feed both labels and genomic data to an AI model that can then use a supervised learning algorithm like deep learning (DL), so that it can serve as a predictor for known variants of the virus. If such an AI model of sufficient strength requires it to be large, deep and complex, we call it a deep neural network (DNN). Shallow neural networks, commonly referred to as Neural Networks (NNs) are data driven mathematical models consisting of about three layers of artificial neurons or nodes (several linear and nonlinear processing elements) which are interconnected through weighted connections.
Graph Neural Networks: A Powerful and Versatile Tool for Advancing Design, Reliability, and Security of ICs
Alrahis, Lilas, Knechtel, Johann, Sinanoglu, Ozgur
Graph neural networks (GNNs) have pushed the state-of-the-art (SOTA) for performance in learning and predicting on large-scale data present in social networks, biology, etc. Since integrated circuits (ICs) can naturally be represented as graphs, there has been a tremendous surge in employing GNNs for machine learning (ML)-based methods for various aspects of IC design. Given this trajectory, there is a timely need to review and discuss some powerful and versatile GNN approaches for advancing IC design. In this paper, we propose a generic pipeline for tailoring GNN models toward solving challenging problems for IC design. We outline promising options for each pipeline element, and we discuss selected and promising works, like leveraging GNNs to break SOTA logic obfuscation. Our comprehensive overview of GNNs frameworks covers (i) electronic design automation (EDA) and IC design in general, (ii) design of reliable ICs, and (iii) design as well as analysis of secure ICs. We provide our overview and related resources also in the GNN4IC hub at https://github.com/DfX-NYUAD/GNN4IC. Finally, we discuss interesting open problems for future research.
A survey on multi-player bandits
Boursier, Etienne, Perchet, Vianney
Due mostly to its application to cognitive radio networks, multiplayer bandits gained a lot of interest in the last decade. A considerable progress has been made on its theoretical aspect. However, the current algorithms are far from applicable and many obstacles remain between these theoretical results and a possible implementation of multiplayer bandits algorithms in real cognitive radio networks. This survey contextualizes and organizes the rich multiplayer bandits literature. In light of the existing works, some clear directions for future research appear. We believe that a further study of these different directions might lead to theoretical algorithms adapted to real-world situations.
Holding AI to Account: Challenges for the Delivery of Trustworthy AI in Healthcare
Procter, Rob, Tolmie, Peter, Rouncefield, Mark
The need for AI systems to provide explanations for their behaviour is now widely recognised as key to their adoption. In this paper, we examine the problem of trustworthy AI and explore what delivering this means in practice, with a focus on healthcare applications. Work in this area typically treats trustworthy AI as a problem of Human-Computer Interaction involving the individual user and an AI system. However, we argue here that this overlooks the important part played by organisational accountability in how people reason about and trust AI in socio-technical settings. To illustrate the importance of organisational accountability, we present findings from ethnographic studies of breast cancer screening and cancer treatment planning in multidisciplinary team meetings to show how participants made themselves accountable both to each other and to the organisations of which they are members. We use these findings to enrich existing understandings of the requirements for trustworthy AI and to outline some candidate solutions to the problems of making AI accountable both to individual users and organisationally. We conclude by outlining the implications of this for future work on the development of trustworthy AI, including ways in which our proposed solutions may be re-used in different application settings.
What is Generative AI, and How Will It Disrupt Society?
The concept of generative artificial intelligence (GAI) poses a groundbreaking question that has until recently not been contemplated: at what stage does the relationship between humans and machines evolve from its present-day form into one that is so fundamentally changed that we can no longer regard one as being superior to the other when it comes to creative terms? Humanity stands on the brink of a new technological revolution. It is poised to harness the full potential of AI and machine learning, allowing us to automate many tasks and systems, revolutionise communication, and conserve time and money in our daily lives. Many are concerned that this could be the harbinger of a world full of robot overlords which would rob the human race of its free will. But what about those who will create those machines? In fact, some argue that in developing AI, we are creating a tool to enhance human cognition, giving us new means to think, invent and explore the universe rather than enslave humanity. Let's explore what generative AI is, where it currently stands, and where it could potentially take us in the next years. Generative AI is a branch of computer science that involves unsupervised and semi-supervised algorithms that enable computers to create new content using previously created content, such as text, audio, video, images, and code. It is all about creating authentic-looking artifacts that are completely original. In other words, generative AI is a subset of machine learning that focuses on creating algorithms that can generate new data. Generative models are used in many different application areas, from art and music to computer vision and robotics.
Topic Modeling with BERTopic - Talking Language AI Ep#1
In the first episode of the Talking Language AI series, I spoke with Maarten Grootendorst, author and maintainer of the BERTopic open source package (over 3,000 stars on Github). BERTopic is used to explore collections of text to spot trends and identify the topics in these texts. This is an NLP task called Topic Modeling. It's also embedded in the bottom of this overview. Feel free to post questions or comments in this thread in the Cohere Discord.
FedLesScan: Mitigating Stragglers in Serverless Federated Learning
Elzohairy, Mohamed, Chadha, Mohak, Jindal, Anshul, Grafberger, Andreas, Gu, Jianfeng, Gerndt, Michael, Abboud, Osama
Federated Learning (FL) is a machine learning paradigm that enables the training of a shared global model across distributed clients while keeping the training data local. While most prior work on designing systems for FL has focused on using stateful always running components, recent work has shown that components in an FL system can greatly benefit from the usage of serverless computing and Function-as-a-Service technologies. To this end, distributed training of models with serverless FL systems can be more resource-efficient and cheaper than conventional FL systems. However, serverless FL systems still suffer from the presence of stragglers, i.e., slow clients due to their resource and statistical heterogeneity. While several strategies have been proposed for mitigating stragglers in FL, most methodologies do not account for the particular characteristics of serverless environments, i.e., cold-starts, performance variations, and the ephemeral stateless nature of the function instances. Towards this, we propose FedLesScan, a novel clustering-based semi-asynchronous training strategy, specifically tailored for serverless FL. FedLesScan dynamically adapts to the behaviour of clients and minimizes the effect of stragglers on the overall system. We implement our strategy by extending an open-source serverless FL system called FedLess. Moreover, we comprehensively evaluate our strategy using the 2nd generation Google Cloud Functions with four datasets and varying percentages of stragglers. Results from our experiments show that compared to other approaches FedLesScan reduces training time and cost by an average of 8% and 20% respectively while utilizing clients better with an average increase in the effective update ratio of 17.75%.
Performance Evaluation, Optimization and Dynamic Decision in Blockchain Systems: A Recent Overview
Li, Quan-Lin, Chang, Yan-Xia, Wang, Qing
With rapid development of blockchain technology as well as integration of various application areas, performance evaluation, performance optimization, and dynamic decision in blockchain systems are playing an increasingly important role in developing new blockchain technology. This paper provides a recent systematic overview of this class of research, and especially, developing mathematical modeling and basic theory of blockchain systems. Important examples include (a) performance evaluation: Markov processes, queuing theory, Markov reward processes, random walks, fluid and diffusion approximations, and martingale theory; (b) performance optimization: Linear programming, nonlinear programming, integer programming, and multi-objective programming; (c) optimal control and dynamic decision: Markov decision processes, and stochastic optimal control; and (d) artificial intelligence: Machine learning, deep reinforcement learning, and federated learning. So far, a little research has focused on these research lines. We believe that the basic theory with mathematical methods, algorithms and simulations of blockchain systems discussed in this paper will strongly support future development and continuous innovation of blockchain technology.
A Survey on Conversational Search and Applications in Biomedicine
Adatrao, Naga Sai Krishna, Gadireddy, Gowtham Reddy, Noh, Jiho
This paper aims to provide a radical rundown on Conversation Search (ConvSearch), an approach to enhance the information retrieval method where users engage in a dialogue for the information-seeking tasks. In this survey, we predominantly focused on the human interactive characteristics of the ConvSearch systems, highlighting the operations of the action modules, likely the Retrieval system, Question-Answering, and Recommender system. We labeled various ConvSearch research problems in knowledge bases, natural language processing, and dialogue management systems along with the action modules. We further categorized the framework to ConvSearch and the application is directed toward biomedical and healthcare fields for the utilization of clinical social technology. Finally, we conclude by talking through the challenges and issues of ConvSearch, particularly in Bio-Medicine. Our main aim is to provide an integrated and unified vision of the ConvSearch components from different fields, which benefit the information-seeking process in healthcare systems.
Bayesian Network Models of Causal Interventions in Healthcare Decision Making: Literature Review and Software Evaluation
Velikzhanin, Artem, Wang, Benjie, Kwiatkowska, Marta
This report summarises the outcomes of a systematic literature search to identify Bayesian network models used to support decision making in healthcare. After describing the search methodology, the selected research papers are briefly reviewed, with the view to identify publicly available models and datasets that are well suited to analysis using the causal interventional analysis software tool developed in Wang B, Lyle C, Kwiatkowska M (2021). Finally, an experimental evaluation of applying the software on a selection of models is carried out and preliminary results are reported.