Overview
Fairness in Graph Mining: A Survey
Dong, Yushun, Ma, Jing, Wang, Song, Chen, Chen, Li, Jundong
Abstract--Graph mining algorithms have been playing a significant role in myriad fields over the years. However, despite their promising performance on various graph analytical tasks, most of these algorithms lack fairness considerations. As a consequence, they could lead to discrimination towards certain populations when exploited in human-centered applications. Recently, algorithmic fairness has been extensively studied in graph-based applications. In contrast to algorithmic fairness on independent and identically distributed (i.i.d.) data, fairness in graph mining has exclusive backgrounds, taxonomies, and fulfilling techniques. In this survey, we provide a comprehensive and up-to-date introduction of existing literature under the context of fair graph mining. Specifically, we propose a novel taxonomy of fairness notions on graphs, which sheds light on their connections and differences. We further present an organized summary of existing techniques that promote fairness in graph mining. Finally, we discuss current research challenges and open questions, aiming at encouraging cross-breeding ideas and further advances. Graph-structured data is pervasive in diverse real-world Compared with achieving fairness in the context of independent applications, e.g., E-commerce [102], [121], health care [37], and identically distributed (i.i.d.) data, fulfilling [53], traffic forecasting [72], [100], and drug discovery [15], fairness in graph mining can be non-trivial due to two [172]. The first challenge is to formulate proper have been proposed to gain a deeper understanding of such fairness notions as the criteria to determine the existence of data. These algorithms have shown promising performance unfairness (i.e., bias). Although a vast amount of traditional on graph analytical tasks such as node classification [59], algorithmic fairness notions have been proposed centered [86], [161] and link prediction [4], [103], [109], contributing on i.i.d. For example, the same population can be most of them lack fairness considerations. Consequently, connected with different topologies as in Figure 1a and 1b, they could yield discriminatory results towards certain populations where each node represents an individual, and the color when such algorithms are exploited in humancentered of nodes denotes their demographic subgroup membership, applications [80]. Compared with the graph topology job recommender system may unfavorably recommend in Figure 1a, the topology in Figure 1b has more intra-group fewer job opportunities to individuals of a certain edges than inter-group edges. The dominance of intra-group gender [97] or individuals in an underrepresented ethnic edges in the graph topology is a common type of bias group [150].
A Survey of Resources and Methods for Natural Language Processing of Serbian Language
Marovac, Ulfeta A., Avdiฤ, Aldina R., Miloลกeviฤ, Nikola Lj.
The Serbian language is a Slavic language spoken by over 12 million speakers and well understood by over 15 million people. In the area of natural language processing, it can be considered a low-resourced language. Also, Serbian is considered a high-inflectional language. The combination of many word inflections and low availability of language resources makes natural language processing of Serbian challenging. Nevertheless, over the past three decades, there have been a number of initiatives to develop resources and methods for natural language processing of Serbian, ranging from developing a corpus of free text from books and the internet, annotated corpora for classification and named entity recognition tasks to various methods and models performing these tasks. In this paper, we review the initiatives, resources, methods, and their availability.
A Comprehensive Study on Object Detection Techniques in Unconstrained Environments
Object detection is a crucial task in computer vision that aims to identify and localize objects in images or videos. The recent advancements in deep learning and Convolutional Neural Networks (CNNs) have significantly improved the performance of object detection techniques. This paper presents a comprehensive study of object detection techniques in unconstrained environments, including various challenges, datasets, and state-of-the-art approaches. Additionally, we present a comparative analysis of the methods and highlight their strengths and weaknesses. Finally, we provide some future research directions to further improve object detection in unconstrained environments.
Learning Optimal Fair Scoring Systems for Multi-Class Classification
Rouzot, Julien, Ferry, Julien, Huguet, Marie-Josรฉ
Machine Learning models are increasingly used for decision making, in particular in high-stakes applications such as credit scoring, medicine or recidivism prediction. However, there are growing concerns about these models with respect to their lack of interpretability and the undesirable biases they can generate or reproduce. While the concepts of interpretability and fairness have been extensively studied by the scientific community in recent years, few works have tackled the general multi-class classification problem under fairness constraints, and none of them proposes to generate fair and interpretable models for multi-class classification. In this paper, we use Mixed-Integer Linear Programming (MILP) techniques to produce inherently interpretable scoring systems under sparsity and fairness constraints, for the general multi-class classification setup. Our work generalizes the SLIM (Supersparse Linear Integer Models) framework that was proposed by Rudin and Ustun to learn optimal scoring systems for binary classification. The use of MILP techniques allows for an easy integration of diverse operational constraints (such as, but not restricted to, fairness or sparsity), but also for the building of certifiably optimal models (or sub-optimal models with bounded optimality gap).
A Systematic Survey of Control Techniques and Applications in Connected and Automated Vehicles
Liu, Wei, Hua, Min, Deng, Zhiyun, Meng, Zonglin, Huang, Yanjun, Hu, Chuan, Song, Shunhui, Gao, Letian, Liu, Changsheng, Shuai, Bin, Khajepour, Amir, Xiong, Lu, Xia, Xin
Vehicle control is one of the most critical challenges in autonomous vehicles (AVs) and connected and automated vehicles (CAVs), and it is paramount in vehicle safety, passenger comfort, transportation efficiency, and energy saving. This survey attempts to provide a comprehensive and thorough overview of the current state of vehicle control technology, focusing on the evolution from vehicle state estimation and trajectory tracking control in AVs at the microscopic level to collaborative control in CAVs at the macroscopic level. First, this review starts with vehicle key state estimation, specifically vehicle sideslip angle, which is the most pivotal state for vehicle trajectory control, to discuss representative approaches. Then, we present symbolic vehicle trajectory tracking control approaches for AVs. On top of that, we further review the collaborative control frameworks for CAVs and corresponding applications. Finally, this survey concludes with a discussion of future research directions and the challenges. This survey aims to provide a contextualized and in-depth look at state of the art in vehicle control for AVs and CAVs, identifying critical areas of focus and pointing out the potential areas for further exploration.
Enhancing Image Classification with Data Image Augmentation in Python
Data image augmentation is a technique used in computer vision and deep learning to increase the amount and diversity of data available for training a model. This paper presents an overview of data image augmentation and provides a tutorial on how to perform data image augmentation in Python using the Keras.preprocessing.image The paper also includes a discussion on the benefits and limitations of data image augmentation and provides tips on how to use it effectively. In recent years, computer vision and deep learning have made significant strides in accurately classifying and detecting objects in images. One of the key factors that contribute to the success of these techniques is the availability of large and diverse datasets for training models.
Discharge Summary Hospital Course Summarisation of In Patient Electronic Health Record Text with Clinical Concept Guided Deep Pre-Trained Transformer Models
Searle, Thomas, Ibrahim, Zina, Teo, James, Dobson, Richard
Brief Hospital Course (BHC) summaries are succinct summaries of an entire hospital encounter, embedded within discharge summaries, written by senior clinicians responsible for the overall care of a patient. Methods to automatically produce summaries from inpatient documentation would be invaluable in reducing clinician manual burden of summarising documents under high time-pressure to admit and discharge patients. Automatically producing these summaries from the inpatient course, is a complex, multi-document summarisation task, as source notes are written from various perspectives (e.g. nursing, doctor, radiology), during the course of the hospitalisation. We demonstrate a range of methods for BHC summarisation demonstrating the performance of deep learning summarisation models across extractive and abstractive summarisation scenarios. We also test a novel ensemble extractive and abstractive summarisation model that incorporates a medical concept ontology (SNOMED) as a clinical guidance signal and shows superior performance in 2 real-world clinical data sets.
Drones on the Rise: Exploring the Current and Future Potential of UAVs
Unmanned Aerial Vehicles (UAVs) have become increasingly popular in recent years due to their versatility and affordability. This article provides an overview of the history and development of UAVs, as well as their current and potential applications in various fields. In particular, the article highlights the use of UAVs in aerial photography and videography, surveying and mapping, agriculture and forestry, infrastructure inspection and maintenance, search and rescue operations, disaster management and humanitarian aid, and military applications such as reconnaissance, surveillance, and combat. The article also explores potential advancements in UAV technology and new applications that could emerge in the future, as well as concerns about the impact of UAVs on society, such as privacy, safety, security, job displacement, and environmental impact. Overall, the article aims to provide a comprehensive overview of the current state and future potential of UAV technology, and the benefits and challenges associated with its use in various industries and fields.
AffectMachine-Classical: A novel system for generating affective classical music
Agres, Kat R., Dash, Adyasha, Chua, Phoebe
This work introduces a new music generation system, called AffectMachine-Classical, that is capable of generating affective Classic music in real-time. AffectMachine was designed to be incorporated into biofeedback systems (such as brain-computer-interfaces) to help users become aware of, and ultimately mediate, their own dynamic affective states. That is, this system was developed for music-based MedTech to support real-time emotion self-regulation in users. We provide an overview of the rule-based, probabilistic system architecture, describing the main aspects of the system and how they are novel. We then present the results of a listener study that was conducted to validate the ability of the system to reliably convey target emotions to listeners. The findings indicate that AffectMachine-Classical is very effective in communicating various levels of Arousal ($R^2 = .96$) to listeners, and is also quite convincing in terms of Valence (R^2 = .90). Future work will embed AffectMachine-Classical into biofeedback systems, to leverage the efficacy of the affective music for emotional well-being in listeners.
The Factual Inconsistency Problem in Abstractive Text Summarization: A Survey
Huang, Yichong, Feng, Xiachong, Feng, Xiaocheng, Qin, Bing
Recently, various neural encoder-decoder models pioneered by Seq2Seq framework have been proposed to achieve the goal of generating more abstractive summaries by learning to map input text to output text. At a high level, such neural models can freely generate summaries without any constraint on the words or phrases used. Moreover, their format is closer to human-edited summaries and output is more readable and fluent. However, the neural model's abstraction ability is a double-edged sword. A commonly observed problem with the generated summaries is the distortion or fabrication of factual information in the article. This inconsistency between the original text and the summary has caused various concerns over its applicability, and the previous evaluation methods of text summarization are not suitable for this issue. In response to the above problems, the current research direction is predominantly divided into two categories, one is to design fact-aware evaluation metrics to select outputs without factual inconsistency errors, and the other is to develop new summarization systems towards factual consistency. In this survey, we focus on presenting a comprehensive review of these fact-specific evaluation methods and text summarization models.