Torroni, Paolo
Investigating the Effectiveness of Explainability Methods in Parkinson's Detection from Speech
Mancini, Eleonora, Paissan, Francesco, Torroni, Paolo, Ravanelli, Mirco, Subakan, Cem
Speech impairments in Parkinson's disease (PD) provide significant early indicators for diagnosis. While models for speech-based PD detection have shown strong performance, their interpretability remains underexplored. This study systematically evaluates several explainability methods to identify PD-specific speech features, aiming to support the development of accurate, interpretable models for clinical decision-making in PD diagnosis and monitoring. Our methodology involves (i) obtaining attributions and saliency maps using mainstream interpretability techniques, (ii) quantitatively evaluating the faithfulness of these maps and their combinations obtained via union and intersection through a range of established metrics, and (iii) assessing the information conveyed by the saliency maps for PD detection from an auxiliary classifier. Our results reveal that, while explanations are aligned with the classifier, they often fail to provide valuable information for domain experts.
A Chatbot for Asylum-Seeking Migrants in Europe
Fazzinga, Bettina, Palmieri, Elena, Vestoso, Margherita, Bolognini, Luca, Galassi, Andrea, Furfaro, Filippo, Torroni, Paolo
We present ACME: A Chatbot for asylum-seeking Migrants tool that goes beyond the checklists used for handling well-defined, in Europe. ACME relies on computational argumentation and simple procedures since there is not only a problem of evaluating aims to help migrants identify the highest level of protection they legal and factual data, but there is also an issue with understanding can apply for. This would contribute to a more sustainable migration which procedures are relevant. Indeed, there is not only one type of by reducing the load on territorial commissions, Courts, and humanitarian protection but several ones. Importantly, since applicants may be political organizations supporting asylum applicants. We describe the refugees and victims of abuse, discrimination, and persecution, context, system architectures, technologies, and the case study used the collection and processing of their personal data for immigration to run the demonstration.
Let Guidelines Guide You: A Prescriptive Guideline-Centered Data Annotation Methodology
Ruggeri, Federico, Misino, Eleonora, Muti, Arianna, Korre, Katerina, Torroni, Paolo, Barrón-Cedeño, Alberto
We introduce the Guideline-Centered annotation process, a novel data annotation methodology focused on reporting the annotation guidelines associated with each data sample. We identify three main limitations of the standard prescriptive annotation process and describe how the Guideline-Centered methodology overcomes them by reducing the loss of information in the annotation process and ensuring adherence to guidelines. Additionally, we discuss how the Guideline-Centered enables the reuse of annotated data across multiple tasks at the cost of a single human-annotation process.
Dynamic Few-Shot Learning for Knowledge Graph Question Answering
D'Abramo, Jacopo, Zugarini, Andrea, Torroni, Paolo
Large language models present opportunities for innovative Question Answering over Knowledge Graphs (KGQA). However, they are not inherently designed for query generation. To bridge this gap, solutions have been proposed that rely on fine-tuning or ad-hoc architectures, achieving good results but limited out-of-domain distribution generalization. In this study, we introduce a novel approach called Dynamic Few-Shot Learning (DFSL). DFSL integrates the efficiency of in-context learning and semantic similarity and provides a generally applicable solution for KGQA with state-of-the-art performance. We run an extensive evaluation across multiple benchmark datasets and architecture configurations.
Promoting Fairness and Diversity in Speech Datasets for Mental Health and Neurological Disorders Research
Mancini, Eleonora, Tanevska, Ana, Galassi, Andrea, Galatolo, Alessio, Ruggeri, Federico, Torroni, Paolo
Current research in machine learning and artificial intelligence is largely centered on modeling and performance evaluation, less so on data collection. However, recent research demonstrated that limitations and biases in data may negatively impact trustworthiness and reliability. These aspects are particularly impactful on sensitive domains such as mental health and neurological disorders, where speech data are used to develop AI applications aimed at improving the health of patients and supporting healthcare providers. In this paper, we chart the landscape of available speech datasets for this domain, to highlight possible pitfalls and opportunities for improvement and promote fairness and diversity. We present a comprehensive list of desiderata for building speech datasets for mental health and neurological disorders and distill it into a checklist focused on ethical concerns to foster more responsible research.
Fast Vocabulary Transfer for Language Model Compression
Gee, Leonidas, Zugarini, Andrea, Rigutini, Leonardo, Torroni, Paolo
Real-world business applications require a trade-off between language model performance and size. We propose a new method for model compression that relies on vocabulary transfer. We evaluate the method on various vertical domains and downstream tasks. Our results indicate that vocabulary transfer can be effectively used in combination with other compression techniques, yielding a significant reduction in model size and inference time while marginally compromising on performance.
Tree-Constrained Graph Neural Networks For Argument Mining
Ruggeri, Federico, Lippi, Marco, Torroni, Paolo
Graph Neural Networks (GNNs) are currently a hot topic in artificial intelligence, with a huge amount of applications in many domains, ranging from bioinformatics to computer vision, from social network analysis to natural language processing [1]. First introduced in [2], and then far and wide extended with a large number of variants, GNNs can learn embedding representations of generic graphs, by exploiting aggregation functions based on propagation and pooling layers. These building blocks are frequently stacked into a deep network, and the resulting embeddings can be exploited in any high-level task. This kind of architecture has rapidly become the state-of-the-art, or at least a strong competitor, in many application domains dealing with structured data. Historically, in natural language processing (NLP) as well as in other domains, Tree Kernels (TKs) have long been one of the most widely employed technique to handle structured data in the form of trees [3]. A TK is basically a similarity function that captures the degree of similarity of two trees by looking at common fragments within their substructures.
MemBERT: Injecting Unstructured Knowledge into BERT
Ruggeri, Federico, Lippi, Marco, Torroni, Paolo
Transformers changed modern NLP in many ways. However, they can hardly exploit domain knowledge, and like other blackbox models, they lack interpretability. Unfortunately, structured knowledge injection, in the long run, risks to suffer from a knowledge acquisition bottleneck. We thus propose a memory enhancement of transformer models that makes use of unstructured domain knowledge expressed in plain natural language. An experimental evaluation conducted on two challenging NLP tasks demonstrates that our approach yields better performance and model interpretability than baseline transformer-based architectures.
An Argumentative Dialogue System for COVID-19 Vaccine Information
Fazzinga, Bettina, Galassi, Andrea, Torroni, Paolo
Dialogue systems are widely used in AI to support timely and interactive communication with users. We propose a general-purpose dialogue system architecture that leverages computational argumentation to perform reasoning and provide consistent and explainable answers. We illustrate the system using a COVID-19 vaccine information case study.
Multi-Task Attentive Residual Networks for Argument Mining
Galassi, Andrea, Lippi, Marco, Torroni, Paolo
We explore the use of residual networks and neural attention for argument mining and in particular link prediction. The method we propose makes no assumptions on document or argument structure. We propose a residual architecture that exploits attention, multi-task learning, and makes use of ensemble. We evaluate it on a challenging data set consisting of user-generated comments, as well as on two other datasets consisting of scientific publications. On the user-generated content dataset, our model outperforms state-of-the-art methods that rely on domain knowledge. On the scientific literature datasets it achieves results comparable to those yielded by BERT-based approaches but with a much smaller model size.