AITopics | Aarau

Collaborating Authors

Aarau

Pathologist-like explainable AI for interpretable Gleason grading in prostate cancer

Mittmann, Gesa, Laiouar-Pedari, Sara, Mehrtens, Hendrik A., Haggenmüller, Sarah, Bucher, Tabea-Clara, Chanda, Tirtha, Gaisa, Nadine T., Wagner, Mathias, Klamminger, Gilbert Georg, Rau, Tilman T., Neppl, Christina, Compérat, Eva Maria, Gocht, Andreas, Hämmerle, Monika, Rupp, Niels J., Westhoff, Jula, Krücken, Irene, Seidl, Maximillian, Schürch, Christian M., Bauer, Marcus, Solass, Wiebke, Tam, Yu Chun, Weber, Florian, Grobholz, Rainer, Augustyniak, Jaroslaw, Kalinski, Thomas, Hörner, Christian, Mertz, Kirsten D., Döring, Constanze, Erbersdobler, Andreas, Deubler, Gabriele, Bremmer, Felix, Sommer, Ulrich, Brodhun, Michael, Griffin, Jon, Lenon, Maria Sarah L., Trpkov, Kiril, Cheng, Liang, Chen, Fei, Levi, Angelique, Cai, Guoping, Nguyen, Tri Q., Amin, Ali, Cimadamore, Alessia, Shabaik, Ahmed, Manucha, Varsha, Ahmad, Nazeel, Messias, Nidia, Sanguedolce, Francesca, Taheri, Diana, Baraban, Ezra, Jia, Liwei, Shah, Rajal B., Siadat, Farshid, Swarbrick, Nicole, Park, Kyung, Hassan, Oudai, Sakhaie, Siamak, Downes, Michelle R., Miyamoto, Hiroshi, Williamson, Sean R., Holland-Letz, Tim, Schneider, Carolin V., Kather, Jakob Nikolas, Tolkach, Yuri, Brinker, Titus J.

arXiv.org Artificial IntelligenceOct-19-2024

The aggressiveness of prostate cancer, the most common cancer in men worldwide, is primarily assessed based on histopathological data using the Gleason scoring system. While artificial intelligence (AI) has shown promise in accurately predicting Gleason scores, these predictions often lack inherent explainability, potentially leading to distrust in human-machine interactions. To address this issue, we introduce a novel dataset of 1,015 tissue microarray core images, annotated by an international group of 54 pathologists. The annotations provide detailed localized pattern descriptions for Gleason grading in line with international guidelines. Utilizing this dataset, we develop an inherently explainable AI system based on a U-Net architecture that provides predictions leveraging pathologists' terminology. This approach circumvents post-hoc explainability methods while maintaining or exceeding the performance of methods trained directly for Gleason pattern segmentation (Dice score: 0.713 $\pm$ 0.003 trained on explanations vs. 0.691 $\pm$ 0.010 trained on Gleason patterns). By employing soft labels during training, we capture the intrinsic uncertainty in the data, yielding strong results in Gleason pattern segmentation even in the context of high interobserver variability. With the release of this dataset, we aim to encourage further research into segmentation in medical tasks with high levels of subjectivity and to advance the understanding of pathologists' reasoning processes.

explanation, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2410.15012

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Washington > King County > Seattle (0.14)
(46 more...)

Genre: Research Report > Experimental Study (0.68)

Industry:

Health & Medicine > Therapeutic Area > Urology (1.00)
Health & Medicine > Therapeutic Area > Oncology > Prostate Cancer (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.89)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (0.70)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Generative AI Voting: Fair Collective Choice is Resilient to LLM Biases and Inconsistencies

Majumdar, Srijoni, Elkind, Edith, Pournaras, Evangelos

arXiv.org Artificial IntelligenceMay-30-2024

Scaling up deliberative and voting participation is a longstanding endeavor -- a cornerstone for direct democracy and legitimate collective choice. Recent breakthroughs in generative artificial intelligence (AI) and large language models (LLMs) provide unprecedented opportunities, but also alerting risks for digital democracy. AI personal assistants can overcome cognitive bandwidth limitations of humans, providing decision support capabilities or even direct AI representation of human voters at large scale. However, the quality of this representation and what underlying biases manifest when delegating collective decision making to LLMs is an alarming and timely challenge to tackle. By rigorously emulating with high realism more than >50K LLM voting personas in 81 real-world voting elections, we show that different LLMs (GPT 3, GPT 3.5, and Llama2) come with biases and significant inconsistencies in complex preferential ballot formats, compared to simpler and more consistent majoritarian elections. Strikingly, fair voting aggregation methods, such as equal shares, prove to be a win-win: fairer voting outcomes for humans with fairer AI representation. This novel underlying relationship proves paramount for democratic resilience in progressives scenarios with low voters turnout and voter fatigue supported by AI representatives: abstained voters are mitigated by recovering highly representative voting outcomes that are fairer. These insights provide remarkable foundations for science, policymakers and citizens in explaining and mitigating AI risks in democratic innovations.

consistency, language model, voter, (13 more...)

arXiv.org Artificial Intelligence

2406.11871

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > Estonia (0.14)
Europe > Switzerland > Aargau > Aarau (0.05)
(10 more...)

Genre:

Research Report > New Finding (0.69)
Research Report > Experimental Study (0.47)

Industry:

Government > Voting & Elections (1.00)
Government > Regional Government > North America Government > United States Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.85)

Add feedback

NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional Stimuli

Wang, Xu, Li, Cheng, Chang, Yi, Wang, Jindong, Wu, Yuan

arXiv.org Artificial IntelligenceMay-12-2024

Large Language Models (LLMs) have become integral to a wide spectrum of applications, ranging from traditional computing tasks to advanced artificial intelligence (AI) applications. This widespread adoption has spurred extensive research into LLMs across various disciplines, including the social sciences. Notably, studies have revealed that LLMs possess emotional intelligence, which can be further developed through positive emotional stimuli. This discovery raises an intriguing question: can negative emotions similarly influence LLMs, potentially enhancing their performance? In response to this question, we introduce NegativePrompt, a novel approach underpinned by psychological principles, involving ten specifically designed negative emotional stimuli. We embark on rigorous experimental evaluations of five LLMs including Flan-T5-Large, Vicuna, Llama 2, ChatGPT, and GPT-4, across a set of 45 tasks. The results are revealing: NegativePrompt markedly enhances the performance of LLMs, evidenced by relative improvements of 12.89% in Instruction Induction tasks and 46.25% in BIG-Bench tasks. Moreover, we conduct attention visualization experiments to decipher the underlying mechanisms of NegativePrompt's influence. Our research contributes significantly to the understanding of LLMs and emotion interaction, demonstrating the practical efficacy of NegativePrompt as an emotion-driven method and offering novel insights for the enhancement of LLMs in real-world applications. The code is available at https://github.com/wangxu0820/NegativePrompt.

emotional stimuli, negativeprompt, stimuli, (13 more...)

arXiv.org Artificial Intelligence

2405.02814

Country:

North America > United States (0.28)
Europe > Switzerland > Zürich > Zürich (0.04)
Asia > Japan (0.04)
Europe > Switzerland > Aargau > Aarau (0.04)

Genre: Research Report (1.00)

Industry:

Law (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (1.00)
Health & Medicine > Consumer Health (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Fair Voting Outcomes with Impact and Novelty Compromises? Unraveling Biases of Equal Shares in Participatory Budgeting

Maharjan, Sajan, Majumdar, Srijoni, Pournaras, Evangelos

arXiv.org Artificial IntelligenceMay-9-2024

Participatory budgeting, as a paradigm for democratic innovations, engages citizens in the distribution of a public budget to projects, which they propose and vote for implementation. So far, voting algorithms have been devised and studied in social choice literature to elect projects that are popular, while others prioritize on a proportional representation of voters' preferences, for instance, equal shares. However, the anticipated impact and novelty in the broader society by the winning projects, as selected by different algorithms, remains totally under-explored, lacking both a universal theory of impact for voting and a rigorous framework for impact and novelty assessments. This papers tackles this grand challenge towards new axiomatic foundations for designing effective and fair voting methods. This is via new and striking insights derived from a large-scale analysis of biases over 345 real-world voting outcomes, characterized for the first time by a novel portfolio of impact and novelty metrics. We find strong causal evidence that equal shares comes with impact loss in several infrastructural projects of different cost levels that have been so far over-represented. However, it also comes with a novel, yet over-represented, impact gain in welfare, education and culture. We discuss broader implications of these results and how impact loss can be mitigated at the stage of campaign design and project ideation.

equal share, impact area, impact loss, (16 more...)

arXiv.org Artificial Intelligence

2405.05085

Country:

Europe > Switzerland > Aargau > Aarau (0.05)
Europe > Poland (0.05)
Europe > United Kingdom > England > West Yorkshire > Leeds (0.04)
(7 more...)

Genre: Research Report (1.00)

Industry: Government > Voting & Elections (0.34)

Technology: Information Technology > Artificial Intelligence (0.48)

Add feedback

Classification of Prostate Cancer in 3D Magnetic Resonance Imaging Data based on Convolutional Neural Networks

Rippa, Malte, Schulze, Ruben, Himstedt, Marian, Burn, Felice

arXiv.org Artificial IntelligenceApr-16-2024

Prostate cancer is a commonly diagnosed cancerous disease among men world-wide. Even with modern technology such as multi-parametric magnetic resonance tomography and guided biopsies, the process for diagnosing prostate cancer remains time consuming and requires highly trained professionals. In this paper, different convolutional neural networks (CNN) are evaluated on their abilities to reliably classify whether an MRI sequence contains malignant lesions. Implementations of a ResNet, a ConvNet and a ConvNeXt for 3D image data are trained and evaluated. The models are trained using different data augmentation techniques, learning rates, and optimizers. The data is taken from a private dataset, provided by Cantonal Hospital Aarau. The best result was achieved by a ResNet3D, yielding an average precision score of 0.4583 and AUC ROC score of 0.6214.

auc roc, classification, prostate cancer, (13 more...)

arXiv.org Artificial Intelligence

2404.10548

Country:

Europe > Switzerland > Aargau > Aarau (0.26)
Europe > Switzerland > Basel-City > Basel (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Urology (1.00)
Health & Medicine > Therapeutic Area > Oncology > Prostate Cancer (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)

Add feedback

Causal thinking for decision making on Electronic Health Records: why and how

Doutreligne, Matthieu, Struja, Tristan, Abecassis, Judith, Morgand, Claire, Celi, Leo Anthony, Varoquaux, Gaël

arXiv.org Machine LearningSep-7-2023

Accurate predictions, as with machine learning, may not suffice to provide optimal healthcare for every patient. Indeed, prediction can be driven by shortcuts in the data, such as racial biases. Causal thinking is needed for data-driven decisions. Here, we give an introduction to the key elements, focusing on routinely-collected data, electronic health records (EHRs) and claims data. Using such data to assess the value of an intervention requires care: temporal dependencies and existing practices easily confound the causal effect. We present a step-by-step framework to help build valid decision making from real-life patient records by emulating a randomized trial before individualizing decisions, eg with machine learning. Our framework highlights the most important pitfalls and considerations in analysing EHRs or claims data to draw causal conclusions. We illustrate the various choices in studying the effect of albumin on sepsis mortality in the Medical Information Mart for Intensive Care database (MIMIC-IV). We study the impact of various choices at every step, from feature extraction to causal-estimator selection. In a tutorial spirit, the code and the data are openly available.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Machine Learning

2308.01605

Country:

North America > Greenland (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Europe > Switzerland > Aargau > Aarau (0.04)
(5 more...)

Genre:

Research Report > Strength High (1.00)
Research Report > New Finding (1.00)
Research Report > Experimental Study > Negative Result (0.46)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Health Care Technology > Medical Record (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation

Alam, Md Mahfuz Ibn, Ahmadi, Sina, Anastasopoulos, Antonios

arXiv.org Artificial IntelligenceMay-26-2023

Neural machine translation (NMT) systems exhibit limited robustness in handling source-side linguistic variations. Their performance tends to degrade when faced with even slight deviations in language usage, such as different domains or variations introduced by second-language speakers. It is intuitive to extend this observation to encompass dialectal variations as well, but the work allowing the community to evaluate MT systems on this dimension is limited. To alleviate this issue, we compile and release \dataset, a contrastive dialectal benchmark encompassing 882 different variations from nine different languages. We also quantitatively demonstrate the challenges large MT models face in effectively translating dialectal variants. We are releasing all code and data.

artificial intelligence, machine translation, natural language, (18 more...)

arXiv.org Artificial Intelligence

2305.17267

Country:

Europe > Germany (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Italy > Veneto (0.04)
(67 more...)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

2nd Swiss German Speech to Standard German Text Shared Task at SwissText 2022

Plüss, Michel, Schraner, Yanick, Scheller, Christian, Vogel, Manfred

arXiv.org Artificial IntelligenceJan-17-2023

We present the results and findings of the 2nd Swiss German speech to Standard German text shared task at SwissText 2022. Participants were asked to build a sentence-level Swiss German speech to Standard German text system specialized on the Grisons dialect. The objective was to maximize the BLEU score on a test set of Grisons speech. 3 teams participated, with the best-performing system achieving a BLEU score of 70.1.

artificial intelligence, natural language, speech, (11 more...)

arXiv.org Artificial Intelligence

2301.0679

Country:

Europe > Switzerland > Zürich > Zürich (0.05)
Europe > Switzerland > Aargau > Aarau (0.05)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.70)

Add feedback

Tackling Low-Resourced Sign Language Translation: UPC at WMT-SLT 22

Tarrés, Laia, Gàllego, Gerard I., Giró-i-Nieto, Xavier, Torres, Jordi

arXiv.org Artificial IntelligenceDec-2-2022

This paper describes the system developed at the Universitat Polit\`ecnica de Catalunya for the Workshop on Machine Translation 2022 Sign Language Translation Task, in particular, for the sign-to-text direction. We use a Transformer model implemented with the Fairseq modeling toolkit. We have experimented with the vocabulary size, data augmentation techniques and pretraining the model with the PHOENIX-14T dataset. Our system obtains 0.50 BLEU score for the test set, improving the organizers' baseline by 0.38 BLEU. We remark the poor results for both the baseline and our system, and thus, the unreliability of our findings.

artificial intelligence, machine translation, natural language, (16 more...)

arXiv.org Artificial Intelligence

2212.0114

Country:

Europe > Spain (0.14)
Europe > Switzerland > Aargau > Aarau (0.04)
North America > United States > Pennsylvania (0.04)
(5 more...)

Genre: Research Report > New Finding (0.34)

Industry: Education > Curriculum > Subject-Specific Education (0.66)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Satellite-derived solar radiation for intra-hour and intra-day applications: Biases and uncertainties by season and altitude

Carpentieri, Alberto, Folini, Doris, Wild, Martin, Vuilleumier, Laurent, Meyer, Angela

arXiv.org Artificial IntelligenceOct-5-2022

Accurate estimates of the surface solar radiation (SSR) are a prerequisite for intra-day forecasts of solar resources and photovoltaic power generation. Intra-day SSR forecasts are of interest to power traders and to operators of solar plants and power grids who seek to optimize their revenues and maintain the grid stability by matching power supply and demand. Our study analyzes systematic biases and the uncertainty of SSR estimates derived from Meteosat with the SARAH-2 and HelioMont algorithms at intra-hour and intra-day time scales. The satellite SSR estimates are analyzed based on 136 ground stations across altitudes from 200 m to 3570 m Switzerland in 2018. We find major biases and uncertainties in the instantaneous, hourly and daily-mean SSR. In peak daytime periods, the instantaneous satellite SSR deviates from the ground-measured SSR by a mean absolute deviation (MAD) of 110.4 and 99.6 W/m2 for SARAH-2 and HelioMont, respectively. For the daytime SSR, the instantaneous, hourly and daily-mean MADs amount to 91.7, 81.1, 50.8 and 82.5, 66.7, 42.9 W/m2 for SARAH-2 and HelioMont, respectively. Further, the SARAH-2 instantaneous SSR drastically underestimates the solar resources at altitudes above 1000 m in the winter half year. A possible explanation in line with the seasonality of the bias is that snow cover may be misinterpreted as clouds at higher altitudes.

artificial intelligence, ground station, ssr, (16 more...)

arXiv.org Artificial Intelligence

2212.11745

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Norway (0.14)
Europe > Sweden (0.14)
(16 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Energy > Renewable > Solar (1.00)
Energy > Power Industry (1.00)

Technology: Information Technology > Artificial Intelligence (0.46)

Add feedback