Goto

Collaborating Authors

 microbiology


AI-driven Generation of MALDI-TOF MS for Microbial Characterization

Schmidt-Santiago, Lucía, Rodríguez-Temporal, David, Sevilla-Salcedo, Carlos, Gómez-Verdejo, Vanessa

arXiv.org Artificial Intelligence

Matrix-Assisted Laser Desorption/Ionization Time-of-Flight Mass Spectrometry (MALDI-TOF MS) has become a cornerstone technology in clinical microbiology, enabling rapid and accurate microbial identification. However, the development of data-driven diagnostic models remains limited by the lack of sufficiently large, balanced, and standardized spectral datasets. This study investigates the use of deep generative models to synthesize realistic MALDI-TOF MS spectra, aiming to overcome data scarcity and support the development of robust machine learning tools in microbiology. We adapt and evaluate three generative models, Variational Autoencoders (MALDIVAEs), Generative Adversarial Networks (MALDIGANs), and Denoising Diffusion Probabilistic Model (MALDIffusion), for the conditional generation of microbial spectra guided by species labels. Generation is conditioned on species labels, and spectral fidelity and diversity are assessed using diverse metrics. Our experiments show that synthetic data generated by MALDIVAE, MALDIGAN, and MALDIffusion are statistically and diagnostically comparable to real measurements, enabling classifiers trained exclusively on synthetic samples to reach performance levels similar to those trained on real data. While all models faithfully reproduce the peak structure and variability of MALDI-TOF spectra, MALDIffusion obtains this fidelity at a substantially higher computational cost, and MALDIGAN shows competitive but slightly less stable behaviour. In contrast, MALDIVAE offers the most favorable balance between realism, stability, and efficiency. Furthermore, augmenting minority species with synthetic spectra markedly improves classification accuracy, effectively mitigating class imbalance and domain mismatch without compromising the authenticity of the generated data.


Predicting Microbial Interactions Using Graph Neural Networks

Gholamzadeh, Elham, Singla, Kajal, Scherf, Nico

arXiv.org Artificial Intelligence

Predicting interspecies interactions is a key challenge in microbial ecology, as these interactions are critical to determining the structure and activity of microbial communities. In this work, we used data on monoculture growth capabilities, interactions with other species, and phylogeny to predict a negative or positive effect of interactions. More precisely, we used one of the largest available pairwise interaction datasets to train our models, comprising over 7,500 interactions be- tween 20 species from two taxonomic groups co-cultured under 40 distinct carbon conditions, with a primary focus on the work of Nestor et al.[28 ]. In this work, we propose Graph Neural Networks (GNNs) as a powerful classifier to predict the direction of the effect. We construct edge-graphs of pairwise microbial interactions in order to leverage shared information across individual co-culture experiments, and use GNNs to predict modes of interaction. Our model can not only predict binary interactions (positive/negative) but also classify more complex interaction types such as mutualism, competition, and parasitism. Our initial results were encouraging, achieving an F1-score of 80.44%. This significantly outperforms comparable methods in the literature, including conventional Extreme Gradient Boosting (XGBoost) models, which reported an F1-score of 72.76%.


Friend or Foe

Cherendichenko, Oleksandr, Solowiej-Wedderburn, Josephine, Carroll, Laura M., Libby, Eric

arXiv.org Artificial Intelligence

A fundamental challenge in microbial ecology is determining whether bacteria compete or cooperate in different environmental conditions. With recent advances in genome-scale metabolic models, we are now capable of simulating interactions between thousands of pairs of bacteria in thousands of different environmental settings at a scale infeasible experimentally. These approaches can generate tremendous amounts of data that can be exploited by state-of-the-art machine learning algorithms to uncover the mechanisms driving interactions. Here, we present Friend or Foe, a compendium of 64 tabular environmental datasets, consisting of more than 26M shared environments for more than 10K pairs of bacteria sampled from two of the largest collections of metabolic models. The Friend or Foe datasets are curated for a wide range of machine learning tasks -- supervised, unsupervised, and generative -- to address specific questions underlying bacterial interactions. We benchmarked a selection of the most recent models for each of these tasks and our results indicate that machine learning can be successful in this application to microbial ecology. Going beyond, analyses of the Friend or Foe compendium can shed light on the predictability of bacterial interactions and highlight novel research directions into how bacteria infer and navigate their relationships.


AI-Driven Rapid Identification of Bacterial and Fungal Pathogens in Blood Smears of Septic Patients

Sroka-Oleksiak, Agnieszka, Pardyl, Adam, Rymarczyk, Dawid, Olechowska-Jarząb, Aldona, Biegun-Drożdż, Katarzyna, Ochońska, Dorota, Wronka, Michał, Borowa, Adriana, Gosiewski, Tomasz, Adamczyk, Miłosz, Telega, Henryk, Zieliński, Bartosz, Brzychczy-Włoch, Monika

arXiv.org Artificial Intelligence

Sepsis is a life-threatening condition which requires rapid diagnosis and treatment. Traditional microbiological methods are time-consuming and expensive. In response to these challenges, deep learning algorithms were developed to identify 14 bacteria species and 3 yeast-like fungi from microscopic images of Gram-stained smears of positive blood samples from sepsis patients. A total of 16,637 Gram-stained microscopic images were used in the study. The analysis used the Cellpose 3 model for segmentation and Attention-based Deep Multiple Instance Learning for classification. Our model achieved an accuracy of 77.15% for bacteria and 71.39% for fungi, with ROC AUC of 0.97 and 0.88, respectively. The highest values, reaching up to 96.2%, were obtained for Cutibacterium acnes, Enterococcus faecium, Stenotrophomonas maltophilia and Nakaseomyces glabratus. Classification difficulties were observed in closely related species, such as Staphylococcus hominis and Staphylococcus haemolyticus, due to morphological similarity, and within Candida albicans due to high morphotic diversity. The study confirms the potential of our model for microbial classification, but it also indicates the need for further optimisation and expansion of the training data set. In the future, this technology could support microbial diagnosis, reducing diagnostic time and improving the effectiveness of sepsis treatment due to its simplicity and accessibility. Part of the results presented in this publication was covered by a patent application at the European Patent Office EP24461637.1 "A computer implemented method for identifying a microorganism in a blood and a data processing system therefor".


Virtual Gram staining of label-free bacteria using darkfield microscopy and deep learning

Isil, Cagatay, Koydemir, Hatice Ceylan, Eryilmaz, Merve, de Haan, Kevin, Pillar, Nir, Mentesoglu, Koray, Unal, Aras Firat, Rivenson, Yair, Chandrasekaran, Sukantha, Garner, Omai B., Ozcan, Aydogan

arXiv.org Artificial Intelligence

Gram staining has been one of the most frequently used staining protocols in microbiology for over a century, utilized across various fields, including diagnostics, food safety, and environmental monitoring. Its manual procedures make it vulnerable to staining errors and artifacts due to, e.g., operator inexperience and chemical variations. Here, we introduce virtual Gram staining of label-free bacteria using a trained deep neural network that digitally transforms darkfield images of unstained bacteria into their Gram-stained equivalents matching brightfield image contrast. After a one-time training effort, the virtual Gram staining model processes an axial stack of darkfield microscopy images of label-free bacteria (never seen before) to rapidly generate Gram staining, bypassing several chemical steps involved in the conventional staining process. We demonstrated the success of the virtual Gram staining workflow on label-free bacteria samples containing Escherichia coli and Listeria innocua by quantifying the staining accuracy of the virtual Gram staining model and comparing the chromatic and morphological features of the virtually stained bacteria against their chemically stained counterparts. This virtual bacteria staining framework effectively bypasses the traditional Gram staining protocol and its challenges, including stain standardization, operator errors, and sensitivity to chemical variations.


Whole Genome Transformer for Gene Interaction Effects in Microbiome Habitat Specificity

Li, Zhufeng, Cranganore, Sandeep S, Youngblut, Nicholas, Kilbertus, Niki

arXiv.org Artificial Intelligence

Leveraging the vast genetic diversity within microbiomes offers unparalleled insights into complex phenotypes, yet the task of accurately predicting and understanding such traits from genomic data remains challenging. We propose a framework taking advantage of existing large models for gene vectorization to predict habitat specificity from entire microbial genome sequences. Based on our model, we develop attribution techniques to elucidate gene interaction effects that drive microbial adaptation to diverse environments. We train and validate our approach on a large dataset of high quality microbiome genomes from different habitats. We not only demonstrate solid predictive performance, but also how sequence-level information of entire genomes allows us to identify gene associations underlying complex phenotypes. Our attribution recovers known important interaction networks and proposes new candidates for experimental follow up.


Performing Video Frame Prediction of Microbial Growth with a Recurrent Neural Network

Robertson, Connor, Wilmoth, Jared L., Retterer, Scott, Fuentes-Cabrera, Miguel

arXiv.org Artificial Intelligence

A Recurrent Neural Network (RNN) was used to perform video frame prediction of microbial growth for a population of two mutants of Pseudomonas aeruginosa. The RNN was trained on videos of 20 frames that were acquired using fluorescence microscopy and microfluidics. The network predicted the last 10 frames of each video, and the accuracy's of the predictions was assessed by comparing raw images, population curves, and the number and size of individual colonies. Overall, we found the predictions to be accurate using this approach. The implications this result has on designing autonomous experiments in microbiology, and the steps that can be taken to make the predictions even more accurate, are discussed.


Artificial intelligence can help predict the bacteria responsible for pneumonia in emergency rooms

#artificialintelligence

A team of researchers showed that artificial intelligence (AI) could help predict the type of bacteria that caused the infection in patients with pneumonia. The research is presented at ASM Microbe Online, the annual meeting of the American Society for Microbiology. "This research highlights the potential of AI as a supplementary tool for physicians in identifying causal pathogens of pneumonia, even before sputum culture results are available," said Joowhan Sung, M.D., hospitalist at MedStar Southern Maryland Hospital. "We demonstrated that physicians could be assisted by AI to decide appropriate antibiotics." In the study, investigators showed that AI could use the information available in the emergency room and predict if the patient has MRSA or pseudomonas so that physicians can immediately prescribe specific antibiotics targeting specific bacteria.


Virus test results in minutes? Scientists question accuracy

The Japan Times

MADRID – Some political leaders are hailing a potential breakthrough in the fight against COVID-19: simple pin-prick blood tests or nasal swabs that can determine within minutes if someone has, or previously had, the virus. The tests could reveal the true extent of the outbreak and help separate the healthy from the sick. But some scientists have challenged their accuracy. Hopes are hanging on two types of quick tests: antigen tests that use a nose or throat swab to look for the virus, and antibody tests that look in the blood for evidence someone had the virus and recovered. The tests are in short supply, and some of them are considered unreliable.


From Microbiology to Machine Learning with Springboard

#artificialintelligence

Microbiology and MBA grad JK started to learn about big data and machine learning in his job, but wanted to learn more about data science in a structured environment. He enrolled in Springboard's Machine Learning Career Track to learn about ML and AI online. JK tells us how he balanced his full-time job with the Springboard bootcamp (hint: he didn't sleep much), and how networking at conferences helped him land his new job as a Data Engineer at KPMG! What is your educational and career background? I didn't come from a computer science (CS) background. My undergrad was in microbiology, immunology and molecular genetics. I then completed an MBA with a concentration in Accounting and Finance, working at the Australian Chamber of Commerce in Korea. And that's where I got a taste of some CS database work.