Goto

Collaborating Authors

 indianapolis


Classifying Histopathologic Glioblastoma Sub-regions with EfficientNet

Adap, Sanyukta, Baid, Ujjwal, Bakas, Spyridon

arXiv.org Artificial Intelligence

Glioblastoma (GBM) is the most common aggressive, fast-growing brain tumor, with a grim prognosis. Despite clinical diagnostic advancements, there have not been any substantial improvements to patient prognosis. Histopathological assessment of excised tumors is the first line of clinical diagnostic routine. We hypothesize that automated, robust, and accurate identification of distinct histological sub-regions within GBM could contribute to morphologically understanding this disease at scale. In this study, we designed a four-step deep learning approach to classify six (6) histopathological regions and quantitatively evaluated it on the BraTS-Path 2024 challenge dataset, which includes digitized Hematoxylin \& Eosin (H\&E) stained GBM tissue sections annotated for six distinct regions. We used the challenge's publicly available training dataset to develop and evaluate the effectiveness of several variants of EfficientNet architectures (i.e., B0, B1, B2, B3, B4). EfficientNet-B1 and EfficientNet-B4 achieved the best performance, achieving an F1 score of 0.98 in a 5-fold cross-validation configuration using the BraTS-Path training set. The quantitative performance evaluation of our proposed approach with EfficientNet-B1 on the BraTS-Path hold-out validation data and the final hidden testing data yielded F1 scores of 0.546 and 0.517, respectively, for the associated 6-class classification task. The difference in the performance on training, validation, and testing data highlights the challenge of developing models that generalize well to new data, which is crucial for clinical applications. The source code of the proposed approach can be found at the GitHub repository of Indiana University Division of Computational Pathology: https://github.com/IUCompPath/brats-path-2024-enet.


Indiana senator calls on WNBA, Fever to apologize to fans after accusations of racism: 'So demeaning'

FOX News

Republican Sen. Jim Banks explains why Indiana Fever fans deserve an apology after the league's latest investigation during an appearance on OutKick's'Don't @ Me with Dan Dakich.' U.S. Sen. Jim Banks, R-Ind., called on the WNBA and the Indiana Fever to apologize to Fever fans after the league's investigation failed to find evidence that corroborated allegations of racial comments directed at Angel Reese during a recent game. The league investigated the allegations involving the Chicago Sky star last month after a May 17 game hosted by the Fever. Chicago Sky forward Angel Reese (5) reacts to a flagrant foul from Indiana Fever guard Caitlin Clark (22) May 17, 2025, at Gainbridge Fieldhouse in Indianapolis. "Based on information gathered to date, including from relevant fans, team and arena staff, as well as audio and video review of the game, we have not substantiated [the report,]" the league said in a statement.


Interpretable Hierarchical Attention Network for Medical Condition Identification

Fang, Dongping, Duan, Lian, Yuan, Xiaojing, Klunder, Allyn, Tan, Kevin, Cao, Suiting, Ji, Yeqing, Xu, Mike

arXiv.org Artificial Intelligence

Accurate prediction of medical conditions with straight past clinical evidence is a long-sought topic in the medical management and health insurance field. Although great progress has been made with machine learning algorithms, the medical community is still skeptical about the model accuracy and interpretability. This paper presents an innovative hierarchical attention deep learning model to achieve better prediction and clear interpretability that can be easily understood by medical professionals. This paper developed an Interpretable Hierarchical Attention Network (IHAN). IHAN uses a hierarchical attention structure that matches naturally with the medical history data structure and reflects patients encounter (date of service) sequence. The model attention structure consists of 3 levels: (1) attention on the medical code types (diagnosis codes, procedure codes, lab test results, and prescription drugs), (2) attention on the sequential medical encounters within a type, (3) attention on the individual medical codes within an encounter and type. This model is applied to predict the occurrence of stage 3 chronic kidney disease (CKD), using three years medical history of Medicare Advantage (MA) members from an American nationwide health insurance company. The model takes members medical events, both claims and Electronic Medical Records (EMR) data, as input, makes a prediction of stage 3 CKD and calculates contribution from individual events to the predicted outcome.


GaNDLF-Synth: A Framework to Democratize Generative AI for (Bio)Medical Imaging

Pati, Sarthak, Mazurek, Szymon, Bakas, Spyridon

arXiv.org Artificial Intelligence

Generative Artificial Intelligence (GenAI) is a field of AI that creates new data samples from existing ones. It utilizing deep learning to overcome the scarcity and regulatory constraints of healthcare data by generating new data points that integrate seamlessly with original datasets. This paper explores the background and motivation for GenAI, and introduces the Generally Nuanced Deep Learning Framework for Synthesis (GaNDLF-Synth) to address a significant gap in the literature and move towards democratizing the implementation and assessment of image synthesis tasks in healthcare. GaNDLF-Synth describes a unified abstraction for various synthesis algorithms, including autoencoders, generative adversarial networks, and diffusion models. Leveraging the GANDLF-core framework, it supports diverse data modalities and distributed computing, ensuring scalability and reproducibility through extensive unit testing. The aim of GaNDLF-Synth is to lower the entry barrier for GenAI, and make it more accessible and extensible by the wider scientific community.


Brain Tumor Segmentation (BraTS) Challenge 2024: Meningioma Radiotherapy Planning Automated Segmentation

LaBella, Dominic, Schumacher, Katherine, Mix, Michael, Leu, Kevin, McBurney-Lin, Shan, Nedelec, Pierre, Villanueva-Meyer, Javier, Shapey, Jonathan, Vercauteren, Tom, Chia, Kazumi, Al-Salihi, Omar, Leu, Justin, Halasz, Lia, Velichko, Yury, Wang, Chunhao, Kirkpatrick, John, Floyd, Scott, Reitman, Zachary J., Mullikin, Trey, Bagci, Ulas, Sachdev, Sean, Hattangadi-Gluth, Jona A., Seibert, Tyler, Farid, Nikdokht, Puett, Connor, Pease, Matthew W., Shiue, Kevin, Anwar, Syed Muhammad, Faghani, Shahriar, Haider, Muhammad Ammar, Warman, Pranav, Albrecht, Jake, Jakab, András, Moassefi, Mana, Chung, Verena, Aristizabal, Alejandro, Karargyris, Alexandros, Kassem, Hasan, Pati, Sarthak, Sheller, Micah, Huang, Christina, Coley, Aaron, Ghanta, Siddharth, Schneider, Alex, Sharp, Conrad, Saluja, Rachit, Kofler, Florian, Lohmann, Philipp, Vollmuth, Phillipp, Gagnon, Louis, Adewole, Maruf, Li, Hongwei Bran, Kazerooni, Anahita Fathi, Tahon, Nourel Hoda, Anazodo, Udunna, Moawad, Ahmed W., Menze, Bjoern, Linguraru, Marius George, Aboian, Mariam, Wiestler, Benedikt, Baid, Ujjwal, Conte, Gian-Marco, Rauschecker, Andreas M. T., Nada, Ayman, Abayazeed, Aly H., Huang, Raymond, de Verdier, Maria Correia, Rudie, Jeffrey D., Bakas, Spyridon, Calabrese, Evan

arXiv.org Artificial Intelligence

The 2024 Brain Tumor Segmentation Meningioma Radiotherapy (BraTS-MEN-RT) challenge aims to advance automated segmentation algorithms using the largest known multi-institutional dataset of radiotherapy planning brain MRIs with expert-annotated target labels for patients with intact or post-operative meningioma that underwent either conventional external beam radiotherapy or stereotactic radiosurgery. Each case includes a defaced 3D post-contrast T1-weighted radiotherapy planning MRI in its native acquisition space, accompanied by a single-label "target volume" representing the gross tumor volume (GTV) and any at-risk post-operative site. Target volume annotations adhere to established radiotherapy planning protocols, ensuring consistency across cases and institutions. For pre-operative meningiomas, the target volume encompasses the entire GTV and associated nodular dural tail, while for post-operative cases, it includes at-risk resection cavity margins as determined by the treating institution. Case annotations were reviewed and approved by expert neuroradiologists and radiation oncologists. Participating teams will develop, containerize, and evaluate automated segmentation models using this comprehensive dataset. Model performance will be assessed using the lesion-wise Dice Similarity Coefficient and the 95% Hausdorff distance. The top-performing teams will be recognized at the Medical Image Computing and Computer Assisted Intervention Conference in October 2024. BraTS-MEN-RT is expected to significantly advance automated radiotherapy planning by enabling precise tumor segmentation and facilitating tailored treatment, ultimately improving patient outcomes.


Pipeline and Dataset Generation for Automated Fact-checking in Almost Any Language

Drchal, Jan, Ullrich, Herbert, Mlynář, Tomáš, Moravec, Václav

arXiv.org Artificial Intelligence

This article presents a pipeline for automated fact-checking leveraging publicly available Language Models and data. The objective is to assess the accuracy of textual claims using evidence from a ground-truth evidence corpus. The pipeline consists of two main modules -- the evidence retrieval and the claim veracity evaluation. Our primary focus is on the ease of deployment in various languages that remain unexplored in the field of automated fact-checking. Unlike most similar pipelines, which work with evidence sentences, our pipeline processes data on a paragraph level, simplifying the overall architecture and data requirements. Given the high cost of annotating language-specific fact-checking training data, our solution builds on the Question Answering for Claim Generation (QACG) method, which we adapt and use to generate the data for all models of the pipeline. Our strategy enables the introduction of new languages through machine translation of only two fixed datasets of moderate size. Subsequently, any number of training samples can be generated based on an evidence corpus in the target language. We provide open access to all data and fine-tuned models for Czech, English, Polish, and Slovak pipelines, as well as to our codebase that may be used to reproduce the results.We comprehensively evaluate the pipelines for all four languages, including human annotations and per-sample difficulty assessment using Pointwise V-information. The presented experiments are based on full Wikipedia snapshots to promote reproducibility. To facilitate implementation and user interaction, we develop the FactSearch application featuring the proposed pipeline and the preliminary feedback on its performance.


C.J. Stroud dismisses low cognitive test results: 'I'm not a test taker, I play football'

FOX News

Fox News Flash top sports headlines are here. Check out what's clicking on Foxnews.com. Former Ohio State standout C.J. Stroud is widely regarded as one the top prospects heading into this week's NFL Draft. Teams throughout the league used to routinely use the Wonderlic Test to assess the competence in decision-making of NFL prospects. That test was recently replaced with a new test called the S2 Cognition Test.


Data Architect, Internal IT (Remote in US) at Resultant - Indianapolis, IN, United States

#artificialintelligence

Resultant is a modern consulting firm with a radically different approach to solving problems. We don't solve problems for our clients. We solve problems with them. Through outcomes driven by data analytics, technology solutions, digital transformation, and beyond, our team works with clients in both the public and private sectors to solve their most complex challenges. We start by learning as much as we can about who they are, how they work, and what they're striving for so we can feel their problems as our own.


SceNDD: A Scenario-based Naturalistic Driving Dataset

Prabu, Avinash, Ranjan, Nitya, Li, Lingxi, Tian, Renran, Chien, Stanley, Chen, Yaobin, Sherony, Rini

arXiv.org Artificial Intelligence

In this paper, we propose SceNDD: a scenario-based naturalistic driving dataset that is built upon data collected from an instrumented vehicle in downtown Indianapolis. The data collection was completed in 68 driving sessions with different drivers, where each session lasted about 20--40 minutes. The main goal of creating this dataset is to provide the research community with real driving scenarios that have diverse trajectories and driving behaviors. The dataset contains ego-vehicle's waypoints, velocity, yaw angle, as well as non-ego actor's waypoints, velocity, yaw angle, entry-time, and exit-time. Certain flexibility is provided to users so that actors, sensors, lanes, roads, and obstacles can be added to the existing scenarios. We used a Joint Probabilistic Data Association (JPDA) tracker to detect non-ego vehicles on the road. We present some preliminary results of the proposed dataset and a few applications associated with it. The complete dataset is expected to be released by early 2023.


Having rich childhood friends is linked to a higher salary as an adult

New Scientist

Children who grow up in low-income households but who make friends that come from higher-income homes are more likely to have higher salaries in adulthood than those who have fewer such friends. "There's been a lot of speculation… that the individuals' access to social capital, their social networks and the community they live in might matter a lot for a child's chance to rise out of poverty," says Raj Chetty at Harvard University. To find out how if that holds up, he and his colleagues analysed anonymised Facebook data belonging to 72.2 million people in the US between the ages of 25 and 44, accounting for 84 per cent of the age group's US population. It is relatively nationally representative of that age group, he says. The team used a machine learning algorithm to determine each person's socioeconomic status (SES), combining data such as the median income of people who live in the same region, the person's age, sex and the value of their phone model as a proxy for individual income.