Goto

Collaborating Authors

 philadelphia


5 new quarters commemorate 250 years of American independence

Popular Science

The new designs honor the Constitution, Civil War, and more. Breakthroughs, discoveries, and DIY tips sent every weekday. While we've said goodbye to both the year 2025 and the penny, five new United States quarters will be finding their way into your pocket soon enough. The designs of each new quarter will honor the country's 250th anniversary (aka its semiquincentennial). According to a press release from the U.S. Mint, the coins "commemorate 250 years of American Liberty by reflecting our country's founding principles and honoring our Nation's history."


PRIMRose: Insights into the Per-Residue Energy Metrics of Proteins with Double InDel Mutations using Deep Learning

Brown, Stella, Preisig, Nicolas, Davis, Autumn, Hutchinson, Brian, Jagodzinski, Filip

arXiv.org Artificial Intelligence

Understanding how protein mutations affect protein structure is essential for advancements in computational biology and bioinformatics. We introduce PRIMRose, a novel approach that predicts energy values for each residue given a mutated protein sequence. Unlike previous models that assess global energy shifts, our method analyzes the localized energetic impact of double amino acid insertions or deletions (InDels) at the individual residue level, enabling residue-specific insights into structural and functional disruption. We implement a Convolutional Neural Network architecture to predict the energy changes of each residue in a protein mutation. We train our model on datasets constructed from nine proteins, grouped into three categories: one set with exhaustive double InDel mutations, another with approximately 145k randomly sampled double InDel mutations, and a third with approximately 80k randomly sampled double InDel mutations. Our model achieves high predictive accuracy across a range of energy metrics as calculated by the Rosetta molecular modeling suite and reveals localized patterns that influence model performance, such as solvent accessibility and secondary structure context. This per-residue analysis offers new insights into the mutational tolerance of specific regions within proteins and provides higher interpretable and biologically meaningful predictions of InDels' effects.


Is "Six Seven" Really Brain Rot?

The New Yorker

Is "Six Seven" Really Brain Rot? The viral phrase is easy to dismiss, but its ubiquity suggests something crucial about human nature. Recently, my wife was texting with a friend who lives in Singapore. The news from the other side of the world turned out to be that kids there had discovered "six seven." On Halloween, our friend reported, a boy with a handmade "six seven" jersey had earned applause as he made his way through her neighborhood--a place that's a long way from Sixty-seventh Street in Philadelphia, which the rapper Skrilla may have been referencing in his song "Doot Doot (6 7)," which came out last December.


Philly's 'transit vigilante' created a real-time bus tracker for his neighbors

Popular Science

Philly's'transit vigilante' created a real-time bus tracker for his neighbors With a sports timer and some clever coding, Max Goldberg built a DIY display that tells South Philly commuters exactly when their next bus will arrive. Breakthroughs, discoveries, and DIY tips sent every weekday. Philadelphia's mass transit system has had a rough go of it lately. The Pennsylvania city's main public transit provider, SEPTA, has been dealing with massive service cuts, including the elimination of entire bus routes. But South Philly resident Max Goldberg is undeterred.


Enabling Few-Shot Alzheimer's Disease Diagnosis on Biomarker Data with Tabular LLMs

Kearney, Sophie, Yang, Shu, Wen, Zixuan, Hou, Bojian, Duong-Tran, Duy, Chen, Tianlong, Moore, Jason, Ritchie, Marylyn, Shen, Li

arXiv.org Artificial Intelligence

Early and accurate diagnosis of Alzheimer's disease (AD), a complex neurodegenerative disorder, requires analysis of heterogeneous biomarkers (e.g., neuroimaging, genetic risk factors, cognitive tests, and cerebrospinal fluid proteins) typically represented in a tabular format. With flexible few-shot reasoning, multimodal integration, and natural-language-based interpretability, large language models (LLMs) offer unprecedented opportunities for prediction with structured biomedical data. We propose a novel framework called TAP-GPT, Tabular Alzheimer's Prediction GPT, that adapts TableGPT2, a multimodal tabular-specialized LLM originally developed for business intelligence tasks, for AD diagnosis using structured biomarker data with small sample sizes. Our approach constructs few-shot tabular prompts using in-context learning examples from structured biomedical data and finetunes TableGPT2 using the parameter-efficient qLoRA adaption for a clinical binary classification task of AD or cognitively normal (CN). The TAP-GPT framework harnesses the powerful tabular understanding ability of TableGPT2 and the encoded prior knowledge of LLMs to outperform more advanced general-purpose LLMs and a tabular foundation model (TFM) developed for prediction tasks. To our knowledge, this is the first application of LLMs to the prediction task using tabular biomarker data, paving the way for future LLM-driven multi-agent frameworks in biomedical informatics.


A rare coin treasure hunt kicks off in 4 American cities

Popular Science

Coin collectors, go dust off those running shoes. Breakthroughs, discoveries, and DIY tips sent every weekday. For Americans wishing they could participate in The Great Canadian Treasure Hunt, there is a new opportunity stateside. This month, the rare coin dealer and auction house Stack's Bowers Galleries is inviting the public to join in on a treasure hunt to celebrate the 90th anniversary of the firm's first auction. Certificates for rare coins and banknotes will be hidden in New York City, Boston, Philadelphia, and Miami, all cities where the auction house has retail store fronts.


Fair CCA for Fair Representation Learning: An ADNI Study

Hou, Bojian, Wang, Zhanliang, Zhou, Zhuoping, Tong, Boning, Wang, Zexuan, Bao, Jingxuan, Duong-Tran, Duy, Long, Qi, Shen, Li

arXiv.org Artificial Intelligence

Canonical correlation analysis (CCA) is a technique for finding correlations between different data modalities and learning low-dimensional representations. As fairness becomes crucial in machine learning, fair CCA has gained attention. However, previous approaches often overlook the impact on downstream classification tasks, limiting applicability. We propose a novel fair CCA method for fair representation learning, ensuring the projected features are independent of sensitive attributes, thus enhancing fairness without compromising accuracy. We validate our method on synthetic data and real-world data from the Alzheimer's Disease Neuroimaging Initiative (ADNI), demonstrating its ability to maintain high correlation analysis performance while improving fairness in classification tasks. Our work enables fair machine learning in neuroimaging studies where unbiased analysis is essential. Code is available in https://github.com/ZhanliangAaronWang/FR-CCA-ADNI.


Graph-Based Spatio-temporal Attention and Multi-Scale Fusion for Clinically Interpretable, High-Fidelity Fetal ECG Extraction

Wang, Chang, Zhu, Ming, Latifi, Shahram, Dawn, Buddhadeb, Zhai, Shengjie

arXiv.org Artificial Intelligence

Congenital Heart Disease (CHD) is the most common neonatal anomaly, highlighting the urgent need for early detection to improve outcomes. Yet, fetal ECG (fECG) signals in abdominal ECG (aECG) are often masked by maternal ECG and noise, challenging conventional methods under low signal-to-noise ratio (SNR) conditions. We propose FetalHealthNet (FHNet), a deep learning framework that integrates Graph Neural Networks with a multi-scale enhanced transformer to dynamically model spatiotemporal inter-lead correlations and extract clean fECG signals. On benchmark aECG datasets, FHNet consistently outperforms long short-term memory (LSTM) models, standard transformers, and state-of-the-art models, achieving R2>0.99 and RMSE = 0.015 even under severe noise. Interpretability analyses highlight physiologically meaningful temporal and lead contributions, supporting model transparency and clinical trust. FHNet illustrates the potential of AI-driven modeling to advance fetal monitoring and enable early CHD screening, underscoring the transformative impact of next-generation biomedical signal processing.


Benchmarking Dimensionality Reduction Techniques for Spatial Transcriptomics

Mahmud, Md Ishtyaq, Kochat, Veena, Satpati, Suresh, Dwarampudi, Jagan Mohan Reddy, Rai, Kunal, Banerjee, Tania

arXiv.org Artificial Intelligence

We introduce a unified framework for evaluating dimensionality reduction techniques in spatial transcriptomics beyond standard PCA approaches. We benchmark six methods PCA, NMF, autoencoder, VAE, and two hybrid embeddings on a cholangiocarcinoma Xenium dataset, systematically varying latent dimensions ($k$=5-40) and clustering resolutions ($ρ$=0.1-1.2). Each configuration is evaluated using complementary metrics including reconstruction error, explained variance, cluster cohesion, and two novel biologically-motivated measures: Cluster Marker Coherence (CMC) and Marker Exclusion Rate (MER). Our results demonstrate distinct performance profiles: PCA provides a fast baseline, NMF maximizes marker enrichment, VAE balances reconstruction and interpretability, while autoencoders occupy a middle ground. We provide systematic hyperparameter selection using Pareto optimal analysis and demonstrate how MER-guided reassignment improves biological fidelity across all methods, with CMC scores improving by up to 12\% on average. This framework enables principled selection of dimensionality reduction methods tailored to specific spatial transcriptomics analyses.


Atherosclerosis through Hierarchical Explainable Neural Network Analysis

Adam, Irsyad, Swee, Steven, Yilin, Erika, Ji, Ethan, Speier, William, Wang, Dean, Bui, Alex, Wang, Wei, Watson, Karol, Ping, Peipei

arXiv.org Artificial Intelligence

In this work, we study the problem pertaining to personalized classification of subclinical atherosclerosis by developing a hierarchical graph neural network framework to leverage two characteristic modalities of a patient: clinical features within the context of the cohort, and molecular data unique to individual patients. Current graph-based methods for disease classification detect patient-specific molecular fingerprints, but lack consistency and comprehension regarding cohort-wide features, which are an essential requirement for understanding pathogenic phenotypes across diverse atherosclerotic trajectories. Furthermore, understanding patient subtypes often considers clinical feature similarity in isolation, without integration of shared pathogenic interdependencies among patients. To address these challenges, we introduce ATHENA: Atherosclerosis Through Hierarchical Explainable Neural Network Analysis, which constructs a novel hierarchical network representation through integrated modality learning; subsequently, it optimizes learned patient-specific molecular fingerprints that reflect individual omics data, enforcing consistency with cohort-wide patterns. With a primary clinical dataset of 391 patients, we demonstrate that this heterogeneous alignment of clinical features with molecular interaction patterns has significantly boosted subclinical atherosclerosis classification performance across various baselines by up to 13% in area under the receiver operating curve (AUC) and 20% in F1 score. Taken together, ATHENA enables mechanistically-informed patient subtype discovery through explainable AI (XAI)-driven subnetwork clustering; this novel integration framework strengthens personalized intervention strategies, thereby improving the prediction of atherosclerotic disease progression and management of their clinical actionable outcomes.