phd
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- Europe > Germany > Brandenburg > Potsdam (0.04)
Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story
Pedashenko, Vladislav, Kushnareva, Laida, Nibal, Yana Khassan, Tulchinskii, Eduard, Kuznetsov, Kristian, Zharchinskii, Vladislav, Maximov, Yury, Piontkovskaya, Irina
Intrinsic dimension (ID) is an important tool in modern LLM analysis, informing studies of training dynamics, scaling behavior, and dataset structure, yet its textual determinants remain underexplored. We provide the first comprehensive study grounding ID in interpretable text properties through cross-encoder analysis, linguistic features, and sparse autoencoders (SAEs). In this work, we establish three key findings. First, ID is complementary to entropy-based metrics: after controlling for length, the two are uncorrelated, with ID capturing geometric complexity orthogonal to prediction quality. Second, ID exhibits robust genre stratification: scientific prose shows low ID (~8), encyclopedic content medium ID (~9), and creative/opinion writing high ID (~10.5) across all models tested. This reveals that contemporary LLMs find scientific text "representationally simple" while fiction requires additional degrees of freedom. Third, using SAEs, we identify causal features: scientific signals (formal tone, report templates, statistics) reduce ID; humanized signals (personalization, emotion, narrative) increase it. Steering experiments confirm these effects are causal. Thus, for contemporary models, scientific writing appears comparatively "easy", whereas fiction, opinion, and affect add representational degrees of freedom. Our multi-faceted analysis provides practical guidance for the proper use of ID and the sound interpretation of ID-based results.
- Europe > Austria > Vienna (0.14)
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
- North America > United States > Florida > Miami-Dade County > Miami (0.04)
- (6 more...)
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Europe > Russia (0.04)
- Asia > Russia (0.04)
- (10 more...)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)
- Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
- (2 more...)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
- Europe > Netherlands > North Holland > Amsterdam (0.10)
- Europe > Netherlands > South Holland > Rotterdam (0.05)
- (16 more...)
Diffusion-Driven Generation of Minimally Preprocessed Brain MRI
Remedios, Samuel W., Carass, Aaron, Prince, Jerry L., Dewey, Blake E.
The purpose of this study is to present and compare three denoising diffusion probabilistic models (DDPMs) that generate 3D $T_1$-weighted MRI human brain images. Three DDPMs were trained using 80,675 image volumes from 42,406 subjects spanning 38 publicly available brain MRI datasets. These images had approximately 1 mm isotropic resolution and were manually inspected by three human experts to exclude those with poor quality, field-of-view issues, and excessive pathology. The images were minimally preprocessed to preserve the visual variability of the data. Furthermore, to enable the DDPMs to produce images with natural orientation variations and inhomogeneity, the images were neither registered to a common coordinate system nor bias field corrected. Evaluations included segmentation, Frechet Inception Distance (FID), and qualitative inspection. Regarding results, all three DDPMs generated coherent MR brain volumes. The velocity and flow prediction models achieved lower FIDs than the sample prediction model. However, all three models had higher FIDs compared to real images across multiple cohorts. In a permutation experiment, the generated brain regional volume distributions differed statistically from real data. However, the velocity and flow prediction models had fewer statistically different volume distributions in the thalamus and putamen. In conclusion this work presents and releases the first 3D non-latent diffusion model for brain data without skullstripping or registration. Despite the negative results in statistical testing, the presented DDPMs are capable of generating high-resolution 3D $T_1$-weighted brain images. All model weights and corresponding inference code are publicly available at https://github.com/piksl-research/medforj .
- North America > United States > California > San Francisco County > San Francisco (0.14)
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- North America > United States > Massachusetts (0.05)
- (21 more...)
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (0.68)
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Europe > Russia (0.04)
- Asia > Russia (0.04)
- (10 more...)
Interview with Janice Anta Zebaze: using AI to address energy supply challenges
In this interview series, we're meeting some of the AAAI/SIGAI Doctoral Consortium participants to find out more about their research. Janice Anta Zebaze is using AI to address energy supply challenges and she told us more about the research she's carried our so far, her plans for further investigations, and what inspired her to pursue a PhD in the field. Tell us a bit about your PhD - where are you studying, and what is the topic of your research? I am currently pursuing my PhD in Physics at the University of Yaounde I in Cameroon, with a focus on renewable energy systems, tribology, and artificial intelligence. The aim of my research is to address energy supply challenges in developing countries by leveraging AI to evaluate resource availability and optimize energy systems.
- Energy > Renewable (1.00)
- Leisure & Entertainment > Sports > Soccer (0.31)
Interview with Zahra Ghorrati: developing frameworks for human activity recognition using wearable sensors
In this interview series, we're meeting some of the AAAI/SIGAI Doctoral Consortium participants to find out more about their research. Zahra Ghorrati is developing frameworks for human activity recognition using wearable sensors. We caught up with Zahra to find out more about this research, the aspects she has found most interesting, and her advice for prospective PhD students. Tell us a bit about your PhD - where are you studying, and what is the topic of your research? I am pursuing my PhD at Purdue University, where my dissertation focuses on developing scalable and adaptive deep learning frameworks for human activity recognition (HAR) using wearable sensors.
- Health & Medicine (0.50)
- Information Technology (0.48)
Temporally-Aware Diffusion Model for Brain Progression Modelling with Bidirectional Temporal Regularisation
Litrico, Mattia, Guarnera, Francesco, Giuffrida, Mario Valerio, Ravì, Daniele, Battiato, Sebastiano
Generating realistic MRIs to accurately predict future changes in the structure of brain is an invaluable tool for clinicians in assessing clinical outcomes and analysing the disease progression at the patient level. However, current existing methods present some limitations: (i) some approaches fail to explicitly capture the relationship between structural changes and time intervals, especially when trained on age-imbalanced datasets; (ii) others rely only on scan interpolation, which lack clinical utility, as they generate intermediate images between timepoints rather than future pathological progression; and (iii) most approaches rely on 2D slice-based architectures, thereby disregarding full 3D anatomical context, which is essential for accurate longitudinal predictions. We propose a 3D Temporally-Aware Diffusion Model (TADM-3D), which accurately predicts brain progression on MRI volumes. To better model the relationship between time interval and brain changes, TADM-3D uses a pre-trained Brain-Age Estimator (BAE) that guides the diffusion model in the generation of MRIs that accurately reflect the expected age difference between baseline and generated follow-up scans. Additionally, to further improve the temporal awareness of TADM-3D, we propose the Back-In-Time Regularisation (BITR), by training TADM-3D to predict bidirectionally from the baseline to follow-up (forward), as well as from the follow-up to baseline (backward). Although predicting past scans has limited clinical applications, this regularisation helps the model generate temporally more accurate scans. We train and evaluate TADM-3D on the OASIS-3 dataset, and we validate the generalisation performance on an external test set from the NACC dataset. The code will be available upon acceptance.
- Europe > United Kingdom > England > Nottinghamshire > Nottingham (0.14)
- Europe > Switzerland (0.04)
- Europe > Italy (0.04)
- (2 more...)
- Health & Medicine > Health Care Technology (1.00)
- Health & Medicine > Diagnostic Medicine > Imaging (1.00)
- Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (0.94)
Interview with Benyamin Tabarsi: Computing education and generative AI
In this interview series, we're meeting some of the AAAI/SIGAI Doctoral Consortium participants to find out more about their research. In this latest interview, Benyamin Tabarsi tells us about his research at the intersection of generative AI and computing education. We find out more about what he's investigated so far during his PhD, what is particularly interesting about this research area, and what inspired him to undertake a PhD in the field. I'm a computer science student at North Carolina (NC) State University, and my research focuses on computing education and generative AI. I've always been passionate about finding ways to make learning easier for students and teaching more efficient for instructors, especially in computer science.