Goto

Collaborating Authors

 charlottesville


"Monuments," Reviewed: The Confederacy Surrenders to a Truer American Past

The New Yorker

As the Trump Administration tries to rescue symbols of the Lost Cause, an exhibition in Los Angeles, led by Kara Walker, finds meaning in their desecration. Kara Walker's "Unmanned Drone" (2023) transforms a Stonewall Jackson statue. The first thing you see is a horse's ass, protruding, upside down, from the thorax of a monster. A man's arm descends from the beast's stomach, his gloved hand clutching the blade of a fallen sabre. Every part of the work comes from a statue of the Confederate general Stonewall Jackson that was removed from Charlottesville, Virginia, in 2021.


Optimizing Storytelling, Improving Audience Retention, and Reducing Waste in the Entertainment Industry

Cornfeld, Andrew, Miller, Ashley, Mora-Figueroa, Mercedes, Samuels, Kurt, Palomba, Anthony

arXiv.org Artificial Intelligence

Television networks face high financial risk when making programming decisions, often relying on limited historical data to forecast episodic viewership. This study introduces a machine learning framework that integrates natural language processing (NLP) features from over 25000 television episodes with traditional viewership data to enhance predictive accuracy. By extracting emotional tone, cognitive complexity, and narrative structure from episode dialogue, we evaluate forecasting performance using SARIMAX, rolling XGBoost, and feature selection models. While prior viewership remains a strong baseline predictor, NLP features contribute meaningful improvements for some series. We also introduce a similarity scoring method based on Euclidean distance between aggregate dialogue vectors to compare shows by content. Tested across diverse genres, including Better Call Saul and Abbott Elementary, our framework reveals genre-specific performance and offers interpretable metrics for writers, executives, and marketers seeking data-driven insight into audience behavior.


Rubric Is All You Need: Enhancing LLM-based Code Evaluation With Question-Specific Rubrics

Pathak, Aditya, Gandhi, Rachit, Uttam, Vaibhav, Devansh, null, Nakka, Yashwanth, Jindal, Aaryan Raj, Ghosh, Pratyush, Ramamoorthy, Arnav, Verma, Shreyash, Mittal, Aditya, Ased, Aashna, Khatri, Chirag, Challa, Jagat Sesh, Kumar, Dhruv

arXiv.org Artificial Intelligence

Since the disruption in LLM technology brought about by the release of GPT-3 and ChatGPT, LLMs have shown remarkable promise in programming-related tasks. While code generation remains a popular field of research, code evaluation using LLMs remains a problem with no conclusive solution. In this paper, we focus on LLM-based code evaluation and attempt to fill in the existing gaps. We propose multi-agentic novel approaches using question-specific rubrics tailored to the problem statement, arguing that these perform better for logical assessment than the existing approaches that use question-agnostic rubrics. To address the lack of suitable evaluation datasets, we introduce two datasets: a Data Structures and Algorithms dataset containing 150 student submissions from a popular Data Structures and Algorithms practice website, and an Object Oriented Programming dataset comprising 80 student submissions from undergraduate computer science courses. In addition to using standard metrics (Spearman Correlation, Cohen's Kappa), we additionally propose a new metric called as Leniency, which quantifies evaluation strictness relative to expert assessment. Our comprehensive analysis demonstrates that question-specific rubrics significantly enhance logical assessment of code in educational settings, providing better feedback aligned with instructional goals beyond mere syntactic correctness.


An Overview of MLCommons Cloud Mask Benchmark: Related Research and Data

von Laszewski, Gregor, Gu, Ruochen

arXiv.org Artificial Intelligence

Cloud masking is a crucial task that is well-motivated for meteorology and its applications in environmental and atmospheric sciences. Its goal is, given satellite images, to accurately generate cloud masks that identify each pixel in image to contain either cloud or clear sky. In this paper, we summarize some of the ongoing research activities in cloud masking, with a focus on the research and benchmark currently conducted in MLCommons Science Working Group. This overview is produced with the hope that others will have an easier time getting started and collaborate on the activities related to MLCommons Cloud Mask Benchmark.


Population Age Group Sensitivity for COVID-19 Infections with Deep Learning

Islam, Md Khairul, Valentine, Tyler, Wang, Royal, Davis, Levi, Manner, Matt, Fox, Judy

arXiv.org Artificial Intelligence

The COVID-19 pandemic has created unprecedented challenges for governments and healthcare systems worldwide, highlighting the critical importance of understanding the factors that contribute to virus transmission. This study aimed to identify the most influential age groups in COVID-19 infection rates at the US county level using the Modified Morris Method and deep learning for time series. Our approach involved training the state-of-the-art time-series model Temporal Fusion Transformer on different age groups as a static feature and the population vaccination status as the dynamic feature. We analyzed the impact of those age groups on COVID-19 infection rates by perturbing individual input features and ranked them based on their Morris sensitivity scores, which quantify their contribution to COVID-19 transmission rates. The findings are verified using ground truth data from the CDC and US Census, which provide the true infection rates for each age group. The results suggest that young adults were the most influential age group in COVID-19 transmission at the county level between March 1, 2020, and November 27, 2021. Using these results can inform public health policies and interventions, such as targeted vaccination strategies, to better control the spread of the virus. Our approach demonstrates the utility of feature sensitivity analysis in identifying critical factors contributing to COVID-19 transmission and can be applied in other public health domains.


Shape Analysis for Pediatric Upper Body Motor Function Assessment

Kumar, Shashwat, Gutierez, Robert, Datta, Debajyoti, Tolman, Sarah, McCrady, Allison, Blemker, Silvia, Scharf, Rebecca J., Barnes, Laura

arXiv.org Artificial Intelligence

Neuromuscular disorders, such as Spinal Muscular Atrophy (SMA) and Duchenne Muscular Dystrophy (DMD), cause progressive muscular degeneration and loss of motor function for 1 in 6,000 children. Traditional upper limb motor function assessments do not quantitatively measure patient-performed motions, which makes it difficult to track progress for incremental changes. Assessing motor function in children with neuromuscular disorders is particularly challenging because they can be nervous or excited during experiments, or simply be too young to follow precise instructions. These challenges translate to confounding factors such as performing different parts of the arm curl slower or faster (phase variability) which affects the assessed motion quality. This paper uses curve registration and shape analysis to temporally align trajectories while simultaneously extracting a mean reference shape. Distances from this mean shape are used to assess the quality of motion. The proposed metric is invariant to confounding factors, such as phase variability, while suggesting several clinically relevant insights. First, there are statistically significant differences between functional scores for the control and patient populations (p$=$0.0213$\le$0.05). Next, several patients in the patient cohort are able to perform motion on par with the healthy cohort and vice versa. Our metric, which is computed based on wearables, is related to the Brooke's score ((p$=$0.00063$\le$0.05)), as well as motor function assessments based on dynamometry ((p$=$0.0006$\le$0.05)). These results show promise towards ubiquitous motion quality assessment in daily life.


AI/ML, Data Science Jobs #hiring

#artificialintelligence

The University of Virginia School of Data Science--the first of its kind in the nation--is guided by common goals: to further discovery, share knowledge, and make a positive impact on society through collaborative, open, and responsible data science research and education. Founded in fall 2019 through the largest gift in UVA history.


Improving mathematical questioning in teacher training

Datta, Debajyoti, Phillips, Maria, Bywater, James P, Chiu, Jennifer, Watson, Ginger S., Barnes, Laura E., Brown, Donald E

arXiv.org Artificial Intelligence

High-fidelity, AI-based simulated classroom systems enable teachers to rehearse effective teaching strategies. However, dialogue-oriented open-ended conversations such as teaching a student about scale factors can be difficult to model. This paper builds a text-based interactive conversational agent to help teachers practice mathematical questioning skills based on the well-known Instructional Quality Assessment. We take a human-centered approach to designing our system, relying on advances in deep learning, uncertainty quantification, and natural language processing while acknowledging the limitations of conversational agents for specific pedagogical needs. Using experts' input directly during the simulation, we demonstrate how conversation success rate and high user satisfaction can be achieved.


Evaluation of mathematical questioning strategies using data collected through weak supervision

Datta, Debajyoti, Phillips, Maria, Bywater, James P, Chiu, Jennifer, Watson, Ginger S., Barnes, Laura E., Brown, Donald E

arXiv.org Artificial Intelligence

A large body of research demonstrates how teachers' questioning strategies can improve student learning outcomes. However, developing new scenarios is challenging because of the lack of training data for a specific scenario and the costs associated with labeling. This paper presents a high-fidelity, AI-based classroom simulator to help teachers rehearse research-based mathematical questioning skills. Using a human-in-the-loop approach, we collected a high-quality training dataset for a mathematical questioning scenario. Using recent advances in uncertainty quantification, we evaluated our conversational agent for usability and analyzed the practicality of incorporating a human-in-the-loop approach for data collection and system evaluation for a mathematical questioning scenario.


RxSwift Unit Testing and Machine Learning on iOS Devices

#artificialintelligence

Venturing the world of RxSwift unit testing Reactive programming is an emerging discipline that allows to write declarative, asynchronous and concurrent code in a functional way and is continuously gaining popularity and adoption. In this talk we will wander in the unexplored pathways of RxSwift testing infrastructure. Specifically, we will look into the key aspects of testing RxSwift code and we will analyze the different ways to unit test observable streams through a simple sign in form. Eleni Papanikolopoulou, iOS Developer @ Workable: l am an iOS Developer based in Athens. I have been working at Workable, the recruiting software company, for the past three years and hold a Master's degree in Computer Science from University of Manchester, UK.