Goto

Collaborating Authors

 automated


Automated Labeling of Intracranial Arteries with Uncertainty Quantification Using Deep Learning

arXiv.org Artificial Intelligence

Accurate anatomical labeling of intracranial arteries is essential for cerebrovascular diagnosis and hemodynamic analysis but remains time-consuming and subject to interoperator variability. We present a deep learning-based framework for automated artery labeling from 3D Time-of-Flight Magnetic Resonance Angiography (3D ToF-MRA) segmentations (n=35), incorporating uncertainty quantification to enhance interpretability and reliability. We evaluated three convolutional neural network architectures: (1) a UNet with residual encoder blocks, reflecting commonly used baselines in vascular labeling; (2) CS-Net, an attention-augmented UNet incorporating channel and spatial attention mechanisms for enhanced curvilinear structure recognition; and (3) nnUNet, a self-configuring framework that automates preprocessing, training, and architectural adaptation based on dataset characteristics. Among these, nnUNet achieved the highest labeling performance (average Dice score: 0.922; average surface distance: 0.387 mm), with improved robustness in anatomically complex vessels. To assess predictive confidence, we implemented test-time augmentation (TT A) and introduced a novel coordinate-guided strategy to reduce interpolation errors during augmented inference. The resulting uncertainty maps reliably indicated regions of anatomical ambiguity, pathological variation, or manual labeling inconsistency. We further validated clinical utility by comparing flow velocities derived from automated and manual labels in co-registered 4D Flow MRI datasets, observing close agreement with no statistically significant differences. Our framework offers a scalable, accurate, and uncertainty-aware solution for automated cerebrovascular labeling, supporting downstream hemodynamic analysis and facilitating clinical integration. Introduction The intracranial arterial system plays a critical role in brain perfusion to maintain normal cognitive function.


A Survey of Cognitive Distortion Detection and Classification in NLP

arXiv.org Artificial Intelligence

As interest grows in applying natural language processing (NLP) techniques to mental health, an expanding body of work explores the automatic detection and classification of cognitive distortions (CDs). CDs are habitual patterns of negatively biased or flawed thinking that distort how people perceive events, judge themselves, and react to the world. Identifying and addressing them is a central goal of therapy. Despite this momentum, the field remains fragmented, with inconsistencies in CD taxonomies, task formulations, and evaluation practices limiting comparability across studies. This survey presents the first comprehensive review of 38 studies spanning two decades, mapping how CDs have been implemented in computational research and evaluating the methods applied. We provide a consolidated CD taxonomy reference, summarise common task setups, and highlight persistent challenges to support more coherent and reproducible research. Alongside our review, we introduce practical resources, including curated evaluation metrics from surveyed papers, a standardised datasheet template, and an ethics flowchart, available online.


The ALCHEmist: Automated Labeling 500x CHEaper than LLM Data Annotators

Neural Information Processing Systems

Large pretrained models can be used as annotators, helping replace or augment crowdworkers and enabling distilling generalist models into smaller specialist models. Unfortunately, this comes at a cost: employing top-of-the-line models often requires paying thousands of dollars for API calls, while the resulting datasets are static and challenging to audit. To address these challenges, we propose a simple alternative: rather than directly querying labels from pretrained models, we task models to generate programs that can produce labels. These programs can be stored and applied locally, re-used and extended, and cost orders of magnitude less. Our system, \textbf{Alchemist}, obtains comparable to or better performance than large language model-based annotation in a range of tasks for a fraction of the cost: on average, improvements amount to a \textbf{12.9}


ChatGPT for automated grading of short answer questions in mechanical ventilation

arXiv.org Artificial Intelligence

Standardised tests using short answer questions (SAQs) are common in postgraduate education. Large language models (LLMs) simulate conversational language and interpret unstructured free-text responses in ways aligning with applying SAQ grading rubrics, making them attractive for automated grading. We evaluated ChatGPT 4o to grade SAQs in a postgraduate medical setting using data from 215 students (557 short-answer responses) enrolled in an online course on mechanical ventilation (2020--2024). Deidentified responses to three case-based scenarios were presented to ChatGPT with a standardised grading prompt and rubric. Outputs were analysed using mixed-effects modelling, variance component analysis, intraclass correlation coefficients (ICCs), Cohen's kappa, Kendall's W, and Bland--Altman statistics. ChatGPT awarded systematically lower marks than human graders with a mean difference (bias) of -1.34 on a 10-point scale. ICC values indicated poor individual-level agreement (ICC1 = 0.086), and Cohen's kappa (-0.0786) suggested no meaningful agreement. Variance component analysis showed minimal variability among the five ChatGPT sessions (G-value = 0.87), indicating internal consistency but divergence from the human grader. The poorest agreement was observed for evaluative and analytic items, whereas checklist and prescriptive rubric items had less disagreement. We caution against the use of LLMs in grading postgraduate coursework. Over 60% of ChatGPT-assigned grades differed from human grades by more than acceptable boundaries for high-stakes assessments.


Neural Automated Writing Evaluation with Corrective Feedback

arXiv.org Artificial Intelligence

The utilization of technology in second language learning and teaching has become ubiquitous. For the assessment of writing specifically, automated writing evaluation (AWE) and grammatical error correction (GEC) have become immensely popular and effective methods for enhancing writing proficiency and delivering instant and individualized feedback to learners. By leveraging the power of natural language processing (NLP) and machine learning algorithms, AWE and GEC systems have been developed separately to provide language learners with automated corrective feedback and more accurate and unbiased scoring that would otherwise be subject to examiners. In this paper, we propose an integrated system for automated writing evaluation with corrective feedback as a means of bridging the gap between AWE and GEC results for second language learners. This system enables language learners to simulate the essay writing tests: a student writes and submits an essay, and the system returns the assessment of the writing along with suggested grammatical error corrections. Given that automated scoring and grammatical correction are more efficient and cost-effective than human grading, this integrated system would also alleviate the burden of manually correcting innumerable essays.


Your Fast Food Is Already Automated

The Atlantic - Technology

A clawlike contraption lurched forward, like a bird pecking at feed, to snatch dishes holding a faux-chicken cutlet and potatoes, then inserted them onto a metal track that snakes through a 650-degree-Fahrenheit oven. Seven minutes, some automatic food dispensers, and two conveyor belts later (with a healthy assist from human hands), my meal was sitting on a shelf of mint-green cubbies. It was a vegan fried-chicken sandwich, a cucumber salad, crispy potatoes, and a smattering of other sides. This is Kernel, a fast-casual venture that opened its first store, in Manhattan, this February. Its founder, Steve Ells, kicked off the lunch-bowl boom when he started Chipotle in 1993.


Machine Learning for Flow Cytometry Data Analysis

arXiv.org Artificial Intelligence

Flow cytometry mainly used for detecting the characteristics of a number of biochemical substances based on the expression of specific markers in cells. It is particularly useful for detecting membrane surface receptors, antigens, ions, or during DNA/RNA expression. Not only can it be employed as a biomedical research tool for recognising distinctive types of cells in mixed populations, but it can also be used as a diagnostic tool for classifying abnormal cell populations connected with disease. Modern flow cytometers can rapidly analyse tens of thousands of cells at the same time while also measuring multiple parameters from a single cell. However, the rapid development of flow cytometers makes it challenging for conventional analysis methods to interpret flow cytometry data. Researchers need to be able to distinguish interesting-looking cell populations manually in multi-dimensional data collected from millions of cells. Thus, it is essential to find a robust approach for analysing flow cytometry data automatically, specifically in identifying cell populations automatically. This thesis mainly concerns discover the potential shortcoming of current automated-gating algorithms in both real datasets and synthetic datasets. Three representative automated clustering algorithms are selected to be applied, compared and evaluated by completely and partially automated gating. A subspace clustering ProClus also implemented in this thesis. The performance of ProClus in flow cytometry is not well, but it is still a useful algorithm to detect noise.


AI in the Tax Industry: Can Everything Be Automated? - Unite.AI

#artificialintelligence

The tax industry is an exciting place for artificial intelligence (AI) and automation to flourish. Tax professionals must prepare and file millions of returns each year, but people aren't the best at repeating things meticulously. They want to be creative and not just follow a script.


Evidence Partners Raises $20M for Automated, Evidence-Based Research

#artificialintelligence

DistillerSR is a web-based platform that allows researchers to collaborate concurrently on the same projects from anywhere, without blocking or overwriting one another. Bootstrapped since its formation in 2008, Evidence Partners pioneered the development of AI-enabled literature review software through the development of DistillerSR, which has had double-digit growth since the platform's launch in 2009. Literature reviews are the cornerstone of evidence-based research, but their production has traditionally been highly manual, time consuming, and error prone. Today, more than 300 of the world's leading research organizations, including more than 60 percent of the largest pharmaceutical and medical device companies, trust DistillerSR to securely produce transparent, audit-ready, and regulatory compliant literature reviews faster and more accurately than any other method. With more organizations using DistillerSR to automate their systematic reviews, healthcare researchers can make more informed and time-sensitive health policy decisions, clinical practice guidelines, regulatory submissions, and deliver better overall research.


Will Marketing Be Automated by AI?

#artificialintelligence

Paul Roetzer (@paulroetzer) is founder of PR 20/20, author of The Marketing Performance Blueprint and The Marketing Agency Blueprint, and creator of The Marketing Artificial Intelligence Institute and Marketing Score.