turbo
- North America > United States > Minnesota (0.04)
- North America > United States > Connecticut (0.04)
- North America > United States > Washington > King County > Redmond (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > Florida > Alachua County > Gainesville (0.04)
- (2 more...)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.82)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
- North America > United States > New Mexico > Los Alamos County > Los Alamos (0.05)
- North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
- (2 more...)
AI-Enabled grading with near-domain data for scaling feedback with human-level accuracy
Agarwal, Shyam, Moghimi, Ali, Haudek, Kevin C.
Constructed-response questions are crucial to encourage generative processing and test a learner's understanding of core concepts. However, the limited availability of instructor time, large class sizes, and other resource constraints pose significant challenges in providing timely and detailed evaluation, which is crucial for a holistic educational experience. In addition, providing timely and frequent assessments is challenging since manual grading is labor intensive, and automated grading is complex to generalize to every possible response scenario. This paper proposes a novel and practical approach to grade short-answer constructed-response questions. We discuss why this problem is challenging, define the nature of questions on which our method works, and finally propose a framework that instructors can use to evaluate their students' open-responses, utilizing near-domain data like data from similar questions administered in previous years. The proposed method outperforms the state of the art machine learning models as well as non-fine-tuned large language models like GPT 3.5, GPT 4, and GPT 4o by a considerable margin of over 10-20% in some cases, even after providing the LLMs with reference/model answers. Our framework does not require pre-written grading rubrics and is designed explicitly with practical classroom settings in mind. Our results also reveal exciting insights about learning from near-domain data, including what we term as accuracy and data advantages using human-labeled data, and we believe this is the first work to formalize the problem of automated short answer grading based on the near-domain data.
- North America > United States > Michigan (0.04)
- North America > United States > California > Yolo County > Davis (0.04)
- Oceania > Australia > Victoria > Melbourne (0.04)
- (5 more...)
- Research Report > New Finding (1.00)
- Instructional Material (1.00)
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
- Education > Curriculum (1.00)
- Education > Educational Setting > Online (0.93)
- (3 more...)
- North America > United States > Washington > King County > Redmond (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > Florida > Alachua County > Gainesville (0.04)
- (2 more...)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.82)
Porsche Reveals Everything About Its Cayenne Electric--Except for One Vital Thing
The automaker has taken the covers off its Cayenne Electric and Cayenne Turbo Electric, the most powerful production Porsches ever. But it won't confirm a key AI feature of its first fully electric SUV. In the first nine months of 2025, Porsche's operating profit plummeted by 99 percent compared to the same stint the year before. Profit has tanked for the auto brand with a track record of making billions. The reasons for Porsche's misfortune are no secret.
- South America > French Guiana > Guyane > Cayenne (0.88)
- Europe > United Kingdom (0.05)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- (3 more...)
- Transportation > Ground > Road (1.00)
- Automobiles & Trucks > Manufacturer (1.00)
Safer in Translation? Presupposition Robustness in Indic Languages
Palnitkar, Aadi, Suresh, Arjun, Rajesh, Rishi, Puli, Puneet
Increasingly, more and more people are turning to large language models (LLMs) for healthcare advice and consultation, making it important to gauge the efficacy and accuracy of the responses of LLMs to such queries. While there are pre-existing medical benchmarks literature which seeks to accomplish this very task, these benchmarks are almost universally in English, which has led to a notable gap in existing literature pertaining to multilingual LLM evaluation. Within this work, we seek to aid in addressing this gap with Cancer-Myth-Indic, an Indic language benchmark built by translating a 500-item subset of Cancer-Myth, sampled evenly across its original categories, into five under-served but widely used languages from the subcontinent (500 per language; 2,500 translated items total). Native-speaker translators followed a style guide for preserving implicit presuppositions in translation; items feature false presuppositions relating to cancer. We evaluate several popular LLMs under this presupposition stress.
- Asia > India (0.15)
- North America > United States > Maryland > Prince George's County > College Park (0.14)
- Asia > Indonesia > Bali (0.04)