AITopics | attainment

Collaborating Authors

attainment

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Evaluating undergraduate mathematics examinations in the era of generative AI: a curriculum-level case study

Walker, Benjamin J., Kalaydzhieva, Nikoleta, Lameda, Beatriz Navarro, Reynolds, Ruth A.

arXiv.org Artificial IntelligenceSep-30-2025

Generative artificial intelligence (GenAI) tools such as OpenAI's ChatGPT are transforming the educational landscape, prompting reconsideration of traditional assessment practices. In parallel, universities are exploring alternatives to in-person, closed-book examinations, raising concerns about academic integrity and pedagogical alignment in uninvigilated settings. This study investigates whether traditional closed-book mathematics examinations retain their pedagogical relevance when hypothetically administered in uninvigilated, open-book settings with GenAI access. Adopting an empirical approach, we generate, transcribe, and blind-mark GenAI submissions to eight undergraduate mathematics examinations at a Russell Group university, spanning the entirety of the first-year curriculum. By combining independent GenAI responses to individual questions, we enable a meaningful evaluation of GenAI performance, both at the level of modules and across the first-year curriculum. We find that GenAI attainment is at the level of a first-class degree, though current performance can vary between modules. Further, we find that GenAI performance is remarkably consistent when viewed across the entire curriculum, significantly more so than that of students in invigilated examinations. Our findings evidence the need for redesigning assessments in mathematics for unsupervised settings, and highlight the potential reduction in pedagogical value of current standards in the era of generative artificial intelligence.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2509.13359

Country: Europe > United Kingdom > England (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study > Negative Result (0.46)

Industry: Education > Educational Setting > Higher Education (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

HyperFlexis: Joint Design of Algorithms and Systems for Multi-SLO Serving and Fast Scaling

Yousefijamarani, Zahra, Wang, Xinglu, Wang, Qian, Heisler, Morgan Lindsay, Shabani, Taha, Gholipour, Niloofar, Yassini, Parham, Chang, Hong, Chen, Kan, Zhang, Qiantao, Bai, Xiaolong, Wang, Jiannan, Xiong, Ying, Zhang, Yong, Fan, Zhenan

arXiv.org Artificial IntelligenceSep-26-2025

Modern large language model (LLM) serving systems face challenges from highly variable requests with diverse lengths, priorities, and stage-specific service-level objectives (SLOs). Meeting these requires real-time scheduling, rapid and cost-effective scaling, and support for both collocated and disaggregated Prefill/Decode (P/D) architectures. We present HyperFlexis, a unified LLM serving system that integrates algorithmic and system-level innovations to jointly optimize scheduling and scaling under multiple SLOs. It features a multi-SLO-aware scheduler that leverages budget estimation and request prioritization to ensure proactive SLO compliance for both new and ongoing requests. The system supports prefill- and decode-stage multi-SLO scheduling for P/D-disaggregated architectures and KV cache transfers. It also enables cost-effective scaling decisions, prefill-decode instance linking during scaling, and rapid P/D role transitions. To accelerate scaling and reduce cold-start latency, a device-to-device (D2D) weight transfer mechanism is proposed that lowers weight loading overhead by up to 19.39$\times$. These optimizations allow the system to achieve up to 4.44$\times$ higher SLO attainment, 65.82% lower request latency, and cost parity with state-of-the-art baselines. The code will be released soon.

hyperflexis, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2508.15919

Country: North America > United States (0.14)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

On the attainment of the Wasserstein--Cramer--Rao lower bound

Nishimori, Hayato, Matsuda, Takeru

arXiv.org Machine LearningJun-18-2025

Recently, a Wasserstein analogue of the Cramer--Rao inequality has been developed using the Wasserstein information matrix (Otto metric). This inequality provides a lower bound on the Wasserstein variance of an estimator, which quantifies its robustness against additive noise. In this study, we investigate conditions for an estimator to attain the Wasserstein--Cramer--Rao lower bound (asymptotically), which we call the (asymptotic) Wasserstein efficiency. We show a condition under which Wasserstein efficient estimators exist for one-parameter statistical models. This condition corresponds to a recently proposed Wasserstein analogue of one-parameter exponential families (e-geodesics). We also show that the Wasserstein estimator, a Wasserstein analogue of the maximum likelihood estimator based on the Wasserstein score function, is asymptotically Wasserstein efficient in location-scale families.

artificial intelligence, estimator, machine learning, (13 more...)

arXiv.org Machine Learning

2506.12732

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.35)

Add feedback

Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving

Yu, Shan, Xing, Jiarong, Qiao, Yifan, Ma, Mingyuan, Li, Yangmin, Wang, Yang, Yang, Shuo, Xie, Zhiqiang, Cao, Shiyi, Bao, Ke, Stoica, Ion, Xu, Harry, Sheng, Ying

arXiv.org Artificial IntelligenceMay-14-2025

Serving large language models (LLMs) is expensive, especially for providers hosting many models, making cost reduction essential. The unique workload patterns of serving multiple LLMs (i.e., multi-LLM serving) create new opportunities and challenges for this task. The long-tail popularity of models and their long idle periods present opportunities to improve utilization through GPU sharing. However, existing GPU sharing systems lack the ability to adjust their resource allocation and sharing policies at runtime, making them ineffective at meeting latency service-level objectives (SLOs) under rapidly fluctuating workloads. This paper presents Prism, a multi-LLM serving system that unleashes the full potential of GPU sharing to achieve both cost efficiency and SLO attainment. At its core, Prism tackles a key limitation of existing systems$\unicode{x2014}$the lack of $\textit{cross-model memory coordination}$, which is essential for flexibly sharing GPU memory across models under dynamic workloads. Prism achieves this with two key designs. First, it supports on-demand memory allocation by dynamically mapping physical to virtual memory pages, allowing flexible memory redistribution among models that space- and time-share a GPU. Second, it improves memory efficiency through a two-level scheduling policy that dynamically adjusts sharing strategies based on models' runtime demands. Evaluations on real-world traces show that Prism achieves more than $2\times$ cost savings and $3.3\times$ SLO attainment compared to state-of-the-art systems.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2505.04021

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Industry: Information Technology (0.47)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

SLOs-Serve: Optimized Serving of Multi-SLO LLMs

Chen, Siyuan, Jia, Zhipeng, Khan, Samira, Krishnamurthy, Arvind, Gibbons, Phillip B.

arXiv.org Artificial IntelligenceApr-15-2025

This paper introduces SLOs-Serve, a system designed for serving multi-stage large language model (LLM) requests with application- and stage-specific service level objectives (SLOs). The key idea behind SLOs-Serve is to customize the allocation of tokens to meet these SLO requirements. SLOs-Serve uses a multi-SLO dynamic programming-based algorithm to continuously optimize token allocations under SLO constraints by exploring the full design space of chunked prefill and (optional) speculative decoding. Leveraging this resource planning algorithm, SLOs-Serve effectively supports multi-SLOs and multi-replica serving with dynamic request routing while being resilient to bursty arrivals. Our evaluation across 6 LLM application scenarios (including summarization, coding, chatbot, tool calling, and reasoning) demonstrates that SLOs-Serve improves per-GPU serving capacity by 2.2x on average compared to prior state-of-the-art systems.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2504.08784

Country: North America > United States (1.00)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Efficiently Serving LLM Reasoning Programs with Certaindex

Fu, Yichao, Chen, Junda, Zhu, Siqi, Fu, Zheyu, Dai, Zhongdongming, Qiao, Aurick, Zhang, Hao

arXiv.org Artificial IntelligenceDec-30-2024

The rapid evolution of large language models (LLMs) has unlocked their capabilities in advanced reasoning tasks like mathematical problem-solving, code generation, and legal analysis. Central to this progress are inference-time reasoning algorithms, which refine outputs by exploring multiple solution paths, at the cost of increasing compute demands and response latencies. Existing serving systems fail to adapt to the scaling behaviors of these algorithms or the varying difficulty of queries, leading to inefficient resource use and unmet latency targets. We present Dynasor, a system that optimizes inference-time compute for LLM reasoning queries. Unlike traditional engines, Dynasor tracks and schedules requests within reasoning queries and uses Certaindex, a proxy that measures statistical reasoning progress based on model certainty, to guide compute allocation dynamically. Dynasor co-adapts scheduling with reasoning progress: it allocates more compute to hard queries, reduces compute for simpler ones, and terminates unpromising queries early, balancing accuracy, latency, and cost. On diverse datasets and algorithms, Dynasor reduces compute by up to 50% in batch processing and sustaining 3.3x higher query rates or 4.7x tighter latency SLOs in online serving.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2412.20993

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Granularity at Scale: Estimating Neighborhood Socioeconomic Indicators from High-Resolution Orthographic Imagery and Hybrid Learning

Brewer, Ethan, Valdrighi, Giovani, Solunke, Parikshit, Rulff, Joao, Piadyk, Yurii, Lv, Zhonghui, Poco, Jorge, Silva, Claudio

arXiv.org Artificial IntelligenceDec-12-2023

Many areas of the world are without basic information on the socioeconomic well-being of the residing population due to limitations in existing data collection methods. Overhead images obtained remotely, such as from satellite or aircraft, can help serve as windows into the state of life on the ground and help "fill in the gaps" where community information is sparse, with estimates at smaller geographic scales requiring higher resolution sensors. Concurrent with improved sensor resolutions, recent advancements in machine learning and computer vision have made it possible to quickly extract features from and detect patterns in image data, in the process correlating these features with other information. In this work, we explore how well two approaches, a supervised convolutional neural network and semi-supervised clustering based on bag-of-visual-words, estimate population density, median household income, and educational attainment of individual neighborhoods from publicly available high-resolution imagery of cities throughout the United States. Results and analyses indicate that features extracted from the imagery can accurately estimate the density (R$^2$ up to 0.81) of neighborhoods, with the supervised approach able to explain about half the variation in a population's income and education. In addition to the presented approaches serving as a basis for further geographic generalization, the novel semi-supervised approach provides a foundation for future work seeking to estimate fine-scale information from aerial imagery without the need for label data.

imagery, learning, neighborhood, (17 more...)

arXiv.org Artificial Intelligence

2309.16808

Country:

Asia > Bangladesh (0.04)
South America > Peru (0.04)
North America > United States > New York > New York County > New York City (0.04)
(31 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Food & Agriculture > Agriculture (1.00)
(2 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

AI Designs Decisions

#artificialintelligenceJun-18-2021, 12:05:05 GMT

Havelock Ellis said it is not the attainment of the goal that matters, it is the things met with by the way. He was speaking of philosophy. In business AI is all about goal attainment. The things met along the way are decisions. Decisions constitute a focus of the recent survey by Signal AI of 1,000 C-suite executives in an attempt to estimate the impact of AI on the U.S. economy.

ai design decision, attainment, role expectation, (14 more...)

#artificialintelligence

Industry: Banking & Finance > Economy (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.38)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.36)

Add feedback

Priority to unemployed immigrants? A causal machine learning evaluation of training in Belgium

Cockx, Bart, Lechner, Michael, Bollens, Joost

arXiv.org Machine LearningDec-30-2019

We investigate heterogenous employment effects of Flemish training programmes. Based on administrative individual data, we analyse programme effects at various aggregation levels using Modified Causal Forests (MCF), a causal machine learning estimator for multiple programmes. While all programmes have positive effects after the lock-in period, we find substantial heterogeneity across programmes and types of unemployed. Simulations show that assigning unemployed to programmes that maximise individual gains as identified in our estimation can considerably improve effectiveness. Simplified rules, such as one giving priority to unemployed with low employability, mostly recent migrants, lead to about half of the gains obtained by more sophisticated rules.

current unemployment spell, interaction, unemployment spell, (14 more...)

arXiv.org Machine Learning

1912.12864

Country:

Asia > Middle East > Republic of Türkiye (0.04)
Europe > Belgium > Flanders > Flemish Brabant > Leuven (0.04)
Africa > Middle East > Morocco (0.04)
(11 more...)

Genre: Research Report > Experimental Study (1.00)

Industry:

Education > Educational Setting (1.00)
Government > Regional Government (0.92)
Consumer Products & Services (0.92)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Automation and Artificial Intelligence: How machines are affecting people and places

#artificialintelligenceJan-26-2019, 19:33:58 GMT

At first, technologists issued dystopian alarms about the power of automation and artificial intelligence (AI) to destroy jobs. Then came a correction, with a wave of reassurances. Now, the discourse appears to be arriving at a more complicated understanding, suggesting that automation will bring neither apocalypse nor utopia, but instead both benefits and stress alike. Such is the ambiguous and sometimes disembodied nature of the "future of work" discussion. Hence the analysis presented here.

artificial intelligence, automation, occupation, (11 more...)

#artificialintelligence

Industry: Education (0.49)

Technology:

Information Technology > Artificial Intelligence > Robots (0.36)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.36)

Add feedback