AITopics | Education

Collaborating Authors

Education

CANDI: Hybrid Discrete-Continuous Diffusion Models

Pynadath, Patrick, Shi, Jiaxin, Zhang, Ruqi

arXiv.org Machine LearningOct-30-2025

While continuous diffusion has shown remarkable success in continuous domains such as image generation, its direct application to discrete data has underperformed compared to purely discrete formulations. This gap is counterintuitive, given that continuous diffusion learns score functions that enable joint evolution across multiple positions. To understand this gap, we introduce token identifiability as an analytical framework for understanding how Gaussian noise corrupts discrete data through two mechanisms: discrete identity corruption and continuous rank degradation. We reveal that these mechanisms scale differently with vocabulary size, creating a temporal dissonance: at noise levels where discrete corruption preserves enough structure for conditional learning, continuous denoising is trivial; at noise levels where continuous denoising is meaningful, discrete corruption destroys nearly all conditional structure. To solve this, we propose CANDI (Continuous ANd DIscrete diffusion), a hybrid framework that decouples discrete and continuous corruption, enabling simultaneous learning of both conditional structure and continuous geometry. We empirically validate the temporal dissonance phenomenon and demonstrate that CANDI successfully avoids it. This unlocks the benefits of continuous diffusion for discrete spaces: on controlled generation, CANDI enables classifier-based guidance with off-the-shelf classifiers through simple gradient addition; on text generation, CANDI outperforms masked diffusion at low NFE, demonstrating the value of learning continuous gradients for discrete spaces. We include the code on the project page available here: https://patrickpynadath1.github.io/candi-lander

large language model, machine learning, natural language, (17 more...)

arXiv.org Machine Learning

2510.2251

Country:

Asia (0.93)
Europe > United Kingdom > England (0.28)
North America > United States > Indiana (0.28)

Genre: Research Report > New Finding (0.45)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Transportation (0.93)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)

Add feedback

Nvidia becomes first 5 trillion firm as AI rally picks up steam

The Japan TimesOct-29-2025, 23:54:00 GMT

Nvidia CEO Jensen Huang speaks during an event in Washington on Tuesday. Nvidia achieved a historic $5 trillion market capitalization on Wednesday as CEO Jensen Huang's spree of deals catapults the artificial intelligence frenzy to new heights. The shares closed 3.1% higher at $207.16, propelling Nvidia just over the milestone. It's only been four months since the company cracked the $4 trillion barrier, and the rally has accelerated as Huang forges new agreements to supply companies from Nokia Oyj to Samsung Electronics and Hyundai Motor Group with chips. Nvidia has become the most-important stock in a bull market that's been driven by optimism for AI to revolutionize the global economy.

crime & legal science, nvidia, politics crime & legal science, (5 more...)

The Japan Times

Country:

North America > United States (0.16)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.12)
Asia > Middle East > Palestine > Gaza Strip > Gaza Governorate > Gaza (0.05)
(2 more...)

Industry:

Law (1.00)
Information Technology > Hardware (1.00)
Banking & Finance > Trading (1.00)
Education > Educational Setting > K-12 Education (0.31)

Technology:

Information Technology > Artificial Intelligence (0.91)
Information Technology > Communications > Social Media (0.78)

Add feedback

The AI job cuts are here - or are they?

BBC NewsOct-29-2025, 00:09:22 GMT

The AI job cuts are here - or are they? Amazon's move this week to slash thousands of corporate jobs fed into a longstanding anxiety: that Artificial Intelligence is starting to replace workers. The tech giant joined a growing list of companies in the US that have pointed to AI technology as a reason behind layoffs. But some question whether AI is fully to blame - and have voiced scepticism that recent high-profile layoffs are a telling sign of the technology's effect on employment. Chegg, the online education firm, cited the new realities of AI as it announced a 45% reduction in workforce on Monday.

ai job cut, amazon, gimbel, (16 more...)

BBC News

Country:

Asia > South Korea (0.16)
South America (0.15)
North America > Central America (0.15)
(16 more...)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Education (1.00)
Banking & Finance > Economy (1.00)
Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

DiNo and RanBu: Lightweight Predictions from Shallow Random Forests

Santos, Tiago Mendonça dos, Izbicki, Rafael, Esteves, Luís Gustavo

arXiv.org Machine LearningOct-29-2025

Random Forest ensembles are a strong baseline for tabular prediction tasks, but their reliance on hundreds of deep trees often results in high inference latency and memory demands, limiting deployment in latency-sensitive or resource-constrained environments. We introduce DiNo (Distance with Nodes) and RanBu (Random Bushes), two shallow-forest methods that convert a small set of depth-limited trees into efficient, distance-weighted predictors. DiNo measures cophenetic distances via the most recent common ancestor of observation pairs, while RanBu applies kernel smoothing to Breiman's classical proximity measure. Both approaches operate entirely after forest training: no additional trees are grown, and tuning of the single bandwidth parameter $h$ requires only lightweight matrix-vector operations. Across three synthetic benchmarks and 25 public datasets, RanBu matches or exceeds the accuracy of full-depth random forests-particularly in high-noise settings-while reducing training plus inference time by up to 95\%. DiNo achieves the best bias-variance trade-off in low-noise regimes at a modest computational cost. Both methods extend directly to quantile regression, maintaining accuracy with substantial speed gains. The implementation is available as an open-source R/C++ package at https://github.com/tiagomendonca/dirf. We focus on structured tabular random samples (i.i.d.), leaving extensions to other modalities for future work.

artificial intelligence, machine learning, mrca, (18 more...)

arXiv.org Machine Learning

2510.23624

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > South Korea > Seoul > Seoul (0.04)
South America > Brazil > São Paulo (0.04)
(5 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Education (0.67)
Banking & Finance > Real Estate (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.92)

Add feedback

CritiCal: Can Critique Help LLM Uncertainty or Confidence Calibration?

Zong, Qing, Liu, Jiayu, Zheng, Tianshi, Li, Chunyang, Xu, Baixuan, Shi, Haochen, Wang, Weiqi, Wang, Zhaowei, Chan, Chunkit, Song, Yangqiu

arXiv.org Artificial IntelligenceOct-29-2025

Accurate confidence calibration in Large Language Models (LLMs) is critical for safe use in high-stakes domains, where clear verbalized confidence enhances user trust. Traditional methods that mimic reference confidence expressions often fail to capture the reasoning needed for accurate confidence assessment. We propose natural language critiques as a solution, ideally suited for confidence calibration, as precise gold confidence labels are hard to obtain and often require multiple generations. This paper studies how natural language critiques can enhance verbalized confidence, addressing: (1) What to critique: uncertainty (question-focused) or confidence (answer-specific)? Analysis shows confidence suits multiple-choice tasks, while uncertainty excels in open-ended scenarios. (2) How to critique: self-critique or critique calibration training? We propose Self-Critique, enabling LLMs to critique and optimize their confidence beyond mere accuracy, and CritiCal, a novel Critique Calibration training method that leverages natural language critiques to improve confidence calibration, moving beyond direct numerical optimization. Experiments show that CritiCal significantly outperforms Self-Critique and other competitive baselines, even surpassing its teacher model, GPT-4o, in complex reasoning tasks. CritiCal also shows robust generalization in out-of-distribution settings, advancing LLM's reliability.

calibration, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2510.24505

Country:

Asia (0.68)
Europe > Austria > Vienna (0.16)
North America > United States > Florida > Miami-Dade County > Miami (0.14)

Genre: Research Report (1.00)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Local Performance vs. Out-of-Distribution Generalization: An Empirical Analysis of Personalized Federated Learning in Heterogeneous Data Environments

Hussaini, Mortesa, Theiß, Jan, Stein, Anthony

arXiv.org Artificial IntelligenceOct-29-2025

In the context of Federated Learning with heterogeneous data environments, local models tend to converge to their own local model optima during local training steps, deviating from the overall data distributions. Aggregation of these local updates, e.g., with FedAvg, often does not align with the global model optimum (client drift), resulting in an update that is suboptimal for most clients. Personalized Federated Learning approaches address this challenge by exclusively focusing on the average local performances of clients' models on their own data distribution. Generalization to out-of-distribution samples, which is a substantial benefit of FedAvg and represents a significant component of robustness, appears to be inadequately incorporated into the assessment and evaluation processes. This study involves a thorough evaluation of Federated Learning approaches, encompassing both their local performance and their generalization capabilities. Therefore, we examine different stages within a single communication round to enable a more nuanced understanding of the considered metrics. Furthermore, we propose and incorporate a modified approach of FedAvg, designated as Federated Learning with Individualized Updates (FLIU), extending the algorithm by a straightforward individualization step with an adaptive personalization factor. We evaluate and compare the approaches empirically using MNIST and CIFAR-10 under various distributional conditions, including benchmark IID and pathological non-IID, as well as additional novel test environments with Dirichlet distribution specifically developed to stress the algorithms on complex data heterogeneity.

artificial intelligence, generalization, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2510.24503

Genre: Research Report > New Finding (0.93)

Industry: Education (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

AdaRewriter: Unleashing the Power of Prompting-based Conversational Query Reformulation via Test-Time Adaptation

Lai, Yilong, Wu, Jialong, Wang, Zhenglin, Zhou, Deyu

arXiv.org Artificial IntelligenceOct-29-2025

Prompting-based conversational query reformulation has emerged as a powerful approach for conversational search, refining ambiguous user queries into standalone search queries. Best-of-N reformulation over the generated candidates via prompting shows impressive potential scaling capability. However, both the previous tuning methods (training time) and adaptation approaches (test time) can not fully unleash their benefits. In this paper, we propose AdaRewriter, a novel framework for query reformulation using an outcome-supervised reward model via test-time adaptation. By training a lightweight reward model with contrastive ranking loss, AdaRewriter selects the most promising reformulation during inference. Notably, it can operate effectively in black-box systems, including commercial LLM APIs. Experiments on five conversational search datasets show that AdaRewriter significantly outperforms the existing methods across most settings, demonstrating the potential of test-time adaptation for conversational query reformulation.

computational linguistic, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2506.01381

Country:

North America > United States (1.00)
Asia > Middle East > UAE (0.46)

Genre: Research Report > New Finding (0.93)

Industry:

Government (0.68)
Education (0.46)
Transportation > Air (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback

OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions

Luo, Cheng, Wang, Jianghui, Li, Bing, Song, Siyang, Ghanem, Bernard

arXiv.org Artificial IntelligenceOct-29-2025

In this paper, we introduce Online Multimodal Conversational Response Generation (OMCRG), a novel task designed to produce synchronized verbal and non-verbal listener feedback online, based on the speaker's multimodal inputs. OMCRG captures natural dyadic interactions and introduces new challenges in aligning generated audio with listeners' facial responses. To tackle these challenges, we incorporate text as an intermediate modality to connect audio and facial responses. We propose OmniResponse, a Multimodal Large Language Model (MLLM) that autoregressively generates accurate multimodal listener responses. OmniResponse leverages a pretrained LLM enhanced with two core components: Chrono-Text Markup, which precisely timestamps generated text tokens, and TempoVoice, a controllable online text-to-speech (TTS) module that outputs speech synchronized with facial responses. To advance OMCRG research, we offer ResponseNet, a dataset of 696 detailed dyadic interactions featuring synchronized split-screen videos, multichannel audio, transcripts, and annotated facial behaviors. Comprehensive evaluations on ResponseNet demonstrate that OmniResponse outperforms baseline models in terms of semantic speech content, audio-visual synchronization, and generation quality. Our dataset, code, and models are publicly available.

arxiv preprint arxiv, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2505.21724

Genre:

Research Report (1.00)
Instructional Material (0.68)

Industry:

Information Technology (0.68)
Health & Medicine (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)
Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Hybrid Deep Learning Model to Estimate Cognitive Effort from fNIRS Signals

Sharmin, Shayla, Barmaki, Roghayeh Leila

arXiv.org Artificial IntelligenceOct-29-2025

This study estimates cognitive effort based on functional near-infrared spectroscopy data and performance scores using a hybrid DeepNet model. The estimation of cognitive effort enables educators to modify material to enhance learning effectiveness and student engagement. In this study, we collected oxygenated hemoglobin using functional near-infrared spectroscopy during an educational quiz game. Participants (n=16) responded to 16 questions in a Unity-based educational game, each within a 30-second response time limit. We used DeepNet models to predict the performance score from the oxygenated hemoglobin, and compared traditional machine learning and DeepNet models to determine which approach provides better accuracy in predicting performance scores. The result shows that the proposed CNN-GRU gives better performance with 73% than other models. After the prediction, we used the predicted score and the oxygenated hemoglobin to observe cognitive effort by calculating relative neural efficiency and involvement in our test cases. Our result shows that even with moderate accuracy, the predicted cognitive effort closely follow the actual trends. This findings can be helpful in designing and improving learning environments and provide valuable insights into learning materials.

artificial intelligence, cognitive effort, machine learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3747327.3764901

2504.13883

Country: North America > United States > Delaware > New Castle County > Newark (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Health Care Technology (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis

Chen, Xuanzhong, Qiao, Zile, Chen, Guoxin, Su, Liangcai, Zhang, Zhen, Wang, Xinyu, Xie, Pengjun, Huang, Fei, Zhou, Jingren, Jiang, Yong

arXiv.org Artificial IntelligenceOct-29-2025

Training large language model agents on tasks at the frontier of their capabilities is key to unlocking advanced reasoning. We introduce a data synthesis approach inspired by the educational theory of the Zone of Proximal Development (ZPD), which defines this frontier as tasks an LLM cannot solve alone but can master with guidance. To operationalize this, we present the AgentFrontier Engine, an automated pipeline that synthesizes high-quality, multidisciplinary data situated precisely within the LLM's ZPD. This engine supports both continued pre-training with knowledge-intensive data and targeted post-training on complex reasoning tasks. From the same framework, we derive the ZPD Exam, a dynamic and automated benchmark designed to evaluate agent capabilities on these frontier tasks. We train AgentFrontier-30B-A3B model on our synthesized data, which achieves state-of-the-art results on demanding benchmarks like Humanity's Last Exam, even surpassing some leading proprietary agents. Our work demonstrates that a ZPD-guided approach to data synthesis offers a scalable and effective path toward building more capable LLM agents.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2510.24695

Country: Europe > Austria (0.28)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Musculoskeletal (1.00)
Health & Medicine > Diagnostic Medicine (1.00)
Education (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback