AITopics

The automatic curation of discussion forums in online courses requires constant updates, making frequent retraining of Large Language Models (LLMs) a resource-intensive process. To circumvent the need for costly fine-tuning, this paper proposes and evaluates the use of Bayesian fusion. The approach combines the multidimensional classification scores of a pre-trained generic LLM with those of a classifier trained on local data. The performance comparison demonstrated that the proposed fusion improves the results compared to each classifier individually, and is competitive with the LLM fine-tuning approach

classification, large language model, machine learning, (16 more...)

2508.10008

Country:

South America (0.47)
North America > United States (0.28)

Genre:

Instructional Material > Course Syllabus & Notes (0.66)
Research Report > New Finding (0.46)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)
(2 more...)

From Answers to Questions: EQGBench for Evaluating LLMs' Educational Question Generation

Zhou, Chengliang, Wang, Mei, Zhang, Ting, Zhu, Qiannan, Li, Jian, Huang, Hua

Large Language Models (LLMs) have demonstrated remarkable capabilities in mathematical problem-solving. However, the transition from providing answers to generating high-quality educational questions presents significant challenges that remain underexplored. To advance Educational Question Generation (EQG) and facilitate LLMs in generating pedagogically valuable and educationally effective questions, we introduce EQGBench, a comprehensive benchmark specifically designed for evaluating LLMs' performance in Chinese EQG. EQGBench establishes a five-dimensional evaluation framework supported by a dataset of 900 evaluation samples spanning three fundamental middle school disciplines: mathematics, physics, and chemistry. The dataset incorporates user queries with varying knowledge points, difficulty gradients, and question type specifications to simulate realistic educational scenarios. Through systematic evaluation of 46 mainstream large models, we reveal significant room for development in generating questions that reflect educational value and foster students' comprehensive abilities.

large language model, machine learning, question generation, (19 more...)

2508.10005

Country:

Europe (0.46)
North America > United States (0.28)

Genre:

Research Report (0.82)
Instructional Material (0.68)

Industry: Education > Educational Setting > K-12 Education > Middle School (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Schneider, Johannes, Hasler, Béatrice S., Varrone, Michaela, Hoya, Fabian, Schroffenegger, Thomas, Mah, Dana-Kristin, Peböck, Karl

Thematic and Task-Based Categorization of K-12 GenAI Usages with Hierarchical Topic Modeling

We analyze anonymous interaction data of minors in class-rooms spanning several months, schools, and subjects employing a novel, simple topic modeling approach. Specifically, we categorize more than 17,000 messages generated by students, teachers, and ChatGPT in two dimensions: content (such as nature and people) and tasks (such as writing and explaining). Our hierarchical categorization done separately for each dimension includes exemplary prompts, and provides both a high-level overview as well as tangible insights. Prior works mostly lack a content or thematic categorization. While task categorizations are more prevalent in education, most have not been supported by real-world data for K-12. In turn, it is not surprising that our analysis yielded a number of novel applications. In deriving these insights, we found that many of the well-established classical and emerging computational methods, i.e., topic modeling, for analysis of large amounts of texts underperform, leading us to directly apply state-of-the-art LLMs with adequate pre-processing to achieve hierarchical topic structures with better human alignment through explicit instructions than prior approaches. Our findings support fellow researchers, teachers and students in enriching the usage of GenAI, while our discussion also highlights a number of concerns and open questions for future research.

large language model, machine learning, natural language, (21 more...)

2508.09997

Country:

Europe (1.00)
North America > Mexico (0.28)

Genre:

Instructional Material > Course Syllabus & Notes (0.93)
Research Report > New Finding (0.88)

Industry:

Education > Curriculum > Subject-Specific Education (1.00)
Media (0.93)
Education > Educational Setting > K-12 Education > Secondary School (0.69)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.31)

PromptTSS: A Prompting-Based Approach for Interactive Multi-Granularity Time Series Segmentation

Chang, Ching, Lo, Ming-Chih, Peng, Wen-Chih, Chen, Tien-Fu

Multivariate time series data, collected across various fields such as manufacturing and wearable technology, exhibit states at multiple levels of granularity, from coarse-grained system behaviors to fine-grained, detailed events. Effectively segmenting and integrating states across these different granularities is crucial for tasks like predictive maintenance and performance optimization. However, existing time series segmentation methods face two key challenges: (1) the inability to handle multiple levels of granularity within a unified model, and (2) limited adaptability to new, evolving patterns in dynamic environments. To address these challenges, we propose PromptTSS, a novel framework for time series segmentation with multi-granularity states. PromptTSS uses a unified model with a prompting mechanism that leverages label and boundary information to guide segmentation, capturing both coarse- and fine-grained patterns while adapting dynamically to unseen patterns. Experiments show PromptTSS improves accuracy by 24.49% in multi-granularity segmentation, 17.88% in single-granularity segmentation, and up to 599.24% in transfer learning, demonstrating its adaptability to hierarchical states and evolving time series dynamics. Our code is available at https://github.com/blacksnail789521/PromptTSS.

large language model, machine learning, segmentation, (19 more...)

doi: 10.1145/3746252.3761142

2506.1117

Country:

Asia (0.29)
North America > United States (0.28)
Europe > Austria (0.28)

Genre:

Research Report (0.50)
Instructional Material (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Human Computer Interaction (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Neural Information Processing SystemsAug-14-2025, 23:37:17 GMT

Byzantine-tolerant federated Gaussian process regression for streaming data

We also characterize the lower and upper bounds of the predictive variance.

agent, algorithm, gpr, (12 more...)

Country:

North America > United States > Pennsylvania (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Instructional Material (0.46)

Industry: Information Technology (0.95)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining (0.93)

Neural Information Processing SystemsAug-14-2025, 19:43:01 GMT

Impression learning Online representation learning with synaptic plasticity Appendices

Our derivation of the update for IL (Eq. 3) is based on an expansion of log We examine the consequences of this bias formula for our specific model. Note that the update term in Eq. (S1) is However, we will show in Appendix C that these updates may have high variance. 'reparameterization trick,' in which a change of variables allows the use of stochastic gradient descent It is worth noting that this'reparameterization' will work only for additive Gaussian noise. As already mentioned, WS can be viewed as a special case of IL. Since WS is a special case of IL, the bias properties of its individual samples are identical.

artificial intelligence, machine learning, variance, (18 more...)

Genre: Instructional Material > Online (0.40)

Industry:

Energy > Oil & Gas (1.00)
Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)

Neural Information Processing SystemsAug-14-2025, 19:42:57 GMT

Impression learning: Online representation learning with synaptic plasticity

Because the true underlying structure of the data is rarely accessible, this "representation learning" must be

inference, neuron, stimuli, (17 more...)

Country:

North America > United States (0.14)
Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Instructional Material > Online (0.40)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.95)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Neural Information Processing SystemsAug-14-2025, 15:48:45 GMT

End-to-End Stochastic Optimization with Energy-Based Model

Many real-life decision making tasks are stochastic optimization problems, where one needs to make decisions to minimize a cost function that involves stochastic parameters.

artificial intelligence, machine learning, optimization problem, (16 more...)

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > Hawaii (0.04)
Asia > China > Hubei Province > Wuhan (0.04)

Genre: Instructional Material (0.46)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Epidemiology (1.00)
Health & Medicine > Therapeutic Area > Immunology (0.95)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Neural Information Processing SystemsAug-14-2025, 11:14:07 GMT

503e7dbbd6217b9a591f3322f39b5a6c-Paper.pdf

artificial intelligence, curriculum, machine learning, (17 more...)

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Instructional Material (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsAug-14-2025, 05:18:51 GMT

A Provably Efficient Sample Collection Strategy for Reinforcement Learning

One of the challenges in online reinforcement learning (RL) is that the agent needs to trade off the exploration of the environment and the exploitation of the samples to optimize its behavior. Whether we optimize for regret, sample complexity, state-space coverage or model estimation, we need to strike a different exploration-exploitation trade-off.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Country: North America > United States (0.46)

Genre: Instructional Material (0.34)

Industry: Energy > Oil & Gas > Upstream (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)