AITopics | Education

Collaborating Authors

Education

AI-Powered Citation Auditing: A Zero-Assumption Protocol for Systematic Reference Verification in Academic Research

arXiv.org Artificial IntelligenceNov-10-2025

Academic citation integrity faces persistent challenges, with research indicating 20% of citations contain errors and manual verification requiring months of expert time. This paper presents a novel AI-powered methodology for systematic, comprehensive reference auditing using agentic AI with tool-use capabilities. We develop a zero-assumption verification protocol that independently validates every reference against multiple academic databases (Semantic Scholar, Google Scholar, CrossRef) without assuming any citation is correct. The methodology was validated across 30 academic documents (2,581 references) spanning undergraduate projects to doctoral theses and peer-reviewed publications. Results demonstrate 91.7% average verification rate on published PLOS papers, with successful detection of fabricated references, retracted articles, orphan citations, and predatory journals. Time efficiency improved dramatically: 90-minute audits for 916-reference doctoral theses versus months of manual review. The system achieved <0.5% false positive rate while identifying critical issues manual review might miss. This work establishes the first validated AI-agent methodology for academic citation integrity, demonstrating practical applicability for supervisors, students, and institutional quality assurance.

artificial intelligence, machine learning, verification, (15 more...)

arXiv.org Artificial Intelligence

2511.04683

Genre: Research Report > New Finding (0.49)

Industry:

Education (0.48)
Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

Ling Team, null, Li, Ang, Liu, Ben, Hu, Binbin, Li, Bing, Zeng, Bingwei, Ye, Borui, Tang, Caizhi, Tian, Changxin, Huang, Chao, Zhang, Chao, Qian, Chen, Ju, Chenchen, Li, Chenchen, Tang, Chengfu, Fu, Chilin, Ren, Chunshao, Wu, Chunwei, Zhang, Cong, Peng, Cunyin, Xu, Dafeng, Wang, Daixin, Zhang, Dalong, Jin, Dingnan, Zhu, Dingyuan, Hu, Dongke, Zhao, Fangzheng, Wu, Feifan, Zhu, Feng, Wang, Gangshan, Zhang, Haitao, Zhao, Hailin, Zhang, Hanxiao, Wang, Hanzi, Qian, Hao, Yu, Haoyi, Zhang, Heng, Zhang, Hongliang, Luan, Hongzhi, Dong, Huirong, Li, Huizhong, Li, Jia, Liu, Jia, Zhu, Jialong, Sha, Jian, Wei, Jianping, Yang, Jiaolong, Ma, Jieyue, Wu, Jiewei, Huang, Jinjing, Tian, Jingyun, Zhang, Jingyuan, Sun, Jinquan, Tu, Juanhui, Liu, Jun, Xu, Jun, Zhou, Jun, Ou, Junjie, Fang, Junpeng, Zhang, Kaihong, Hu, Kaiqin, Shi, Ke, Tang, Kun, Chen, Kunlong, Mei, Lanyin, Liang, Lei, Xu, Lei, Zhang, Libo, Ju, Lin, Yuan, Lin, Zhong, Ling, Ma, Lintao, Liu, Lu, Yu, Lu, Cai, Lun, Zhu, Meiqi, Li, Mengying, Chen, Min, Xue, Minghao, Cai, Minghong, Yin, Mingming, Jiang, Peijie, Zhao, Peilong, Liu, Pingping, Zhao, Qian, Cui, Qing, Huang, Qingxiang, Yang, Qingyuan, Yu, Quankun, Wei, Shaowei, Lian, Shijie, Zheng, Shoujian, Song, Shun, Zhang, Shungen, Zhang, Shuo, Li, Siyuan, Liu, Song, Guo, Ting, Zhao, Tong, Gu, Wanli, Wu, Weichang, Han, Weiguang, Fang, Wenjing, Wang, Wubin, Shu, Xiang, Shi, Xiao, Lan, Xiaoshun, Zhang, Xiaolu, Sun, Xiaqing, Zhao, Xin, Lu, Xingyu, Xu, Xiong, Wang, Xudong, Wang, Xudong, Yang, Xuemin, Yang, Yajie, Xiang, Yang, Li, Yanzhe, Zhang, Yi, Wang, Yilong, Li, Yingxue, Guo, Yongzhen, Fu, Yuzhuo, Wang, Yuanyuan, Yang, Yue, Yu, Yue, Deng, Yufeng, Zhang, Yun, Yu, Yunfei, Zhang, Yuqi, He, Yuxiao, Gui, Zengke, Huan, Zhaoxin, Wang, Zhaoyang, Zhu, Zhibo, Wang, Zhihao, Zhang, Zhiqiang, Wang, Zhoufei, Zeng, Zihang, Liu, Ziqi, Xuan, Zitao, Tang, Zuoli

arXiv.org Artificial IntelligenceNov-10-2025

We introduce Ling 2.0, a series reasoning-oriented language foundation built upon the principle that every activation boosts reasoning capability. Designed to scale from tens of billions to one trillion parameters under a unified Mixture-of-Experts (MoE) paradigm, Ling 2.0 emphasizes high sparsity, cross-scale consistency, and efficiency guided by empirical scaling laws. The series includes three non-thinking (instruct) models - Ling-mini-2.0, Ling-flash-2.0, and Ling-1T - ranging from 16B to 1T total parameters and achieving up to 7-fold active-compute efficiency compared with dense counterparts. Ling 2.0 integrates coordinated innovations across model architecture, pre-training, post-training, and infrastructure: a high-sparsity MoE with MTP for efficient reasoning, reasoning-oriented data and mid-training CoT activation, reinforcement-based fine-tuning (DFT, Evo-CoT), and full-scale FP8 training with fine-grained heterogeneous pipelines. At the trillion scale, Ling-1T establishes a new Pareto frontier of reasoning accuracy versus computational efficiency, demonstrating that sparse activation, when properly aligned with reasoning objectives, enables scalable and efficient intelligence. Collectively, Ling 2.0 provides a coherent, open, and efficient foundation for advancing future reasoning and thinking models, including the Ring series built upon the same base.

data mining, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2510.22115

Country:

Asia (0.93)
Europe > Austria (0.28)
North America > United States (0.28)

Genre: Research Report > New Finding (0.67)

Industry: Education > Educational Setting > Online (0.48)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

DRQA: Dynamic Reasoning Quota Allocation for Controlling Overthinking in Reasoning Large Language Models

Yan, Kaiwen, Shi, Xuanqing, Guo, Hongcheng, Wang, Wenxuan, Zhang, Zhuosheng, Qin, Chengwei

arXiv.org Artificial IntelligenceNov-10-2025

Reasoning large language models (RLLMs), such as OpenAI-O3 and DeepSeek-R1, have recently demonstrated remarkable capabilities by performing structured and multi-step reasoning. However, recent studies reveal that RLLMs often suffer from overthinking, i.e., producing unnecessarily lengthy reasoning chains even for simple questions, leading to excessive token consumption and computational inefficiency. Interestingly, we observe that when processing multiple questions in batch mode, RLLMs exhibit more resource-efficient behavior by dynamically compressing reasoning steps for easier problems, due to implicit resource competition. Inspired by this, we propose Dynamic Reasoning Quota Allocation (DRQA), a novel method that transfers the benefits of resource competition from batch processing to single-question inference. Specifically, DRQA leverages batch-generated preference data and reinforcement learning to train the model to allocate reasoning resources adaptively. By encouraging the model to internalize a preference for responses that are both accurate and concise, DRQA enables it to generate concise answers for simple questions while retaining sufficient reasoning depth for more challenging ones. Extensive experiments on a wide range of mathematical and scientific reasoning benchmarks demonstrate that DRQA significantly reduces token usage while maintaining, and in many cases improving, answer accuracy. By effectively mitigating the overthinking problem, DRQA offers a promising direction for more efficient and scalable deployment of RLLMs, and we hope it inspires further exploration into fine-grained control of reasoning behaviors.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2508.17803

Country: Asia > China (0.46)

Genre:

Research Report > New Finding (0.68)
Research Report > Promising Solution (0.48)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)

Add feedback

Trained on Tokens, Calibrated on Concepts: The Emergence of Semantic Calibration in LLMs

Nakkiran, Preetum, Bradley, Arwen, Goliński, Adam, Ndiaye, Eugene, Kirchhof, Michael, Williamson, Sinead

arXiv.org Machine LearningNov-10-2025

Large Language Models (LLMs) often lack meaningful confidence estimates for their outputs. While base LLMs are known to exhibit next-token calibration, it remains unclear whether they can assess confidence in the actual meaning of their responses beyond the token level. We find that, when using a certain sampling-based notion of semantic calibration, base LLMs are remarkably well-calibrated: they can meaningfully assess confidence in open-domain question-answering tasks, despite not being explicitly trained to do so. Our main theoretical contribution establishes a mechanism for why semantic calibration emerges as a byproduct of next-token prediction, leveraging a recent connection between calibration and local loss optimality. The theory relies on a general definition of "B-calibration," which is a notion of calibration parameterized by a choice of equivalence classes (semantic or otherwise). This theoretical mechanism leads to a testable prediction: base LLMs will be semantically calibrated when they can easily predict their own distribution over semantic answer classes before generating a response. We state three implications of this prediction, which we validate through experiments: (1) Base LLMs are semantically calibrated across question-answering tasks, (2) RL instruction-tuning systematically breaks this calibration, and (3) chain-of-thought reasoning breaks calibration. To our knowledge, our work provides the first principled explanation of when and why semantic calibration emerges in LLMs.

calibration, large language model, machine learning, (19 more...)

arXiv.org Machine Learning

2511.04869

Country: North America > United States > Washington > King County > Seattle (0.27)

Genre: Research Report (1.00)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The Hidden Math of Ocean Waves

WIREDNov-9-2025, 07:00:00 GMT

The math behind even the simplest ocean waves is notoriously uncooperative. A team of Italian mathematicians has made major advances toward understanding it. The best perk of Alberto Maspero's job, he says, is the view from his window. Situated on a hill above the ancient port city of Trieste, Italy, his office at the International School for Advanced Studies overlooks a broad bay at the northern tip of the Adriatic Sea. "It's very inspiring," the mathematician said. "For sure the most beautiful view I've ever had." When the bora is strong enough, it drives the waves into reverse. But they never actually get there.

equation, instability, mathematician, (16 more...)

WIRED

Country:

Europe > Italy > Friuli Venezia Giulia > Trieste Province > Trieste (0.25)
Atlantic Ocean > Mediterranean Sea > Adriatic Sea (0.24)
Asia > Nepal (0.14)
(5 more...)

Industry:

Health & Medicine (0.46)
Education (0.34)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Kim Kardashian misses the mark on the California bar exam, vows to keep trying

Los Angeles TimesNov-8-2025, 23:44:14 GMT

Things to Do in L.A. Tap to enable a layout that focuses on the article. After deciding in 2018 that she wanted to study law, Kim Kardashian has failed the California bar exam on her first attempt. This is read by an automated voice. Please report any issues or inconsistencies here . Shapewear mogul Kim Kardashian announced Saturday that she has failed the California bar exam, seven years after embarking on her law studies.

california bar exam, kardashian, kim kardashian, (12 more...)

Los Angeles Times

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.07)
North America > United States > Texas (0.05)
North America > United States > New York (0.05)
(5 more...)

Genre: Personal (0.48)

Industry:

Leisure & Entertainment (1.00)
Health & Medicine (1.00)
Education (0.71)
(4 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (0.97)

Add feedback

High-dimensional limit theorems for SGD: Momentum and Adaptive Step-sizes

Jagannath, Aukosh, Jones-McCormick, Taj, Sarangian, Varnan

arXiv.org Machine LearningNov-7-2025

We develop a high-dimensional scaling limit for Stochastic Gradient Descent with Polyak Momentum (SGD-M) and adaptive step-sizes. This provides a framework to rigourously compare online SGD with some of its popular variants. We show that the scaling limits of SGD-M coincide with those of online SGD after an appropriate time rescaling and a specific choice of step-size. However, if the step-size is kept the same between the two algorithms, SGD-M will amplify high-dimensional effects, potentially degrading performance relative to online SGD. We demonstrate our framework on two popular learning problems: Spiked Tensor PCA and Single Index Models. In both cases, we also examine online SGD with an adaptive step-size based on normalized gradients. In the high-dimensional regime, this algorithm yields multiple benefits: its dynamics admit fixed points closer to the population minimum and widens the range of admissible step-sizes for which the iterates converge to such solutions. These examples provide a rigorous account, aligning with empirical motivation, of how early preconditioners can stabilize and improve dynamics in settings where online SGD fails.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Machine Learning

2511.03952

Country:

North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)
(4 more...)

Genre: Research Report (0.50)

Industry: Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.45)

Add feedback

From Static to Dynamic: Enhancing Offline-to-Online Reinforcement Learning via Energy-Guided Diffusion Stratification

Zu, Lipeng, Zhou, Hansong, Zhang, Xiaonan

arXiv.org Artificial IntelligenceNov-7-2025

Transitioning from offline to online reinforcement learning (RL) poses critical challenges due to distributional shifts between the fixed behavior policy in the offline dataset and the evolving policy during online learning. Although this issue is widely recognized, few methods attempt to explicitly assess or utilize the distributional structure of the offline data itself, leaving a research gap in adapting learning strategies to different types of samples. To address this challenge, we propose an innovative method, Energy-Guided Diffusion Stratification (StratDiff), which facilitates smoother transitions in offline-to-online RL. StratDiff deploys a diffusion model to learn prior knowledge from the offline dataset. It then refines this knowledge through energy-based functions to improve policy imitation and generate offline-like actions during online fine-tuning. The KL divergence between the generated action and the corresponding sampled action is computed for each sample and used to stratify the training batch into offline-like and online-like subsets. Offline-like samples are updated using offline objectives, while online-like samples follow online learning strategies. We demonstrate the effectiveness of StratDiff by integrating it with off-the-shelf methods Cal-QL and IQL. Extensive empirical evaluations on D4RL benchmarks show that StratDiff significantly outperforms existing methods, achieving enhanced adaptability and more stable performance across diverse RL settings.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

2511.03828

Genre: Research Report (0.83)

Industry: Education > Educational Setting > Online (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)

Add feedback

A Characterization of List Language Identification in the Limit

Charikar, Moses, Pabbaraju, Chirag, Tewari, Ambuj

arXiv.org Artificial IntelligenceNov-7-2025

We study the problem of language identification in the limit, where given a sequence of examples from a target language, the goal of the learner is to output a sequence of guesses for the target language such that all the guesses beyond some finite time are correct. Classical results of Gold showed that language identification in the limit is impossible for essentially any interesting collection of languages. Later, Angluin gave a precise characterization of language collections for which this task is possible. Motivated by recent positive results for the related problem of language generation, we revisit the classic language identification problem in the setting where the learner is given the additional power of producing a list of $k$ guesses at each time step. The goal is to ensure that beyond some finite time, one of the guesses is correct at each time step. We give an exact characterization of collections of languages that can be $k$-list identified in the limit, based on a recursive version of Angluin's characterization (for language identification with a list of size $1$). This further leads to a conceptually appealing characterization: A language collection can be $k$-list identified in the limit if and only if the collection can be decomposed into $k$ collections of languages, each of which can be identified in the limit (with a list of size $1$). We also use our characterization to establish rates for list identification in the statistical setting where the input is drawn as an i.i.d. stream from a distribution supported on some language in the collection. Our results show that if a collection is $k$-list identifiable in the limit, then the collection can be $k$-list identified at an exponential rate, and this is best possible. On the other hand, if a collection is not $k$-list identifiable in the limit, then it cannot be $k$-list identified at any rate that goes to zero.

artificial intelligence, identification, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2511.04103

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.54)

Industry: Education (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

PLLuM: A Family of Polish Large Language Models

Kocoń, Jan, Piasecki, Maciej, Janz, Arkadiusz, Ferdinan, Teddy, Radliński, Łukasz, Koptyra, Bartłomiej, Oleksy, Marcin, Woźniak, Stanisław, Walkowiak, Paweł, Wojtasik, Konrad, Moska, Julia, Naskręt, Tomasz, Walkowiak, Bartosz, Gniewkowski, Mateusz, Szyc, Kamil, Motyka, Dawid, Banach, Dawid, Dalasiński, Jonatan, Rudnicka, Ewa, Alberski, Bartłomiej, Walkowiak, Tomasz, Szczęsny, Aleksander, Markiewicz, Maciej, Bernaś, Tomasz, Mazur, Hubert, Żyta, Kamil, Tykierko, Mateusz, Chodak, Grzegorz, Kajdanowicz, Tomasz, Kazienko, Przemysław, Karlińska, Agnieszka, Seweryn, Karolina, Kołos, Anna, Chrabąszcz, Maciej, Lorenc, Katarzyna, Krasnodębska, Aleksandra, Wilczek, Artur, Dziewulska, Katarzyna, Betscher, Paula, Cieślińska, Zofia, Kowol, Katarzyna, Mikoś, Daria, Trzciński, Maciej, Krutul, Dawid, Kozłowski, Marek, Dadas, Sławomir, Poświata, Rafał, Perełkiewicz, Michał, Grębowiec, Małgorzata, Kazuła, Maciej, Białas, Marcin, Roszko, Roman, Roszko, Danuta, Vaičenonienė, Jurgita, Utka, Andrius, Levchuk, Paweł, Kowalski, Paweł, Prawdzic-Jankowska, Irena, Ogrodniczuk, Maciej, Borys, Monika, Bulińska, Anna, Gumienna, Wiktoria, Kieraś, Witold, Komosińska, Dorota, Krasnowska-Kieraś, Katarzyna, Kobyliński, Łukasz, Lewandowska, Martyna, Łaziński, Marek, Łątkowski, Mikołaj, Mastalerz, Dawid, Milewicz, Beata, Mykowiecka, Agnieszka Anna, Peljak-Łapińska, Angelika, Penno, Sandra, Przybysz, Zuzanna, Rudolf, Michał, Rybak, Piotr, Saputa, Karolina, Tomaszewska, Aleksandra, Wawer, Aleksander, Woliński, Marcin, Wołoszyn, Joanna, Wróblewska, Alina, Żuk, Bartosz, Żarnecki, Filip, Kaczyński, Konrad, Cichosz, Anna, Deckert, Zuzanna, Garnys, Monika, Grabarczyk, Izabela, Janowski, Wojciech, Karasińska, Sylwia, Kujawiak, Aleksandra, Misztela, Piotr, Szymańska, Maria, Walkusz, Karolina, Siek, Igor, Kwiatkowski, Jakub, Pęzik, Piotr

arXiv.org Artificial IntelligenceNov-7-2025

Large Language Models (LLMs) play a central role in modern artificial intelligence, yet their development has been primarily focused on English, resulting in limited support for other languages. We present PLLuM (Polish Large Language Model), the largest open-source family of foundation models tailored specifically for the Polish language. Developed by a consortium of major Polish research institutions, PLLuM addresses the need for high-quality, transparent, and culturally relevant language models beyond the English-centric commercial landscape. We describe the development process, including the construction of a new 140-billion-token Polish text corpus for pre-training, a 77k custom instructions dataset, and a 100k preference optimization dataset. A key component is a Responsible AI framework that incorporates strict data governance and a hybrid module for output correction and safety filtering. We detail the models' architecture, training procedures, and alignment techniques for both base and instruction-tuned variants, and demonstrate their utility in a downstream task within public administration. By releasing these models publicly, PLLuM aims to foster open research and strengthen sovereign AI technologies in Poland.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2511.03823

Country:

North America (1.00)
Europe > Poland (1.00)
Asia (1.00)

Genre:

Overview (1.00)
Research Report > New Finding (0.92)

Industry:

Law > Intellectual Property & Technology Law (1.00)
Law Enforcement & Public Safety (1.00)
Information Technology > Security & Privacy (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.92)

Add feedback