Goto

Collaborating Authors

 languagemodel


Temporal Predictors of Outcome in Reasoning Language Models

arXiv.org Artificial Intelligence

The chain-of-thought (CoT) paradigm uses the elicitation of step-by-step rationales as a proxy for reasoning, gradually refining the model's latent representation of a solution. However, it remains unclear just how early a Large Language Model (LLM) internally commits to an eventual outcome. We probe this by training linear classifiers on hidden states after the first t reasoning tokens, showing that eventual correctness is highly predictable after only a few tokens, even when longer outputs are needed to reach a definite answer. We show that, for harder questions, a drop in predictive accuracy highlights a selection artifact: hard items are disproportionately represented in long CoTs. Overall, our results imply that for reasoning models, internal self-assessment of success tends to emerge after only a few tokens, with implications for interpretability and for inference-time control.


SCIRGC: Multi-Granularity Citation Recommendation and Citation Sentence Preference Alignment

arXiv.org Artificial Intelligence

Citations are crucial in scientific research articles as they highlight the connection between the current study and prior work. However, this process is often time-consuming for researchers. In this study, we propose the SciRGC framework, which aims to automatically recommend citation articles and generate citation sentences for citation locations within articles. The framework addresses two key challenges in academic citation generation: 1) how to accurately identify the author's citation intent and find relevant citation papers, and 2) how to generate high-quality citation sentences that align with human preferences. We enhance citation recommendation accuracy in the citation article recommendation module by incorporating citation networks and sentiment intent, and generate reasoning-based citation sentences in the citation sentence generation module by using the original article abstract, local context, citation intent, and recommended articles as inputs. Additionally, we propose a new evaluation metric to fairly assess the quality of generated citation sentences. Through comparisons with baseline models and ablation experiments, the SciRGC framework not only improves the accuracy and relevance of citation recommendations but also ensures the appropriateness of the generated citation sentences in context, providing a valuable tool for interdisciplinary researchers.


Pinaki Laskar on LinkedIn: #llms #languagemodels #machinelearning #chatgpt #gpt3

#artificialintelligence

Are Large Language Models as Stochastic Parroting without any Intellect? Language models may be categorized as probabilistic methods and neural network-based modern language models. A simple probabilistic language model that calculates n-gram probabilities has significant drawbacks. The major one is the context problem. Complicated texts have deep context influencing the choice of the next word.