AITopics | lollipop

Collaborating Authors

lollipop

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The magic of making candy canes by hand

How the candy makers at Hammond's Candies have made the sweet treats for over 100 years. Decembmer 26 is National Candy Cane Day. Breakthroughs, discoveries, and DIY tips sent every weekday. Candy canes are a holiday staple with roots dating back to the 1600s. The story suggests that in 1670, a choirmaster in Cologne, Germany, gave children these sugary sticks shaped like a shepherd's staff for the long nativity church service.

candy cane, hammond, popular science team, (9 more...)

Popular Science

Country:

Europe > Germany > North Rhine-Westphalia > Cologne Region > Cologne (0.25)
North America > United States > Colorado > Denver County > Denver (0.05)

Genre: Research Report > New Finding (0.36)

Technology: Information Technology > Artificial Intelligence (0.36)

Add feedback

9d5609613524ecf4f15af0f7b31abca4-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 06:40:04 GMT

exemplar, large language model, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
Asia > Vietnam (0.04)
North America > Mexico (0.04)

Industry:

Media (0.93)
Consumer Products & Services (0.67)
Leisure & Entertainment > Sports > Basketball (0.46)
Leisure & Entertainment > Sports > Football (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

8bb0d291acd4acf06ef112099c16f326-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-16-2025, 20:00:57 GMT

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.04)
North America > United States (0.04)
North America > Mexico (0.04)

Industry:

Leisure & Entertainment (1.00)
Health & Medicine > Consumer Health (0.67)
Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.77)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.49)

Add feedback

How Chain-of-Thought Works? Tracing Information Flow from Decoding, Projection, and Activation

Yang, Hao, Zhao, Qinghua, Li, Lei

arXiv.org Artificial IntelligenceJul-29-2025

Chain-of-Thought (CoT) prompting significantly enhances model reasoning, yet its internal mechanisms remain poorly understood. We analyze CoT's operational principles by reversely tracing information flow across decoding, projection, and activation phases. Our quantitative analysis suggests that CoT may serve as a decoding space pruner, leveraging answer templates to guide output generation, with higher template adherence strongly correlating with improved performance. Furthermore, we surprisingly find that CoT modulates neuron engagement in a task-dependent manner: reducing neuron activation in open-domain tasks, yet increasing it in closed-domain scenarios. These findings offer a novel mechanistic interpretability framework and critical insights for enabling targeted CoT interventions to design more efficient and robust prompts. We released our code and data at https://anonymous.4open.science/r/cot-D247.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2507.20758

Country:

Asia (0.69)
North America > United States (0.46)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Sports > Football (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)

Add feedback

Making Large Language Models Better Reasoners with Orchestrated Streaming Experiences

Liu, Xiangyang, He, Junliang, Qiu, Xipeng

arXiv.org Artificial IntelligenceApr-1-2025

Large language models (LLMs) can perform complex reasoning by generating intermediate thoughts under zero-shot or few-shot settings. However, zero-shot prompting always encounters low performance, and the superior performance of few-shot prompting hinges on the manual-crafted demonstrations. In this paper, we present RoSE (Reasoning with Orchestrated Streaming Experiences), a general framework for solving reasoning tasks that can self-improve without complex external efforts. To enable RoSE, we describe an architecture that extends an LLM to store all answered questions and their thoughts in a streaming experience pool then orchestrates helpful questions from the pool to assist in answering new questions. To set up a question-aware orchestration mechanism, RoSE first calculates the similarity of each question in the pool with a new test question. Since the solution to each answered question is not always correct, RoSE will sort the questions according to their similarity with the new question, and then uniformly divide them into multiple buckets. It finally extracts one question from each bucket to make these extracted questions more diverse. To make these extracted questions help RoSE answer new questions as much as possible, we introduce two other attributes of uncertainty and complexity for each question. RoSE will preferentially select the questions with low uncertainty and high complexity from each bucket. We evaluate the versatility of RoSE in various reasoning tasks, LLMs, and CoT methods.

demonstration, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2504.00473

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Africa > Rwanda > Kigali > Kigali (0.05)
North America > United States > Pennsylvania (0.04)
(8 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Chain of Draft: Thinking Faster by Writing Less

Xu, Silei, Xie, Wenhao, Zhao, Lingxiao, He, Pengcheng

arXiv.org Artificial IntelligenceMar-3-2025

Large Language Models (LLMs) have demonstrated remarkable performance in solving complex reasoning tasks through mechanisms like Chain-of-Thought (CoT) prompting, which emphasizes verbose, step-by-step reasoning. However, humans typically employ a more efficient strategy: drafting concise intermediate thoughts that capture only essential information. In this work, we propose Chain of Draft (CoD), a novel paradigm inspired by human cognitive processes, where LLMs generate minimalistic yet informative intermediate reasoning outputs while solving tasks. By reducing verbosity and focusing on critical insights, CoD matches or surpasses CoT in accuracy while using as little as only 7.6% of the tokens, significantly reducing cost and latency across various reasoning tasks. Our code and data are available at https://github.com/sileix/chain-of-draft.

arxiv preprint arxiv, language model, reasoning, (15 more...)

arXiv.org Artificial Intelligence

2502.186

Country: North America > United States (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

Hint Marginalization for Improved Reasoning in Large Language Models

Pal, Soumyasundar, Chételat, Didier, Zhang, Yingxue, Coates, Mark

arXiv.org Artificial IntelligenceDec-17-2024

Large Language Models (LLMs) have exhibited an impressive capability to perform reasoning tasks, especially if they are encouraged to generate a sequence of intermediate steps. Reasoning performance can be improved by suitably combining multiple LLM responses, generated either in parallel in a single query, or via sequential interactions with LLMs throughout the reasoning process. Existing strategies for combination, such as self-consistency and progressive-hint-prompting, make inefficient usage of the LLM responses. We present Hint Marginalization, a novel and principled algorithmic framework to enhance the reasoning capabilities of LLMs. Our approach can be viewed as an iterative sampling strategy for forming a Monte Carlo approximation of an underlying distribution of answers, with the goal of identifying the mode -- the most likely answer. Empirical evaluation on several benchmark datasets for arithmetic reasoning demonstrates the superiority of the proposed approach.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2412.13292

Country:

North America > Canada > Quebec > Montreal (0.14)
Europe > Monaco (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback

Zero-to-Strong Generalization: Eliciting Strong Capabilities of Large Language Models Iteratively without Gold Labels

Liu, Chaoqun, Chao, Qin, Zhang, Wenxuan, Wu, Xiaobao, Li, Boyang, Luu, Anh Tuan, Bing, Lidong

arXiv.org Artificial IntelligenceSep-18-2024

Large Language Models (LLMs) have demonstrated remarkable performance through supervised fine-tuning or in-context learning using gold labels. However, this paradigm is limited by the availability of gold labels, while in certain scenarios, LLMs may need to perform tasks that are too complex for humans to provide such labels. To tackle this challenge, this study explores whether solely utilizing unlabeled data can elicit strong model capabilities. We propose a new paradigm termed zero-to-strong generalization. We iteratively prompt LLMs to annotate unlabeled data and retain high-quality labels by filtering. Surprisingly, we obverse that this iterative process gradually unlocks LLMs' potential on downstream tasks. Our experiments on extensive classification and reasoning tasks confirm the effectiveness of our proposed framework. Our analysis indicates that this paradigm is effective for both in-context learning and fine-tuning, and for various model sizes.

classification task, demonstration, iteration, (13 more...)

arXiv.org Artificial Intelligence

2409.12425

Country:

North America > United States > Washington > King County > Seattle (0.14)
Asia > Singapore (0.05)
North America > United States > New York > New York County > New York City (0.04)
(7 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Self-Harmonized Chain of Thought

Jin, Ziqi, Lu, Wei

arXiv.org Artificial IntelligenceSep-6-2024

Chain-of-Thought (CoT) prompting reveals that large language models are capable of performing complex reasoning via intermediate steps. CoT prompting is primarily categorized into three approaches. The first approach utilizes straightforward prompts like ``Let's think step by step'' to generate a sequential thought process before yielding an answer. The second approach makes use of human-crafted, step-by-step demonstrations to guide the model's reasoning process. The third automates the generation of reasoned demonstrations with the 'Let's think step by step'.This approach sometimes leads to reasoning errors, highlighting the need to diversify demonstrations to mitigate its misleading effects. However, diverse demonstrations pose challenges for effective representations. In this work, we propose ECHO, a self-harmonized chain-of-thought prompting method. It consolidates diverse solution paths into a uniform and effective solution pattern.ECHO demonstrates the best overall performance across three reasoning domains.

demonstration, let, wrapper, (15 more...)

arXiv.org Artificial Intelligence

2409.04057

Country:

Asia > Japan (0.04)
North America > United States > Virginia (0.04)
North America > United States > California (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment (0.93)
Education (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs

Rai, Daking, Yao, Ziyu

arXiv.org Artificial IntelligenceJun-18-2024

Large language models (LLMs) have shown strong arithmetic reasoning capabilities when prompted with Chain-of-Thought (CoT) prompts. However, we have only a limited understanding of how they are processed by LLMs. To demystify it, prior work has primarily focused on ablating different components in the CoT prompt and empirically observing their resulting LLM performance change. Yet, the reason why these components are important to LLM reasoning is not explored. To fill this gap, in this work, we investigate ``neuron activation'' as a lens to provide a unified explanation to observations made by prior work. Specifically, we look into neurons within the feed-forward layers of LLMs that may have activated their arithmetic reasoning capabilities, using Llama2 as an example. To facilitate this investigation, we also propose an approach based on GPT-4 to automatically identify neurons that imply arithmetic reasoning. Our analyses revealed that the activation of reasoning neurons in the feed-forward layers of an LLM can explain the importance of various components in a CoT prompt, and future research can extend it for a more complete understanding.

activation, neuron, reasoning, (16 more...)

arXiv.org Artificial Intelligence

2406.12288

Country:

North America > United States > Virginia > Fairfax County > Fairfax (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback