AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

D-LLM: AT oken Adaptive Computing Resource Allocation Strategy for Large Language Models

Neural Information Processing SystemsDec-27-2025, 18:48:12 GMT

Large language models have shown an impressive societal impact owing to their excellent understanding and logical reasoning skills. However, such strong ability relies on a huge amount of computing resources, which makes it difficult to deploy LLMs on computing resource-constrained platforms. Currently, LLMs process each token equivalently, but we argue that not every word is equally important. Some words should not be allocated excessive computing resources, particularly for dispensable terms in simple questions. In this paper, we propose a novel dynamic inference paradigm for LLMs, namely D-LLMs, which adaptively allocate computing resources in token processing. We design a dynamic decision module for each transformer layer that decides whether a network unit should be executed or skipped. Moreover, we tackle the issue of adapting D-LLMs to real-world applications, specifically concerning the missing KV -cache when layers are skipped. To overcome this, we propose a simple yet effective eviction policy to exclude the skipped layers from subsequent attention calculations. The eviction policy not only enables D-LLMs to be compatible with prevalent applications but also reduces considerable storage resources.

computational cost, d-llm, language model, (16 more...)

Neural Information Processing Systems

Country: Asia > China > Hong Kong (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Efficient Adversarial Training in LLMs with Continuous Attacks

Neural Information Processing SystemsDec-27-2025, 17:47:37 GMT

The authors use Greedy Coordinate Gradient (GCG) to generate discrete adversarial suffixes in natural language.

adversarial attack, adversarial training, robustness, (16 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > China (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry:

Information Technology > Security & Privacy (0.48)
Government (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

AVIS: Autonomous Visual Information Seeking with Large Language Model Agent

Neural Information Processing SystemsDec-27-2025, 17:46:42 GMT

In this paper, we propose an autonomous information seeking visual question answering framework, A VIS. Our method leverages a Large Language Model (LLM)

arxiv preprint arxiv, information, visual question, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(3 more...)

Genre: Workflow (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

02fd91a387a6a5a5751e81b58a75af90-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsDec-27-2025, 17:38:39 GMT

dataset, helpsteer2, reward model, (15 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
South America > Colombia > Meta Department > Villavicencio (0.04)
(7 more...)

Genre: Research Report (0.92)

Industry:

Leisure & Entertainment > Sports (0.46)
Education > Educational Setting > K-12 Education (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

02ee6b7295f720407b56c457b34c54d5-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsDec-27-2025, 17:36:45 GMT

arxiv preprint arxiv, dataset, language model, (15 more...)

Neural Information Processing Systems

Country:

Asia > China (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > California (0.04)
Europe > Germany > Berlin (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Education > Curriculum > Subject-Specific Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

From shrimp Jesus to erotic tractors: how viral AI slop took over the internet

The GuardianDec-27-2025, 17:00:44 GMT

Clockwise from top left: Shrimp Jesus, Nayib Bukele, Justin Bieber and Super Cat League. Clockwise from top left: Shrimp Jesus, Nayib Bukele, Justin Bieber and Super Cat League. In the algorithm-driven economy of 2025, one man's shrimp Jesus is another man's side hustle. AI slop - the low-quality, surreal content flooding social media platforms, designed to farm views - is a phenomenon, some would say the phenomenon of the 2024 and 2025 internet. Merriam-Webster's word of the year this year is "slop", referring exclusively to the internet variety.

ai slop, shrimp jesus, video, (11 more...)

The Guardian

Country:

North America > United States (0.30)
Oceania > Australia (0.05)
Europe > Ukraine > Donetsk Oblast > Mariupol (0.05)
(4 more...)

Industry:

Media (1.00)
Leisure & Entertainment > Sports (0.96)
Government > Regional Government > North America Government > United States Government (0.30)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.49)

Add feedback

Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation Models

Neural Information Processing SystemsDec-27-2025, 16:35:02 GMT

The proposed method is dubbed as UniHOI .

detection, interaction, proceedings, (8 more...)

Neural Information Processing Systems

Country: Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Open LLMs are Necessary for Current Private Adaptations and Outperform their Closed Alternatives

Neural Information Processing SystemsDec-27-2025, 15:56:28 GMT

Recently, various new methods have been proposed to adapt closed LLMs to private data without leaking private information to third parties and/or the LLM provider.

adaptation, dataset, llm, (16 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
(7 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Large language models transition from integrating across position-yoked, exponential windows to structure-yoked, power-law windows

Neural Information Processing SystemsDec-27-2025, 15:55:31 GMT

Prior work suggests that human brain responses to language exhibit hierarchically organized "integration windows" that substantially constrain the

boundary, integration window, structure-yoked integration, (15 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Tuscany > Florence (0.04)
North America > United States > New York > Monroe County > Rochester (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report > New Finding (0.94)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)

Add feedback

Discovering Sparsity Allocation for Layer-wise Pruning of Large Language Models

Neural Information Processing SystemsDec-27-2025, 14:39:13 GMT

In this paper, we present DSA, the first automated framework for discovering sparsity allocation schemes for layer-wise pruning in Large Language Models (LLMs). LLMs have become increasingly powerful, but their large parameter counts make them computationally expensive. Existing pruning methods for compressing LLMs primarily focus on evaluating redundancies and removing element-wise weights. However, these methods fail to allocate adaptive layer-wise sparsities, leading to performance degradation in challenging tasks. We observe that per-layer importance statistics can serve as allocation indications, but their effectiveness depends on the allocation function between layers.

large language model, natural language, proceedings, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback