AITopics | alp

Collaborating Authors

alp

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models

Neural Information Processing SystemsMar-19-2026, 22:44:25 GMT

The impressive performance of Large Language Models (LLMs) across various natural language processing tasks comes at the cost of vast computational resources and storage requirements. One-shot pruning techniques offer a way to alleviate these burdens by removing redundant weights without the need for retraining. Yet, the massive scale of LLMs often forces current pruning approaches to rely on heuristics instead of optimization-based techniques, potentially resulting in suboptimal compression. In this paper, we introduce ALPS, an optimization-based framework that tackles the pruning problem using the operator splitting technique and a preconditioned conjugate gradient-based post-processing step. Our approach incorporates novel techniques to accelerate and theoretically guarantee convergence while leveraging vectorization and GPU parallelism for efficiency. ALPS substantially outperforms state-of-the-art methods in terms of the pruning objective and perplexity reduction, particularly for highly sparse models. On the LLaMA3-8B model with 70\% sparsity, ALPS achieves a 29\% reduction in test perplexity on the WikiText dataset and a 8\% improvement in zero-shot benchmark performance compared to existing methods.

large language model, natural language, proceedings, (5 more...)

Neural Information Processing Systems

Genre: Research Report > Promising Solution (0.60)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Ancient bone may prove legendary war elephant crossing of Alps

BBC NewsFeb-16-2026, 17:26:49 GMT

An elephant foot bone found by archaeologists digging in southern Spain may be evidence that a troop of war elephants stomped through ancient Europe. It would be the first concrete proof of the legendary Carthaginian General Hannibal's troop of battle elephants, according to academics. Drawings of Hannibal's war against the Romans had long suggested that the beasts were used in fighting, but no hard evidence backed up the theories. Now the creatures' skeletal remains appear to have been found in an Iron Age dig near Cordoba. Beyond ivory, the discovery of elephant remains in European archaeological contexts is exceptionally rare, says the team of scientists in a paper published in Journal of Archaeological Science: Reports.

artificial intelligence, bone, elephant, (12 more...)

BBC News

Country:

North America (1.00)
Europe > United Kingdom > England (0.15)

Genre: Research Report (0.35)

Industry:

Leisure & Entertainment (0.75)
Media > Film (0.30)

Technology: Information Technology > Artificial Intelligence (0.30)

Add feedback

ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models

Neural Information Processing SystemsFeb-11-2026, 22:56:31 GMT

One-shot pruning techniques offer a way to alleviate these burdens by removing redundant weights without the need for retraining. Y et, the massive scale of LLMs often forces current pruning approaches to rely on heuristics instead of optimization-based techniques, potentially resulting in suboptimal compression.

large language model, machine learning, pruning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Italy (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.92)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

c1619d2ad66f7629c12c87fe21d32a58-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 00:16:47 GMT

active learning, algorithm, learning, (15 more...)

Neural Information Processing Systems

Country:

Europe > Denmark > Capital Region > Copenhagen (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

LearningPhysicsConstrainedDynamicsUsing Autoencoders

Neural Information Processing SystemsFeb-9-2026, 16:05:49 GMT

Recent workhasshown that neural networks can exhibit an inductive bias that is often introduced via designing specific structures[4].

artificial intelligence, inaddition, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Africa > Ethiopia (0.05)
North America > United States > California (0.04)
(5 more...)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models

Neural Information Processing SystemsOct-10-2025, 00:36:31 GMT

arxiv preprint arxiv, pruning, sparsity level, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Italy (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.92)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Learning Physics Constrained Dynamics Using Autoencoders Tsung-Y en Y ang

Neural Information Processing SystemsOct-9-2025, 15:58:27 GMT

We consider the problem of estimating states ( e.g., position and velocity) and physical parameters ( e.g., friction, elasticity) from a sequence of observations

fourier feature mapping, neural network, physical parameter, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Long Beach (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
(9 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Energy (0.67)
Education > Curriculum > Subject-Specific Education (0.40)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

When Inverse Data Outperforms: Exploring the Pitfalls of Mixed Data in Multi-Stage Fine-Tuning

Deng, Mengyi, Li, Xin, Zhu, Tingyu, Yang, Zhicheng, Guo, Zhijiang, Wang, Wei

arXiv.org Artificial IntelligenceSep-17-2025

Existing work has shown that o1-level performance can be achieved with limited data distillation, but most existing methods focus on unidirectional supervised fine-tuning (SFT), overlooking the intricate interplay between diverse reasoning patterns. In this paper, we construct r1k, a high-quality reverse reasoning dataset derived by inverting 1,000 forward examples from s1k, and examine how SFT and Direct Preference Optimization (DPO) affect alignment under bidirectional reasoning objectives. SFT on r1k yields a 1.6%--6.8% accuracy improvement over s1k across evaluated benchmarks. However, naively mixing forward and reverse data during SFT weakens the directional distinction. Although DPO can partially recover this distinction, it also suppresses less preferred reasoning paths by shifting the probability mass toward irrelevant outputs. These findings suggest that mixed reasoning data introduce conflicting supervision signals, underscoring the need for robust and direction-aware alignment strategies.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2509.13079

Country: Asia > China (0.28)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.97)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Online Active Learning with Surrogate Loss Functions

Neural Information Processing SystemsAug-17-2025, 05:07:10 GMT

In this paper, we are specifically interested in binary classification problems in the so-called streaming (or online) setting of active learning.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > Denmark > Capital Region > Copenhagen (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

GETALP@AutoMin 2025: Leveraging RAG to Answer Questions based on Meeting Transcripts

Kang, Jeongwoo, Vartampetian, Markarit, Herron, Felix, Zhou, Yongxin, Fabre, Diandra, Gonzalez-Saez, Gabriela

arXiv.org Artificial IntelligenceAug-4-2025

This paper documents GETALP's submission to the Third Run of the Automatic Minuting Shared Task at SIGDial 2025. We participated in Task B: question-answering based on meeting transcripts. Our method is based on a retrieval augmented generation (RAG) system and Abstract Meaning Representations (AMR). We propose three systems combining these two approaches. Our results show that incorporating AMR leads to high-quality responses for approximately 35% of the questions and provides notable improvements in answering questions that involve distinguishing between different participants (e.g., who questions).

computational linguistic, large language model, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2508.00476

Country:

Europe > France (0.29)
North America > Mexico (0.28)
Asia > Middle East > UAE (0.28)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback