AITopics | lsr

Collaborating Authors

lsr

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Dense Associative Memory with Epanechnikov Energy

Neural Information Processing SystemsJun-13-2026, 19:22:52 GMT

We propose a novel energy function for Dense Associative Memory (DenseAM) networks, the log-sum-ReLU (LSR), inspired by optimal kernel density estimation. Unlike the common log-sum-exponential (LSE) function, LSR is based on the Epanechnikov kernel and enables exact memory retrieval with exponential capacity without requiring exponential separation functions. Uniquely, it introduces abundant additional emergent local minima while preserving perfect pattern recovery --- a characteristic previously unseen in DenseAM literature. Empirical results show that LSR energy has significantly more local minima (memories) that have comparable log-likelihood to LSE-based models. Analysis of LSR's emergent memories on image datasets reveals a degree of creativity and novelty, hinting at this method's potential for both large-scale memory storage and generative tasks.

artificial intelligence, name change, proceedings, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Cognitive Science (0.66)

Add feedback

Credit-assigned Policy Gradient for Early Stage Retrieval in Two-stage Ranking

Kiyohara, Haruka, Curmei, Mihaela, Evnine, Ariel, Kalyanaraman, Shankar, Nir, Israel, Pop, Ana-Roxana, Razin, Nitzan, Dean, Sarah, Joachims, Thorsten, Weinsberg, Udi

arXiv.org Machine LearningMay-27-2026

Large-scale search, recommendation, and retrieval-augmented generation (RAG) systems typically employ a two-stage architecture: an early-stage ranker (ESR) generates a candidate set, which is subsequently re-ranked by a late-stage ranker (LSR). While there are many reinforcement learning (RL) methods for training the LSR, end-to-end training of the ESR has proven challenging. In particular, naive application of "vanilla" policy gradient (V-PG) is not scalable for candidate-set sizes relevant for practical use due to exploding variance. This issue arises because V-PG propagates the gradient to the joint probability of the candidate sets, ignoring the contribution of each specific item in the candidate set to the reward. To mitigate this issue, we propose a novel "credit-assigned" policy gradient (CA-PG), which computes gradients with respect to the probability that the target item is chosen in any candidate set, i.e. marginalizing over all candidate sets that contain it. Our theoretical analysis reveals that CA-PG significantly reduces the variance of V-PG by marginalizing over the specific composition of the candidate set, while preserving the ability to learn the correct ranking of items under a reasonably aligned LSR policy. Experiments on both synthetic and real-world data demonstrate that CA-PG improves the convergence speed and training stability for ESRs utilizing the canonical Plackett-Luce model, especially when the candidate-set size is large.

large language model, machine learning, reinforcement learning, (17 more...)

arXiv.org Machine Learning

2605.26385

Country: Asia (0.46)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.66)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

main

Sadhika Malladi

Neural Information Processing SystemsFeb-9-2026, 04:44:09 GMT

accuracy, approximation, noise, (15 more...)

Neural Information Processing Systems

Genre: Workflow (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

SMART-3D: Three-Dimensional Self-Morphing Adaptive Replanning Tree

Agrawal, Priyanshu, Gupta, Shalabh, Shen, Zongyuan

arXiv.org Artificial IntelligenceSep-23-2025

Abstract--This paper presents SMART -3D, an extension of the SMART algorithm to 3D environments. SMART -3D is a tree-based adaptive replanning algorithm for dynamic environments with fast moving obstacles. SMART -3D morphs the underlying tree to find a new path in real-time whenever the current path is blocked by obstacles. SMART -3D removed the grid decomposition requirement of the SMART algorithm by replacing the concept of hot-spots with that of hot-nodes, thus making it computationally efficient and scalable to 3D environments. The hot-nodes are nodes which allow for efficient reconnections to morph the existing tree to find a new safe and reliable path. The performance of SMART -3D is evaluated by extensive simulations in 2D and 3D environments populated with randomly moving dynamic obstacles. The results show that SMART -3D achieves high success rates and low replanning times, thus highlighting its suitability for real-time onboard applications. Recent decades have seen significant growth of autonomous robots in supporting a diverse range of human operations.

artificial intelligence, dynamic obstacle, obstacle, (16 more...)

arXiv.org Artificial Intelligence

2509.16812

Country: North America > United States > Connecticut (0.28)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.46)

Add feedback

main

Sadhika Malladi

Neural Information Processing SystemsAug-14-2025, 23:32:32 GMT

accuracy, approximation, noise, (17 more...)

Neural Information Processing Systems

Genre: Workflow (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

3DGS_LSR:Large_Scale Relocation for Autonomous Driving Based on 3D Gaussian Splatting

Lu, Haitao, Chen, Haijier, Liu, Haoze, Zhang, Shoujian, Xu, Bo, Liu, Ziao

arXiv.org Artificial IntelligenceJul-9-2025

In autonomous robotic systems, precise localization is a prerequisite for safe navigation. However, in complex urban environments, GNSS positioning often suffers from signal occlusion and multipath effects, leading to unreliable absolute positioning. Traditional mapping approaches are constrained by storage requirements and computational inefficiency, limiting their applicability to resource-constrained robotic platforms. To address these challenges, we propose 3DGS-LSR: a large-scale relocalization framework leveraging 3D Gaussian Splatting (3DGS), enabling centimeter-level positioning using only a single monocular RGB image on the client side. We combine multi-sensor data to construct high-accuracy 3DGS maps in large outdoor scenes, while the robot-side localization requires just a standard camera input. Using SuperPoint and SuperGlue for feature extraction and matching, our core innovation is an iterative optimization strategy that refines localization results through step-by-step rendering, making it suitable for real-time autonomous navigation. Experimental validation on the KITTI dataset demonstrates our 3DGS-LSR achieves average positioning accuracies of 0.026m, 0.029m, and 0.081m in town roads, boulevard roads, and traffic-dense highways respectively, significantly outperforming other representative methods while requiring only monocular RGB input. This approach provides autonomous robots with reliable localization capabilities even in challenging urban environments where GNSS fails.

artificial intelligence, autonomous driving, gaussian splatting, (2 more...)

arXiv.org Artificial Intelligence

2507.05661

Genre: Research Report (0.40)

Industry:

Transportation > Ground > Road (0.40)
Information Technology > Robotics & Automation (0.40)
Automobiles & Trucks (0.40)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.40)

Add feedback

From Haystack to Needle: Label Space Reduction for Zero-shot Classification

Vandemoortele, Nathan, Steenwinckel, Bram, Ongenae, Femke, Van Hoecke, Sofie

arXiv.org Artificial IntelligenceFeb-12-2025

We present Label Space Reduction (LSR), a novel method for improving zero-shot classification performance of Large Language Models (LLMs). LSR iteratively refines the classification label space by systematically ranking and reducing candidate classes, enabling the model to concentrate on the most relevant options. By leveraging unlabeled data with the statistical learning capabilities of data-driven models, LSR dynamically optimizes the label space representation at test time. Our experiments across seven benchmarks demonstrate that LSR improves macro-F1 scores by an average of 7.0% (up to 14.2%) with Llama-3.1-70B and 3.3% (up to 11.1%) with Claude-3.5-Sonnet compared to standard zero-shot classification baselines. To reduce the computational overhead of LSR, which requires an additional LLM call at each iteration, we propose distilling the model into a probabilistic classifier, allowing for efficient inference.

classification, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2502.08436

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)
(2 more...)

Genre:

Research Report > New Finding (0.46)
Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation

Chen, Yu, Zhang, Xiangcheng, Wang, Siwei, Huang, Longbo

arXiv.org Artificial IntelligenceFeb-28-2024

Reinforcement learning (RL) [43] has emerged as a powerful framework for sequential decision-making in dynamic and uncertain environments. While traditional RL methods, predominantly focused on maximizing the expected return, have seen significant advancements through approaches such as Q-learning [37, 25] and policy gradients [28, 10], they often fall short in real-world scenarios demanding strict risk control, such as financial investment [9], medical treatment [16], and automous driving [11]. The significance of comprehending risk management in RL has led to the emergence of Risk-Sensitive RL (RSRL). Unlike risk-neutral RL, which primarily focuses on maximizing expected returns, RSRL seeks to optimize risk metrics, such as entropy risk measures (ERM) [17, 18] or conditional value-at-risk (CVaR) [46], of the possible cumulative reward which emphasizes its distributional characteristics. However, traditional RL framework based on Q-learning which typically considers the mean of reward-to-go and corresponding Bellman equation, cannot efficiently capture the characteristics of the cumulative reward's distribution. Therefore, there has been an upsurge of interest in Distributional RL (DisRL) due to its capacity to understand the intrinsic distributional attributes of cumulative rewards, which has already achieved significant empirical success in risk-sensitive tasks [8, 14, 30, 45, 34].

function approximation, probability, risk measure, (12 more...)

arXiv.org Artificial Intelligence

2402.18159

Country:

Asia > Middle East > Jordan (0.04)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine (0.87)
Information Technology > Security & Privacy (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.42)

Add feedback

Time-Optimal Path Planning in a Constant Wind for Uncrewed Aerial Vehicles using Dubins Set Classification

Moon, Brady, Sachdev, Sagar, Yuan, Junbin, Scherer, Sebastian

arXiv.org Artificial IntelligenceNov-2-2023

Time-optimal path planning in high winds for a turning-rate constrained UAV is a challenging problem to solve and is important for deployment and field operations. Previous works have used trochoidal path segments comprising straight and maximum-rate turn segments, as optimal extremal paths in uniform wind conditions. Current methods iterate over all candidate trochoidal trajectory types and select the one that is time-optimal; however, this exhaustive search can be computationally slow. In this paper, we introduce a method to decrease the computation time. This is achieved by reducing the number of candidate trochoidal trajectory types by framing the problem in the air-relative frame and bounding the solution within a subset of candidate trajectories. Our method reduces overall computation by 37.4% compared to pre-existing methods in Bang-Straight-Bang trajectories, freeing up computation for other onboard processes and can lead to significant total computational reductions when solving many trochoidal paths. When used within the framework of a global path planner, faster state expansions help find solutions faster or compute higher-quality paths. We also release our open-source codebase as a C++ package. The website and demo can be bound at https://bradymoon.com/trochoids, codebase at https://github.com/castacks/trochoids, and video at https://youtu.be/qOU5gI7JshI .

decision table, trajectory, transition point, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/LRA.2023.3333167

2306.11845

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
Europe > Netherlands > South Holland > Dordrecht (0.04)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.72)

Add feedback

Dubins Curve Based Continuous-Curvature Trajectory Planning for Autonomous Mobile Robots

Huang, Xuanhao, Yan, Chao-Bo

arXiv.org Artificial IntelligenceSep-14-2023

AMR is widely used in factories to replace manual labor to reduce costs and improve efficiency. However, it is often difficult for logistics robots to plan the optimal trajectory and unreasonable trajectory planning can lead to low transport efficiency and high energy consumption. In this paper, we propose a method to directly calculate the optimal trajectory for short distance on the basis of the Dubins set, which completes the calculation of the Dubins path. Additionally, as an improvement of Dubins path, we smooth the Dubins path based on clothoid, which makes the curvature varies linearly. AMR can adjust the steering wheels while following this trajectory. The experiments show that the Dubins path can be calculated quickly and well smoothed.

dubin path, optimal path, trajectory, (15 more...)

arXiv.org Artificial Intelligence

2309.07565

Country:

Asia > China > Shaanxi Province > Xi'an (0.04)
North America > United States > Missouri > St. Louis County > St. Louis (0.04)
Europe > Slovakia (0.04)
(2 more...)

Genre: Research Report (0.40)

Industry: Transportation (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.47)
Information Technology > Artificial Intelligence > Robots > Locomotion (0.41)

Add feedback