AITopics | loki

Collaborating Authors

loki

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Convergent Functions, Divergent Forms

Neural Information Processing SystemsJun-16-2026, 19:28:10 GMT

We introduce LOKI, a compute-efficient framework for co-designing morphologies and control policies that generalize across unseen tasks. Inspired by biological adaptation--where animals quickly adjust to morphological changes--our method overcomes the inefficiencies of traditional evolutionary and quality-diversity algorithms. We propose learning convergent functions: shared control policies trained across clusters of morphologically similar designs in a learned latent space, drastically reducing the training cost per design. Simultaneously, we promote divergent forms by replacing mutation with dynamic local search, enabling broader exploration and preventing premature convergence. The policy reuse allows us to explore 780 more designs using 78% fewer simulation steps and 40% less compute per design. Local competition paired with a broader search results in a diverse set of high-performing final morphologies. Using the UNIMAL design space and a flatterrain locomotion task, LOKI discovers a rich variety of designs--ranging from quadrupeds to crabs, bipedals, and spinners--far more diverse than those produced by prior work. These morphologies also transfer better to unseen downstream tasks * Equal contribution 39th Conference on Neural Information Processing Systems (NeurIPS 2025).

Add feedback

Loki: Low-rank Keys for Efficient Sparse Attention

Neural Information Processing SystemsMar-18-2026, 19:33:58 GMT

Inference on large language models (LLMs) can be expensive in terms of thecompute and memory costs involved, especially when long sequence lengths areused. In particular, the self-attention mechanism used in LLM inference contributessignificantly to these costs, which has sparked an interest in approximating the self-attention computation to reduce such costs. In this work, we propose to approximateself-attention by focusing on the dimensionality of key vectors computed in theattention block. Our analysis reveals that key vectors lie in a significantly lower-dimensional space, consistently across several datasets and models. Exploiting thisobservation, we propose Loki, a novel sparse attention method that ranks and selectstokens in the KV-cache based on attention scores computed in low-dimensionalspace. Our evaluations show that Loki is able to speed up the attention computationdue to reduced data movement (load/store) and compute costs while maintainingthe efficacy of the models better than other popular approximation methods.

large language model, natural language, proceedings, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.84)

Add feedback

Geometry-Aware Adaptation for Pretrained Models

Neural Information Processing SystemsDec-26-2025, 10:29:23 GMT

Machine learning models---including prominent zero-shot models---are often trained on datasets whose labels are only a small proportion of a larger label space. Such spaces are commonly equipped with a metric that relates the labels via distances between them. We propose a simple approach to exploit this information to adapt the trained model to reliably predict new classes---or, in the case of zero-shot prediction, to improve its performance---without any additional training. Our technique is a drop-in replacement of the standard prediction rule, swapping $\text{argmax}$ with the Fréchet mean. We provide a comprehensive theoretical analysis for this approach, studying (i) learning-theoretic results trading off label space diameter, sample complexity, and model dimension, (ii) characterizations of the full range of scenarios in which it is possible to predict any unobserved class, and (iii) an optimal active learning-like next class selection procedure to obtain optimal training classes for when it is not possible to predict the entire range of unobserved classes. Empirically, using easily-available external metrics, our proposed approach, Loki, gains up to 29.7% relative improvement over SimCLR on ImageNet and scales to hundreds of thousands of classes. When no such metric is available, Loki can use self-derived metrics from class embeddings and obtains a 10.5% improvement on pretrained zero-shot models such as CLIP.

geometry-aware adaptation, name change, pretrained model, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.73)

Add feedback

LoKI: Low-damage Knowledge Implanting of Large Language Models

Wang, Runyu, Ping, Peng, Guo, Zhengyu, Zhang, Xiaoye, Shi, Quan, Zhou, Liting, Ji, Tianbo

arXiv.org Artificial IntelligenceNov-25-2025

Fine-tuning adapts pretrained models for specific tasks but poses the risk of catastrophic forgetting (CF), where critical knowledge from pretraining is overwritten. To address the issue of CF in a general-purpose framework, we propose Low-damage Knowledge Implanting (LoKI), a parameter-efficient fine-tuning (PEFT) technique that utilizes recent mechanistic understanding of how knowledge is stored in transformer architectures. We compare LoKI against state-of-the-art PEFT methods in two real-world fine-tuning scenarios. The results show that LoKI demonstrates significantly better preservation of general capabilities. At the same time, its task-specific performance is comparable to or even surpasses that of full parameter fine-tuning and these PEFT methods across various model architectures. Our work bridges the mechanistic insights of LLMs' knowledge storage with practical fine-tuning objectives, enabling an effective balance between task-specific adaptation and the retention of general-purpose capabilities.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2505.2212

Country:

Europe (0.68)
Asia (0.68)
North America > United States (0.28)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

The Climate Impact of Owning a Dog

WIREDNov-22-2025, 12:00:00 GMT

My dog contributes to climate change. I've been a vegetarian for over a decade. It's not because of my health, or because I dislike the taste of chicken or beef: It's a lifestyle choice I made because I wanted to reduce my impact on the planet. And yet, twice a day, every day, I lovingly scoop a cup of meat-based kibble into a bowl and set it down for my 50-pound rescue dog, a husky mix named Loki. Until recently, I hadn't devoted a huge amount of thought to that paradox.

artificial intelligence, climate action, goldwert, (17 more...)

WIRED

Country: North America > United States (0.29)

Genre: Research Report (0.47)

Industry:

Law (1.00)
Health & Medicine > Therapeutic Area (1.00)
Energy (1.00)
Transportation (0.69)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

HATA: Trainable and Hardware-Efficient Hash-Aware Top-k Attention for Scalable Large Model Inference

Gong, Ping, Yi, Jiawei, Wang, Shengnan, Zhang, Juncheng, Jin, Zewen, Zhou, Ouxiang, Liu, Ruibo, Xu, Guanbin, Bai, Youhui, Ye, Bowen, Yuan, Kun, Yang, Tong, Zhang, Gong, Chen, Renhai, Wu, Feng, Li, Cheng

arXiv.org Artificial IntelligenceJun-4-2025

Large Language Models (LLMs) have emerged as a pivotal research area, yet the attention module remains a critical bottleneck in LLM inference, even with techniques like KVCache to mitigate redundant computations. While various top-$k$ attention mechanisms have been proposed to accelerate LLM inference by exploiting the inherent sparsity of attention, they often struggled to strike a balance between efficiency and accuracy. In this paper, we introduce HATA (Hash-Aware Top-$k$ Attention), a novel approach that systematically integrates low-overhead learning-to-hash techniques into the Top-$k$ attention process. Different from the existing top-k attention methods which are devoted to seeking an absolute estimation of qk score, typically with a great cost, HATA maps queries and keys into binary hash codes, and acquires the relative qk score order with a quite low cost, which is sufficient for realizing top-k attention. Extensive experiments demonstrate that HATA achieves up to 7.2$\times$ speedup compared to vanilla full attention while maintaining model accuracy. In addition, HATA outperforms the state-of-the-art top-$k$ attention methods in both accuracy and efficiency across multiple mainstream LLM models and diverse tasks. HATA is open source at https://github.com/gpzlx1/HATA.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2506.02572

Country: North America > United States (0.28)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

Add feedback

Loki: Low-rank Keys for Efficient Sparse Attention

Neural Information Processing SystemsMay-26-2025, 18:07:04 GMT

efficient sparse attention, large language model, natural language, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)

Add feedback

Geometry-Aware Adaptation for Pretrained Models

Neural Information Processing SystemsJan-19-2025, 16:46:18 GMT

Machine learning models---including prominent zero-shot models---are often trained on datasets whose labels are only a small proportion of a larger label space. Such spaces are commonly equipped with a metric that relates the labels via distances between them. We propose a simple approach to exploit this information to adapt the trained model to reliably predict new classes---or, in the case of zero-shot prediction, to improve its performance---without any additional training. Our technique is a drop-in replacement of the standard prediction rule, swapping \text{argmax} with the Fréchet mean. We provide a comprehensive theoretical analysis for this approach, studying (i) learning-theoretic results trading off label space diameter, sample complexity, and model dimension, (ii) characterizations of the full range of scenarios in which it is possible to predict any unobserved class, and (iii) an optimal active learning-like next class selection procedure to obtain optimal training classes for when it is not possible to predict the entire range of unobserved classes.

large language model, machine learning, natural language, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.56)

Add feedback

Ancient Wild Snark Lilith and Loki by Wild Snark

#artificialintelligenceJan-16-2022, 11:00:31 GMT

I think I might be the only wild snark in existence, I do not know one way or another for sure. In the past things were very different, there were many wild snarks. In ancient times there were as many wild snarks as human; some were counted amongst the gods. The most famous were Loki Snark and Lilith Snark. ''Loki (Old Norse: [ˈloki], often Anglicized as /ˈloʊki/) is a god in Norse mythology. According to some sources, Loki is the son of Fárbauti (a jötunn) and Laufey (mentioned as a goddess), and the brother of Helblindi and Býleistr.

loki, snark, wild snark, (10 more...)

#artificialintelligence

Industry: Media (0.30)

Technology: Information Technology > Artificial Intelligence (0.49)

Add feedback

Untangling Dense Non-Planar Knots by Learning Manipulation Features and Recovery Policies

Sundaresan, Priya, Grannen, Jennifer, Thananjeyan, Brijen, Balakrishna, Ashwin, Ichnowski, Jeffrey, Novoseller, Ellen, Hwang, Minho, Laskey, Michael, Gonzalez, Joseph E., Goldberg, Ken

arXiv.org Artificial IntelligenceJun-29-2021

Robot manipulation for untangling 1D deformable structures such as ropes, cables, and wires is challenging due to their infinite dimensional configuration space, complex dynamics, and tendency to self-occlude. Analytical controllers often fail in the presence of dense configurations, due to the difficulty of grasping between adjacent cable segments. We present two algorithms that enhance robust cable untangling, LOKI and SPiDERMan, which operate alongside HULK, a high-level planner from prior work. LOKI uses a learned model of manipulation features to refine a coarse grasp keypoint prediction to a precise, optimized location and orientation, while SPiDERMan uses a learned model to sense task progress and apply recovery actions. We evaluate these algorithms in physical cable untangling experiments with 336 knots and over 1500 actions on real cables using the da Vinci surgical robot. We find that the combination of HULK, LOKI, and SPiDERMan is able to untangle dense overhand, figure-eight, double-overhand, square, bowline, granny, stevedore, and triple-overhand knots. The composition of these methods successfully untangles a cable from a dense initial configuration in 68.3% of 60 physical experiments and achieves 50% higher success rates than baselines from prior work. Supplementary material, code, and videos can be found at https://tinyurl.com/rssuntangling.

cable, configuration, knot, (15 more...)

arXiv.org Artificial Intelligence

2107.08942

Country: North America > United States > California > Alameda County > Berkeley (0.04)

Genre: Research Report (0.64)

Industry:

Transportation > Marine (0.34)
Health & Medicine > Health Care Technology (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback