Goto

Collaborating Authors

 lite




Supplementary Materialfor " K - LITE: Learning Transferable Visual Modelswith External Knowledge "

Neural Information Processing Systems

In many of the species the breeding lumage of the males is ightly oloured(Wiki knowledge improves performance) t and sour soup Any one of several oups, served in various Asian cuisines, hich are both spicy and sourobster bisque: thick reamy soup ade from ish, shellfish, meat or egetablesafflesflat pastry pressed with rid pattern often eaten hot with utter and / or honey or syruphicken quesadilla A Mexican dish made y filling a tortilla with cheese and ometimes other ingredients and then ooking it until the cheese is melted.pets Ukraine: A country in Eastern Europe; was long part of the Russian Empire and Austro-Hungarian Empire, then of the Soviet Union. China: a communist nation that covers a vast territory in eastern Asia; the most populous country in the world1: the smallest whole number or a numeral representing this number0: a mathematical element that when added to another number yields the same number Lymph node containing metastatic tumor tissue: Thin, woven, gauze-like fabric.Lymph node: Each of the small oval bodies of the lymphatic system, distributed along the lymphatic vessels, that are clustered in the armpits, groin, neck, chest and abdomen. They act as filters, with an internal honeycomb of connective tissue filled with lymphocytes and macrophages that collect and destroy bacteria, viruses and foreign matter from lymph.


Memory Efficient Meta-Learning with Large Images

Neural Information Processing Systems

Meta learning approaches to few-shot classification are computationally efficient at test time, requiring just a few optimization steps or single forward pass to learn a new task, but they remain highly memory-intensive to train. This limitation arises because a task's entire support set, which can contain up to 1000 images, must be processed before an optimization step can be taken. Harnessing the performance gains offered by large images thus requires either parallelizing the meta-learner across multiple GPUs, which may not be available, or trade-offs between task and image size when memory constraints apply. We improve on both options by proposing LITE, a general and memory efficient episodic training scheme that enables meta-training on large tasks composed of large images on a single GPU. We achieve this by observing that the gradients for a task can be decomposed into a sum of gradients over the task's training images. This enables us to perform a forward pass on a task's entire training set but realize significant memory savings by back-propagating only a random subset of these images which we show is an unbiased approximation of the full gradient. We use LITE to train meta-learners and demonstrate new state-of-the-art accuracy on the real-world ORBIT benchmark and 3 of the 4 parts of the challenging VTAB+MD benchmark relative to leading meta-learners. LITE also enables meta-learners to be competitive with transfer learning approaches but at a fraction of the test-time computational cost, thus serving as a counterpoint to the recent narrative that transfer learning is all you need for few-shot classification.


Gear News of the Week: Withings Launches Its Pee Scanner, and Samsung Shows Off a Trifold Phone

WIRED

Plus: Supercute kei cars from Honda and BYD, Insta360 has a cheaper 360 camera, and Nothing's latest phone won't be coming to the US, while the OnePlus 15 gets a launch date. All products featured on WIRED are independently selected by our editors. However, we may receive compensation from retailers and/or from purchases of products through these links. A few weeks ago, bathroom and plumbing company Kohler debuted the Dekoda, a health and wellness sensor that lives on your toilet bowl and records signs of your gut health and hydration. Now, Withings has launched the U-Scan.



UniMIC: Token-Based Multimodal Interactive Coding for Human-AI Collaboration

Mao, Qi, Yang, Tinghan, Li, Jiahao, Li, Bin, Jin, Libiao, Lu, Yan

arXiv.org Artificial Intelligence

The rapid progress of Large Multimodal Models (LMMs) and cloud-based AI agents is transforming human-AI collaboration into bidirectional, multimodal interaction. However, existing codecs remain optimized for unimodal, one-way communication, resulting in repeated degradation under conventional compress-transmit-reconstruct pipelines. To address this limitation, we propose UniMIC, a Unified token-based Multimodal Interactive Coding framework that bridges edge devices and cloud AI agents. Instead of transmitting raw pixels or plain text, UniMIC employs compact tokenized representations as the communication medium, enabling efficient low-bitrate transmission while maintaining compatibility with LMMs. To further enhance compression, lightweight Transformer-based entropy models with scenario-specific designs-generic, masked, and text-conditioned-effectively minimize inter-token redundancy. Extensive experiments on text-to-image generation, text-guided inpainting, outpainting, and visual question answering show that UniMIC achieves substantial bitrate savings and remains robust even at ultra-low bitrates (<0.05bpp), without compromising downstream task performance. These results establish UniMIC as a practical and forward-looking paradigm for next-generation multimodal interactive communication.


Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems

Martinez, Matias, Franch, Xavier

arXiv.org Artificial Intelligence

The rapid progress in Automated Program Repair (APR) has been driven by advances in AI, particularly large language models (LLMs) and agent-based systems. SWE-Bench is a recent benchmark designed to evaluate LLM-based repair systems using real issues and pull requests mined from 12 popular open-source Python repositories. Its public leaderboards -- SWE-Bench Lite and SWE-Bench Verified -- have become central platforms for tracking progress and comparing solutions. However, because the submission process does not require detailed documentation, the architectural design and origin of many solutions remain unclear. In this paper, we present the first comprehensive study of all submissions to the SWE-Bench Lite (79 entries) and Verified (99 entries) leaderboards, analyzing 80 unique approaches across dimensions such as submitter type, product availability, LLM usage, and system architecture. Our findings reveal the dominance of proprietary LLMs (especially Claude 3.5), the presence of both agentic and non-agentic designs, and a contributor base spanning from individual developers to large tech companies.



LITE: A Learning-Integrated Topological Explorer for Multi-Floor Indoor Environments

Chen, Junhao, Zhang, Zhen, Zhu, Chengrui, Hou, Xiaojun, Hu, Tianyang, Wu, Huifeng, Liu, Yong

arXiv.org Artificial Intelligence

-- This work focuses on multi-floor indoor exploration, which remains an open area of research. Compared to traditional methods, recent learning-based explorers have demonstrated significant potential due to their robust environmental learning and modeling capabilities, but most are restricted to 2D environments. In this paper, we proposed a learning-integrated topological explorer, LITE, for multi-floor indoor environments. As we incrementally build floor-stair topology in exploration using YOLO11-based instance segmentation model, the agent can transition between floors through a finite state machine. Additionally, we implement an attention-based 2D exploration policy that utilizes an attention mechanism to capture spatial dependencies between different regions, thereby determining the next global goal for more efficient exploration. Extensive comparison and ablation studies conducted on the HM3D and MP3D datasets demonstrate that our proposed 2D exploration policy significantly outperforms all baseline explorers in terms of exploration efficiency. Furthermore, experiments in several 3D multi-floor environments indicate that our framework is compatible with various 2D exploration methods, facilitating effective multi-floor indoor exploration. I. INTRODUCTION Autonomous exploration is a fundamental problem in the development of embodied intelligence and plays a crucial role in uncertain scenarios such as search and rescue [1], scene reconstruction [2], and extraterrestrial planetary exploration [3].