AITopics | capt

Collaborating Authors

capt

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

All You Need is One: Capsule Prompt Tuning with a Single Vector

Neural Information Processing SystemsJun-19-2026, 00:15:59 GMT

Prompt-based learning has emerged as a parameter-efficient finetuning (PEFT) approach to facilitate Large Language Model (LLM) adaptation to downstream tasks by conditioning generation with task-aware guidance. Despite its successes, current prompt-based learning methods heavily rely on laborious grid searching for optimal prompt length and typically require considerable number of prompts, introducing additional computational burden. Worse yet, our pioneer findings indicate that the task-aware prompt design is inherently limited by its absence of instance-aware information, leading to a subtle attention interplay with the input sequence. In contrast, simply incorporating instance-aware information as a part of the guidance can enhance the prompt-tuned model performance without additional fine-tuning. Moreover, we find an interesting phenomenon, namely "attention anchor," that incorporating instance-aware tokens at the earliest position of the sequence can successfully preserve strong attention to critical structural information and exhibit more active attention interaction with all input tokens. In light of our observation, we introduce Capsule Prompt-Tuning (CaPT), an efficient and effective solution that leverages off-the-shelf, informative instance semantics into prompt-based learning. Our approach innovatively integrates both instanceaware and task-aware information in a nearly parameter-free manner (i.e., one single capsule prompt). Empirical results demonstrate that our method can exhibit superior performance across various language tasks (e.g., 84.03% average accuracy on T5-Large), serving as an "attention anchor," while enjoying high parameter efficiency (e.g., 0.003% of model parameters on Llama3.2-1B).

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: North America > United States (1.00)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

All You Need is One: Capsule Prompt Tuning with a Single Vector

Liu, Yiyang, Liang, James C., Fan, Heng, Yang, Wenhao, Cui, Yiming, Han, Xiaotian, Huang, Lifu, Liu, Dongfang, Wang, Qifan, Han, Cheng

arXiv.org Artificial IntelligenceOct-21-2025

Prompt-based learning has emerged as a parameter-efficient finetuning (PEFT) approach to facilitate Large Language Model (LLM) adaptation to downstream tasks by conditioning generation with task-aware guidance. Despite its successes, current prompt-based learning methods heavily rely on laborious grid searching for optimal prompt length and typically require considerable number of prompts, introducing additional computational burden. Worse yet, our pioneer findings indicate that the task-aware prompt design is inherently limited by its absence of instance-aware information, leading to a subtle attention interplay with the input sequence. In contrast, simply incorporating instance-aware information as a part of the guidance can enhance the prompt-tuned model performance without additional fine-tuning. Moreover, we find an interesting phenomenon, namely "attention anchor", that incorporating instance-aware tokens at the earliest position of the sequence can successfully preserve strong attention to critical structural information and exhibit more active attention interaction with all input tokens. In light of our observation, we introduce Capsule Prompt-Tuning (CaPT), an efficient and effective solution that leverages off-the-shelf, informative instance semantics into prompt-based learning. Our approach innovatively integrates both instance-aware and task-aware information in a nearly parameter-free manner (i.e., one single capsule prompt). Empirical results demonstrate that our method can exhibit superior performance across various language tasks (e.g., 84.03\% average accuracy on T5-Large), serving as an "attention anchor," while enjoying high parameter efficiency (e.g., 0.003\% of model parameters on Llama3.2-1B).

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2510.1667

Country: North America > United States (1.00)

Genre: Research Report > New Finding (1.00)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Mitigating Spurious Correlations in LLMs via Causality-Aware Post-Training

Gui, Shurui, Ji, Shuiwang

arXiv.org Artificial IntelligenceJun-12-2025

While large language models (LLMs) have demonstrated remarkable capabilities in language modeling, recent studies reveal that they often fail on out-of-distribution (OOD) samples due to spurious correlations acquired during pre-training. Here, we aim to mitigate such spurious correlations through causality-aware post-training (CAPT). By decomposing a biased prediction into two unbiased steps, known as \textit{event estimation} and \textit{event intervention}, we reduce LLMs' pre-training biases without incurring additional fine-tuning biases, thus enhancing the model's generalization ability. Experiments on the formal causal inference benchmark CLadder and the logical reasoning dataset PrOntoQA show that 3B-scale language models fine-tuned with CAPT can outperform both traditional SFT and larger LLMs on in-distribution (ID) and OOD tasks using only 100 ID fine-tuning samples, demonstrating the effectiveness and sample efficiency of CAPT.

correlation, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2506.09433

Country: North America > United States > Texas (0.28)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

CAPT: Class-Aware Prompt Tuning for Federated Long-Tailed Learning with Vision-Language Model

Hou, Shihao, Shang, Xinyi, Gowda, Shreyank N, Lu, Yang, Wu, Chao, Yan, Yan, Wang, Hanzi

arXiv.org Artificial IntelligenceMar-10-2025

Effectively handling the co-occurrence of non-IID data and long-tailed distributions remains a critical challenge in federated learning. While fine-tuning vision-language models (VLMs) like CLIP has shown to be promising in addressing non-IID data challenges, this approach leads to severe degradation of tail classes in federated long-tailed scenarios. Under the composite effects of strong non-IID data distribution and long-tailed class imbalances, VLM fine-tuning may even fail to yield any improvement. To address this issue, we propose Class-Aware Prompt Learning for Federated Long-tailed Learning (CAPT), a novel framework that leverages a pre-trained VLM to effectively handle both data heterogeneity and long-tailed distributions. CAPT introduces a dual-prompt mechanism that synergizes general and class-aware prompts, enabling the framework to capture global trends while preserving class-specific knowledge. To better aggregate and share knowledge across clients, we introduce a heterogeneity-aware client clustering strategy that groups clients based on their data distributions, enabling efficient collaboration and knowledge sharing. Extensive experiments on various long-tailed datasets with different levels of data heterogeneity demonstrate that CAPT significantly improves tail class performance without compromising overall accuracy, outperforming state-of-the-art methods in federated long-tailed learning scenarios.

capt, learning, tail class, (15 more...)

arXiv.org Artificial Intelligence

2503.06993

Country:

Europe > United Kingdom > England > Nottinghamshire > Nottingham (0.04)
Europe > Austria (0.04)
Asia > China > Fujian Province > Xiamen (0.04)

Genre:

Research Report > Promising Solution (0.66)
Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.68)

Add feedback

Collision-Affording Point Trees: SIMD-Amenable Nearest Neighbors for Fast Collision Checking

Ramsey, Clayton W., Kingston, Zachary, Thomason, Wil, Kavraki, Lydia E.

arXiv.org Artificial IntelligenceJun-4-2024

Motion planning against sensor data is often a critical bottleneck in real-time robot control. For sampling-based motion planners, which are effective for high-dimensional systems such as manipulators, the most time-intensive component is collision checking. We present a novel spatial data structure, the collision-affording point tree (CAPT): an exact representation of point clouds that accelerates collision-checking queries between robots and point clouds by an order of magnitude, with an average query time of less than 10 nanoseconds on 3D scenes comprising thousands of points. With the CAPT, sampling-based planners can generate valid, high-quality paths in under a millisecond, with total end-to-end computation time faster than 60 FPS, on a single thread of a consumer-grade CPU. We also present a point cloud filtering algorithm, based on space-filling curves, which reduces the number of points in a point cloud while preserving structure. Our approach enables robots to plan at real-time speeds in sensed environments, opening up potential uses of planning for high-dimensional systems in dynamic, changing, and unmodeled environments.

artificial intelligence, planning & scheduling, point cloud, (16 more...)

arXiv.org Artificial Intelligence

2406.02807

Country: North America > Canada (0.28)

Genre: Research Report (0.82)

Industry: Energy > Oil & Gas > Upstream (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.91)

Add feedback

CAPT: Category-level Articulation Estimation from a Single Point Cloud Using Transformer

Fu, Lian, Ishikawa, Ryoichi, Sato, Yoshihiro, Oishi, Takeshi

arXiv.org Artificial IntelligenceFeb-27-2024

The ability to estimate joint parameters is essential for various applications in robotics and computer vision. In this paper, we propose CAPT: category-level articulation estimation from a point cloud using Transformer. CAPT uses an end-to-end transformer-based architecture for joint parameter and state estimation of articulated objects from a single point cloud. The proposed CAPT methods accurately estimate joint parameters and states for various articulated objects with high precision and robustness. The paper also introduces a motion loss approach, which improves articulation estimation performance by emphasizing the dynamic features of articulated objects. Additionally, the paper presents a double voting strategy to provide the framework with coarse-to-fine parameter estimation. Experimental results on several category datasets demonstrate that our methods outperform existing alternatives for articulation estimation. Our research provides a promising solution for applying Transformer-based architectures in articulated object analysis.

estimation, point cloud, transformer, (13 more...)

arXiv.org Artificial Intelligence

2402.1736

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Counterfactual Data Augmentation via Perspective Transition for Open-Domain Dialogues

Ou, Jiao, Zhang, Jinchao, Feng, Yang, Zhou, Jie

arXiv.org Artificial IntelligenceOct-30-2022

The construction of open-domain dialogue systems requires high-quality dialogue datasets. The dialogue data admits a wide variety of responses for a given dialogue history, especially responses with different semantics. However, collecting high-quality such a dataset in most scenarios is labor-intensive and time-consuming. In this paper, we propose a data augmentation method to automatically augment high-quality responses with different semantics by counterfactual inference. Specifically, given an observed dialogue, our counterfactual generation model first infers semantically different responses by replacing the observed reply perspective with substituted ones. Furthermore, our data selection method filters out detrimental augmented responses. Experimental results show that our data augmentation method can augment high-quality responses with different semantics for a given dialogue history, and can outperform competitive baselines on multiple downstream tasks.

artificial intelligence, computational linguistic, natural language, (20 more...)

arXiv.org Artificial Intelligence

2210.16838

Country:

Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
Asia > China (0.04)
North America > United States > New York > New York County > New York City (0.04)
(5 more...)

Genre: Research Report > Experimental Study (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.46)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.46)

Add feedback

William Shatner, TV's Capt. Kirk, blasts into space

Boston HeraldOct-13-2021, 23:10:07 GMT

bezos, shatner, star trek, (5 more...)

Boston Herald

Country: North America > United States > Texas > Culberson County > Van Horn (0.06)

Industry: Consumer Products & Services (0.34)

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

William Shatner, TV's Capt. Kirk, blasts into space

Associated PressOct-13-2021, 16:04:05 GMT

Hollywood's Captain Kirk, 90-year-old William Shatner, blasted into space Wednesday in a convergence of science fiction and science reality, reaching the final frontier aboard a ship built by Jeff Bezos' Blue Origin company. The "Star Trek" hero and three fellow passengers hurtled to an estimated 66 miles (106 kilometers) over the West Texas desert in the fully automated capsule, then safely parachuted back to Earth in a flight that lasted just over 10 minutes. ""You have done something," an exhilarated Shatner told Bezos as he emerged from the capsule, the words spilling from him in a torrent. "What you have given me is the most profound experience." He added: "I hope I never recover from this." He said that going from the blue sky to the blackness of space was a moving experience that made him wonder, "Is that the way death is?" Shatner became the oldest person in space, eclipsing the previous record -- set by a passenger on a similar jaunt on a Bezos spaceship in July -- by eight years. The flight included about three minutes of weightlessness and a view of the curvature of the Earth. Sci-fi fans reveled in the opportunity to see the man best known as the stalwart Capt. James T. Kirk of the starship Enterprise boldly go where no star of American TV has gone before. "This is a pinch-me moment for all of us to see Capt.

artificial intelligence, shatner, william shatner, (12 more...)

Associated Press

Country:

North America > United States > Texas > Culberson County > Van Horn (0.05)
North America > United States > Florida > Brevard County > Cape Canaveral (0.05)
North America > United States > California > Los Angeles County > Los Angeles (0.05)

Industry:

Transportation > Passenger (0.49)
Transportation > Air (0.31)

Technology: Information Technology > Artificial Intelligence (0.77)

Add feedback

William Shatner, TV's Capt. Kirk, blasts into space

Boston HeraldOct-13-2021, 15:45:05 GMT

VAN HORN, Texas (AP) -- Hollywood's Captain Kirk, 90-year-old William Shatner, blasted into space Wednesday in a convergence of science fiction and science reality, reaching the final frontier aboard a ship built by Jeff Bezos' Blue Origin company. The "Star Trek" hero and three fellow passengers hurtled to an estimated 66 miles (106 kilometers) over the West Texas desert in the fully automated capsule, then safely parachuted back to Earth in a flight that lasted just over 10 minutes. "You have done something," an exhilarated Shatner told Bezos as he emerged from the capsule, the words spilling from him in a torrent. "What you have given me is the most profound experience." He added: "I hope I never recover from this."

shatner, star trek, william shatner, (11 more...)

Boston Herald

Country:

North America > United States > Texas > Culberson County > Van Horn (0.25)
North America > United States > Florida > Brevard County > Cape Canaveral (0.05)
North America > United States > California > Los Angeles County > Los Angeles (0.05)

Industry: Transportation > Air (0.31)

Technology: Information Technology > Artificial Intelligence (0.37)

Add feedback