AITopics | ftr

Collaborating Authors

ftr

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Focus-Then-Reuse: Fast Adaptation in Visual Perturbation Environments

Neural Information Processing SystemsJun-15-2026, 19:27:01 GMT

Visual reinforcement learning has shown promise in various real-world applications. However, deploying policies in complex real-world environments with visual perturbations remains a significant challenge. We notice that humans tend to filter information at the object level prior to decision-making, facilitating efficient skill transfer across different contexts. Inspired by this, we introduce Focus-ThenReuse (FTR), a method utilizing a novel object selection mechanism to focus on task-relevant objects, and directly reuse the simulation-trained policy on them.

large language model, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country: Asia > China (0.14)

Genre: Research Report (0.93)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

Unleashing the True Potential of LLMs: A Feedback-Triggered Self-Correction with Long-Term Multipath Decoding

Li, Jipeng, Gao, Zeyu, Qi, Yubin, Dong, Hande, Chen, Weijian, Lin, Qiang

arXiv.org Artificial IntelligenceSep-10-2025

Large Language Models (LLMs) have achieved remarkable performance across diverse tasks, yet their susceptibility to generating incorrect content during inference remains a critical unsolved challenge. While self-correction methods offer potential solutions, their effectiveness is hindered by two inherent limitations: (1) the absence of reliable guidance signals for error localization, and (2) the restricted reasoning depth imposed by conventional next-token decoding paradigms. To address these issues, we propose Feedback-Triggered Regeneration (FTR), a novel framework that synergizes user feedback with enhanced decoding dynamics. Specifically, FTR activates response regeneration only upon receiving negative user feedback, thereby circumventing error propagation from faulty self-assessment while preserving originally correct outputs. Furthermore, we introduce Long-Term Multipath (LTM) decoding, which enables systematic exploration of multiple reasoning trajectories through delayed sequence evaluation, effectively overcoming the myopic decision-making characteristic of standard next-token prediction. Extensive experiments on mathematical reasoning and code generation benchmarks demonstrate that our framework achieves consistent and significant improvements over state-of-the-art prompt-based self-correction methods.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2509.07676

Genre: Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.33)

Add feedback

An Empirical Study of Causal Relation Extraction Transfer: Design and Data

Anuyah, Sydney, Vanschaik, Jack, Jain, Palak, Lehman, Sawyer, Chakraborty, Sunandan

arXiv.org Artificial IntelligenceMar-8-2025

We conduct an empirical analysis of neural network architectures and data transfer strategies for causal relation extraction. By conducting experiments with various contextual embedding layers and architectural components, we show that a relatively straightforward BioBERT-BiGRU relation extraction model generalizes better than other architectures across varying web-based sources and annotation strategies. Furthermore, we introduce a metric for evaluating transfer performance, $F1_{phrase}$ that emphasizes noun phrase localization rather than directly matching target tags. Using this metric, we can conduct data transfer experiments, ultimately revealing that augmentation with data with varying domains and annotation styles can improve performance. Data augmentation is especially beneficial when an adequate proportion of implicitly and explicitly causal sentences are included.

dataset, extraction, relation extraction, (13 more...)

arXiv.org Artificial Intelligence

2503.06076

Country: North America > United States > Indiana > Marion County > Indianapolis (0.04)

Genre: Research Report > Experimental Study (0.47)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Ranking Large Language Models without Ground Truth

Dhurandhar, Amit, Nair, Rahul, Singh, Moninder, Daly, Elizabeth, Ramamurthy, Karthikeyan Natesan

arXiv.org Artificial IntelligenceJun-10-2024

Evaluation and ranking of large language models (LLMs) has become an important problem with the proliferation of these models and their impact. Evaluation methods either require human responses which are expensive to acquire or use pairs of LLMs to evaluate each other which can be unreliable. In this paper, we provide a novel perspective where, given a dataset of prompts (viz. questions, instructions, etc.) and a set of LLMs, we rank them without access to any ground truth or reference responses. Inspired by real life where both an expert and a knowledgeable person can identify a novice our main idea is to consider triplets of models, where each one of them evaluates the other two, correctly identifying the worst model in the triplet with high probability. We also analyze our idea and provide sufficient conditions for it to succeed. Applying this idea repeatedly, we propose two methods to rank LLMs. In experiments on different generative tasks (summarization, multiple-choice, and dialog), our methods reliably recover close to true rankings without reference data. This points to a viable low-resource mechanism for practical use.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2402.1486

Country:

North America > United States (0.04)
North America > Canada > Ontario > Toronto (0.04)
North America > Canada > Ontario > National Capital Region > Ottawa (0.04)
(4 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Okay, Let's Do This! Modeling Event Coreference with Generated Rationales and Knowledge Distillation

Nath, Abhijnan, Manafi, Shadi, Chelle, Avyakta, Krishnaswamy, Nikhil

arXiv.org Artificial IntelligenceApr-4-2024

In NLP, Event Coreference Resolution (ECR) is the task of connecting event clusters that refer to the same underlying real-life event, usually via neural systems. In this work, we investigate using abductive free-text rationales (FTRs) generated by modern autoregressive LLMs as distant supervision of smaller student models for cross-document coreference (CDCR) of events. We implement novel rationale-oriented event clustering and knowledge distillation methods for event coreference scoring that leverage enriched information from the FTRs for improved CDCR without additional annotation or expensive document clustering. Our model using coreference specific knowledge distillation achieves SOTA B3 F1 on the ECB+ and GVC corpora and we establish a new baseline on the AIDA Phase 1 corpus. Our code can be found at https://github.com/csu-signal/llama_cdcr

computational linguistic, information, rationale, (15 more...)

arXiv.org Artificial Intelligence

2404.03196

Country:

Europe > Ukraine (0.14)
Asia > Russia (0.14)
North America > United States > Washington > King County > Seattle (0.04)
(14 more...)

Genre: Research Report (0.82)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.68)
Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion

Choi, Ha-Yeong, Lee, Sang-Hoon, Lee, Seong-Whan

arXiv.org Artificial IntelligenceMay-25-2023

Diffusion-based generative models have exhibited powerful generative performance in recent years. However, as many attributes exist in the data distribution and owing to several limitations of sharing the model parameters across all levels of the generation process, it remains challenging to control specific styles for each attribute. To address the above problem, this paper presents decoupled denoising diffusion models (DDDMs) with disentangled representations, which can control the style for each attribute in generative models. We apply DDDMs to voice conversion (VC) tasks to address the challenges of disentangling and controlling each speech attribute (e.g., linguistic information, intonation, and timbre). First, we use a self-supervised representation to disentangle the speech representation. Subsequently, the DDDMs are applied to resynthesize the speech from the disentangled representations for denoising with respect to each attribute. Moreover, we also propose the prior mixup for robust voice style transfer, which uses the converted representation of the mixed style as a prior distribution for the diffusion models. The experimental results reveal that our method outperforms publicly available VC models. Furthermore, we show that our method provides robust generative performance regardless of the model size. Audio samples are available https://hayeong0.github.io/DDDM-VC-demo/.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2305.15816

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > South Korea > Seoul > Seoul (0.04)

Genre: Research Report (0.81)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

KNIFE: Distilling Reasoning Knowledge From Free-Text Rationales

Chan, Aaron, Zeng, Zhiyuan, Lake, Wyatt, Joshi, Brihi, Chen, Hanjie, Ren, Xiang

arXiv.org Artificial IntelligenceMay-21-2023

Language models (LMs) have yielded impressive results on many language reasoning tasks, but their unexpected errors raise doubts about their reasoning abilities. In light of this, there is growing interest in finetuning/prompting LMs with both task instances and their associated free-text rationales (FTRs), which explain the correct reasoning process for predicting the correct task output (i.e., how to be "right for the right reasons"). However, existing finetuning methods fail to improve LM performance, while prompting needs prohibitively large (i.e., >50B) LMs to work well. We propose KNIFE, which shows that reasoning knowledge can be effectively distilled from FTRs into a small (i.e., <1B) LM and improve the LM's performance. First, KNIFE finetunes a teacher LM (given task input and FTR) to predict the task output, transferring reasoning knowledge from the FTRs to the teacher's hidden states. Second, KNIFE finetunes a student LM (given task input only) such that its hidden states are aligned with the teacher's. Thus, the student is endowed with reasoning knowledge but can be used for inference without direct FTR input. On two question-answering datasets, KNIFE outperforms various finetuning and prompting baselines in fully-supervised and low-resource settings. Also, we observe that FTR quality is crucial to KNIFE's performance.

ftr, knowledge, reasoning knowledge, (14 more...)

arXiv.org Artificial Intelligence

2212.09721

Country:

North America > United States > California (0.14)
North America > United States > Virginia (0.04)

Genre: Research Report (0.40)

Industry: Education (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

DeepBrain AI Joins AWS Partner Network

#artificialintelligenceFeb-21-2023, 13:10:20 GMT

PALO ALTO, CA, Feb 21, 2023 – DeepBrain AI, a deep-learning based video synthesis startup company, announced that its solutions, AI Human and AI Studios, have successfully completed Amazon Web Services (AWS) Foundational Technical Review (FTR) and the company has joined the AWS Partner Network (APN). DeepBrain AI has successfully undergone the AWS Foundational Technical Review (FTR), which enables members of the Amazon Web Services (AWS) Partner Network (APN) to detect and address potential vulnerabilities in their solutions by utilizing the AWS Well-Architected Framework. The qualified solution AI Human helps customers utilize conversational virtual AI human in their business such as AI banker, AI tutor, etc. This solution is based on various interactive AI technologies that combines voice and video synthesis, voice recognition technologies, and natural language processing (NLP). AI Studios is a SaaS based text-to-video production tool that allows users to create interactive AI human video by texting, without studio, lighting, camera, set-staff, and even the video host.

aw partner network, deepbrain ai, partner network, (8 more...)

#artificialintelligence

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.26)
Asia > South Korea (0.18)

Genre: Press Release (0.97)

Industry: Information Technology (0.83)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Communications > Web (0.83)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)

Add feedback