AITopics | Kruengkrai, Canasai

Collaborating Authors

Kruengkrai, Canasai

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression

Yoshida, Kai, Mizukami, Masahiro, Kawano, Seiya, Kruengkrai, Canasai, Sugiyama, Hiroaki, Yoshino, Koichiro

arXiv.org Artificial IntelligenceJan-25-2025

To improve user engagement during conversations with dialogue systems, we must improve individual dialogue responses and dialogue impressions such as consistency, personality, and empathy throughout the entire dialogue. While such dialogue systems have been developing rapidly with the help of large language models (LLMs), reinforcement learning from AI feedback (RLAIF) has attracted attention to align LLM-based dialogue models for such dialogue impressions. In RLAIF, a reward model based on another LLM is used to create a training signal for an LLM-based dialogue model using zero-shot/few-shot prompting techniques. However, evaluating an entire dialogue only by prompting LLMs is challenging. In this study, the supervised fine-tuning (SFT) of LLMs prepared reward models corresponding to 12 metrics related to the impression of the entire dialogue for evaluating dialogue responses. We tuned our dialogue models using the reward model signals as feedback to improve the impression of the system. The results of automatic and human evaluations showed that tuning the dialogue model using our reward model corresponding to dialogue impression improved the evaluation of individual metrics and the naturalness of the dialogue response.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2501.12698

Country:

Asia > Japan (0.14)
North America > Canada (0.14)
Europe > Italy (0.14)
Asia > Thailand (0.14)

Genre: Research Report > New Finding (0.67)

Industry: Energy (0.31)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Bridging Textual and Tabular Worlds for Fact Verification: A Lightweight, Attention-Based Model

Varnosfaderani, Shirin Dabbaghi, Kruengkrai, Canasai, Yahyapour, Ramin, Yamagishi, Junichi

arXiv.org Artificial IntelligenceMar-25-2024

FEVEROUS is a benchmark and research initiative focused on fact extraction and verification tasks involving unstructured text and structured tabular data. In FEVEROUS, existing works often rely on extensive preprocessing and utilize rule-based transformations of data, leading to potential context loss or misleading encodings. This paper introduces a simple yet powerful model that nullifies the need for modality conversion, thereby preserving the original evidence's context. By leveraging pre-trained models on diverse text and tabular datasets and by incorporating a lightweight attention-based mechanism, our approach efficiently exploits latent connections between different data types, thereby yielding comprehensive and reliable verdict predictions. The model's modular structure adeptly manages multi-modal information, ensuring the integrity and authenticity of the original evidence are uncompromised. Comparative analyses reveal that our approach exhibits competitive performance, aligning itself closely with top-tier models on the FEVEROUS benchmark.

benchmark, machine learning, natural language, (12 more...)

arXiv.org Artificial Intelligence

2403.17361

Country: Europe > Germany (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

XFEVER: Exploring Fact Verification across Languages

Chang, Yi-Chen, Kruengkrai, Canasai, Yamagishi, Junichi

arXiv.org Artificial IntelligenceOct-24-2023

This paper introduces the Cross-lingual Fact Extraction and VERification (XFEVER) dataset designed for benchmarking the fact verification models across different languages. We constructed it by translating the claim and evidence texts of the Fact Extraction and VERification (FEVER) dataset into six languages. The training and development sets were translated using machine translation, whereas the test set includes texts translated by professional translators and machine-translated texts. Using the XFEVER dataset, two cross-lingual fact verification scenarios, zero-shot learning and translate-train learning, are defined, and baseline models for each scenario are also proposed in this paper. Experimental results show that the multilingual language model can be used to build fact verification models in different languages efficiently. However, the performance varies by language and is somewhat inferior to the English case. We also found that we can effectively mitigate model miscalibration by considering the prediction similarity between the English and target languages. The XFEVER dataset, code, and model checkpoints are available at https://github.com/nii-yamagishilab/xfever.

artificial intelligence, exploring fact verification, natural language, (1 more...)

arXiv.org Artificial Intelligence

2310.16278

Genre: Research Report (0.69)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.87)

Add feedback

DAGA: Data Augmentation with a Generation Approach for Low-resource Tagging Tasks

Ding, Bosheng, Liu, Linlin, Bing, Lidong, Kruengkrai, Canasai, Nguyen, Thien Hai, Joty, Shafiq, Si, Luo, Miao, Chunyan

arXiv.org Artificial IntelligenceNov-3-2020

Data augmentation techniques have been widely used to improve machine learning performance as they enhance the generalization capability of models. In this work, to generate high quality synthetic data for low-resource tagging tasks, we propose a novel augmentation method with language models trained on the linearized labeled sentences. Our method is applicable to both supervised and semi-supervised settings. For the supervised settings, we conduct extensive experiments on named entity recognition (NER), part of speech (POS) tagging and end-to-end target based sentiment analysis (E2E-TBSA) tasks. For the semi-supervised settings, we evaluate our method on the NER task under the conditions of given unlabeled data only and unlabeled data plus a knowledge base. The results show that our method can consistently outperform the baselines, particularly when the given gold training data are less.

computational linguistics, deep learning, neural network, (20 more...)

arXiv.org Artificial Intelligence

2011.01549

Country:

Europe (1.00)
North America > United States > Colorado (0.14)
North America > United States > California (0.14)

Genre: Research Report (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Semi-Distantly Supervised Neural Model for Generating Compact Answers to Open-Domain Why Questions

Ishida, Ryo (National Institute of Information and Communications Technology) | Torisawa, Kentaro (National Institute of Information and Communications Technology) | Oh, Jong-Hoon (National Institute of Information and Communications Technology) | Iida, Ryu (National Institute of Information and Communications Technology) | Kruengkrai, Canasai (National Institute of Information and Communications Technology) | Kloetzer, Julien (National Institute of Information and Communications Technology)

AAAI ConferencesFeb-8-2018

This paper proposes a neural network-based method for generating compact answers to open-domain why-questions (e.g., "Why was Mr. Trump elected as the president of the US?"). Unlike factoid question answering methods that provide short text spans as answers, existing work for why-question answering have aimed at answering questions by retrieving relatively long text passages, each of which often consists of several sentences, from a text archive. While the actual answer to a why-question may be expressed over several consecutive sentences, these often contain redundant and/or unrelated parts. Such answers would not be suitable for spoken dialog systems and smart speakers such as Amazon Echo, which receive much attention in these days. In this work, we aim at generating non-redundant compact answers to why-questions from answer passages retrieved from a very large web data corpora (4 billion web pages) by an already existing open-domain why-question answering system, using a novel neural network obtained by extending existing summarization methods. We also automatically generate training data using a large number of causal relations automatically extracted from 4 billion web pages by an existing supervised causality recognizer. The data is used to train our neural network, together with manually created training data. Through a series of experiments, we show that both our novel neural network and auto-generated training data improve the quality of the generated answers both in ROUGE score and in a subjective evaluation.

compact answer, deep learning, neural network, (23 more...)

AAAI Conferences

Thirty-Second AAAI Conference on Artificial Intelligence

Country:

North America > United States (1.00)
Asia > Japan > Hokkaidō (0.14)

Industry: Government > Regional Government > North America Government > United States Government (0.74)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.98)

Add feedback

Improving Event Causality Recognition with Multiple Background Knowledge Sources Using Multi-Column Convolutional Neural Networks

Kruengkrai, Canasai (National Institute of Information and Communications Technology) | Torisawa, Kentaro (National Institute of Information and Communications Technology) | Hashimoto, Chikara (National Institute of Information and Communications Technology) | Kloetzer, Julien (National Institute of Information and Communications Technology) | Oh, Jong-Hoon (National Institute of Information and Communications Technology) | Tanaka, Masahiro (National Institute of Information and Communications Technology)

AAAI ConferencesFeb-14-2017

We propose a method for recognizing such event causalities as "smoke cigarettes" → "die of lung cancer" using background knowledge taken from web texts as well as original sentences from which candidates for the causalities were extracted. We retrieve texts related to our event causality candidates from four billion web pages by three distinct methods, including a why-question answering system, and feed them to our multi-column convolutional neural networks. This allows us to identify the useful background knowledge scattered in web texts and effectively exploit the identified knowledge to recognize event causalities. We empirically show that the combination of our neural network architecture and background knowledge significantly improves average precision, while the previous state-of-the-art method gains just a small benefit from such background knowledge.

deep learning, neural network, proceedings, (21 more...)

AAAI Conferences

Thirty-First AAAI Conference on Artificial Intelligence

Country: Asia > Japan (0.14)

Genre: Research Report (0.66)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback