AITopics | Tseng, Bo-Hsiang

Collaborating Authors

Tseng, Bo-Hsiang

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SynthDST: Synthetic Data is All You Need for Few-Shot Dialog State Tracking

Kulkarni, Atharva, Tseng, Bo-Hsiang, Moniz, Joel Ruben Antony, Piraviperumal, Dhivya, Yu, Hong, Bhargava, Shruti

arXiv.org Artificial IntelligenceFeb-3-2024

In-context learning with Large Language Models (LLMs) has emerged as a promising avenue of research in Dialog State Tracking (DST). However, the best-performing in-context learning methods involve retrieving and adding similar examples to the prompt, requiring access to labeled training data. Procuring such training data for a wide range of domains and applications is time-consuming, expensive, and, at times, infeasible. While zero-shot learning requires no training data, it significantly lags behind the few-shot setup. Thus, `\textit{Can we efficiently generate synthetic data for any dialogue schema to enable few-shot prompting?}' Addressing this question, we propose \method, a data generation framework tailored for DST, utilizing LLMs. Our approach only requires the dialogue schema and a few hand-crafted dialogue templates to synthesize natural, coherent, and free-flowing dialogues with DST annotations. Few-shot learning using data from {\method} results in $4-5%$ improvement in Joint Goal Accuracy over the zero-shot baseline on MultiWOZ 2.1 and 2.4. Remarkably, our few-shot learning approach recovers nearly $98%$ of the performance compared to the few-shot setup using human-annotated training data. Our synthetic data and code can be accessed at https://github.com/apple/ml-synthdst

computational linguistic, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2402.02285

Country:

Europe (1.00)
Asia > Middle East > UAE (0.14)
North America > United States > Pennsylvania (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Can Large Language Models Understand Context?

Zhu, Yilun, Moniz, Joel Ruben Antony, Bhargava, Shruti, Lu, Jiarui, Piraviperumal, Dhivya, Li, Site, Zhang, Yuan, Yu, Hong, Tseng, Bo-Hsiang

arXiv.org Artificial IntelligenceFeb-1-2024

Understanding context is key to understanding human language, an ability which Large Language Models (LLMs) have been increasingly seen to demonstrate to an impressive extent. However, though the evaluation of LLMs encompasses various domains within the realm of Natural Language Processing, limited attention has been paid to probing their linguistic capability of understanding contextual features. This paper introduces a context understanding benchmark by adapting existing datasets to suit the evaluation of generative models. This benchmark comprises of four distinct tasks and nine datasets, all featuring prompts designed to assess the models' ability to understand context. First, we evaluate the performance of LLMs under the in-context learning pretraining scenario. Experimental results indicate that pre-trained dense models struggle with understanding more nuanced contextual features when compared to state-of-the-art fine-tuned models. Second, as LLM compression holds growing significance in both research and real-world applications, we assess the context understanding of quantized models under in-context-learning settings. We find that 3-bit post-training quantization leads to varying degrees of performance reduction on our benchmark. We conduct an extensive analysis of these scenarios to substantiate our experimental results.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2402.00858

Country:

Europe (1.00)
Asia (1.00)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report (1.00)

Industry:

Education (0.66)
Consumer Products & Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

MARRS: Multimodal Reference Resolution System

Ates, Halim Cagri, Bhargava, Shruti, Li, Site, Lu, Jiarui, Maddula, Siddhardha, Moniz, Joel Ruben Antony, Nalamalapu, Anil Kumar, Nguyen, Roman Hoang, Ozyildirim, Melis, Patel, Alkesh, Piraviperumal, Dhivya, Renkens, Vincent, Samal, Ankit, Tran, Thy, Tseng, Bo-Hsiang, Yu, Hong, Zhang, Yuan, Zou, Rong

arXiv.org Artificial IntelligenceNov-2-2023

Successfully handling context is essential for any dialog understanding task. This context maybe be conversational (relying on previous user queries or system responses), visual (relying on what the user sees, for example, on their screen), or background (based on signals such as a ringing alarm or playing music). In this work, we present an overview of MARRS, or Multimodal Reference Resolution System, an on-device framework within a Natural Language Understanding system, responsible for handling conversational, visual and background context. In particular, we present different machine learning models to enable handing contextual queries; specifically, one to enable reference resolution, and one to handle context via query rewriting. We also describe how these models complement each other to form a unified, coherent, lightweight system that can understand context while preserving user privacy.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2023.crac-main.7

2311.0165

Country: North America > United States (0.46)

Genre:

Overview (0.55)
Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.30)

Add feedback

Grounding Description-Driven Dialogue State Trackers with Knowledge-Seeking Turns

Coca, Alexandru, Tseng, Bo-Hsiang, Chen, Jinghong, Lin, Weizhe, Zhang, Weixuan, Anders, Tisha, Byrne, Bill

arXiv.org Artificial IntelligenceSep-23-2023

Schema-guided dialogue state trackers can generalise to new domains without further training, yet they are sensitive to the writing style of the schemata. Augmenting the training set with human or synthetic schema paraphrases improves the model robustness to these variations but can be either costly or difficult to control. We propose to circumvent these issues by grounding the state tracking model in knowledge-seeking turns collected from the dialogue corpus as well as the schema. Including these turns in prompts during finetuning and inference leads to marked improvements in model robustness, as demonstrated by large average joint goal accuracy and schema sensitivity improvements on SGD and SGD-X.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2309.13448

Country:

North America > United States (0.68)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)

Genre:

Research Report (1.00)
Overview (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.86)

Add feedback

5IDER: Unified Query Rewriting for Steering, Intent Carryover, Disfluencies, Entity Carryover and Repair

Lu, Jiarui, Tseng, Bo-Hsiang, Moniz, Joel Ruben Antony, Li, Site, Zhu, Xueyun, Yu, Hong, Akbacak, Murat

arXiv.org Artificial IntelligenceJun-2-2023

Providing voice assistants the ability to navigate multi-turn conversations is a challenging problem. Handling multi-turn interactions requires the system to understand various conversational use-cases, such as steering, intent carryover, disfluencies, entity carryover, and repair. The complexity of this problem is compounded by the fact that these use-cases mix with each other, often appearing simultaneously in natural language. This work proposes a non-autoregressive query rewriting architecture that can handle not only the five aforementioned tasks, but also complex compositions of these use-cases. We show that our proposed model has competitive single task performance compared to the baseline approach, and even outperforms a fine-tuned T5 model in use-case compositions, despite being 15 times smaller in parameters and 25 times faster in latency.

artificial intelligence, entity carryover and repair, unified query rewriting, (3 more...)

arXiv.org Artificial Intelligence

doi: 10.21437/Interspeech.2023-1038

2306.01855

Genre: Research Report (0.40)

Technology:

Information Technology > Information Management > Search (0.60)
Information Technology > Artificial Intelligence (0.53)

Add feedback

Transferable Dialogue Systems and User Simulators

Tseng, Bo-Hsiang, Dai, Yinpei, Kreyssig, Florian, Byrne, Bill

arXiv.org Artificial IntelligenceJul-25-2021

One of the difficulties in training dialogue systems is the lack of training data. We explore the possibility of creating dialogue data through the interaction between a dialogue system and a user simulator. Our goal is to develop a modelling framework that can incorporate new dialogue scenarios through self-play between the two agents. In this framework, we first pre-train the two agents on a collection of source domain dialogues, which equips the agents to converse with each other via natural language. With further fine-tuning on a small amount of target domain data, the agents continue to interact with the aim of improving their behaviors using reinforcement learning with structured reward functions. In experiments on the MultiWOZ dataset, two practical transfer learning problems are investigated: 1) domain adaptation and 2) single-to-multiple domain transfer. We demonstrate that the proposed framework is highly effective in bootstrapping the performance of the two agents in transfer learning. We also show that our method leads to improvements in dialogue system performance on complete datasets.

deep learning, dialogue, neural network, (19 more...)

arXiv.org Artificial Intelligence

2107.11904

Country:

Europe (1.00)
North America > United States (0.68)

Genre: Research Report (0.64)

Industry: Consumer Products & Services > Restaurants (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)

Add feedback

CREAD: Combined Resolution of Ellipses and Anaphora in Dialogues

Tseng, Bo-Hsiang, Bhargava, Shruti, Lu, Jiarui, Moniz, Joel Ruben Antony, Piraviperumal, Dhivya, Li, Lin, Yu, Hong

arXiv.org Artificial IntelligenceMay-20-2021

Anaphora and ellipses are two common phenomena in dialogues. Without resolving referring expressions and information omission, dialogue systems may fail to generate consistent and coherent responses. Traditionally, anaphora is resolved by coreference resolution and ellipses by query rewrite. In this work, we propose a novel joint learning framework of modeling coreference resolution and query rewriting for complex, multi-turn dialogue understanding. Given an ongoing dialogue between a user and a dialogue assistant, for the user query, our joint learning model first predicts coreference links between the query and the dialogue context, and then generates a self-contained rewritten user query. To evaluate our model, we annotate a dialogue based coreference resolution dataset, MuDoCo, with rewritten queries. Results show that the performance of query rewrite can be substantially boosted (+2.3% F1) with the aid of coreference modeling. Furthermore, our joint model outperforms the state-of-the-art coreference resolution model (+2% F1) on this dataset.

artificial intelligence, coreference resolution, natural language, (16 more...)

arXiv.org Artificial Intelligence

2105.09914

Country:

Europe > Italy (0.14)
Europe > France (0.14)
Asia > Japan (0.14)
Asia > China (0.14)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

A Generative Model for Joint Natural Language Understanding and Generation

Tseng, Bo-Hsiang, Cheng, Jianpeng, Fang, Yimai, Vandyke, David

arXiv.org Artificial IntelligenceJun-12-2020

Natural language understanding (NLU) and natural language generation (NLG) are two fundamental and related tasks in building task-oriented dialogue systems with opposite objectives: NLU tackles the transformation from natural language to formal representations, whereas NLG does the reverse. A key to success in either task is parallel training data which is expensive to obtain at a large scale. In this work, we propose a generative model which couples NLU and NLG through a shared latent variable. This approach allows us to explore both spaces of natural language and formal representations, and facilitates information sharing through the latent space to eventually benefit NLU and NLG. Our model achieves state-of-the-art performance on two dialogue datasets with both flat and tree-structured formal representations. We also show that the model can be trained in a semi-supervised fashion by utilising unlabelled data to boost its performance.

deep learning, neural network, representation, (21 more...)

arXiv.org Artificial Intelligence

2006.07499

Country:

Europe (0.93)
North America > United States (0.28)

Genre: Research Report (0.82)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Variational Cross-domain Natural Language Generation for Spoken Dialogue Systems

Tseng, Bo-Hsiang, Kreyssig, Florian, Budzianowski, Pawel, Casanueva, Inigo, Wu, Yen-Chen, Ultes, Stefan, Gasic, Milica

arXiv.org Artificial IntelligenceDec-20-2018

Cross-domain natural language generation (NLG) is still a difficult task within spoken dialogue modelling. Given a semantic representation provided by the dialogue manager, the language generator should generate sentences that convey desired information. Traditional template-based generators can produce sentences with all necessary information, but these sentences are not sufficiently diverse. With RNN-based models, the diversity of the generated sentences can be high, however, in the process some information is lost. In this work, we improve an RNN-based generator by considering latent information at the sentence level during generation using the conditional variational autoencoder architecture. We demonstrate that our model outperforms the original RNN-based generator, while yielding highly diverse sentences. In addition, our model performs better when the training data is limited.

deep learning, information, neural network, (20 more...)

arXiv.org Artificial Intelligence

1812.08879

Country:

North America > Canada (0.14)
Europe (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Nearly Zero-Shot Learning for Semantic Decoding in Spoken Dialogue Systems

Rojas-Barahona, Lina M., Ultes, Stefan, Budzianowski, Pawel, Casanueva, Iñigo, Gasic, Milica, Tseng, Bo-Hsiang, Young, Steve

arXiv.org Artificial IntelligenceJun-21-2018

This paper presents two ways of dealing with scarce data in semantic decoding using N-Best speech recognition hypotheses. First, we learn features by using a deep learning architecture in which the weights for the unknown and known categories are jointly optimised. Second, an unsupervised method is used for further tuning the weights. Sharing weights injects prior knowledge to unknown categories. The unsupervised tuning (i.e. the risk minimisation) improves the F-Measure when recognising nearly zero-shot data on the DSTC3 corpus. This unsupervised method can be applied subject to two assumptions: the rank of the class marginal is assumed to be known and the class-conditional scores of the classifier are assumed to follow a Gaussian distribution.

deep learning, neural network, risk minimisation, (18 more...)

arXiv.org Artificial Intelligence

1806.05484

Country:

North America > United States (0.47)
Europe (0.29)
Asia (0.28)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback