AITopics | Zhao, Zhongzhou

Collaborating Authors

Zhao, Zhongzhou

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

VK-G2T: Vision and Context Knowledge enhanced Gloss2Text

Jing, Liqiang, Song, Xuemeng, Zu, Xinxing, Zheng, Na, Zhao, Zhongzhou, Nie, Liqiang

arXiv.org Artificial IntelligenceDec-15-2023

Existing sign language translation methods follow a two-stage pipeline: first converting the sign language video to a gloss sequence (i.e. Sign2Gloss) and then translating the generated gloss sequence into a spoken language sentence (i.e. Gloss2Text). While previous studies have focused on boosting the performance of the Sign2Gloss stage, we emphasize the optimization of the Gloss2Text stage. However, this task is non-trivial due to two distinct features of Gloss2Text: (1) isolated gloss input and (2) low-capacity gloss vocabulary. To address these issues, we propose a vision and context knowledge enhanced Gloss2Text model, named VK-G2T, which leverages the visual content of the sign language video to learn the properties of the target sentence and exploit the context knowledge to facilitate the adaptive translation of gloss words. Extensive experiments conducted on a Chinese benchmark validate the superiority of our model.

machine learning, natural language, translation, (18 more...)

arXiv.org Artificial Intelligence

2312.1021

Country: Asia > China (0.28)

Genre: Research Report (0.40)

Industry: Education (0.81)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

CASEIN: Cascading Explicit and Implicit Control for Fine-grained Emotion Intensity Regulation

Cui, Yuhao, Wang, Xiongwei, Zhao, Zhongzhou, Zhou, Wei, Chen, Haiqing

arXiv.org Artificial IntelligenceJun-27-2023

Existing fine-grained intensity regulation methods rely on explicit control through predicted emotion probabilities. However, these high-level semantic probabilities are often inaccurate and unsmooth at the phoneme level, leading to bias in learning. Especially when we attempt to mix multiple emotion intensities for specific phonemes, resulting in markedly reduced controllability and naturalness of the synthesis. To address this issue, we propose the CAScaded Explicit and Implicit coNtrol framework (CASEIN), which leverages accurate disentanglement of emotion manifolds from the reference speech to learn the implicit representation at a lower semantic level. This representation bridges the semantical gap between explicit probabilities and the synthesis model, reducing bias in learning. In experiments, our CASEIN surpasses existing methods in both controllability and naturalness. Notably, we are the first to achieve fine-grained control over the mixed intensity of multiple emotions.

artificial intelligence, emotion, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2307.0002

Country:

Europe (1.00)
North America > United States (0.28)
North America > Canada (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Stylized Data-to-Text Generation: A Case Study in the E-Commerce Domain

Jing, Liqiang, Song, Xuemeng, Lin, Xuming, Zhao, Zhongzhou, Zhou, Wei, Nie, Liqiang

arXiv.org Artificial IntelligenceMay-4-2023

Existing data-to-text generation efforts mainly focus on generating a coherent text from non-linguistic input data, such as tables and attribute-value pairs, but overlook that different application scenarios may require texts of different styles. Inspired by this, we define a new task, namely stylized data-to-text generation, whose aim is to generate coherent text for the given non-linguistic data according to a specific style. This task is non-trivial, due to three challenges: the logic of the generated text, unstructured style reference, and biased training samples. To address these challenges, we propose a novel stylized data-to-text generation model, named StyleD2T, comprising three components: logic planning-enhanced data embedding, mask-based style embedding, and unbiased stylized text generation. In the first component, we introduce a graph-guided logic planner for attribute organization to ensure the logic of generated text. In the second component, we devise feature-level mask-based style embedding to extract the essential style signal from the given unstructured style reference. In the last one, pseudo triplet augmentation is utilized to achieve unbiased text generation, and a multi-condition based confidence assignment function is designed to ensure the quality of pseudo samples. Extensive experiments on a newly collected dataset from Taobao have been conducted, and the results show the superiority of our model over existing methods.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2305.03256

Country:

Asia > China > Zhejiang Province (0.14)
Asia > China > Shandong Province (0.14)
Asia > China > Guangdong Province (0.14)

Genre: Research Report > New Finding (0.66)

Industry: Information Technology > Services > e-Commerce Services (0.51)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Towards Zero-Shot Personalized Table-to-Text Generation with Contrastive Persona Distillation

Zhan, Haolan, Lin, Xuming, Cui, Shaobo, Zhao, Zhongzhou, Zhou, Wei, Chen, Haiqing

arXiv.org Artificial IntelligenceApr-18-2023

Existing neural methods have shown great potentials towards generating informative text from structured tabular data as well as maintaining high content fidelity. However, few of them shed light on generating personalized expressions, which often requires well-aligned persona-table-text datasets that are difficult to obtain. To overcome these obstacles, we explore personalized table-to-text generation under a zero-shot setting, by assuming no well-aligned persona-table-text triples are required during training. To this end, we firstly collect a set of unpaired persona information and then propose a semi-supervised approach with contrastive persona distillation (S2P-CPD) to generate personalized context. Specifically, tabular data and persona information are firstly represented as latent variables separately. Then, we devise a latent space fusion technique to distill persona information into the table representation. Besides, a contrastive-based discriminator is employed to guarantee the style consistency between the generated context and its corresponding persona. Experimental results on two benchmarks demonstrate S2P-CPD's ability on keeping both content fidelity and personalized expressions.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2304.08911

Country:

Asia > China (0.14)
North America > United States > Texas (0.14)
Europe > Spain (0.14)

Genre: Research Report (0.64)

Industry: Media (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)

Add feedback

Digital Human Interactive Recommendation Decision-Making Based on Reinforcement Learning

Junwu, Xiong, Feng, Xiaoyun, Shi, YunZhou, Zhang, James, Zhao, Zhongzhou, Zhou, Wei

arXiv.org Artificial IntelligenceNov-3-2022

Digital human recommendation system has been developed to help customers find their favorite products and is playing an active role in various recommendation contexts. How to timely catch and learn the dynamics of the preferences of the customers, while meeting their exact requirements, becomes crucial in the digital human recommendation domain. We design a novel practical digital human interactive recommendation agent framework based on Reinforcement Learning(RL) to improve the efficiency of the interactive recommendation decision-making by leveraging both the digital human features and the superior flexibility of RL. Our proposed framework learns through real-time interactions between the digital human and customers dynamically through the state-of-the-art RL algorithms, combined with multi-modal embedding and graph embedding, to improve the accuracy of personalization and thus enable the digital human agent to timely catch the attention of the customer. Experiments on real business data demonstrate that our framework can provide better personalized customer engagement and better customer experiences.

customer, machine learning, reinforcement learning, (14 more...)

arXiv.org Artificial Intelligence

2210.10638

Genre: Research Report (0.64)

Industry: Information Technology > Services (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

AliMe MKG: A Multi-modal Knowledge Graph for Live-streaming E-commerce

Xu, Guohai, Chen, Hehong, Li, Feng-Lin, Sun, Fu, Shi, Yunzhou, Zeng, Zhixiong, Zhou, Wei, Zhao, Zhongzhou, Zhang, Ji

arXiv.org Artificial IntelligenceSep-13-2021

Live streaming is becoming an increasingly popular trend of sales in E-commerce. The core of live-streaming sales is to encourage customers to purchase in an online broadcasting room. To enable customers to better understand a product without jumping out, we propose AliMe MKG, a multi-modal knowledge graph that aims at providing a cognitive profile for products, through which customers are able to seek information about and understand a product. Based on the MKG, we build an online live assistant that highlights product search, product exhibition and question answering, allowing customers to skim over item list, view item details, and ask item-related questions. Our system has been launched online in the Taobao app, and currently serves hundreds of thousands of customers per day.

artificial intelligence, customer, information technology services, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3459637.3481983

2109.07411

Country:

North America > United States (0.14)
Africa > Ethiopia (0.14)

Genre: Research Report (0.41)

Industry: Information Technology > Services > e-Commerce Services (0.73)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.64)

Add feedback

Memory-augmented Dialogue Management for Task-oriented Dialogue Systems

Zhang, Zheng, Huang, Minlie, Zhao, Zhongzhou, Ji, Feng, Chen, Haiqing, Zhu, Xiaoyan

arXiv.org Artificial IntelligenceApr-30-2018

Dialogue management (DM) decides the next action of a dialogue system according to the current dialogue state, and thus plays a central role in task-oriented dialogue systems. Since dialogue management requires to have access to not only local utterances, but also the global semantics of the entire dialogue session, modeling the long-range history information is a critical issue. To this end, we propose a novel Memory-Augmented Dialogue management model (MAD) which employs a memory controller and two additional memory structures, i.e., a slot-value memory and an external memory. The slot-value memory tracks the dialogue state by memorizing and updating the values of semantic slots (for instance, cuisine, price, and location), and the external memory augments the representation of hidden states of traditional recurrent neural networks through storing more context information. To update the dialogue state efficiently, we also propose slot-level attention on user utterances to extract specific semantic information for each slot. Experiments show that our model can obtain state-of-the-art performance and outperforms existing baselines.

deep learning, dialogue management, neural network, (17 more...)

arXiv.org Artificial Intelligence

1805.0015

Country:

Asia > China (0.28)
Europe > Spain (0.28)
North America > United States > Texas (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback