AITopics | Zhang, Xinyun

Collaborating Authors

Zhang, Xinyun

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Reconstruct Before Summarize: An Efficient Two-Step Framework for Condensing and Summarizing Meeting Transcripts

Tan, Haochen, Wu, Han, Shao, Wei, Zhang, Xinyun, Zhan, Mingjie, Hou, Zhaohui, Liang, Ding, Song, Linqi

arXiv.org Artificial IntelligenceOct-22-2023

Based on this understanding, Although numerous achievements have been made we propose a two-step meeting summarization in the well-structured text abstractive summarization framework, Reconstrcut before Summarize(RbS), (Zhang et al., 2020a; Liu* et al., 2018; Lewis to address the challenge of scattered et al., 2020), the research on meeting summarization information in meetings. RbS adopts a reconstructor is still stretched in limit. There are some outstanding to reconstruct the responses in the meeting, it challenges in this field, including 1) much also synchronically traces out which texts in the noise brought from automated speech recognition meeting drove the responses and marks them as models; 2) lengthy meeting transcripts consisting essential contents. Therefore, salient information of casual conversations, content redundancy, and is captured and annotated as anchor tokens in RbS.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2305.07988

Country:

Europe (0.93)
North America > United States (0.28)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Towards Versatile and Efficient Visual Knowledge Integration into Pre-trained Language Models with Cross-Modal Adapters

Zhang, Xinyun, Tan, Haochen, Wu, Han, Zhan, Mingjie, Liang, Ding, Yu, Bei

arXiv.org Artificial IntelligenceAug-28-2023

Humans learn language via multi-modal knowledge. However, due to the text-only pre-training scheme, most existing pre-trained language models (PLMs) are hindered from the multi-modal information. To inject visual knowledge into PLMs, existing methods incorporate either the text or image encoder of vision-language models (VLMs) to encode the visual information and update all the original parameters of PLMs for knowledge fusion. In this paper, we propose a new plug-and-play module, X-adapter, to flexibly leverage the aligned visual and textual knowledge learned in pre-trained VLMs and efficiently inject them into PLMs. Specifically, we insert X-adapters into PLMs, and only the added parameters are updated during adaptation. To fully exploit the potential in VLMs, X-adapters consist of two sub-modules, V-expert and T-expert, to fuse VLMs' image and text representations, respectively. We can opt for activating different sub-modules depending on the downstream tasks. Experimental results show that our method can significantly improve the performance on object-color reasoning and natural language understanding (NLU) tasks compared with PLM baselines.

artificial intelligence, natural language, text processing, (19 more...)

arXiv.org Artificial Intelligence

2305.07358

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.70)

Add feedback

ChatEDA: A Large Language Model Powered Autonomous Agent for EDA

He, Zhuolun, Wu, Haoyuan, Zhang, Xinyun, Yao, Xufeng, Zheng, Su, Zheng, Haisheng, Yu, Bei

arXiv.org Artificial IntelligenceAug-20-2023

The integration of a complex set of Electronic Design Automation (EDA) tools to enhance interoperability is a critical concern for circuit designers. Recent advancements in large language models (LLMs) have showcased their exceptional capabilities in natural language processing and comprehension, offering a novel approach to interfacing with EDA tools. This research paper introduces ChatEDA, an autonomous agent for EDA empowered by a large language model, AutoMage, complemented by EDA tools serving as executors. ChatEDA streamlines the design flow from the Register-Transfer Level (RTL) to the Graphic Data System Version II (GDSII) by effectively managing task planning, script generation, and task execution. Through comprehensive experimental evaluations, ChatEDA has demonstrated its proficiency in handling diverse requirements, and our fine-tuned AutoMage model has exhibited superior performance compared to GPT-4 and other similar LLMs.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2308.10204

Country: Asia > China (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback