AITopics | Li, Zhoubo

Collaborating Authors

Li, Zhoubo

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Editing Large Language Models: Problems, Methods, and Opportunities

Yao, Yunzhi, Wang, Peng, Tian, Bozhong, Cheng, Siyuan, Li, Zhoubo, Deng, Shumin, Chen, Huajun, Zhang, Ningyu

arXiv.org Artificial IntelligenceNov-30-2023

Despite the ability to train capable LLMs, the methodology for maintaining their relevancy and rectifying errors remains elusive. To this end, the past few years have witnessed a surge in techniques for editing LLMs, the objective of which is to efficiently alter the behavior of LLMs within a specific domain without negatively impacting performance across other inputs. This paper embarks on a deep exploration of the problems, methods, and opportunities related to model editing for LLMs. In particular, we provide an exhaustive overview of the task definition and challenges associated with model editing, along with an in-depth empirical analysis of the most progressive methods currently at our disposal. We also build a new benchmark dataset to facilitate a more robust evaluation and pinpoint enduring issues intrinsic to existing techniques. Our objective is to provide valuable insights into the effectiveness and feasibility of each editing technique, thereby assisting the community in making informed decisions on the selection of the most appropriate method for a specific task or context. Code and datasets are available at https://github.com/zjunlp/EasyEdit.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2305.13172

Country:

Asia (0.68)
Europe (0.67)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (0.68)
Information Technology (0.68)
Leisure & Entertainment > Sports (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Unveiling the Pitfalls of Knowledge Editing for Large Language Models

Li, Zhoubo, Zhang, Ningyu, Yao, Yunzhi, Wang, Mengru, Chen, Xi, Chen, Huajun

arXiv.org Artificial IntelligenceNov-21-2023

As the cost associated with fine-tuning Large Language Models (LLMs) continues to rise, recent research efforts have pivoted towards developing methodologies to edit implicit knowledge embedded within LLMs. Yet, there's still a dark cloud lingering overhead -- will knowledge editing trigger butterfly effect? since it is still unclear whether knowledge editing might introduce side effects that pose potential risks or not. This paper pioneers the investigation into the potential pitfalls associated with knowledge editing for LLMs. To achieve this, we introduce new benchmark datasets and propose innovative evaluation metrics. Our results underline two pivotal concerns: (1) Knowledge Conflict: Editing groups of facts that logically clash can magnify the inherent inconsistencies in LLMs-a facet neglected by previous methods. (2) Knowledge Distortion: Altering parameters with the aim of editing factual knowledge can irrevocably warp the innate knowledge structure of LLMs. Experimental results vividly demonstrate that knowledge editing might inadvertently cast a shadow of unintended consequences on LLMs, which warrant attention and efforts for future works. Code is available at https://github.com/zjunlp/PitfallsKnowledgeEditing.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2310.02129

Country:

Asia > Middle East > UAE (0.14)
North America > United States > Maryland (0.14)

Genre: Research Report > New Finding (0.66)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

Add feedback

DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

Zhang, Ningyu, Xu, Xin, Tao, Liankuan, Yu, Haiyang, Ye, Hongbin, Qiao, Shuofei, Xie, Xin, Chen, Xiang, Li, Zhoubo, Li, Lei, Liang, Xiaozhuan, Yao, Yunzhi, Deng, Shumin, Wang, Peng, Zhang, Wen, Zhang, Zhenru, Tan, Chuanqi, Chen, Qiang, Xiong, Feiyu, Huang, Fei, Zheng, Guozhou, Chen, Huajun

arXiv.org Artificial IntelligenceSep-18-2023

We present an open-source and extensible knowledge extraction toolkit DeepKE, supporting complicated low-resource, document-level and multimodal scenarios in the knowledge base population. DeepKE implements various information extraction tasks, including named entity recognition, relation extraction and attribute extraction. With a unified framework, DeepKE allows developers and researchers to customize datasets and models to extract information from unstructured data according to their requirements. Specifically, DeepKE not only provides various functional modules and model implementation for different tasks and scenarios but also organizes all components by consistent frameworks to maintain sufficient modularity and extensibility. We release the source code at GitHub in https://github.com/zjunlp/DeepKE with Google Colab tutorials and comprehensive documents for beginners. Besides, we present an online system in http://deepke.openkg.cn/EN/re_doc_show.html for real-time extraction of various tasks, and a demo video.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2201.03335

Country:

Europe (0.93)
Asia (0.93)
North America > United States > Maryland > Carroll County > Eldersburg (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

LambdaKG: A Library for Pre-trained Language Model-Based Knowledge Graph Embeddings

Xie, Xin, Li, Zhoubo, Wang, Xiaohan, Xi, Zekun, Zhang, Ningyu

arXiv.org Artificial IntelligenceSep-14-2023

Knowledge Graphs (KGs) often have two characteristics: heterogeneous graph structure and text-rich entity/relation information. Text-based KG embeddings can represent entities by encoding descriptions with pre-trained language models, but no open-sourced library is specifically designed for KGs with PLMs at present. In this paper, we present LambdaKG, a library for KGE that equips with many pre-trained language models (e.g., BERT, BART, T5, GPT-3), and supports various tasks (e.g., knowledge graph completion, question answering, recommendation, and knowledge probing). LambdaKG is publicly open-sourced at https://github.com/zjunlp/PromptKG/tree/main/lambdaKG, with a demo video at http://deepke.zjukg.cn/lambdakg.mp4 and long-term maintenance.

computational linguistic, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2210.00305

Country:

Europe (1.00)
Asia (0.68)
North America > United States > Louisiana (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.84)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.71)

Add feedback

Construction and Applications of Billion-Scale Pre-Trained Multimodal Business Knowledge Graph

Deng, Shumin, Wang, Chengming, Li, Zhoubo, Zhang, Ningyu, Dai, Zelin, Chen, Hehong, Xiong, Feiyu, Yan, Ming, Chen, Qiang, Chen, Mosha, Chen, Jiaoyan, Pan, Jeff Z., Hooi, Bryan, Chen, Huajun

arXiv.org Artificial IntelligenceMar-19-2023

Business Knowledge Graphs (KGs) are important to many enterprises today, providing factual knowledge and structured data that steer many products and make them more intelligent. Despite their promising benefits, building business KG necessitates solving prohibitive issues of deficient structure and multiple modalities. In this paper, we advance the understanding of the practical challenges related to building KG in non-trivial real-world systems. We introduce the process of building an open business knowledge graph (OpenBG) derived from a well-known enterprise, Alibaba Group. Specifically, we define a core ontology to cover various abstract products and consumption demands, with fine-grained taxonomy and multimodal facts in deployed applications. OpenBG is an open business KG of unprecedented scale: 2.6 billion triples with more than 88 million entities covering over 1 million core classes/concepts and 2,681 types of relations. We release all the open resources (OpenBG benchmarks) derived from it for the community and report experimental results of KG-centric tasks. We also run up an online competition based on OpenBG benchmarks, and has attracted thousands of teams. We further pre-train OpenBG and apply it to many KG- enhanced downstream tasks in business scenarios, demonstrating the effectiveness of billion-scale multimodal knowledge for e-commerce. All the resources with codes have been released at \url{https://github.com/OpenBGBenchmark/OpenBG}.

artificial intelligence, natural language, text processing, (17 more...)

arXiv.org Artificial Intelligence

2209.15214

Country:

North America > United States (1.00)
Europe (1.00)
Asia > China (0.94)

Genre: Research Report (1.00)

Industry:

Information Technology (0.68)
Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback

From Discrimination to Generation: Knowledge Graph Completion with Generative Transformer

Xie, Xin, Zhang, Ningyu, Li, Zhoubo, Deng, Shumin, Chen, Hui, Xiong, Feiyu, Chen, Mosha, Chen, Huajun

arXiv.org Artificial IntelligenceMar-14-2023

Knowledge graph completion aims to address the problem of extending a KG with missing triples. In this paper, we provide an approach GenKGC, which converts knowledge graph completion to sequence-to-sequence generation task with the pre-trained language model. We further introduce relation-guided demonstration and entity-aware hierarchical decoding for better representation learning and fast inference. Experimental results on three datasets show that our approach can obtain better or comparable performance than baselines and achieve faster inference speed compared with previous methods with pre-trained language models. We also release a new large-scale Chinese knowledge graph dataset AliopenKG500 for research purpose. Code and datasets are available in https://github.com/zjunlp/PromptKG/tree/main/GenKGC.

artificial intelligence, natural language, relation, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3487553.3524238

2202.02113

Country: North America > United States > California (0.98)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback