AITopics | Zhou, Ganbin

Collaborating Authors

Zhou, Ganbin

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Extracting Variable-Depth Logical Document Hierarchy from Long Documents: Method, Evaluation, and Application

Cao, Rongyu, Cao, Yixuan, Zhou, Ganbin, Luo, Ping

arXiv.org Artificial IntelligenceMay-14-2021

In this paper, we study the problem of extracting variable-depth "logical document hierarchy" from long documents, namely organizing the recognized "physical document objects" into hierarchical structures. The discovery of logical document hierarchy is the vital step to support many downstream applications. However, long documents, containing hundreds or even thousands of pages and variable-depth hierarchy, challenge the existing methods. To address these challenges, we develop a framework, namely Hierarchy Extraction from Long Document (HELD), where we "sequentially" insert each physical object at the proper on of the current tree. Determining whether each possible position is proper or not can be formulated as a binary classification problem. To further improve its effectiveness and efficiency, we study the design variants in HELD, including traversal orders of the insertion positions, heading extraction explicitly or implicitly, tolerance to insertion errors in predecessor steps, and so on. The empirical experiments based on thousands of long documents from Chinese, English financial market and English scientific publication show that the HELD model with the "root-to-leaf" traversal order and explicit heading extraction is the best choice to achieve the tradeoff between effectiveness and efficiency with the accuracy of 0.9726, 0.7291 and 0.9578 in Chinese financial, English financial and arXiv datasets, respectively. Finally, we show that logical document hierarchy can be employed to significantly improve the performance of the downstream passage retrieval task. In summary, we conduct a systematic study on this task in terms of methods, evaluations, and applications.

dataset, deep learning, neural network, (22 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s11390-021-1076-7

2105.09297

Country: Asia > China (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Banking & Finance (1.00)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Hierarchical Neural Network for Extracting Knowledgeable Snippets and Documents

Zhou, Ganbin, Cao, Rongyu, Ao, Xiang, Luo, Ping, Lin, Fen, Lin, Leyu, He, Qing

arXiv.org Artificial IntelligenceAug-22-2018

In this study, we focus on extracting knowledgeable snippets and annotating knowledgeable documents from Web corpus, consisting of the documents from social media and We-media. Informally, knowledgeable snippets refer to the text describing concepts, properties of entities, or relations among entities, while knowledgeable documents are the ones with enough knowledgeable snippets. These knowledgeable snippets and documents could be helpful in multiple applications, such as knowledge base construction and knowledge-oriented service. Previous studies extracted the knowledgeable snippets using the pattern-based method. Here, we propose the semantic-based method for this task. Specifically, a CNN based model is developed to extract knowledgeable snippets and annotate knowledgeable documents simultaneously. Additionally, a "low-level sharing, high-level splitting" structure of CNN is designed to handle the documents from different content domains. Compared with building multiple domain-specific CNNs, this joint model not only critically saves the training time, but also improves the prediction accuracy visibly. The superiority of the proposed method is demonstrated in a real dataset from Wechat public platform.

deep learning, knowledgeable document, neural network, (17 more...)

arXiv.org Artificial Intelligence

1808.07228

Country: Asia > China (0.29)

Genre: Research Report > New Finding (0.48)

Industry:

Transportation > Ground > Road (0.48)
Information Technology (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Elastic Responding Machine for Dialog Generation with Dynamically Mechanism Selecting

Zhou, Ganbin (Institute of Computing Technology, Chinese Academy of Sciences) | Luo, Ping (Institute of Computing Technology, Chinese Academy of Sciences) | Xiao, Yijun (University of California Santa Barbara) | Lin, Fen (WeChat, Tencent) | Chen, Bo (WeChat, Tencent) | He, Qing (Institute of Computing Technology, Chinese Academy of Sciences)

AAAI ConferencesFeb-8-2018

Neural models aiming at generating meaningful and diverse response is attracting increasing attention over recent years. For a given post, the conventional encoder-decoder models tend to learn high-frequency but trivial responses, or are difficult to determine which speaking styles are suitable to generate responses. To address this issue, we propose the elastic responding machine (ERM), which is based on a proposed encoder-diverter-filter-decoder framework. ERM models the multiple responding mechanisms to not only generate acceptable responses for a given post but also improve the diversity of responses. Here, the mechanisms could be regraded as some latent variables, and for a given post different responses may be generated by different mechanisms. The experiments demonstrate the quality and diversity of the generated responses, intuitively show how the learned model controls response mechanism when responding, and reveal some underlying relationship between mechanism and language style.

deep learning, mechanism, neural network, (22 more...)

AAAI Conferences

Thirty-Second AAAI Conference on Artificial Intelligence

Country: North America > United States > California > Santa Barbara County > Santa Barbara (0.14)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Tree-Structured Neural Machine for Linguistics-Aware Sentence Generation

Zhou, Ganbin (Institute of Computing Technology, Chinese Academy of Sciences) | Luo, Ping (Institute of Computing Technology, Chinese Academy of Sciences) | Cao, Rongyu (Institute of Computing Technology, Chinese Academy of Sciences) | Xiao, Yijun (Department of Computer Science, University of California Santa Barbara) | Lin, Fen (WeChat Search Application Department, Tencent) | Chen, Bo (WeChat Search Application Department, Tencent) | He, Qing (Institute of Computing Technology, Chinese Academy of Sciences)

AAAI ConferencesFeb-8-2018

Different from other sequential data, sentences in natural language are structured by linguistic grammars. Previous generative conversational models with chain-structured decoder ignore this structure in human language and might generate plausible responses with less satisfactory relevance and fluency. In this study, we aim to incorporate the results from linguistic analysis into the process of sentence generation for high-quality conversation generation. Specifically, we use a dependency parser to transform each response sentence into a dependency tree and construct a training corpus of sentence-tree pairs. A tree-structured decoder is developed to learn the mapping from a sentence to its tree, where different types of hidden states are used to depict the local dependencies from an internal tree node to its children. For training acceleration, we propose a tree canonicalization method, which transforms trees into equivalent ternary trees. Then, with a proposed tree-structured search method, the model is able to generate the most probable responses in the form of dependency trees, which are finally flattened into sequences as the system output. Experimental results demonstrate that the proposed X2Tree framework outperforms baseline methods over 11.15% increase of acceptance ratio.

deep learning, neural network, node, (19 more...)

AAAI Conferences

Thirty-Second AAAI Conference on Artificial Intelligence

Country: North America > United States > California > Santa Barbara County > Santa Barbara (0.14)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.67)

Add feedback

Tree-Structured Neural Machine for Linguistics-Aware Sentence Generation

Zhou, Ganbin, Luo, Ping, Cao, Rongyu, Xiao, Yijun, Lin, Fen, Chen, Bo, He, Qing

arXiv.org Artificial IntelligenceJan-3-2018

deep learning, neural network, node, (20 more...)

arXiv.org Artificial Intelligence

1705.00321

Country: North America > United States > California > Santa Barbara County > Santa Barbara (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.65)

Add feedback

Mechanism-Aware Neural Machine for Dialogue Response Generation

Zhou, Ganbin (Institute of Computing Technology, Chinese Academy of Sciences) | Luo, Ping (Institute of Computing Technology, Chinese Academy of Sciences) | Cao, Rongyu (Institute of Computing Technology, Chinese Academy of Sciences) | Lin, Fen (Tencent) | Chen, Bo (Tencent) | He, Qing (Institute of Computing Technology, Chinese Academy of Sciences)

AAAI ConferencesFeb-14-2017

To the same utterance, people's responses in everyday dialogue may be diverse largely in terms of content semantics, speaking styles, communication intentions and so on. Previous generative conversational models ignore these 1-to-n relationships between a post to its diverse responses, and tend to return high-frequency but meaningless responses. In this study we propose a mechanism-aware neural machine for dialogue response generation. It assumes that there exists some latent responding mechanisms, each of which can generate different responses for a single input post. With this assumption we model different responding mechanisms as latent embeddings, and develop a encoder-diverter-decoder framework to train its modules in an end-to-end fashion. With the learned latent mechanisms, for the first time these decomposed modules can be used to encode the input into mechanism-aware context, and decode the responses with the controlled generation styles and topics. Finally, the experiments with human judgements, intuitive examples, detailed discussions demonstrate the quality and diversity of the generated responses with 9.80% increase of acceptable ratio over the best of six baseline methods.

deep learning, mechanism, neural network, (22 more...)

AAAI Conferences

Thirty-First AAAI Conference on Artificial Intelligence

Country: Asia > China (0.15)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback