AITopics | Yamamoto, Akihiro

Collaborating Authors

Yamamoto, Akihiro

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

BicliqueEncoder: An Efficient Method for Link Prediction in Bipartite Networks using Formal Concept Analysis and Transformer Encoder

Yang, Hongyuan, Peng, Siqi, Yamamoto, Akihiro

arXiv.org Artificial IntelligenceMar-20-2025

We propose a novel and efficient method for link prediction in bipartite networks, using \textit{formal concept analysis} (FCA) and the Transformer encoder. Link prediction in bipartite networks finds practical applications in various domains such as product recommendation in online sales, and prediction of chemical-disease interaction in medical science. Since for link prediction, the topological structure of a network contains valuable information, many approaches focus on extracting structural features and then utilizing them for link prediction. Bi-cliques, as a type of structural feature of bipartite graphs, can be utilized for link prediction. Although several link prediction methods utilizing bi-cliques have been proposed and perform well in rather small datasets, all of them face challenges with scalability when dealing with large datasets since they demand substantial computational resources. This limits the practical utility of these approaches in real-world applications. To overcome the limitation, we introduce a novel approach employing iceberg concept lattices and the Transformer encoder. Our method requires fewer computational resources, making it suitable for large-scale datasets while maintaining high prediction performance. We conduct experiments on five large real-world datasets that exceed the capacity of previous bi-clique-based approaches to demonstrate the efficacy of our method. Additionally, we perform supplementary experiments on five small datasets to compare with the previous bi-clique-based methods for bipartite link prediction and demonstrate that our method is more efficient than the previous ones.

artificial intelligence, data mining, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2503.07645

Country:

Asia > Japan (0.14)
Europe > Greece (0.14)
Europe > Russia (0.14)

Genre: Research Report > Promising Solution (0.34)

Industry: Information Technology (0.67)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

Implementing Derivations of Definite Logic Programs with Self-Attention Networks

Thuy, Phan Thi Thanh, Yamamoto, Akihiro

arXiv.org Artificial IntelligenceOct-15-2024

In this paper we propose that a restricted version of logical inference can be implemented with self-attention networks. We are aiming at showing that LLMs (Large Language Models) constructed with transformer networks can make logical inferences. We would reveal the potential of LLMs by analyzing self-attention networks, which are main components of transformer networks. Our approach is not based on semantics of natural languages but operations of logical inference. %point of view. We show that hierarchical constructions of self-attention networks with feed forward networks (FFNs) can implement top-down derivations for a class of logical formulae. We also show bottom-up derivations are also implemented for the same class. We believe that our results show that LLMs implicitly have the power of logical inference.

derivation, large language model, logic & formal reasoning, (18 more...)

arXiv.org Artificial Intelligence

2410.11396

Country: Asia (0.29)

Genre: Research Report > New Finding (0.55)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.97)

Add feedback

HTML-LSTM: Information Extraction from HTML Tables in Web Pages using Tree-Structured LSTM

Kawamura, Kazuki, Yamamoto, Akihiro

arXiv.org Artificial IntelligenceSep-28-2024

In this paper, we propose a novel method for extracting information from HTML tables with similar contents but with a different structure. We aim to integrate multiple HTML tables into a single table for retrieval of information containing in various Web pages. The method is designed by extending tree-structured LSTM, the neural network for tree-structured data, in order to extract information that is both linguistic and structural information of HTML data. We evaluate the proposed method through experiments using real data published on the WWW.

artificial intelligence, information, machine learning, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-030-88942-5_3

2409.19445

Genre: Research Report (1.00)

Industry: Education > Educational Setting > Higher Education (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

BERT4FCA: A Method for Bipartite Link Prediction using Formal Concept Analysis and BERT

Peng, Siqi, Yang, Hongyuan, Yamamoto, Akihiro

arXiv.org Artificial IntelligenceFeb-13-2024

We propose BERT4FCA, a novel method for link prediction in bipartite networks, using formal concept analysis (FCA) and BERT. Link prediction in bipartite networks is an important task that can solve various practical problems like friend recommendation in social networks and co-authorship prediction in author-paper networks. Recent research has found that in bipartite networks, maximal bi-cliques provide important information for link prediction, and they can be extracted by FCA. Some FCA-based bipartite link prediction methods have achieved good performance. However, we figured out that their performance could be further improved because these methods did not fully capture the rich information of the extracted maximal bi-cliques. To address this limitation, we propose an approach using BERT, which can learn more information from the maximal bi-cliques extracted by FCA and use them to make link prediction. We conduct experiments on three real-world bipartite networks and demonstrate that our method outperforms previous FCA-based methods, and some classic methods such as matrix-factorization and node2vec.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2402.08236

Country:

Asia > Japan (0.14)
Europe > Greece (0.14)
Europe > Russia (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Differentiable Inductive Logic Programming for Structured Examples

Shindo, Hikaru, Nishino, Masaaki, Yamamoto, Akihiro

arXiv.org Artificial IntelligenceMar-2-2021

The differentiable implementation of logic yields a seamless combination of symbolic reasoning and deep neural networks. Recent research, which has developed a differentiable framework to learn logic programs from examples, can even acquire reasonable solutions from noisy datasets. However, this framework severely limits expressions for solutions, e.g., no function symbols are allowed, and the shapes of clauses are fixed. As a result, the framework cannot deal with structured examples. Therefore we propose a new framework to learn logic programs from noisy and structured examples, including the following contributions. First, we propose an adaptive clause search method by looking through structured space, which is defined by the generality of the clauses, to yield an efficient search space for differentiable solvers. Second, we propose for ground atoms an enumeration algorithm, which determines a necessary and sufficient set of ground atoms to perform differentiable inference functions. Finally, we propose a new method to compose logic programs softly, enabling the system to deal with complex programs consisting of several clauses. Our experiments show that our new framework can learn logic programs from noisy and structured examples, such as sequences or trees. Our framework can be scaled to deal with complex programs that consist of several clauses with function symbols.

atom, logic programming, survey article, (17 more...)

arXiv.org Artificial Intelligence

2103.01719

Country: Asia > Japan (0.14)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)

Add feedback

Automatic Source Code Summarization with Extended Tree-LSTM

Shido, Yusuke, Kobayashi, Yasuaki, Yamamoto, Akihiro, Miyamoto, Atsushi, Matsumura, Tadayuki

arXiv.org Machine LearningJun-19-2019

Neural machine translation models are used to automatically generate a document from given source code since this can be regarded as a machine translation task. Source code summarization is one of the components for automatic document generation, which generates a summary in natural language from given source code. This suggests that techniques used in neural machine translation, such as Long Short-Term Memory (LSTM), can be used for source code summarization. However, there is a considerable difference between source code and natural language: Source code is essentially {\em structured}, having loops and conditional branching, etc. Therefore, there is some obstacle to apply known machine translation models to source code. Abstract syntax trees (ASTs) capture these structural properties and play an important role in recent machine learning studies on source code. Tree-LSTM is proposed as a generalization of LSTMs for tree-structured data. However, there is a critical issue when applying it to ASTs: It cannot handle a tree that contains nodes having an arbitrary number of children and their order simultaneously, which ASTs generally have such nodes. To address this issue, we propose an extension of Tree-LSTM, which we call \emph{Multi-way Tree-LSTM} and apply it for source code summarization. As a result of computational experiments, our proposal achieved better results when compared with several state-of-the-art techniques.

deep learning, neural network, source code, (20 more...)

arXiv.org Machine Learning

1906.08094

Country: Asia > Japan (0.14)

Genre: Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Inductive Logic Programming: Challenges

Inoue, Katsumi (National Institute of Informatics) | Ohwada, Hayato (Tokyo University of Science) | Yamamoto, Akihiro (Kyoto University)

AAAI ConferencesApr-19-2016

Stephen Muggleton gave the invited talk "Meta-Interpretive Inductive Logic Programming (ILP) is a research area Learning: achievements and challenges". Meta-Interpretive formed at the intersection of Machine Learning and logicbased Learning (MIL) is an ILP technique aimed at supporting knowledge representation. ILP has originally used learning of recursive definitions, by automatically introducing logic programming as a uniform representation language sub-definitions that allow decomposition into a hierarchy for examples, background knowledge and hypotheses for of reusable parts (Muggleton et al. 2014; 2015). ILP has also explored several connections (or abducing) first-order clauses whose heads unify with with statistical learning and other probabilistic approaches, a given goal. MIL additionally fetches higher-order metarules expanding research horizons significantly. A recent survey whose heads unify with the goal and saves the resulting of ILP can be seen in (Muggleton et al. 2012).

artificial intelligence, ilp 2015, logic programming, (16 more...)

AAAI Conferences

Thirtieth AAAI Conference on Artificial Intelligence

Country: Asia > Japan > Honshū (0.16)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)

Add feedback

Causal Discovery in a Binary Exclusive-or Skew Acyclic Model: BExSAM

Inazumi, Takanori, Washio, Takashi, Shimizu, Shohei, Suzuki, Joe, Yamamoto, Akihiro, Kawahara, Yoshinobu

arXiv.org Machine LearningJan-22-2014

Discovering causal relations among observed variables in a given data set is a major objective in studies of statistics and artificial intelligence. Recently, some techniques to discover a unique causal model have been explored based on non-Gaussianity of the observed data distribution. However, most of these are limited to continuous data. In this paper, we present a novel causal model for binary data and propose an efficient new approach to deriving the unique causal model governing a given binary data set under skew distributions of external binary noises. Experimental evaluation shows excellent performance for both artificial and real world data sets.

algorithm, artificial intelligence, health & medicine, (18 more...)

arXiv.org Machine Learning

1401.5636

Country:

Asia > Japan > Honshū > Kansai (0.14)
North America > United States > New York (0.14)
North America > United States > Utah (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)

Genre: Research Report (0.82)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Discovering causal structures in binary exclusive-or skew acyclic models

Inazumi, Takanori, Washio, Takashi, Shimizu, Shohei, Suzuki, Joe, Yamamoto, Akihiro, Kawahara, Yoshinobu

arXiv.org Machine LearningFeb-14-2012

Discovering causal relations among observed variables in a given data set is a main topic in studies of statistics and artificial intelligence. Recently, some techniques to discover an identifiable causal structure have been explored based on non-Gaussianity of the observed data distribution. However, most of these are limited to continuous data. In this paper, we present a novel causal model for binary data and propose a new approach to derive an identifiable causal structure governing the data based on skew Bernoulli distributions of external noise. Experimental evaluation shows excellent performance for both artificial and real world data sets.

artificial intelligence, causal structure, health & medicine, (17 more...)

arXiv.org Machine Learning

1202.3736

Country:

Asia > Japan > Honshū > Kansai (0.14)
North America > United States > Utah (0.14)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.47)

Add feedback