AITopics | Yamaguchi, Shoichiro

Collaborating Authors

Yamaguchi, Shoichiro

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Virtual Human Generative Model: Masked Modeling Approach for Learning Human Characteristics

Oono, Kenta, Charoenphakdee, Nontawat, Bito, Kotatsu, Gao, Zhengyan, Ota, Yoshiaki, Yamaguchi, Shoichiro, Sugawara, Yohei, Maeda, Shin-ichi, Miyoshi, Kunihiko, Saito, Yuki, Tsuda, Koki, Maruyama, Hiroshi, Hayashi, Kohei

arXiv.org Artificial IntelligenceAug-14-2023

Identifying the relationship between healthcare attributes, lifestyles, and personality is vital for understanding and improving physical and mental conditions. Machine learning approaches are promising for modeling their relationships and offering actionable suggestions. In this paper, we propose Virtual Human Generative Model (VHGM), a machine learning model for estimating attributes about healthcare, lifestyles, and personalities. VHGM is a deep generative model trained with masked modeling to learn the joint distribution of attributes conditioned on known ones. Using heterogeneous tabular datasets, VHGM learns more than 1,800 attributes efficiently. We numerically evaluate the performance of VHGM and its training techniques. As a proof-of-concept of VHGM, we present several applications demonstrating user scenarios, such as virtual measurements of healthcare attributes and hypothesis verifications of lifestyles.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2306.10656

Country: North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre:

Research Report > Strength High (1.00)
Research Report > Experimental Study (1.00)
Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Consumer Health (1.00)
Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Learning-based Collision-free Planning on Arbitrary Optimization Criteria in the Latent Space through cGANs

Ando, Tomoki, Iino, Hiroto, Mori, Hiroki, Torishima, Ryota, Takahashi, Kuniyuki, Yamaguchi, Shoichiro, Okanohara, Daisuke, Ogata, Tetsuya

arXiv.org Artificial IntelligenceFeb-5-2023

We propose a new method for collision-free planning using Conditional Generative Adversarial Networks (cGANs) to transform between the robot's joint space and a latent space that captures only collision-free areas of the joint space, conditioned by an obstacle map. Generating multiple plausible trajectories is convenient in applications such as the manipulation of a robot arm by enabling the selection of trajectories that avoids collision with the robot or surrounding environment. In the proposed method, various trajectories that avoid obstacles can be generated by connecting the start and goal state with arbitrary line segments in this generated latent space. Our method provides this collision-free latent space, after which any planner, using any optimization conditions, can be used to generate the most suitable paths on the fly. We successfully verified this method with a simulated and actual UR5e 6-DoF robotic arm. We confirmed that different trajectories could be generated depending on optimization conditions.

artificial intelligence, latent space, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2202.13062

Country: Asia (0.28)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)

Add feedback

Out-of-Distribution Generalization with Maximal Invariant Predictor

Koyama, Masanori, Yamaguchi, Shoichiro

arXiv.org Machine LearningSep-24-2020

Out-of-Distribution (OOD) generalization problem is a problem of seeking the predictor function whose performance in the worst environments is optimal. This paper makes two contributions to OOD problem. We first use the basic results of probability to prove Maximal Invariant Predictor(MIP) condition, a theoretical result that can be used to identify the OOD optimal solution. We then use our MIP to derive Inter-environmental Gradient Alignment (IGA) algorithm that can be used to help seek the OOD optimal predictor. Previous studies that have investigated the theoretical aspect of the OOD problem use strong structural assumptions such as causal DAG. However, in cases involving image datasets, for example, the identification of hidden structural relations is itself a difficult problem. Our theoretical results are different from those of many previous studies in that it can be applied to cases in which the underlying structure of dataset is difficult to analyze. We present an extensive comparison of previous theoretical approaches to the OOD problems based on the assumptions they make. We also present an extension of the Colored-MNIST that can more accurately represent the pathological OOD situation than the original version, and demonstrate the superiority of IGA over previous methods on both the original and the extended version of Colored-MNIST.

artificial intelligence, assumption, optimization problem, (15 more...)

arXiv.org Machine Learning

2008.01883

Country:

Asia > Japan (0.14)
Europe > Sweden (0.14)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Data Interpolating Prediction: Alternative Interpretation of Mixup

Shimada, Takuya, Yamaguchi, Shoichiro, Hayashi, Kohei, Kobayashi, Sosuke

arXiv.org Machine LearningJun-19-2019

Data augmentation by mixing samples, such as Mixup, has widely been used typically for classification tasks. However, this strategy is not always effective due to the gap between augmented samples for training and original samples for testing. This gap may prevent a classifier from learning the optimal decision boundary and increase the generalization error. To overcome this problem, we propose an alternative framework called Data Interpolating Prediction (DIP). Unlike common data augmentations, we encapsulate the sample-mixing process in the hypothesis class of a classifier so that train and test samples are treated equally. We derive the generalization bound and show that DIP helps to reduce the original Rademacher complexity. Also, we empirically demonstrate that DIP can outperform existing Mixup.

alternative interpretation, mixup, prediction

arXiv.org Machine Learning

1906.08412

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.53)

Add feedback

Semi-flat minima and saddle points by embedding neural networks to overparameterization

Fukumizu, Kenji, Yamaguchi, Shoichiro, Mototake, Yoh-ichi, Tanaka, Mirai

arXiv.org Machine LearningJun-14-2019

We theoretically study the landscape of the training error for neural networks in overparameterized cases. We consider three basic methods for embedding a network into a wider one with more hidden units, and discuss whether a minimum point of the narrower network gives a minimum or saddle point of the wider one. Our results show that the networks with smooth and ReLU activation have different partially flat landscapes around the embedded point. We also relate these results to a difference of their generalization abilities in overparameterized realization.

artificial intelligence, hessian, neural network, (18 more...)

arXiv.org Machine Learning

1906.04868

Country:

Asia > Japan (0.14)
North America > United States (0.14)

Genre: Research Report > New Finding (0.86)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A Differentiable Gaussian-like Distribution on Hyperbolic Space for Gradient-Based Learning

Nagano, Yoshihiro, Yamaguchi, Shoichiro, Fujita, Yasuhiro, Koyama, Masanori

arXiv.org Machine LearningFeb-8-2019

Hyperbolic space is a geometry that is known to be well-suited for representation learning of data with an underlying hierarchical structure. In this paper, we present a novel hyperbolic distribution called \textit{pseudo-hyperbolic Gaussian}, a Gaussian-like distribution on hyperbolic space whose density can be evaluated analytically and differentiated with respect to the parameters. Our distribution enables the gradient-based learning of the probabilistic models on hyperbolic space that could never have been considered before. Also, we can sample from this hyperbolic probability distribution without resorting to auxiliary means like rejection sampling. As applications of our distribution, we develop a hyperbolic-analog of variational autoencoder and a method of probabilistic word embedding on hyperbolic space. We demonstrate the efficacy of our distribution on various datasets including MNIST, Atari 2600 Breakout, and WordNet.

artificial intelligence, hyperbolic space, neural network, (17 more...)

arXiv.org Machine Learning

1902.02992

Country: Asia > Japan > Honshū (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.89)

Add feedback