AITopics | Aramaki, Eiji

Collaborating Authors

Aramaki, Eiji

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Investigating Neurons and Heads in Transformer-based LLMs for Typographical Errors

Tsuji, Kohei, Hiraoka, Tatsuya, Cheng, Yuchang, Aramaki, Eiji, Iwakura, Tomoya

arXiv.org Artificial IntelligenceFeb-26-2025

This paper investigates how LLMs encode inputs with typos. We hypothesize that specific neurons and attention heads recognize typos and fix them internally using local and global contexts. We introduce a method to identify typo neurons and typo heads that work actively when inputs contain typos. Our experimental results suggest the following: 1) LLMs can fix typos with local contexts when the typo neurons in either the early or late layers are activated, even if those in the other are not. 2) Typo neurons in the middle layers are responsible for the core of typo-fixing with global contexts. 3) Typo heads fix typos by widely considering the context not focusing on specific tokens. 4) Typo neurons and typo heads work not only for typo-fixing but also for understanding general contexts.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2502.19669

Country:

North America > United States (0.15)
Europe > Spain (0.14)
Europe > Italy (0.14)
(3 more...)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

LLM-jp, null, :, null, Aizawa, Akiko, Aramaki, Eiji, Chen, Bowen, Cheng, Fei, Deguchi, Hiroyuki, Enomoto, Rintaro, Fujii, Kazuki, Fukumoto, Kensuke, Fukushima, Takuya, Han, Namgi, Harada, Yuto, Hashimoto, Chikara, Hiraoka, Tatsuya, Hisada, Shohei, Hosokawa, Sosuke, Jie, Lu, Kamata, Keisuke, Kanazawa, Teruhito, Kanezashi, Hiroki, Kataoka, Hiroshi, Katsumata, Satoru, Kawahara, Daisuke, Kawano, Seiya, Keyaki, Atsushi, Kiryu, Keisuke, Kiyomaru, Hirokazu, Kodama, Takashi, Kubo, Takahiro, Kuga, Yohei, Kumon, Ryoma, Kurita, Shuhei, Kurohashi, Sadao, Li, Conglong, Maekawa, Taiki, Matsuda, Hiroshi, Miyao, Yusuke, Mizuki, Kentaro, Mizuki, Sakae, Murawaki, Yugo, Nakamura, Ryo, Nakamura, Taishi, Nakayama, Kouta, Nakazato, Tomoka, Niitsuma, Takuro, Nishitoba, Jiro, Oda, Yusuke, Ogawa, Hayato, Okamoto, Takumi, Okazaki, Naoaki, Oseki, Yohei, Ozaki, Shintaro, Ryu, Koki, Rzepka, Rafal, Sakaguchi, Keisuke, Sasaki, Shota, Sekine, Satoshi, Suda, Kohei, Sugawara, Saku, Sugiura, Issa, Sugiyama, Hiroaki, Suzuki, Hisami, Suzuki, Jun, Suzumura, Toyotaro, Tachibana, Kensuke, Takagi, Yu, Takami, Kyosuke, Takeda, Koichi, Takeshita, Masashi, Tanaka, Masahiro, Taura, Kenjiro, Tolmachev, Arseny, Ueda, Nobuhiro, Wan, Zhen, Yada, Shuntaro, Yahata, Sakiko, Yamamoto, Yuya, Yamauchi, Yusuke, Yanaka, Hitomi, Yokota, Rio, Yoshino, Koichiro

arXiv.org Artificial IntelligenceJul-4-2024

This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its activities, and technical reports on the LLMs developed by LLM-jp.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2407.03963

Country:

North America (1.00)
Europe (1.00)
Asia > Japan > Honshū > Kantō (0.14)

Genre:

Research Report (0.50)
Questionnaire & Opinion Survey (0.46)

Industry:

Education (0.93)
Health & Medicine > Therapeutic Area (0.68)
Information Technology (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Enhancing In-Context Learning with Semantic Representations for Relation Extraction

Han, Peitao, Pereira, Lis Kanashiro, Cheng, Fei, She, Wan Jou, Aramaki, Eiji

arXiv.org Artificial IntelligenceJun-14-2024

In this work, we employ two AMR-enhanced semantic representations for ICL on RE: one that explores the AMR structure generated for a sentence at the subgraph level (shortest AMR path), and another that explores the full AMR structure generated for a sentence. In both cases, we demonstrate that all settings benefit from the fine-grained AMR's semantic structure. We evaluate our model on four RE datasets. Our results show that our model can outperform the GPT-based baselines, and achieve SOTA performance on two of the datasets, and competitive performance on the other two.

computational linguistic, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2406.10432

Country:

Asia > Japan (0.15)
North America > Canada (0.15)
Asia > Middle East > UAE (0.15)
(4 more...)

Genre: Research Report > New Finding (0.54)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

A Dataset for Pharmacovigilance in German, French, and Japanese: Annotating Adverse Drug Reactions across Languages

Raithel, Lisa, Yeh, Hui-Syuan, Yada, Shuntaro, Grouin, Cyril, Lavergne, Thomas, Névéol, Aurélie, Paroubek, Patrick, Thomas, Philippe, Nishiyama, Tomohiro, Möller, Sebastian, Aramaki, Eiji, Matsumoto, Yuji, Roller, Roland, Zweigenbaum, Pierre

arXiv.org Artificial IntelligenceMar-27-2024

User-generated data sources have gained significance in uncovering Adverse Drug Reactions (ADRs), with an increasing number of discussions occurring in the digital world. However, the existing clinical corpora predominantly revolve around scientific articles in English. This work presents a multilingual corpus of texts concerning ADRs gathered from diverse sources, including patient fora, social media, and clinical reports in German, French, and Japanese. Our corpus contains annotations covering 12 entity types, four attribute types, and 13 relation types. It contributes to the development of real-world multilingual language models for healthcare. We provide statistics to highlight certain challenges associated with the corpus and conduct preliminary experiments resulting in strong baselines for extracting entities and relations between these entities, both within and across languages.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2403.18336

Country:

Europe (1.00)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report (0.50)

Industry:

Information Technology (0.93)
Health & Medicine > Pharmaceuticals & Biotechnology (0.68)
Health & Medicine > Consumer Health (0.67)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

JaMIE: A Pipeline Japanese Medical Information Extraction System

Cheng, Fei, Yada, Shuntaro, Tanaka, Ribeka, Aramaki, Eiji, Kurohashi, Sadao

arXiv.org Artificial IntelligenceNov-7-2021

We present an open-access natural language processing toolkit for Japanese medical information extraction. We first propose a novel relation annotation schema for investigating the medical and temporal relations between medical entities in Japanese medical reports. We experiment with the practical annotation scenarios by separately annotating two different types of reports. We design a pipeline system with three components for recognizing medical entities, classifying entity modalities, and extracting relations. The empirical results show accurate analyzing performance and suggest the satisfactory annotation quality, the effective annotation strategy for targeting report types, and the superiority of the latest contextual embedding models.

machine learning, natural language, relation, (21 more...)

arXiv.org Artificial Intelligence

2111.04261

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report > New Finding (0.34)

Industry:

Health & Medicine > Therapeutic Area (0.68)
Health & Medicine > Health Care Technology > Medical Record (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

Single Model for Influenza Forecasting of Multiple Countries by Multi-task Learning

Murayama, Taichi, Wakamiya, Shoko, Aramaki, Eiji

arXiv.org Artificial IntelligenceJul-7-2021

The accurate forecasting of infectious epidemic diseases such as influenza is a crucial task undertaken by medical institutions. Although numerous flu forecasting methods and models based mainly on historical flu activity data and online user-generated contents have been proposed in previous studies, no flu forecasting model targeting multiple countries using two types of data exists at present. Our paper leverages multi-task learning to tackle the challenge of building one flu forecasting model targeting multiple countries; each country as each task. Also, to develop the flu prediction model with higher performance, we solved two issues; finding suitable search queries, which are part of the user-generated contents, and how to leverage search queries efficiently in the model creation. For the first issue, we propose the transfer approaches from English to other languages. For the second issue, we propose a novel flu forecasting model that takes advantage of search queries using an attention mechanism and extend the model to a multi-task model for multiple countries' flu forecasts. Experiments on forecasting flu epidemics in five countries demonstrate that our model significantly improved the performance by leveraging the search queries and multi-task learning compared to the baselines.

deep learning, immunology, search query, (20 more...)

arXiv.org Artificial Intelligence

2107.0176

Country:

Europe > United Kingdom (0.28)
North America > United States (0.28)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Smartphone-Based Self Management System for Type-2 Diabetes Patients

AAAI ConferencesMar-25-2012

This paper proposes a novel telemedicine system for type 2 diabetes patients. The proposed system supports the patient self-management via a set of telemedicine devices, consisting of health sensors and a smart phone. The proposed system covers not only the sensor data but also the diet (food) and exercise data. To capture the food information, we also developed the voice recognition module focusing on the food names. The basic feasibility of the system is practically demonstrated in the preliminary experiment.

diabetes, health & medicine, smartphone-based self management system, (18 more...)

AAAI Conferences

2012 AAAI Spring Symposium Series

Country: Asia > Japan (0.15)

Industry: Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (1.00)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Mobile (0.91)
Information Technology > Architecture > Autonomic Computing (0.74)

Add feedback

Influenza Patients Are Invisible in the Web: Traditional Model Still Improves the State of the Art Web Based Influenza Surveillance

Aramaki, Eiji (University of Tokyo) | Maskawa, Sachiko (University of Tokyo) | Morita, Mizuki

AAAI ConferencesMar-25-2012

Although web-based information extraction systems draw much attention, most of such systems assume that the web directly reflects the real world. For instance, Google flu trend, which is one of the-state-of-the-art influenza surveillance systems, relies on the basic idea that the amount of the influenza related search queries directly correlates with the number of the influenza patients. However, the real patients suffering from influenza symptoms are invisible in the web, because they do not use Internet. Considering this gap, this paper employs an infectious model, assuming that a potential patient utilizes Internet at the first sign of flu. The proposed model improves two types of the state-of-the-art systems, Google based system (from 0.837 correlation to 0.928) and Twitter based system (from 0.898 correlation to 0.918). This study demonstrated that a simple model could easily improve the web-based surveillance.

immunology, influenza patient, social media, (19 more...)

AAAI Conferences

2012 AAAI Spring Symposium Series

Country: Asia > Japan > Honshū (0.15)

Genre: Research Report > New Finding (0.89)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Communications > Social Media (0.99)
Information Technology > Information Management > Search (0.92)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.69)

Add feedback