AITopics | Park, Sungjoon

Collaborating Authors

Park, Sungjoon

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning

Shin, Jaemin, Yoon, Hyungjun, Lee, Seungjoo, Park, Sungjoon, Liu, Yunxin, Choi, Jinho D., Lee, Sung-Ju

arXiv.org Artificial IntelligenceOct-25-2023

Psychiatrists diagnose mental disorders via the linguistic use of patients. Still, due to data privacy, existing passive mental health monitoring systems use alternative features such as activity, app usage, and location via mobile devices. We propose FedTherapist, a mobile mental health monitoring system that utilizes continuous speech and keyboard input in a privacy-preserving way via federated learning. We explore multiple model designs by comparing their performance and overhead for FedTherapist to overcome the complex nature of on-device language model training on smartphones. We further propose a Context-Aware Language Learning (CALL) methodology to effectively utilize smartphones' large and noisy text for mental health signal sensing. Our IRB-approved evaluation of the prediction of self-reported depression, stress, anxiety, and mood from 46 participants shows higher accuracy of FedTherapist compared with the performance with non-language features, achieving 0.15 AUROC improvement and 8.21% MAE reduction.

artificial intelligence, natural language, user-generated linguistic expression, (4 more...)

arXiv.org Artificial Intelligence

2310.16538

Genre: Research Report (0.40)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Health & Medicine > Consumer Health (1.00)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Towards standardizing Korean Grammatical Error Correction: Datasets and Annotation

Yoon, Soyoung, Park, Sungjoon, Kim, Gyuwan, Cho, Junhee, Park, Kihyo, Kim, Gyutae, Seo, Minjoon, Oh, Alice

arXiv.org Artificial IntelligenceMay-24-2023

Research on Korean grammatical error correction (GEC) is limited, compared to other major languages such as English. We attribute this problematic circumstance to the lack of a carefully designed evaluation benchmark for Korean GEC. In this work, we collect three datasets from different sources (Kor-Lang8, Kor-Native, and Kor-Learner) that covers a wide range of Korean grammatical errors. Considering the nature of Korean grammar, We then define 14 error types for Korean and provide KAGAS (Korean Automatic Grammatical error Annotation System), which can automatically annotate error types from parallel corpora. We use KAGAS on our datasets to make an evaluation benchmark for Korean, and present baseline models trained from our datasets. We show that the model trained with our datasets significantly outperforms the currently used statistical Korean GEC system (Hanspell) on a wider range of error types, demonstrating the diversity and usefulness of the datasets. The implementations and datasets are open-sourced.

artificial intelligence, error type, natural language, (17 more...)

arXiv.org Artificial Intelligence

2210.14389

Country:

Europe (1.00)
Asia (1.00)
North America > United States > Minnesota (0.28)

Genre: Research Report > New Finding (0.45)

Industry: Education (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Add feedback

Conversation Model Fine-Tuning for Classifying Client Utterances in Counseling Dialogues

Park, Sungjoon, Kim, Donghyun, Oh, Alice

arXiv.org Machine LearningMar-31-2019

The recent surge of text-based online counseling applications enables us to collect and analyze interactions between counselors and clients. A dataset of those interactions can be used to learn to automatically classify the client utterances into categories that help counselors in diagnosing client status and predicting counseling outcome. With proper anonymization, we collect counselor-client dialogues, define meaningful categories of client utterances with professional counselors, and develop a novel neural network model for classifying the client utterances. The central idea of our model, ConvMFiT, is a pre-trained conversation model which consists of a general language model built from an out-of-domain corpus and two role-specific language models built from unlabeled in-domain dialogues. The classification result shows that ConvMFiT outperforms state-of-the-art comparison models. Further, the attention weights in the learned model confirm that the model finds expected linguistic patterns for each category.

deep learning, neural network, utterance, (24 more...)

arXiv.org Machine Learning

1904.0035

Genre: Research Report > Experimental Study (0.93)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

MultilingualWikipedia: Editors of Primary Language Contribute to More Complex Articles

Park, Sungjoon (Korea Advanced Institute of Science and Technology (KAIST)) | Kim, Suin (Korea Advanced Institute of Science and Technology (KAIST)) | Hale, Scott (University of Oxford) | Kim, Sooyoung (Korea Advanced Institute of Science and Technology (KAIST)) | Byun, Jeongmin (Korea Advanced Institute of Science and Technology (KAIST)) | Oh, Alice (Korea Advanced Institute of Science and Technology (KAIST))

AAAI ConferencesApr-4-2015

For many people who speak more than one language,their language proficiency for each of the languagesvaries. We can conjecture that people who use onelanguage (primary language) more than another wouldshow higher language proficiency in that primary language.It is, however, difficult to observe and quantifythat problem because natural language use is difficultto collect in large amounts. We identify Wikipedia asa great resource for studying multilingualism, and weconduct a quantitative analysis of the language complexityof primary and non-primary users of English,German, and Spanish. Our preliminary results indicatethat there are indeed consistent differences of languagecomplexity in the Wikipedia articles chosen by primaryand non-primary users, as well as differences in the editsby the two groups of users.

complex article, multilingualwikipedia, primary language contribute

AAAI Conferences

Ninth International AAAI Conference on Web and Social Media

Technology:

Information Technology > Communications > Social Media (0.73)
Information Technology > Artificial Intelligence > Natural Language (0.73)

Add feedback