AITopics | Kim, Suin

Collaborating Authors

Kim, Suin

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Knowledge Tracing in Programming Education Integrating Students' Questions

Kim, Doyoun, Kim, Suin, Jo, Yojan

arXiv.org Artificial IntelligenceJan-22-2025

Knowledge tracing (KT) in programming education presents unique challenges due to the complexity of coding tasks and the diverse methods students use to solve problems. Although students' questions often contain valuable signals about their understanding and misconceptions, traditional KT models often neglect to incorporate these questions as inputs to address these challenges. This paper introduces SQKT (Students' Question-based Knowledge Tracing), a knowledge tracing model that leverages students' questions and automatically extracted skill information to enhance the accuracy of predicting students' performance on subsequent problems in programming education. Our method creates semantically rich embeddings that capture not only the surface-level content of the questions but also the student's mastery level and conceptual understanding. Experimental results demonstrate SQKT's superior performance in predicting student completion across various Python programming courses of differing difficulty levels. In in-domain experiments, SQKT achieved a 33.1\% absolute improvement in AUC compared to baseline models. The model also exhibited robust generalization capabilities in cross-domain settings, effectively addressing data scarcity issues in advanced programming courses. SQKT can be used to tailor educational content to individual learning needs and design adaptive learning systems in computer science education.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2502.10408

Genre:

Research Report > New Finding (1.00)
Instructional Material (1.00)

Industry: Education > Curriculum > Subject-Specific Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.48)

Add feedback

Generating Plausible Distractors for Multiple-Choice Questions via Student Choice Prediction

Lee, Yooseop, Kim, Suin, Jo, Yohan

arXiv.org Artificial IntelligenceJan-21-2025

In designing multiple-choice questions (MCQs) in education, creating plausible distractors is crucial for identifying students' misconceptions and gaps in knowledge and accurately assessing their understanding. However, prior studies on distractor generation have not paid sufficient attention to enhancing the difficulty of distractors, resulting in reduced effectiveness of MCQs. This study presents a pipeline for training a model to generate distractors that are more likely to be selected by students. First, we train a pairwise ranker to reason about students' misconceptions and assess the relative plausibility of two distractors. Using this model, we create a dataset of pairwise distractor ranks and then train a distractor generator via Direct Preference Optimization (DPO) to generate more plausible distractors. Experiments on computer science subjects (Python, DB, MLDL) demonstrate that our pairwise ranker effectively identifies students' potential misunderstandings and achieves ranking accuracy comparable to human experts. Furthermore, our distractor generator outperforms several baselines in generating plausible distractors and produces questions with a higher item discrimination index (DI).

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2501.13125

Country:

Asia (1.00)
Europe (0.67)
North America > United States (0.28)
North America > Mexico > Mexico City (0.14)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report > New Finding (0.46)

Industry: Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

MultilingualWikipedia: Editors of Primary Language Contribute to More Complex Articles

Park, Sungjoon (Korea Advanced Institute of Science and Technology (KAIST)) | Kim, Suin (Korea Advanced Institute of Science and Technology (KAIST)) | Hale, Scott (University of Oxford) | Kim, Sooyoung (Korea Advanced Institute of Science and Technology (KAIST)) | Byun, Jeongmin (Korea Advanced Institute of Science and Technology (KAIST)) | Oh, Alice (Korea Advanced Institute of Science and Technology (KAIST))

AAAI ConferencesApr-4-2015

For many people who speak more than one language,their language proficiency for each of the languagesvaries. We can conjecture that people who use onelanguage (primary language) more than another wouldshow higher language proficiency in that primary language.It is, however, difficult to observe and quantifythat problem because natural language use is difficultto collect in large amounts. We identify Wikipedia asa great resource for studying multilingualism, and weconduct a quantitative analysis of the language complexityof primary and non-primary users of English,German, and Spanish. Our preliminary results indicatethat there are indeed consistent differences of languagecomplexity in the Wikipedia articles chosen by primaryand non-primary users, as well as differences in the editsby the two groups of users.

complex article, multilingualwikipedia, primary language contribute

AAAI Conferences

Ninth International AAAI Conference on Web and Social Media

Technology:

Information Technology > Communications > Social Media (0.73)
Information Technology > Artificial Intelligence > Natural Language (0.73)

Add feedback

A Hierarchical Aspect-Sentiment Model for Online Reviews

Kim, Suin (KAIST) | Zhang, Jianwen (Microsoft Research Asia) | Chen, Zheng (Microsoft Research Asia) | Oh, Alice (KAIST) | Liu, Shixia (Microsoft Research Asia)

AAAI ConferencesJul-9-2013

To help users quickly understand the major opinions from massive online reviews, it is important to automatically reveal the latent structure of the aspects, sentiment polarities, and the association between them. However, there is little work available to do this effectively. In this paper, we propose a hierarchical aspect sentiment model (HASM) to discover a hierarchical structure of aspect-based sentiments from unlabeled online reviews. In HASM, the whole structure is a tree. Each node itself is a two-level tree, whose root represents an aspect and the children represent the sentiment polarities associated with it. Each aspect or sentiment polarity is modeled as a distribution of words. To automatically extract both the structure and parameters of the tree, we use a Bayesian nonparametric model, recursive Chinese Restaurant Process (rCRP), as the prior and jointly infer the aspect-sentiment tree from the review texts. Experiments on two real datasets show that our model is comparable to two other hierarchical topic models in terms of quantitative measures of topic trees. It is also shown that our model achieves better sentence-level classification accuracy than previously proposed aspect-sentiment joint models.

artificial intelligence, bayesian inference, sentiment, (20 more...)

AAAI Conferences

Twenty-Seventh AAAI Conference on Artificial Intelligence

Country: Asia > China (0.14)

Industry: Media (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.49)

Add feedback

Dirichlet Process with Mixed Random Measures: A Nonparametric Topic Model for Labeled Data

Kim, Dongwoo, Kim, Suin, Oh, Alice

arXiv.org Machine LearningJun-18-2012

We describe a nonparametric topic model for labeled data. The model uses a mixture of random measures (MRM) as a base distribution of the Dirichlet process (DP) of the HDP framework, so we call it the DP-MRM. To model labeled data, we define a DP distributed random measure for each label, and the resulting model generates an unbounded number of topics for each label. We apply DP-MRM on single-labeled and multi-labeled corpora of documents and compare the performance on label prediction with MedLDA, LDA-SVM, and Labeled-LDA. We further enhance the model by incorporating ddCRP and modeling multi-labeled images for image segmentation and object labeling, comparing the performance with nCuts and rddCRP.

artificial intelligence, dp-mrm, natural language, (16 more...)

arXiv.org Machine Learning

1206.4658

Country:

Asia (0.14)
Europe > United Kingdom > Scotland (0.14)

Genre: Research Report (0.82)

Industry: Health & Medicine (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.73)

Add feedback

Do You Feel What I Feel? Social Aspects of Emotions in Twitter Conversations

Kim, Suin (KAIST) | Bak, JinYeong (KAIST) | Oh, Alice Haeyun (KAIST)

AAAI ConferencesFeb-22-2012

We present a computational framework for understanding the social aspects of emotions in Twitter conversations. Using unannotated data and semisupervised machine learning, we look at emotional transitions, emotional influences among the conversation partners, and patterns in the overall emotional exchanges. We find that conversational partners usually express the same emotion, which we name Emotion accommodation, but when they do not, one of the conversational partners tends to respond with a positive emotion. We also show that tweets containing sympathy, apology, and complaint are significant emotion influencers. We verify the emotion classification part of our framework by a human-annotated corpus.

artificial intelligence, emotion, social media, (18 more...)

AAAI Conferences

Sixth International AAAI Conference on Weblogs and Social Media

Country: North America > United States (0.29)

Industry: Information Technology > Services (0.48)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.47)

Add feedback