AITopics | Fernandez, Nigel

Collaborating Authors

Fernandez, Nigel

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Automated Knowledge Component Generation and Knowledge Tracing for Coding Problems

Duan, Zhangqi, Fernandez, Nigel, Kanakadandi, Sri, Akram, Bita, Lan, Andrew

arXiv.org Artificial IntelligenceFeb-25-2025

Knowledge components (KCs) mapped to problems help model student learning, tracking their mastery levels on fine-grained skills thereby facilitating personalized learning and feedback in online learning platforms. However, crafting and tagging KCs to problems, traditionally performed by human domain experts, is highly labor-intensive. We present a fully automated, LLM-based pipeline for KC generation and tagging for open-ended programming problems. We also develop an LLM-based knowledge tracing (KT) framework to leverage these LLM-generated KCs, which we refer to as KCGen-KT. We conduct extensive quantitative and qualitative evaluations validating the effectiveness of KCGen-KT. On a real-world dataset of student code submissions to open-ended programming problems, KCGen-KT outperforms existing KT methods. We investigate the learning curves of generated KCs and show that LLM-generated KCs have a comparable level-of-fit to human-written KCs under the performance factor analysis (PFA) model. We also conduct a human evaluation to show that the KC tagging accuracy of our pipeline is reasonably accurate when compared to that by human domain experts.

large language model, machine learning, programming problem, (21 more...)

arXiv.org Artificial Intelligence

2502.18632

Country:

North America > United States > Massachusetts (0.14)
Asia > Middle East > UAE (0.14)

Genre: Research Report > New Finding (0.46)

Industry: Education > Educational Technology > Educational Software > Computer Based Training (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Test Case-Informed Knowledge Tracing for Open-ended Coding Tasks

Duan, Zhangqi, Fernandez, Nigel, Hicks, Alexander, Lan, Andrew

arXiv.org Artificial IntelligenceDec-20-2024

Open-ended coding tasks, which ask students to construct programs according to certain specifications, are common in computer science education. Student modeling can be challenging since their open-ended nature means that student code can be diverse. Traditional knowledge tracing (KT) models that only analyze response correctness may not fully capture nuances in student knowledge from student code. In this paper, we introduce Test case-Informed Knowledge Tracing for Open-ended Coding (TIKTOC), a framework to simultaneously analyze and predict both open-ended student code and whether the code passes each test case. We augment the existing CodeWorkout dataset with the test cases used for a subset of the open-ended coding questions, and propose a multi-task learning KT method to simultaneously analyze and predict 1) whether a student's code submission passes each test case and 2) the student's open-ended code, using a large language model as the backbone. We quantitatively show that these methods outperform existing KT methods for coding that only use the overall score a code submission receives. We also qualitatively demonstrate how test case information, combined with open-ended code, helps us gain fine-grained insights into student knowledge.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2410.10829

Country:

North America > United States > Massachusetts (0.14)
North America > United States > California (0.14)
North America > Mexico > Mexico City (0.14)
Asia > Middle East > UAE (0.14)

Genre: Research Report (1.00)

Industry: Education > Educational Technology > Educational Software (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions

Fernandez, Nigel, Scarlatos, Alexander, Woodhead, Simon, Lan, Andrew

arXiv.org Artificial IntelligenceJun-27-2024

High-quality distractors are crucial to both the assessment and pedagogical value of multiple-choice questions (MCQs), where manually crafting ones that anticipate knowledge deficiencies or misconceptions among real students is difficult. Meanwhile, automated distractor generation, even with the help of large language models (LLMs), remains challenging for subjects like math. It is crucial to not only identify plausible distractors but also understand the error behind them. In this paper, we introduce DiVERT (Distractor Generation with Variational Errors Represented as Text), a novel variational approach that learns an interpretable representation of errors behind distractors in math MCQs. Through experiments on a real-world math MCQ dataset with 1,434 questions used by hundreds of thousands of students, we show that DiVERT, despite using a base open-source LLM with 7B parameters, outperforms state-of-the-art approaches using GPT-4o on downstream distractor generation. We also conduct a human evaluation with math educators and find that DiVERT leads to error labels that are of comparable quality to human-authored ones.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2406.19356

Country:

Europe (0.67)
North America > United States > Massachusetts (0.14)

Genre:

Research Report > Experimental Study (0.46)
Research Report > New Finding (0.46)
Research Report > Promising Solution (0.34)

Industry:

Education > Educational Setting (0.93)
Education > Curriculum > Subject-Specific Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Interpreting Latent Student Knowledge Representations in Programming Assignments

Fernandez, Nigel, Lan, Andrew

arXiv.org Artificial IntelligenceMay-13-2024

Recent advances in artificial intelligence for education leverage generative large language models, including using them to predict open-ended student responses rather than their correctness only. However, the black-box nature of these models limits the interpretability of the learned student knowledge representations. In this paper, we conduct a first exploration into interpreting latent student knowledge representations by presenting InfoOIRT, an Information regularized Open-ended Item Response Theory model, which encourages the latent student knowledge states to be interpretable while being able to generate student-written code for open-ended programming questions. InfoOIRT maximizes the mutual information between a fixed subset of latent knowledge states enforced with simple prior distributions and generated student code, which encourages the model to learn disentangled representations of salient syntactic and semantic code features including syntactic styles, mastery of programming skills, and code structures. Through experiments on a real-world programming education dataset, we show that InfoOIRT can both accurately generate student code and lead to interpretable student knowledge representations.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2405.08213

Country:

North America > United States > Texas (0.14)
North America > United States > Massachusetts (0.14)

Genre: Research Report (0.40)

Industry: Education > Educational Technology > Educational Software (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

SyllabusQA: A Course Logistics Question Answering Dataset

Fernandez, Nigel, Scarlatos, Alexander, Lan, Andrew

arXiv.org Artificial IntelligenceMar-2-2024

Moreover, text similarity metrics may not be suitable In educational applications, artificial intelligence for some open-ended natural language generation (AI) approaches have shown significant promise in tasks (Amidei et al., 2018). As an example, the answer improving learning outcomes (Aleven et al., 2016; "The final exam will be on Dec 15", has high VanLehn, 2011), by automatically providing feedback surface-level textual similarity with the reference to students or engaging in tutoring dialogues answer, "The final exam is on Dec 14", but contains with them. The key idea is to use AI to create an ondemand a critical factual error that may lead to significant virtual teaching assistant to interact with negative consequences to students. Meanwhile, many students simultaneously; see, e.g., Khamigo human instructors and teaching assistants often answer from Khan Academy (Academy, 2022). These approaches student questions in a concise way, without can scale up the effort of expert human giving any unnecessary information. Therefore, it teachers and tutors, and relieve them from doing is important for LLM-based approaches to generate repetitive tasks so that they can focus on providing answers that are both concise and precise.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2403.14666

Country:

Europe (0.67)
North America > United States > New Mexico (0.14)
North America > United States > Massachusetts (0.14)
North America > United States > Louisiana (0.14)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.75)

Add feedback

3HAN: A Deep Neural Network for Fake News Detection

Singhania, Sneha, Fernandez, Nigel, Rao, Shrisha

arXiv.org Artificial IntelligenceJun-21-2023

The rapid spread of fake news is a serious problem calling for AI solutions. We employ a deep learning based automated detector through a three level hierarchical attention network (3HAN) for fast, accurate detection of fake news. 3HAN has three levels, one each for words, sentences, and the headline, and constructs a news vector: an effective representation of an input news article, by processing an article in an hierarchical bottom-up manner. The headline is known to be a distinguishing feature of fake news, and furthermore, relatively few words and sentences in an article are more important than the rest. 3HAN gives a differential importance to parts of an article, on account of its three layers of attention. By experiments on a large real-world data set, we observe the effectiveness of 3HAN with an accuracy of 96.77%. Unlike some other deep learning models, 3HAN provides an understandable output through the attention weights given to different parts of an article, which can be visualized through a heatmap to enable further manual fact checking.

artificial intelligence, machine learning, vector, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-319-70096-0_59

2306.12014

Country:

North America (0.14)
Asia > India (0.14)

Genre: Research Report (0.64)

Industry: Media > News (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Improving Reading Comprehension Question Generation with Data Augmentation and Overgenerate-and-rank

Kumar, Nischal Ashok, Fernandez, Nigel, Wang, Zichao, Lan, Andrew

arXiv.org Artificial IntelligenceJun-15-2023

Reading comprehension is a crucial skill in many aspects of education, including language learning, cognitive development, and fostering early literacy skills in children. Automated answer-aware reading comprehension question generation has significant potential to scale up learner support in educational activities. One key technical challenge in this setting is that there can be multiple questions, sometimes very different from each other, with the same answer; a trained question generation method may not necessarily know which question human educators would prefer. To address this challenge, we propose 1) a data augmentation method that enriches the training dataset with diverse questions given the same context and answer and 2) an overgenerate-and-rank method to select the best question from a pool of candidates. We evaluate our method on the FairytaleQA dataset, showing a 5% absolute improvement in ROUGE-L over the best existing method. We also demonstrate the effectiveness of our method in generating harder, "implicit" questions, where the answers are not contained in the context as text spans.

machine learning, question answering, question generation, (15 more...)

arXiv.org Artificial Intelligence

2306.08847

Country: North America > United States > Massachusetts (0.14)

Genre: Research Report (0.82)

Industry:

Education > Assessment & Standards > Student Performance (1.00)
Education > Educational Setting (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.96)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.68)

Add feedback

Automated Scoring for Reading Comprehension via In-context BERT Tuning

Fernandez, Nigel, Ghosh, Aritra, Liu, Naiming, Wang, Zichao, Choffin, Benoît, Baraniuk, Richard, Lan, Andrew

arXiv.org Artificial IntelligenceJun-15-2023

Automated scoring of open-ended student responses has the potential to significantly reduce human grader effort. Recent advances in automated scoring often leverage textual representations based on pre-trained language models such as BERT and GPT as input to scoring models. Most existing approaches train a separate model for each item/question, which is suitable for scenarios such as essay scoring where items can be quite different from one another. However, these approaches have two limitations: 1) they fail to leverage item linkage for scenarios such as reading comprehension where multiple items may share a reading passage; 2) they are not scalable since storing one model per item becomes difficult when models have a large number of parameters. In this paper, we report our (grand prize-winning) solution to the National Assessment of Education Progress (NAEP) automated scoring challenge for reading comprehension. Our approach, in-context BERT fine-tuning, produces a single shared scoring model for all items with a carefully-designed input structure to provide contextual information on each item. We demonstrate the effectiveness of our approach via local evaluations using the training dataset provided by the challenge. We also discuss the biases, common error types, and limitations of our approach.

bert, machine learning, natural language, (13 more...)

arXiv.org Artificial Intelligence

2205.09864

Country: North America > United States > Massachusetts (0.14)

Genre: Research Report (0.82)

Industry:

Education > Educational Technology > Educational Software > Computer-Aided Assessment (1.00)
Education > Assessment & Standards > Student Performance (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Synthesizing Tasks for Block-based Programming

Ahmed, Umair Z., Christakis, Maria, Efremov, Aleksandr, Fernandez, Nigel, Ghosh, Ahana, Roychoudhury, Abhik, Singla, Adish

arXiv.org Artificial IntelligenceJul-1-2020

Block-based visual programming environments play a critical role in introducing computing concepts to K-12 students. One of the key pedagogical challenges in these environments is in designing new practice tasks for a student that match a desired level of difficulty and exercise specific programming concepts. In this paper, we formalize the problem of synthesizing visual programming tasks. In particular, given a reference visual task $\rm T^{in}$ and its solution code $\rm C^{in}$, we propose a novel methodology to automatically generate a set $\{(\rm T^{out}, \rm C^{out})\}$ of new tasks along with solution codes such that tasks $\rm T^{in}$ and $\rm T^{out}$ are conceptually similar but visually dissimilar. Our methodology is based on the realization that the mapping from the space of visual tasks to their solution codes is highly discontinuous; hence, directly mutating reference task $\rm T^{in}$ to generate new tasks is futile. Our task synthesis algorithm operates by first mutating code $\rm C^{in}$ to obtain a set of codes $\{\rm C^{out}\}$. Then, the algorithm performs symbolic execution over a code $\rm C^{out}$ to obtain a visual task $\rm T^{out}$; this step uses the Monte Carlo Tree Search (MCTS) procedure to guide the search in the symbolic tree. We demonstrate the effectiveness of our algorithm through an extensive empirical evaluation and user study on reference tasks taken from the \emph{Hour of the Code: Classic Maze} challenge by \emph{Code.org} and the \emph{Intro to Programming with Karel} course by \emph{CodeHS.com}.

constraint, planning & scheduling, survey article, (17 more...)

arXiv.org Artificial Intelligence

2006.16913

Genre: Research Report > Experimental Study (0.46)

Industry: Education > Curriculum > Subject-Specific Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.34)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.34)

Add feedback