Zhou, Yun
A Systematic Survey of Automatic Prompt Optimization Techniques
Ramnath, Kiran, Zhou, Kang, Guan, Sheng, Mishra, Soumya Smruti, Qi, Xuan, Shen, Zhengyuan, Wang, Shuai, Woo, Sangmin, Jeoung, Sullam, Wang, Yawei, Wang, Haozhu, Ding, Han, Lu, Yuzhe, Xu, Zhichao, Zhou, Yun, Srinivasan, Balasubramaniam, Yan, Qiaojing, Chen, Yueyan, Ding, Haibo, Xu, Panpan, Cheong, Lin Lee
Since the advent of large language models (LLMs), prompt engineering has been a crucial step for eliciting desired responses for various Natural Language Processing (NLP) tasks. However, prompt engineering remains an impediment for end users due to rapid advances in models, tasks, and associated best practices. To mitigate this, Automatic Prompt Optimization (APO) techniques have recently emerged that use various automated techniques to help improve the performance of LLMs on various tasks. In this paper, we present a comprehensive survey summarizing the current progress and remaining challenges in this field. We provide a formal definition of APO, a 5-part unifying framework, and then proceed to rigorously categorize all relevant works based on their salient features therein. We hope to spur further research guided by our framework.
Machine Learning for Raman Spectroscopy-based Cyber-Marine Fish Biochemical Composition Analysis
Zhou, Yun, Chen, Gang, Xue, Bing, Zhang, Mengjie, Rooney, Jeremy S., Lagutin, Kirill, MacKenzie, Andrew, Gordon, Keith C., Killeen, Daniel P.
The rapid and accurate detection of biochemical compositions in fish is a crucial real-world task that facilitates optimal utilization and extraction of high-value products in the seafood industry. Raman spectroscopy provides a promising solution for quickly and non-destructively analyzing the biochemical composition of fish by associating Raman spectra with biochemical reference data using machine learning regression models. This paper investigates different regression models to address this task and proposes a new design of Convolutional Neural Networks (CNNs) for jointly predicting water, protein, and lipids yield. To the best of our knowledge, we are the first to conduct a successful study employing CNNs to analyze the biochemical composition of fish based on a very small Raman spectroscopic dataset. Our approach combines a tailored CNN architecture with the comprehensive data preparation procedure, effectively mitigating the challenges posed by extreme data scarcity. The results demonstrate that our CNN can significantly outperform two state-of-the-art CNN models and multiple traditional machine learning models, paving the way for accurate and automated analysis of fish biochemical composition.
Exercise Hierarchical Feature Enhanced Knowledge Tracing
Tong, Hanshuang, Zhou, Yun, Wang, Zhen
Knowledge tracing is a fundamental task in the computer-aid educational system. In this paper, we propose a hierarchical exercise feature enhanced knowledge tracing framework, which could enhance the ability of knowledge tracing by incorporating knowledge distribution, semantic features, and difficulty features from exercise text. Extensive experiments show the high performance of our framework.
Exploring Common and Individual Characteristics of Students via Matrix Recovering
Wang, Zhen, Teng, Ben, Zhou, Yun, Tong, Hanshuang, Liu, Guangtong
Balancing group teaching and individual mentoring is an important issue in education area. The nature behind this issue is to explore common characteristics shared by multiple students and individual characteristics for each student. Biclustering methods have been proved successful for detecting meaningful patterns with the goal of driving group instructions based on students' characteristics. However, these methods ignore the individual characteristics of students as they only focus on common characteristics of students. In this article, we propose a framework to detect both group characteristics and individual characteristics of students simultaneously. We assume that the characteristics matrix of students' is composed of two parts: one is a low-rank matrix representing the common characteristics of students; the other is a sparse matrix representing individual characteristics of students. Thus, we treat the balancing issue as a matrix recovering problem. The experiment results show the effectiveness of our method. Firstly, it can detect meaningful biclusters that are comparable with the state-of-the-art biclutering algorithms. Secondly, it can identify individual characteristics for each student simultaneously. Both the source code of our algorithm and the real datasets are available upon request.
HGKT : Introducing Problem Schema with Hierarchical Exercise Graph for Knowledge Tracing
Tong, Hanshuang, Zhou, Yun, Wang, Zhen
Knowledge tracing (KT) which aims at predicting learner's knowledge mastery plays an important role in the computer-aided educational system. Given learners' exercise records, a knowledge tracing model can trace their hidden knowledge state dynamically. In recent years, many deep learning models have been applied to tackle the KT task, which has shown promising results. However, they still have limitations. Most existing methods simplify the exercising records as knowledge sequence, which fails to explore rich information existed in exercise texts. Besides, the latent hierarchical graph nature of exercises and knowledge remain unexplored. Thus, in this paper, we propose a hierarchical graph knowledge tracing model framework (HGKT) which could leverage the advantages of hierarchical exercise graph and sequence model to enhance the ability of knowledge tracing. Besides, we introduce the concept of problem schema to better represent a group of similar exercises and propose a hierarchical graph neural network to learn representations of problem schemas. Moreover, in the sequence model, we employ two attention mechanisms to highlight important historical states of students. In the testing stage, we present a K&S diagnosis matrix that could trace the transition of mastery of knowledge and problem schema, which could more easily be applied to different applications. Finally, we conduct extensive experiments to evaluate the model on a large scale real-world dataset. The results prove the effectiveness of our model and the diversity of its application scenarios.
Adversarial Generation of Real-time Feedback with Neural Networks for Simulation-based Training
Ma, Xingjun, Wijewickrema, Sudanthi, Zhou, Shuo, Zhou, Yun, Mhammedi, Zakaria, O'Leary, Stephen, Bailey, James
Simulation-based training (SBT) is gaining popularity as a low-cost and convenient training technique in a vast range of applications. However, for a SBT platform to be fully utilized as an effective training tool, it is essential that feedback on performance is provided automatically in real-time during training. It is the aim of this paper to develop an efficient and effective feedback generation method for the provision of real-time feedback in SBT. Existing methods either have low effectiveness in improving novice skills or suffer from low efficiency, resulting in their inability to be used in real-time. In this paper, we propose a neural network based method to generate feedback using the adversarial technique. The proposed method utilizes a bounded adversarial update to minimize a L1 regularized loss via back-propagation. We empirically show that the proposed method can be used to generate simple, yet effective feedback. Also, it was observed to have high effectiveness and efficiency when compared to existing methods, thus making it a promising option for real-time feedback generation in SBT.
Tackling Mental Health by Integrating Unobtrusive Multimodal Sensing
Zhou, Dawei (University of Rochester) | Luo, Jiebo (University of Rochester) | Silenzio, Vincent M.B. (University of Rochester Medical Center) | Zhou, Yun (University of Rochester) | Hu, Jile (University of Rochester) | Currier, Glenn (University of Rochester Medical Center) | Kautz, Henry (University of Rochester)
Mental illness is becoming a major plague in modern societies and poses challenges to the capacity of current public health systems worldwide. With the widespread adoption of social media and mobile devices, and rapid advances in artificial intelligence, a unique opportunity arises for tackling mental health problems. In this study, we investigate how users’ online social activities and physiological signals detected through ubiquitous sensors can be utilized in realistic scenarios for monitoring their mental health states. First, we extract a suite of multimodal time-series signals using modern computer vision and signal processing techniques, from recruited participants while they are immersed in online social media that elicit emotions and emotion transitions. Next, we use machine learning techniques to build a model that establishes the connection between mental states and the extracted multimodal signals. Finally, we validate the effectiveness of our approach using two groups of recruited subjects.