Education
Question Difficulty Prediction for READING Problems in Standard Tests
Huang, Zhenya (University of Science and Technology of China) | Liu, Qi (University of Science and Technology of China) | Chen, Enhong (University of Science and Technology of China) | Zhao, Hongke (University of Science and Technology of China) | Gao, Mingyong ( iFLYTEK Co., Ltd. ) | Wei, Si ( iFLYTEK Co., Ltd. ) | Su, Yu (Anhui University) | Hu, Guoping ( iFLYTEK Co., Ltd. )
Standard tests aim to evaluate the performance of examinees using different tests with consistent difficulties. Thus, a critical demand is to predict the difficulty of each test question before the test is conducted. Existing studies are usually based on the judgments of education experts (e.g., teachers), which may be subjective and labor intensive. In this paper, we propose a novel Test-aware Attention-based Convolutional Neural Network (TACNN) framework to automatically solve this Question Difficulty Prediction (QDP) task for READING problems (a typical problem style in English tests) in standard tests. Specifically, given the abundant historical test logs and text materials of questions, we first design a CNN-based architecture to extract sentence representations for the questions. Then, we utilize an attention strategy to qualify the difficulty contribution of each sentence to questions. Considering the incomparability of question difficulties in different tests, we propose a test-dependent pairwise strategy for training TACNN and generating the difficulty prediction value. Extensive experiments on a real-world dataset not only show the effectiveness of TACNN, but also give interpretable insights to track the attention information for questions.
Psychologically Based Virtual-Suspect for Interrogative Interview Training
Bitan, Moshe (Bar-Ilan University, Israel) | Nahari, Galit (Bar-Ilan University, Israel) | Nisin, Zvi (Israeli Police Department) | Roth, Ariel (Bar-Ilan University, Israel) | Kraus, Sarit (Bar-Ilan University, Israel)
In this paper, we present a Virtual-Suspect system which can be used to train inexperienced law enforcement personnel in interrogation strategies. The system supports different scenario configurations based on historical data. The responses presented by the Virtual-Suspect are selected based on the psychological state of the suspect, which can be configured as well. Furthermore, each interrogator's statement affects the Virtual-Suspect's current psychological state, which may lead the interrogation in different directions. In addition, the model takes into account the context in which the statements are made. Experiments with 24 subjects demonstrate that the Virtual-Suspect's behavior is similar to that of a human who plays the role of the suspect.
JAG: A Crowdsourcing Framework for Joint Assessment and Peer Grading
Labutov, Igor (Carnegie Mellon University) | Studer, Christoph (Cornell University)
Generation and evaluation of crowdsourced content is commonly treated as two separate processes, performed at different times and by two distinct groups of people: content creators and content assessors. As a result, most crowdsourcing tasks follow this template: one group of workers generates content and another group of workers evaluates it. In an educational setting, for example, content creators are traditionally students that submit open-response answers to assignments (e.g., a short answer, a circuit diagram, or a formula) and content assessors are instructors that grade these submissions. Despite the considerable success of peer-grading in massive open online courses (MOOCs), the process of test-taking and grading are still treated as two distinct tasks which typically occur at different times, and require an additional overhead of grader training and incentivization. Inspired by this problem in the context of education, we propose a general crowdsourcing framework that fuses open-response test-taking (content generation) and assessment into a single, streamlined process that appears to students in the form of an explicit test, but where everyone also acts as an implicit grader. The advantages offered by our framework include: a common incentive mechanism for both the creation and evaluation of content, and a probabilistic model that jointly models the processes of contribution and evaluation, facilitating efficient estimation of the quality of the contributions and the competency of the contributors. We demonstrate the effectiveness and limits of our framework via simulations and a real-world user study.
Efficient Stochastic Optimization for Low-Rank Distance Metric Learning
Zhang, Jie (Nanjing University) | Zhang, Lijun (Nanjing University)
Although distance metric learning has been successfully applied to many real-world applications, learning a distance metric from large-scale and high-dimensional data remains a challenging problem. Due to the PSD constraint, the computational complexity of previous algorithms per iteration is at least O ( d 2 ) where d is the dimensionality of the data.In this paper, we develop an efficient stochastic algorithm for a class of distance metric learning problems with nuclear norm regularization, referred to as low-rank DML. By utilizing the low-rank structure of the intermediate solutions and stochastic gradients, the complexity of our algorithm has a linear dependence on the dimensionality d . The key idea is to maintain all the iterates in factorized representations and construct stochastic gradients that are low-rank. In this way, the projection onto the PSD cone can be implemented efficiently by incremental SVD. Experimental results on several data sets validate the effectiveness and efficiency of our method.
The Complexity of Stable Matchings under Substitutable Preferences
Deng, Yuan (Duke University) | Panigrahi, Debmalya (Duke University) | Waggoner, Bo (University of Pennsylvania)
In various matching market settings, such as hospital-doctor matching markets (Hatfield and Milgrom 2005), the existence of stable outcomes depends on substitutability of preferences. But can these stable matchings be computed efficiently, as in the one-to-one matching case? The algorithm of (Hatfield and Milgrom 2005) requires efficient implementation of a choice function over substitutable preferences. We show that even given efficient access to a value oracle or preference relation satisfying substitutability, exponentially many queries may be required in the worst case to implement a choice function. Indeed, this extends to examples where a stable matching requires exponential time to compute. We characterize the computational complexity of stable matchings by showing that efficient computation of a choice function is equivalent to efficient verification—determining whether or not, for a given set, the most preferred subset is the entire set itself. Clearly, verification is necessary for computation, but we show that it is also sufficient: specifically, given a verifier, we design a polynomial-time algorithm for computing a choice function, implying an efficient algorithm for stable matching. We then show that a verifier can be implemented efficiently for various classes of functions, such as submodular functions, implying efficient stable matching algorithms for a broad range of settings. We also investigate the effect of ties in the preference order, which causes complications both in defining substitutes and in computation. In this case, we tightly connect the computational complexity of the choice function to a measure on the number of ties.
Optimizing Positional Scoring Rules for Rank Aggregation
Caragiannis, Ioannis (University of Patras) | Chatzigeorgiou, Xenophon (University of Patras) | Krimpas, George A. (University of Patras) | Voudouris, Alexandros A. (University of Patras)
Nowadays, several crowdsourcing projects exploit social choice methods for computing an aggregate ranking of alternatives given individual rankings provided by workers. Motivated by such systems, we consider a setting where each worker is asked to rank a fixed (small) number of alternatives and, then, a positional scoring rule is used to compute the aggregate ranking. Among the apparently infinite such rules, what is the best one to use? To answer this question, we assume that we have partial access to an underlying true ranking. Then, the important optimization problem to be solved is to compute the positional scoring rule whose outcome, when applied to the profile of individual rankings, is as close as possible to the part of the underlying true ranking we know. We study this fundamental problem from a theoretical point of view and present positive and negative complexity results. Furthermore, we complement our theoretical findings with experiments on real-world and synthetic data.
Learning without Forgetting
When building a unified vision system or gradually adding new capabilities to a system, the usual assumption is that training data for all tasks is always available. However, as the number of tasks grows, storing and retraining on such data becomes infeasible. A new problem arises where we add new capabilities to a Convolutional Neural Network (CNN), but the training data for its existing capabilities are unavailable. We propose our Learning without Forgetting method, which uses only new task data to train the network while preserving the original capabilities. Our method performs favorably compared to commonly used feature extraction and fine-tuning adaption techniques and performs similarly to multitask learning that uses original task data we assume unavailable. A more surprising observation is that Learning without Forgetting may be able to replace fine-tuning with similar old and new task datasets for improved new task performance.
Researchers apply machine learning to condensed matter physics
A machine learning algorithm designed to teach computers how to recognize photos, speech patterns, and hand-written digits has now been applied to a vastly different set of data: identifying phase transitions between states of matter. This new research, published today in Nature Physics by two Perimeter Institute researchers, was built on a simple question: could industry-standard machine learning algorithms help fuel physics research? To find out, former Perimeter Institute postdoctoral fellow Juan Cassasquilla and Roger Melko, an Associate Faculty member at Perimeter and Associate Professor at the University of Waterloo, repurposed Google's TensorFlow, an open-source software library for machine learning, and applied it to a physical system. Melko says they didn't know what to expect. "I thought it was a long shot," he admits. Using gigabytes of data representing different state configurations created using simulation software on supercomputers, Carrasquilla and Melko created a large collection of "images" to introduce into the machine learning algorithm (also known as a neural network).
Forecasting The Future And Explaining Silicon Valley's New Religions
Yuval Noah Harari might be Silicon Valley's favorite historian. His last book, Sapiens: A Brief History of Humankind, which detailed the entirety of human history and how Homo Sapiens came to dominate the Earth, was blurbed by President Barack Obama and Bill Gates, and Mark Zuckerberg recommended it for his book club. And more than 100,000 students have taken Harari's online course. In his new book, Homo Deus: A Brief History of Tomorrow, Harari looks forward and hazards a few guesses on what comes next for humanity. These next chapters in our history range from the utopian to the horrific, he says.
Machine Learning: Is exploring learning rate manually still necessary with an exponential decaying learning rate?
If we have an initial learning rate high enough and a suitable decay factor for exponentially decaying the learning rate over a certain number of epoch, is it still need for us to manually explore the learning rate? Because if all goes well I believe the learning rate can automatically be sampled over a huge range of epoch. However, if we start off with a less than optimal learning rate, assuming the loss does not diverge to infinity, would the loss be less optimal than we have started with the optimal learning rate, even if we could reach the optimal learning rate through decaying the initial learning rate over time? Does the answer differ for a convex/non-convex loss? Specifically for deep learning problems, is an exponential decaying learning rate able to sample the learning rate better than done manually?