AITopics

Joulani, Pooria, György, András, Szepesvári, Csaba

Online Learning under Delayed Feedback

arXiv.org Artificial IntelligenceJun-4-2013

Online learning with delayed feedback has received increasing attention recently due to its several applications in distributed, web-based learning problems. In this paper we provide a systematic study of the topic, and analyze the effect of delay on the regret of online learning algorithms. Somewhat surprisingly, it turns out that delay increases the regret in a multiplicative way in adversarial problems, and in an additive way in stochastic problems. We give meta-algorithms that transform, in a black-box fashion, algorithms developed for the non-delayed case into ones that can handle the presence of delays in the feedback loop. Modifications of the well-known UCB algorithm are also developed for the bandit problem with delayed feedback, with the advantage over the meta-algorithms that they can be implemented with lower complexity.

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

1306.0686

Country:

North America > United States (1.00)
Europe (0.93)
North America > Canada > Alberta (0.28)

Genre: Research Report (0.40)

Industry: Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.83)

Cesa-Bianchi, Nicolo, Dekel, Ofer, Shamir, Ohad

Online Learning with Switching Costs and Other Adaptive Adversaries

arXiv.org Machine LearningJun-1-2013

We study the power of different types of adaptive (nonoblivious) adversaries in the setting of prediction with expert advice, under both full-information and bandit feedback. We measure the player's performance using a new notion of regret, also known as policy regret, which better captures the adversary's adaptiveness to the player's behavior. In a setting where losses are allowed to drift, we characterize ---in a nearly complete manner--- the power of adaptive adversaries with bounded memories and switching costs. In particular, we show that with switching costs, the attainable rate with bandit feedback is $\widetilde{\Theta}(T^{2/3})$. Interestingly, this rate is significantly worse than the $\Theta(\sqrt{T})$ rate attainable with switching costs in the full-information case. Via a novel reduction from experts to bandits, we also show that a bounded memory adversary can force $\widetilde{\Theta}(T^{2/3})$ regret even in the full information case, proving that switching costs are easier to control than bounded memory adversaries. Our lower bounds rely on a new stochastic adversary strategy that generates loss processes with strong dependencies.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Machine Learning

1302.4387

Country: Europe (0.28)

Genre: Research Report (1.00)

Industry: Education > Educational Setting > Online (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.47)
Information Technology > Data Science > Data Mining > Big Data (0.46)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.40)

Ross, Stephane, Mineiro, Paul, Langford, John

Normalized Online Learning

arXiv.org Machine LearningMay-28-2013

We introduce online learning algorithms which are independent of feature scales, proving regret bounds dependent on the ratio of scales existent in the data rather than the absolute scale. This has several useful effects: there is no need to pre-normalize data, the test-time and test-space complexity are reduced, and the algorithms are more robust.

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

1305.6646

Country: North America > United States (0.28)

Genre: Research Report (0.40)

Industry: Education > Educational Setting > Online (0.61)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.61)

Franklin, D. Michael (University of Tennessee) | Parker, Lynne E. (University of Tennessee, Knoxville)

Overwatch: An Educational Testbed for Multi-Robot Experimentation

Educators who wish to engage their students in multi-agent experimentation and learning need an inexpensive multi-robot system that leverages existing equipment and open-source software. This paper proposes Overwatch as an inexpensive educational tool for teaching and experimenting in multi-robot systems. The interaction of multiple agents within a single environment is an important area of study. It is vital that agents within the environment perceive other agents as intelligent, acting within the environment as cooperative teammates or as competitive members of another team. To do so, the system must meet three goals: first, to allow multiple robots to communicate and coordinate; second, to localize within a shared global coordinate system; third, to recognize their teammates and other teams. The cost and scale of such experimental platforms places them outside the reach of many educational institutions or limits the number of agents that are interacting within the system \cite{Liu201160}. The goal of Overwatch is to create an experimental platform for multi-agent systems that is comprised of much smaller, albeit less capable, robots, many of which are prevalent in academic institutions already. Making use of available open-source libraries and utilizing lower cost robots, such as Scribblers, allows for experiments with many agents. This enables Overwatch to fit into the budget limitations of an academic setting. The Overwatch platform provides the Scribblers with global localization capabilities. This paper presents the system in detail and includes experiments to show its ability to localize, interact with other agents, and coordinate behaviors with these other agents. Additionally, the details to setup this system are also included.

educational testbed, multi-robot experimentation, overwatch

Industry: Education (0.53)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Does Size Matter? Investigating User Input at a Larger Bandwidth

Varner, Laura Kristen (Arizona State University) | Jackson, G. Tanner (Arizona State University) | Snow, Erica L. (Arizona State University) | McNamara, Danielle S. (Arizona State University)

This study expands upon an existing model of students’ reading comprehension ability within an intelligent tutoring system. The current system evaluates students’ natural language input using a local student model. We examine the potential to expand this model by assessing the linguistic features of self-explanations aggregated across entire passages. We assessed the relationship between 126 students’ reading comprehension ability and the cohesion of their aggregated self-explanations with three linguistic features. Results indicated that the three cohesion indices accounted for variance in reading ability over and above the features used in the current algorithm. These results demonstrate that broadening the window of NLP analyses can strengthen student models within ITSs.

larger bandwidth, size matter, user input

Industry: Education > Educational Technology > Educational Software > Computer Based Training (0.53)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.53)

Applying Clustering to the Problem of Predicting Retention within an ITS: Comparing Regularity Clustering with Traditional Methods

Song, Fei (Worcester Polytechnic Institute) | Trivedi, Shubhendu (TTI Chicago ) | Wang, Yutao (Worcester Polytechnic Institute) | Sarkozy, Gabor (Worcester Polytechnic Institute) | Heffernan, Neil (Worcester Polytechnic Institute)

In student modeling, the concept of "mastery learning" i.e. that a student continues to learn a skill till mastery is attained is important. Usually, mastery is defined in terms of most recent student performance. This is also the case with models such as Knowledge Tracing which estimate knowledge solely based on patterns of questions a student gets correct and the task usually is to predict immediate next action of the student. In retrospect however, it is not clear if this is a good definition of mastery since it is perhaps more useful to focus more on student retention over a longer period of time. This paper improves a recently introduced model by Wang and Beck that predicts long term student performance by clustering the students and generating multiple predictions by using a recently developed ensemble technique. Another contribution is that we introduce a novel clustering algorithm we call "Regularity Clustering" and show that it is superior in the task of predicting student retention over more popular techniques such as k-means and Spectral Clustering.

clustering, regularity clustering, traditional method, (1 more...)

Industry: Education (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.53)

Snow, Erica Linn (Learning Sciences Institute, Arizona State University) | Jackson, G. Tanner (Learning Sciences Institute, Arizona State University) | Varner, Laura K (Learning Sciences Institute, Arizona State University) | McNamara, Danielle S (Learning Sciences Institute, Arizona State University)

The Impact of Performance Orientation on Students’ Interactions and Achievements in an ITS

Research on individual differences indicates that students vary in how they interact with and perform while using intelligent tutoring systems (ITSs). However, less research has investigated how individual differences affect students’ interactions with game-based features. This study examines how learning outcomes and interactions with specific game-based features (off-task personalization vs. on-task mini games) within a game-based ITS, iSTART-ME, vary as a function of students’ performance orientation. The current study (n=40) is part of a larger study (n=126) conducted with high school students. The analyses in this study focus on those students assigned to iSTART-ME. Results indicate that students with higher levels of performance orientation perform better during training, progress further within the system, and interact less frequently with off-task game-based features. These results provide further evidence that individual differences play an important role in influencing students’ interactions and achievement within learning environments.

interaction and achievement, performance orientation

Genre: Research Report (0.53)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.53)
Education > Educational Setting > K-12 Education > Secondary School (0.53)

Technology: Information Technology > Artificial Intelligence (0.53)

Added Teacher-Created Motiational Video to an ITS

Many intelligent tutoring system (ITS) researchers are looking at ways to detect and to respond to student emotional states (for instance animated pedagogical agents that mirror student emotion). Such interventions are complicated to build, and do not take advantage of the potential for teachers to be part of the process. We present two studies that intervene when a student is having trouble by presenting the student with a YouTube video that is recorded by their own teacher and that delivers a motivational message to help them to persist with the learning session. We experimentally compared two different motivational interventions, which are both grounded in the literature on student affect and motivation. We also had a control condition that had no video. We found that when looking at students’ self-reports on the value of mathematics, we found a main effect of condition for the value-video. In Study 2 we examined whether these 60-second videos could impact homework completion rates and found that in fact homework completion rates were higher for students in the value-video condition. The present research is suggestive of a somewhat novel use of teacher-generated content that could easily be incorporated into other ITSs.

teacher-created motiational video

Industry: Education > Educational Technology > Educational Software > Computer Based Training (0.53)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.53)