AITopics | Instructional Material

Collaborating Authors

Instructional Material

STEP: A Scalable Testing and Evaluation Platform

Christoforaki, Maria (New York University) | Ipeirotis, Panagiotis (New York University)

AAAI ConferencesOct-31-2014

The emergence of online crowdsourcing sites, online work platforms, and evenMassive Open Online Courses (MOOCs), has created an increasing need for reliably evaluating the skills of the participating users in a scalable way.Many platforms already allow users to take online tests and verify their skills, but the existing approaches face many problems. First of all, cheating is very common in online testing without supervision, as the test questions often "leak" and become easily available online together with the answers.Second, technical skills, such as programming, require the tests to be frequently updated in order to reflect the current state-of-the-art. Third,there is very limited evaluation of the tests themselves, and how effectively they measure the skill that the users are tested for. In this paper, we present a Scalable Testing and Evaluation Platform (STEP),that allows continuous generation and evaluation of test questions. STEP leverages already available content, on Question Answering sites such as StackOverflow and re-purposes these questions to generate tests. The system utilizes a crowdsourcing component for the editing of the questions, while it uses automated techniques for identifying promising QA threads that can be successfully re-purposed for testing. This continuous question generation decreases the impact of cheating and also creates questions that are closer to the real problems that the skill holder is expected to solve in real life.STEP also leverages the use of Item Response Theory to evaluate the quality of the questions. We also use external signals about the quality of the workers.These identify the questions that have the strongest predictive ability in distinguishing workers that have the potential to succeed in the online job marketplaces. Existing approaches contrast in using only internal consistency metrics to evaluate the questions. Finally, our system employs an automatic "leakage detector" that queries the Internet to identify leaked versions of our questions. We then mark these questions as "practice only," effectively removing them from the pool of questions used for evaluation. Our experimental evaluation shows that our system generates questions of comparable or higher quality compared to existing tests, with a cost of approximately 3-5 dollars per question, which is lower than the cost of licensing questions from existing test banks.

natural language, question answering, test question, (18 more...)

AAAI Conferences

Second AAAI Conference on Human Computation and Crowdsourcing

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Genre: Instructional Material > Online (0.48)

Industry:

Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.86)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (0.68)
Information Technology > Communications > Social Media > Crowdsourcing (0.55)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.54)

Add feedback

Preface

Bigham, Jeffrey P. (Carnegie Mellon University) | Parkes, David C. (Harvard University)

AAAI ConferencesOct-31-2014

Welcome to the Second AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2014) held November 2-4, 2014, in Pittsburgh, Pennsylvania. This conference is an opportunity to build on the success of the First AAAI Human Computation and Crowdsourcing conference, and to promote the best scholarship in this vibrant and fast emerging, multidisciplinary area. The conference also comes on the heels of four HCOMP workshops, including two workshops hosted at the annual AAAI conference. The HCOMP conference is designed to be a venue for exchanging ideas and developments on principles, experiments, and implementations of systems that rely on programmatic access to human intellect to perform some aspect of computation, or where human perception, knowledge, reasoning, or coordinated activity contributes to the operation of larger systems and applications. Topics relevant to the discipline of human computation and crowdsourcing include human-computer interaction (HCI), computer-supported collaborative work (CSCW), cognitive psychology, organizational behavior, economics, information retrieval, databases, computer systems and programming languages, and optimization.

artificial intelligence, human computer interaction, social media, (17 more...)

AAAI Conferences

Second AAAI Conference on Human Computation and Crowdsourcing

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.25)
North America > United States > Texas > Travis County > Austin (0.06)
North America > United States > Minnesota (0.05)
North America > United States > Massachusetts (0.05)

Genre: Instructional Material (0.51)

Industry:

Education > Educational Setting > Online (0.74)
Education > Educational Technology > Educational Software > Computer Based Training (0.31)

Technology:

Information Technology > Human Computer Interaction (1.00)
Information Technology > Communications > Social Media (0.87)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.32)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.31)

Add feedback

A Comparison of learning algorithms on the Arcade Learning Environment

Defazio, Aaron, Graepel, Thore

arXiv.org Artificial IntelligenceOct-30-2014

Reinforcement learning agents have traditionally been evaluated on small toy problems. With advances in computing power and the advent of the Arcade Learning Environment, it is now possible to evaluate algorithms on diverse and difficult problems within a consistent framework. We discuss some challenges posed by the arcade learning environment which do not manifest in simpler environments. We then provide a comparison of model-free, linear learning algorithms on this challenging problem set.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

1410.862

Genre:

Research Report (0.50)
Instructional Material (0.34)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Education (1.00)
Leisure & Entertainment > Sports (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Learning-Assisted Automated Reasoning with Flyspeck

Kaliszyk, Cezary, Urban, Josef

arXiv.org Artificial IntelligenceOct-26-2014

The considerable mathematical knowledge encoded by the Flyspeck project is combined with external automated theorem provers (ATPs) and machine-learning premise selection methods trained on the proofs, producing an AI system capable of answering a wide range of mathematical queries automatically. The performance of this architecture is evaluated in a bootstrapping scenario emulating the development of Flyspeck from axioms to the last theorem, each time using only the previous theorems and proofs. It is shown that 39% of the 14185 theorems could be proved in a push-button mode (without any high-level advice and user interaction) in 30 seconds of real time on a fourteen-CPU workstation. The necessary work involves: (i) an implementation of sound translations of the HOL Light logic to ATP formalisms: untyped first-order, polymorphic typed first-order, and typed higher-order, (ii) export of the dependency information from HOL Light and ATP proofs for the machine learners, and (iii) choice of suitable representations and methods for learning from previous proofs, and their integration as advisors with HOL Light. This work is described and discussed here, and an initial analysis of the body of proofs that were found fully automatically is provided.

dependency, experiment, theorem, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s10817-014-9303-3

1211.7012

Country:

Asia > Middle East > Jordan (0.04)
Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.04)
South America > Venezuela > Mérida State > Merida (0.04)
(13 more...)

Genre:

Research Report (0.50)
Instructional Material > Course Syllabus & Notes (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)

Add feedback

AAAI News

Hamilton, Carol (AAAI)

AI MagazineSep-29-2014

Participants Intelligence (AAAI-15) and the Twenty-Seventh Conference in the AAAI-15 Robotics Exhibition and the on Innovative Applications of Artificial Intelligence AAAI-15 Video Competition are encouraged to contribute (IAAI-15) will be held January 25-29 at the to the Demonstration Program with their systems, Hyatt Regency Austin in Austin, Texas, USA. AAAI is working October 8 (Papers Due) closely with the local AI community to create opportunities The Senior Member Track provides an opportunity for attendees to experience AI in Texas! Attendees for established researchers in the AI community to can also enjoy nearly 200 music venues that feature give a broad talk on a well-developed body of everything from rock and blues to country and research, an important new research area, or a promising jazz every night of the week. Austin cuisine has new topic. This year, new "Blue Sky Ideas" track expanded from barbecue and Tex-Mex to award-winning is seeking presentations aimed at presenting ideas and inventive international cuisine, and blossomed and visions that can stimulate the research community beyond brick-and-mortar restaurants to a to pursue new directions, such as new problems, vibrant, citywide food truck movement.

artificial intelligence, machine learning, natural language, (17 more...)

AI Magazine

Country:

North America > United States > California (1.00)
North America > United States > Texas > Travis County > Austin (0.34)

Genre:

Personal > Honors (1.00)
Instructional Material (0.68)

Industry:

Information Technology (0.94)
Education > Educational Setting > Online (0.93)
Leisure & Entertainment > Games (0.93)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
(3 more...)

Add feedback

Leveraging AI Teaching in the Cloud for AI Teaching on Campus

Fisher, Douglas H. (Vanderbilt University)

AI MagazineSep-29-2014

The Educational Advances in Artificial Intelligence column discusses and shares innovative educational approaches that teach or leverage AI and its many subfields at all levels of education (K-12, undergraduate, and graduate levels). I credit these positive changes to the active in-class learning and a new enthusiasm for teaching, as well as the first-rate lectures by Stanford professors Jennifer Wisdom and Andrew Ng. I was showed that students liked this SPOC format, although pleased when students, enrolled in Introduction to there were suggestions for better in-class and Artificial Intelligence Class MOOC CS188x at the MOOC-content coordination. Had I tweaked my University of California, Berkeley, came to my channel course and continued along this path, I might have for remediation, taking word back to the MOOC's achieved phenominal success, but sadly I left the discussion forum. I required students in my graduate SPOC format behind.

artificial intelligence, mooc, student, (16 more...)

AI Magazine

Country: North America > United States > California > Alameda County > Berkeley (0.25)

Genre: Instructional Material > Online (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Statistical Estimation: From Denoising to Sparse Regression and Hidden Cliques

Tramel, Eric W., Kumar, Santhosh, Giurgiu, Andrei, Montanari, Andrea

arXiv.org Machine LearningSep-19-2014

These notes review six lectures given by Prof. Andrea Montanari on the topic of statistical estimation for linear models. The first two lectures cover the principles of signal recovery from linear measurements in terms of minimax risk. Subsequent lectures demonstrate the application of these principles to several practical problems in science and engineering. Specifically, these topics include denoising of error-laden signals, recovery of compressively sensed signals, reconstruction of low-rank matrices, and also the discovery of hidden cliques within large networks. These are notes from the lecture of Andrea Montanari given at the autumn school "Statistical Physics, Optimization, Inference, and Message-Passing Algorithms", that took place in Les Houches, France from Monday September 30th, 2013, till Friday October 11th, 2013.

artificial intelligence, data quality, machine learning, (20 more...)

arXiv.org Machine Learning

1409.5557

Country:

Europe (0.88)
North America > United States (0.67)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Data Science > Data Quality > Data Transformation (0.46)

Add feedback

$OntoMath^{PRO}$ Ontology: A Linked Data Hub for Mathematics

Nevzorova, Olga, Zhiltsov, Nikita, Kirillovich, Alexander, Lipachev, Evgeny

arXiv.org Artificial IntelligenceAug-11-2014

In this paper, we present an ontology of mathematical knowledge concepts that covers a wide range of the fields of mathematics and introduces a balanced representation between comprehensive and sensible models. We demonstrate the applications of this representation in information extraction, semantic search, and education. We argue that the ontology can be a core of future integration of math-aware data sets in the Web of Data and, therefore, provide mappings onto relevant datasets, such as DBpedia and ScienceWISE.

artificial intelligence, information management, natural language, (17 more...)

arXiv.org Artificial Intelligence

1407.4833

Country: Europe > Russia (0.46)

Genre:

Instructional Material > Course Syllabus & Notes (0.47)
Research Report (0.40)

Industry: Education > Educational Setting > Higher Education (0.46)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Communications > Web (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Acquiring Commonsense Knowledge for Sentiment Analysis through Human Computation

Boia, Marina (École Polytechnique Fédérale de Lausanne) | Musat, Claudiu Cristian (École Polytechnique Fédérale de Lausanne) | Faltings, Boi (École Polytechnique Fédérale de Lausanne)

AAAI ConferencesJul-14-2014

Many Artificial Intelligence tasks need large amounts of commonsense knowledge. Because obtaining this knowledge through machine learning would require a huge amount of data, a better alternative is to elicit it from people through human computation. We consider the sentiment classification task, where knowledge about the contexts that impact word polarities is crucial, but hard to acquire from data. We describe a novel task design that allows us to crowdsource this knowledge through Amazon Mechanical Turk with high quality. We show that the commonsense knowledge acquired in this way dramatically improves the performance of established sentiment classification methods.

lexicon, polarity, proceedings, (14 more...)

AAAI Conferences

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country: Europe > Switzerland > Vaud > Lausanne (0.04)

Genre: Instructional Material (0.68)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)

Add feedback

Tree-Based On-Line Reinforcement Learning

Barreto, Andre M. S. (Brazilian National Laboratory for Scientific Computing (LNCC))

AAAI ConferencesJul-14-2014

Fitted Q-iteration (FQI) stands out among reinforcement learning algorithms for its flexibility and ease of use. FQI can be combined with any regression method, and this choice determines the algorithm's statistical and computational properties. The combination of FQI with an ensemble of regression trees gives rise to an algorithm, FQIT, that is computationally efficient, scalable to high dimensional spaces, and robust to noise. Despite its nice properties and good performance in practice, FQIT also has some limitations: the fact that an ensemble of trees must be constructed (or updated) at each iteration confines the algorithm to the batch scenario. This paper aims to address this specific issue. Based on a strategy recently proposed in the literature, called the stochastic-factorization trick, we propose a modification of FQIT that makes it fully incremental, and thus suitable for on-line learning. We call the resulting method tree-based stochastic factorization (TBSF). We derive upper bounds for the difference between the value functions computed by FQIT and TBSF, and also show in which circumstances the approximations coincide. A series of computational experiments is presented to illustrate the properties of TBSF and to show its usefulness in practice, including a medical problem involving the treatment of patients infected with HIV.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

AAAI Conferences

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country: South America > Brazil (0.04)

Genre: Instructional Material > Online (0.34)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback