Instructional Material
MOOCs Meet Measurement Theory: A Topic-Modelling Approach
He, Jiazhen (The University of Melbourne) | Rubinstein, Benjamin I. P. (The University of Melbourne) | Bailey, James (The University of Melbourne) | Zhang, Rui (The University of Melbourne) | Milligan, Sandra (The University of Melbourne) | Chan, Jeffrey (RMIT University)
This paper adapts topic models to the psychometric testing of MOOC students based on their online forum postings. Measurement theory from education and psychology provides statistical models for quantifying a person's attainment of intangible attributes such as attitudes, abilities or intelligence. Such models infer latent skill levels by relating them to individuals' observed responses on a series of items such as quiz questions. The set of items can be used to measure a latent skill if individuals' responses on them conform to a Guttman scale. Such well-scaled items differentiate between individuals and inferred levels span the entire range from most basic to the advanced. In practice, education researchers manually devise items (quiz questions) while optimising well-scaled conformance. Due to the costly nature and expert requirements of this process, psychometric testing has found limited use in everyday teaching. We aim to develop usable measurement models for highly-instrumented MOOC delivery platforms, by using participation in automatically-extracted online forum topics as items. The challenge is to formalise the Guttman scale educational constraint and incorporate it into topic models. To favour topics that automatically conform to a Guttman scale, we introduce a novel regularisation into non-negative matrix factorisation-based topic modelling. We demonstrate the suitability of our approach with both quantitative experiments on three Coursera MOOCs, and with a qualitative survey of topic interpretability on two MOOCs by domain expert interviews.
Strategyproof Peer Selection: Mechanisms, Analyses, and Experiments
Aziz, Haris (Data61 and University of New South Wales) | Lev, Omer (University of Toronto) | Mattei, Nicholas (Data61 and University of New South Wales) | Rosenschein, Jeffrey S. (The Hebrew University of Jerusalem) | Walsh, Toby (Data61 and University of New South Wales)
We study an important crowdsourcing setting where agents evaluate one another and, based on these evaluations, a subset of agents are selected. This setting is ubiquitous when peer review is used for distributing awards in a team, allocating funding to scientists, and selecting publications for conferences. The fundamental challenge when applying crowdsourcing in these settings is that agents may misreport their reviews of others to increase their chances of being selected. We propose a new strategyproof (impartial) mechanism called Dollar Partition that satisfies desirable axiomatic properties. We then show, using a detailed experiment with parameter values derived from target real world domains, that our mechanism performs better on average, and in the worst case, than other strategyproof mechanisms in the literature.
Python Machine Learning Blueprints
Machine learning is becoming increasingly pervasive in the modern data-driven world. It is used extensively in many fields such as search engines, robotics, self-driving cars, and more. Through this book, you will learn how to perform various machine learning tasks in a range of environments. If you want to develop machine learning applications or implement machine learning in existing systems, Python is an excellent language to do so. Machine learning with Python is currently the most used standard to perform machine learning and is a great alternative for developers because of its wide selection of libraries and developer-friendly ecosystem.
Singularity University: meet the people who are building our future
It's day one at the Singularity University: the opening address has just been delivered by a hologram. Craig Venter, who was one of the first scientists to sequence the human genome and created the first synthetic life form, is up next. And later, we will see two people, paralysed from the waist down, use robotic exoskeletons to rise up and walk. But first, the co-founder of the Singularity University, Peter Diamandis, gives us our instructions for the day. Your task, he says, is to pick one of the "grand challenges of humanity" โ the lack of clean drinking water, say. And then come up with an idea that "can positively impact the lives of a billion people". Some of us haven't even had coffee yet. There's about 50 of us present and the room has been divided up into tables, one for education, another for poverty, another for water, and I'm not sure where I should sit. Diane Murphy, the university's PR executive, hesitates for a moment and then directs me over to the table marked "food". "Tell you what," she says.
My Philosophy On Teaching Robotics
This semester, as a reflective practice and an opportunity to continue my hobby (filmmaking), I decided to create a weekly video log (or vlog) of what has been happening in my middle school robotics class. The following was episode 4 of the series aptly entitled'Middle School Robotics.' This video was a bit different than the others as I sort of gave an overview of the course, my philosophy behind it, and the practices that I've seen bear the most fruit in the class. Most of my project resources were taken from this website: http://ev3lessons.com/lessons.html
How to program the Best Fit Slope - Practical Machine Learning Tutorial with Python p.8
Welcome to the 8th part of our machine learning regression tutorial within our Machine Learning with Python tutorial series. Where we left off, we had just realized that we needed to replicate some non-trivial algorithms into Python code in an attempt to calculate a best-fit line for a given dataset. Before we embark on that, why are we going to bother with all of this? Linear Regression is basically the brick to the machine learning building. It is used in almost every single major machine learning algorithm, so an understanding of it will help you to get the foundation for most major machine learning algorithms. For the enthusiastic among us, understanding linear regression and general linear algebra is the first step towards writing your own custom machine learning algorithms and branching out into the bleeding edge of machine learning, using what ever the best processing is at the time.
IBM Plans Cognitive Computing Research Center with University of Illinois
In keeping with its vision of an era of cognitive computing enabled by acceleration technology, IBM Research (NYSE: IBM) today announced plans for a multi-year collaboration with the University of Illinois Urbana-Champaign to create the Center for Cognitive Computing Systems Research (C3SR) which will be housed within the College of Engineering on the Urbana campus. IBM has big ambitions for the center: "C3SR will build and optimize integrated systems such as state-of-the-art cognitive computing systems modeled on IBM's Watson technology that can master a subject area by learning from multimedia and multi-modal educational content. Such systems will efficiently ingest vast amounts of data including videos, lecture notes, homework, and textbooks, and reason through this knowledge effectively enough to be able to eventually pass a college level exam." Many details are yet to be worked out. The level of funding and size of installation will be announced this summer when the new center formally opens, said Hillery Hunter, a project driver and the director for systems acceleration and memory at IBM Research.
Weekend Reading List: Free eBooks and Other Online Resources
Time to get away from it all, enjoy our families, friends, and free time... and read up on the latest in data science, machine learning, and analytics. For those of us who can't completely disconnect, or are otherwise interested in reading up over the weekend, the following is a roundup of some of the best free recent ebooks and other online reading resources, as well as a classic throwback article worthy of the attention of newcomers to the field of machine learning. As reported earlier this week, the MIT Press Deep Learning book is finished, and the online version has been finalized. Written by deep learning heavyweights Ian Goodfellow, Yoshua Bengio, and Aaron Courville, the book is poised to become the deep learning book on the market. At over 700 pages, and being quite technical in content, this isn't a simple one-weekend read (at least, not for the majority of folks), but getting started this weekend means only a few more needed.
Regression How it Works - Practical Machine Learning Tutorial with Python p.7
Welcome to the seventh part of our machine learning regression tutorial within our Machine Learning with Python tutorial series. Up to this point, you have been shown the value of linear regression and how to apply it with Scikit Learn and Python, now we're going to dive into how it is calculated. While I do not believe it is necessary to dig into all of the math that goes into every machine learning algorithm (have you dug into the source code of your other favorite modules to see how they do every little thing?), linear algebra is essential to machine learning, and it is useful to understand the true building blocks that machine learning is built upon. The objective of linear algebra is to calculate relationships of points in vector space. This is used for a variety of things, but one day, someone got the wild idea to do this with features of a dataset.
Recommender Systems: New Comprehensive Textbook by Charu Aggarwal
This book covers the topic of recommender systems comprehensively, starting with the fundamentals and then exploring the advanced topics. Algorithms and evaluation: These chapters discuss the fundamental algorithms in recommender systems, including collaborative filtering methods, content-based methods, knowledge-based methods, ensemble-based methods, and evaluation. Recommendations in specific domains and contexts: The context of a recommendation can be viewed as important side information that affects the recommendation goals. Different types of context such as temporal data, spatial data, social data, tagging data, and trustworthiness are explored. Advanced topics and applications: Various robustness aspects of recommender systems, such as shilling systems, attack models, and their defenses are discussed.