Goto

Collaborating Authors

 Instructional Material



Continual egocentric object recognition

arXiv.org Machine Learning

We are interested in the problem of continual object recognition in a setting which resembles that under which humans see and learn. This problem is of high relevance in all those applications where an agent must work collaboratively with a human in the same setting (e.g., personal assistance). The main innovative aspects of this setting with respect to the state-of-the-art are: it assumes an egocentric point-of-view bound to a single person, which implies a relatively low diversity of data and a cold start with no data; it requires to operate in a open world, where new objects can be encountered at any time; supervision is scarce and has to be solicited to the user, and completely unsupervised recognition of new objects should be possible. Note that this setting differs from the one addressed in the open world recognition literature, where supervised feedback is always requested to be able to incorporate new objects. We propose an incremental approach which is based on four main features: the use of time and space persistency (i.e., the appearance of objects changes relatively slowly), the use of similarity as the main driving principle for object recognition and novelty detection, the progressive introduction of new objects in a developmental fashion and the selective elicitation of user feedback in an online active learning fashion. Experimental results show the feasibility of open world, generic object recognition, the ability to recognize, memorize and re-identify new objects even in complete absence of user supervision, and the utility of persistency and incrementality in boosting performance.


EdNet: A Large-Scale Hierarchical Dataset in Education

arXiv.org Artificial Intelligence

With advances in Artificial Intelligence in Education (AIEd) and the ever-growing scale of Interactive Educational Systems (IESs), data-driven approach has become a common recipe for various tasks such as knowledge tracing and learning path recommendation. Unfortunately, collecting real students' interaction data is often challenging, which results in the lack of public large-scale benchmark dataset reflecting a wide variety of student behaviors in modern IESs. Although several datasets, such as ASSISTments, Junyi Academy, Synthetic and STATICS, are publicly available and widely used, they are not large enough to leverage the full potential of state-of-the-art data-driven models and limits the recorded behaviors to question-solving activities. To this end, we introduce EdNet, a large-scale hierarchical dataset of diverse student activities collected by Santa, a multi-platform self-study solution equipped with artificial intelligence tutoring system. EdNet contains 131,441,538 interactions from 784,309 students collected over more than 2 years, which is the largest among the ITS datasets released to the public so far. Unlike existing datasets, EdNet provides a wide variety of student actions ranging from question-solving to lecture consumption and item purchasing. Also, EdNet has a hierarchical structure where the student actions are divided into 4 different levels of abstractions. The features of EdNet are domain-agnostic, allowing EdNet to be extended to different domains easily. The dataset is publicly released under Creative Commons Attribution-NonCommercial 4.0 International license for research purposes. We plan to host challenges in multiple AIEd tasks with EdNet to provide a common ground for the fair comparison between different state of the art models and encourage the development of practical and effective methods.


DSC Webinar Series: 20 Predictions for 2020 from AI to Data Management

#artificialintelligence

AI, machine learning, cloud, self-service, data governance, etcโ€ฆthere is no shortage of buzzwords in data today. Every organization is seeking to outpace their competition by leveraging data to drive differentiation for their business. To win this race, companies are building up data science teams, investing in faster/more scalable cloud data platforms and utilizing the growing variety of publicly available datasets and algorithms. How do you stay ahead of what's next and help drive the successful adoption of new technology and processes within your organization? This latest Data Science Central webinar will be interactive and will review where we think data management, analytics and ML/AI are headed next.


How to Use Out-of-Fold Predictions in Machine Learning

#artificialintelligence

Machine learning algorithms are typically evaluated using resampling techniques such as k-fold cross-validation. During the k-fold cross-validation process, predictions are made on test sets comprised of data not used to train the model. These predictions are referred to as out-of-fold predictions, a type of out-of-sample predictions. Out-of-fold predictions play an important role in machine learning in both estimating the performance of a model when making predictions on new data in the future, so-called the generalization performance of the model, and in the development of ensemble models. In this tutorial, you will discover a gentle introduction to out-of-fold predictions in machine learning. How to Use Out-of-Fold Predictions in Machine Learning Photos by Gael Varoquaux, some rights reserved.


Customer Churn Modeling using Machine Learning with parsnip

#artificialintelligence

This article comes from Diego Usai, a student in Business Science University. Diego has completed both 101 (Data Science Foundations) and 201 (Advanced Machine Learning & Business Consulting) courses. Diego shows off his progress in this Customer Churn Tutorial using Machine Learning with parsnip. Diego originally posted the article on his personal website, diegousai.io, Recently I have completed the online course Business Analysis With R focused on applied data and business science with R, which introduced me to a couple of new modelling concepts and approaches.


Beginning Machine Learning with TensorFlow.js

#artificialintelligence

When does the course start and finish? The course starts now and never ends! It is a completely self-paced online course - you decide when you start and when you finish. That said, most students will need about three weeks to complete the course materials. How long do I have access to the course?


Popular Deep Learning Courses of 2019 - KDnuggets

#artificialintelligence

Deep Learning is gaining more momentum and notoriety among the data science generation of this decade. A few years ago, it was not as mainstream as Machine Learning techniques, such as Logistic Regression and Random Forest for example. Nowadays, it is all about Neural Networks, Activation Functions, Multiple Layers, Drop-out, etc. There is good reason for this one, which is simply, Deep Learning has shown to perform better than Machine Learning algorithms at times. The following courses are famous among peers for knowledge on the new wave of Deep Learning and AI.



3 Machine Learning Secrets to Achieving Impossible Config Data Insights

#artificialintelligence

It's here right now and isn't it refreshing to learn how you can start using it to your advantage TODAY? This technical webinar is a practical view of how machine learning assists with the management of config data. Take home these practical tips that will allow you to gain insights into the speed and quality of your current software engineering practices - releasing you from the shackles of bad config data. Join our US Technical Director, Joe Offenberg, to hear real customer stories of how multinational banks and global telcos are using machine learning to gain a single pane of glass view of consolidated config data. You'll see first hand how machine learning has positively impacted more rigorous approaches to secrets management, RBAC and transparency of audits between Development and Operations.