Goto

Collaborating Authors

 Instructional Material


SCALE: Online Self-Supervised Lifelong Learning without Prior Knowledge

arXiv.org Artificial Intelligence

Unsupervised lifelong learning refers to the ability to learn over time while memorizing previous patterns without supervision. Although great progress has been made in this direction, existing work often assumes strong prior knowledge about the incoming data (e.g., knowing the class boundaries), which can be impossible to obtain in complex and unpredictable environments. In this paper, motivated by real-world scenarios, we propose a more practical problem setting called online self-supervised lifelong learning without prior knowledge. The proposed setting is challenging due to the non-iid and single-pass data, the absence of external supervision, and no prior knowledge. To address the challenges, we propose Self-Supervised ContrAstive Lifelong LEarning without Prior Knowledge (SCALE) which can extract and memorize representations on the fly purely from the data continuum. SCALE is designed around three major components: a pseudo-supervised contrastive loss, a self-supervised forgetting loss, and an online memory update for uniform subset selection. All three components are designed to work collaboratively to maximize learning performance. We perform comprehensive experiments of SCALE under iid and four non-iid data streams. The results show that SCALE outperforms the state-of-the-art algorithm in all settings with improvements up to 3.83%, 2.77% and 5.86% in terms of kNN accuracy on CIFAR-10, CIFAR-100, and TinyImageNet datasets.


PCR: Proxy-based Contrastive Replay for Online Class-Incremental Continual Learning

arXiv.org Artificial Intelligence

Online class-incremental continual learning is a specific task of continual learning. It aims to continuously learn new classes from data stream and the samples of data stream are seen only once, which suffers from the catastrophic forgetting issue, i.e., forgetting historical knowledge of old classes. Existing replay-based methods effectively alleviate this issue by saving and replaying part of old data in a proxy-based or contrastive-based replay manner. Although these two replay manners are effective, the former would incline to new classes due to class imbalance issues, and the latter is unstable and hard to converge because of the limited number of samples. In this paper, we conduct a comprehensive analysis of these two replay manners and find that they can be complementary. Inspired by this finding, we propose a novel replay-based method called proxy-based contrastive replay (PCR). The key operation is to replace the contrastive samples of anchors with corresponding proxies in the contrastive-based way. It alleviates the phenomenon of catastrophic forgetting by effectively addressing the imbalance issue, as well as keeps a faster convergence of the model. We conduct extensive experiments on three real-world benchmark datasets, and empirical results consistently demonstrate the superiority of PCR over various state-of-the-art methods.


Artificial Intelligence/Operations Research Workshop 2 Report Out

arXiv.org Artificial Intelligence

Artificial intelligence (AI) has received significant attention in recent years, primarily due to breakthroughs in game playing, computer vision, and natural language processing that captured the imagination of the scientific community and the public at large. Many businesses, industries, and academic disciplines are now contemplating the application of AI to their own challenges. The federal government in the US and other countries have also invested significantly in advancing AI research and created funding initiatives and programs to promote greater collaboration across multiple communities. Some of the investment examples in the US include the establishment of the National AI Initiative Office, the launch of the National AI Research Resource Task Force, and more recently, the establishment of the National AI Advisory Committee. In 2021 INFORMS and ACM SIGAI joined together with the Computing Community Consortium (CCC) to organize a series of three workshops. The objective for this workshop series is to explore ways to exploit the synergies of the AI and Operations Research (OR) communities to transform decision making.



A Comprehensive Survey on Knowledge Distillation of Diffusion Models

arXiv.org Artificial Intelligence

Diffusion Models (DMs), also referred to as score-based diffusion models, utilize neural networks to specify score functions. Unlike most other probabilistic models, DMs directly model the score functions, which makes them more flexible to parametrize and potentially highly expressive for probabilistic modeling. DMs can learn fine-grained knowledge, i.e., marginal score functions, of the underlying distribution. Therefore, a crucial research direction is to explore how to distill the knowledge of DMs and fully utilize their potential. Our objective is to provide a comprehensible overview of the modern approaches for distilling DMs, starting with an introduction to DMs and a discussion of the challenges involved in distilling them into neural vector fields. We also provide an overview of the existing works on distilling DMs into both stochastic and deterministic implicit generators. Finally, we review the accelerated diffusion sampling algorithms as a training-free method for distillation. Our tutorial is intended for individuals with a basic understanding of generative models who wish to apply DM's distillation or embark on a research project in this field.


Online Active Learning for Soft Sensor Development using Semi-Supervised Autoencoders

arXiv.org Artificial Intelligence

Data-driven soft sensors are extensively used in industrial and chemical processes to predict hard-to-measure process variables whose real value is difficult to track during routine operations. The regression models used by these sensors often require a large number of labeled examples, yet obtaining the label information can be very expensive given the high time and cost required by quality inspections. In this context, active learning methods can be highly beneficial as they can suggest the most informative labels to query. However, most of the active learning strategies proposed for regression focus on the offline setting. In this work, we adapt some of these approaches to the stream-based scenario and show how they can be used to select the most informative data points. We also demonstrate how to use a semi-supervised architecture based on orthogonal autoencoders to learn salient features in a lower dimensional space. The Tennessee Eastman Process is used to compare the predictive performance of the proposed approaches.



AI Applications in Marketing and Finance

#artificialintelligence

In this course, you will learn about AI-powered applications that can enhance the customer journey and extend the customer lifecycle. You will learn how this AI-powered data can enable you to analyze consumer habits and maximize their potential to target your marketing to the right people. You will also learn about fraud, credit risks, and how AI applications can also help you combat the ever-challenging landscape of protecting consumer data. You will also learn methods to utilize supervised and unsupervised machine learning to enhance your fraud detection methods. You will also hear from leading industry experts in the world of data analytics, marketing, and fraud prevention.


Mastering Time Management: A Step-by-Step Guide to Building a Virtual Assistant for Scheduling and Reminders with Machine Learning (Python + Google Calendar) - Code Armada, LLC

#artificialintelligence

Mastering Time Management: A Step-by-Step Guide to Building a Virtual Assistant for Scheduling and Reminders with Machine Learning (Python + Google Calendar) In today’s fast-paced world, managing time and staying organized is crucial. Virtual assistants have become increasingly popular for handling scheduling, reminders, and other day-to-day tasks. In this tutorial, we will walk you through the process of developing a virtual assistant for scheduling and reminders using machine learning. We will cover the necessary steps, including data preparation, model selection, implementation, and deployment. Prerequisites: Basic understanding of Python programming Familiarity with machine learning concepts Access to a Python development environment (e.g., Jupyter Notebook, PyCharm, or Visual Studio Code) Section 1: Overview of Virtual Assistant Functionality Before diving into the implementation, let’s discuss the core functionalities of our virtual assistant. Our virtual assistant will: Understand natural language input for scheduling tasks and setting reminders Interact with users through a text-based interface Integrate with calendar applications for scheduling Send notifications for reminders Section 2: Data Preparation and Preprocessing To create a machine learning model capable of understanding natural language input, we first need to gather and preprocess the data. We will need a dataset containing text data with user queries related to […]


Marc Vidal on LinkedIn: Definición de 'machine learning' ____ Machine learning (aprendizaje…

#artificialintelligence

Hello everyone, If you are looking to learn machine learning, there are several courses available online that can help you get started. Here are some of the best machine learning courses that you can take to learn the skills and knowledge required to succeed in this field: Machine Learning by Andrew Ng: This is one of the most popular machine learning courses available online. Taught by Andrew Ng, a renowned AI expert and founder of Google Brain, this course covers all the basics of machine learning, including supervised and unsupervised learning, linear regression, logistic regression, and more. Applied Data Science with Python by University of Michigan: This course is designed to teach you how to apply machine learning techniques to real-world problems using the Python programming language. You'll learn how to use popular machine learning libraries like scikit-learn and pandas to build predictive models and analyze data. Deep Learning by Yoshua Bengio: This course provides an in-depth understanding of deep learning techniques and architectures.