Goto

Collaborating Authors

 Instructional Material


40 Under 40 Data Scientists 2023 – Who are they?

#artificialintelligence

Following two action-packed days of workshops, conferences, paper presentations, and tech talks, Machine Learning Developers Summit 2023 concluded by awarding 40 dynamic data scientists with the 40 Under 40 Data Scientists award. Aakash is a seasoned analytics leader with 15 years experience and has been instrumental in driving data and insight-led transformations. Over his career, he has worked closely with biz functions to drive revenue and achieve aggressive market growth by leveraging more than 50 analytical approaches. He also has experience in launching AI and tech-based solutions like Omni Channel Attribution, Customer Segmentation, Customer-360, Supply Chain Efficiency, Workforce Management and more at telecom, media, FMCG, retail, and ecommerce industries. Abhilash Surendran is assistant vice president, analytics, and data science at Merkle, leading the analytics practise for their high-tech portfolio. He comes with 15 years of experience in advanced analytics, data science, data visualisation and consulting.


Develop Your First Neural Network with PyTorch, Step-by-Step - MachineLearningMastery.com Develop Your First Neural Network with PyTorch, Step-by-Step - MachineLearningMastery.com

#artificialintelligence

PyTorch is a powerful Python library for building deep learning models. It provides everything you need to define and train a neural network and use it for inference. You don't need to write a lot of code to get all these done. In this pose, you will discover how to create your first deep learning neural network model in Python using PyTorch. Develop Your First Neural Network with PyTorch, Step-by-Step Photo by drown_ in_city.


DBE-KT22: A Knowledge Tracing Dataset Based on Online Student Evaluation

arXiv.org Artificial Intelligence

The recent global pandemic further amplified the impact of online education as an effective alternative that could overcome physical distancing restrictions imposed on students and teaching staff in schools and university campuses. Nevertheless, one of the significant challenges that need to be addressed in online education systems is the ability to effectively trace a student's learning progress, similar to what a human teacher would do in the classroom. Human teachers rely on their intuition and experience to estimate a student's knowledge state and tailor the learning process accordingly. Acquiring such ability would enable online education systems to archive many vital education objectives, including customized curriculum generation, learning materials recommendation, exercise recommendation, automatic evaluation, or learning feedback generation. Achieving such objectives would facilitate automating the teaching process and pave the way for transforming the current online education systems into Intelligent Tutoring Systems (ITS). An ITS not only automates the teaching procedure using computer systems (e.g., web applications) but also handles supporting tasks such as customizing the learning experience and providing guidance and feedback to the students [1]. The Knowledge Tracing (KT) problem formulates the challenge of tracing a student's knowledge state based on their exercise answering history [2, 3]. In particular, the exercise answering history could be represented as a sequence of question-answer pairs, and the task of a solving computational model would be to predict the likelihood of correctly answering the following questions. Figure 1 depicts a probabilistic graphical model for a KT scenario.


KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in Low-Resource NLP

arXiv.org Artificial Intelligence

This paper focuses on the data augmentation for low-resource NLP tasks where the training set is limited. The existing solutions either leverage task-independent heuristic rules (e.g., Synonym Replacement) or fine-tune general-purpose pre-trained language models (e.g., GPT2) using the limited training instances to produce new synthetic data. Consequently, they have trivial task-specific knowledge and are limited to yielding low-quality synthetic data. To combat this issue, we propose Knowledge Mixture Data Augmentation Model (KnowDA) which is an Seq2Seq language model pre-trained on a mixture of diverse NLP tasks under a novel framework of Knowledge Mixture Training (KoMT). The goal of KoMT is to condense diverse NLP task-specific knowledge into the single KnowDA model (i.e., all-in-one) such that KnowDA could utilize these knowledge to quickly grasp the inherent synthesis law of the target task through limited training instances. Specifically, KoMT reformulates input examples from various heterogeneous NLP tasks into a unified text-to-text format, and employs denoising training objectives in different granularity to learn to reconstruct partial or complete samples. To the best of our knowledge, we are the first attempt to apply 100+ NLP multi-task training for data augmentation. Extensive experiments show that i) the synthetic data produced by KnowDA successfully improves performance of the strong pre-trained language models (i.e., Bert, ALBert and Deberta) by a large margin on the low-resource NLP benchmark FewGLUE, CoNLL'03 and WikiAnn; ii) KnowDA successfully transfers the task knowledge to NLP tasks whose types are seen and unseen in KoMT.


Streaming LifeLong Learning With Any-Time Inference

arXiv.org Artificial Intelligence

Despite rapid advancements in lifelong learning (LLL) research, a large body of research mainly focuses on improving the performance in the existing \textit{static} continual learning (CL) setups. These methods lack the ability to succeed in a rapidly changing \textit{dynamic} environment, where an AI agent needs to quickly learn new instances in a `single pass' from the non-i.i.d (also possibly temporally contiguous/coherent) data streams without suffering from catastrophic forgetting. For practical applicability, we propose a novel lifelong learning approach, which is streaming, i.e., a single input sample arrives in each time step, single pass, class-incremental, and subject to be evaluated at any moment. To address this challenging setup and various evaluation protocols, we propose a Bayesian framework, that enables fast parameter update, given a single training example, and enables any-time inference. We additionally propose an implicit regularizer in the form of snap-shot self-distillation, which effectively minimizes the forgetting further. We further propose an effective method that efficiently selects a subset of samples for online memory rehearsal and employs a new replay buffer management scheme that significantly boosts the overall performance. Our empirical evaluations and ablations demonstrate that the proposed method outperforms the prior works by large margins.


Neural Abstractions

arXiv.org Artificial Intelligence

We present a novel method for the safety verification of nonlinear dynamical models that uses neural networks to represent abstractions of their dynamics. Neural networks have extensively been used before as approximators; in this work, we make a step further and use them for the first time as abstractions. For a given dynamical model, our method synthesises a neural network that overapproximates its dynamics by ensuring an arbitrarily tight, formally certified bound on the approximation error. For this purpose, we employ a counterexample-guided inductive synthesis procedure. We show that this produces a neural ODE with non-deterministic disturbances that constitutes a formal abstraction of the concrete model under analysis. This guarantees a fundamental property: if the abstract model is safe, i.e., free from any initialised trajectory that reaches an undesirable state, then the concrete model is also safe. By using neural ODEs with ReLU activation functions as abstractions, we cast the safety verification problem for nonlinear dynamical models into that of hybrid automata with affine dynamics, which we verify using SpaceEx. We demonstrate that our approach performs comparably to the mature tool Flow* on existing benchmark nonlinear models. We additionally demonstrate and that it is effective on models that do not exhibit local Lipschitz continuity, which are out of reach to the existing technologies.


Existing EdTech That You Didn't Know You Needed - Pikmykid

#artificialintelligence

EdTech, or Educational Technology, refers to the use of technology to support and enhance teaching and learning. It can be a useful tool for educators by assisting to engage students, personalize learning, keep them safe and improve student outcomes. When it comes to the instructional methods and tools we use in our classrooms, there are several goals that educators may have in mind when selecting them. Therefore, knowing the types of technology available and the benefits of each is important. There are a variety of tools that have features promoting accessibility.


Multi-modal deep learning in less than 15 lines of code - KDnuggets

#artificialintelligence

For many machine learning use-cases, organizations rely solely on tabular data and tree-based models like XGBoost and LightGBM. This is because deep learning is simply too hard for most ML teams. As a result, teams miss out on valuable signals hidden within unstructured data like text and images. New declarative machine learning systems--like open-source Ludwig started at Uber--provide a low-code approach to automating ML that enables data teams to build and deploy state-of-the-art models faster with a simple configuration file. Specifically, Predibase--the leading low-code declarative ML platform--along Ludwig make it easy to build multi-modal deep learning models in 15 lines of code.


DCU to provide new machine learning module for undergrads

#artificialintelligence

The machine learning module was compiled by researchers at the Insight centre for data analytics and the DCU computer science faculty. Students at Dublin City University (DCU) will soon be able to avail of a new module that provides an introduction to machine learning. The module will be provided to undergraduate computer science students. They will be able to learn the basics, as well as get an insight into how different industries and professionals use machine learning. Machine learning is an AI application that enables systems to self-programme by recognising patterns in large data sets.


LAGAN: Deep Semi-Supervised Linguistic-Anthropology Classification with Conditional Generative Adversarial Neural Network

arXiv.org Artificial Intelligence

Education is a right of all, however, every individual is different than others. Teachers in post-communism era discover inherent individualism to equally train all towards job market of fourth industrial revolution. We can consider scenario of ethnic minority education in academic practices. Ethnic minority group has grown in their own culture and would prefer to be taught in their native way. We have formulated such linguistic anthropology(how people learn)based engagement as semi-supervised problem. Then, we have developed an conditional deep generative adversarial network algorithm namely LA-GAN to classify linguistic ethnographic features in student engagement. Theoretical justification proves the objective, regularization and loss function of our semi-supervised adversarial model. Survey questions are prepared to reach some form of assumptions about z-generation and ethnic minority group, whose learning style, learning approach and preference are our main area of interest.