Goto

Collaborating Authors

 Instructional Material


Recommender Systems and Deep Learning in Python - Udemy Free Coupons Discount - Couse Sites

#artificialintelligence

Free Coupon Discount - The most in-depth course on recommendation systems with deep learning, machine learning, data science, and AI techniques Created by Lazy Programmer Inc. Students also bought Artificial Intelligence: Reinforcement Learning in Python Data Science: Natural Language Processing (NLP) in Python Unsupervised Machine Learning Hidden Markov Models in Python Natural Language Processing with Deep Learning in Python Cluster Analysis and Unsupervised Machine Learning in Python Preview this Udemy Course GET COUPON CODE Description Believe it or not, almost all online businesses today make use of recommender systems in some way or another. What do I mean by "recommender systems", and why are they useful? Let's look at the top 3 websites on the Internet, according to Alexa: Google, YouTube, and Facebook. Recommender systems form the very foundation of these technologies. Google: Search results They are why Google is the most successful technology company today.


GitHub - ajaymache/machine-learning-yearning: Machine Learning Yearning book by ๐Ÿ…ฐ ๐“ท๐“ญ๐“ป๐“ฎ๐”€ ๐Ÿ†–

#artificialintelligence

The book has been divided into 13 parts originally by Prof. Andrew NG along with the complete book with all the parts consolidated. In this book you will learn how to align on ML strategies in a team setting, as well as how to set up development (dev) sets and test sets. Recommendations for how to set up dev/test sets have been changing as Machine Learning is moving toward bigger datasets, and this explains how you should do it for modern ML projects.


Step By Step Guide To Build Visual Inspection of Casting Productsโ€ฆ โ€“ Towards AI

#artificialintelligence

Originally published on Towards AI. Introduction Casting refers to the manufacturing process in which a molten material like metal is poured into a hole or mold with the...


Learn Machine Learning - [2022] Best Machine Learning Tutorials

#artificialintelligence

Learning Machine Learning? Check out these best online Machine Learning courses and tutorials recommended by the data science community. Pick the tutorial as per your learning style: video tutorials or a book. Free course or paid. Tutorials for beginners or advanced learners. Check Machine Learning community's reviews & comments.


10 Best Advanced Machine Learning Courses You Must Know in 2023

#artificialintelligence

Are you looking for the Best Advanced Machine Learning Courses?โ€ฆ If yes, then this article is for you. In this article, you will find the 10 Best Advanced Machine Learning Courses. To gain Machine Learning skills, there are numerous courses available. So, without wasting your time, let's start finding the Best Advanced Machine Learning Coursesโ€“ This is a Nanodegree Program offered by Udacity.


Adaptive Policy Learning for Offline-to-Online Reinforcement Learning

arXiv.org Artificial Intelligence

Conventional reinforcement learning (RL) needs an environment to collect fresh data, which is impractical when online interactions are costly. Offline RL provides an alternative solution by directly learning from the previously collected dataset. However, it will yield unsatisfactory performance if the quality of the offline datasets is poor. In this paper, we consider an offline-to-online setting where the agent is first learned from the offline dataset and then trained online, and propose a framework called Adaptive Policy Learning for effectively taking advantage of offline and online data. Specifically, we explicitly consider the difference between the online and offline data and apply an adaptive update scheme accordingly, that is, a pessimistic update strategy for the offline dataset and an optimistic/greedy update scheme for the online dataset. Such a simple and effective method provides a way to mix the offline and online RL and achieve the best of both worlds. We further provide two detailed algorithms for implementing the framework through embedding value or policy-based RL algorithms into it. Finally, we conduct extensive experiments on popular continuous control tasks, and results show that our algorithm can learn the expert policy with high sample efficiency even when the quality of offline dataset is poor, e.g., random dataset.


DualMix: Unleashing the Potential of Data Augmentation for Online Class-Incremental Learning

arXiv.org Artificial Intelligence

Online Class-Incremental (OCI) learning has sparked new approaches to expand the previously trained model knowledge from sequentially arriving data streams with new classes. Unfortunately, OCI learning can suffer from catastrophic forgetting (CF) as the decision boundaries for old classes can become inaccurate when perturbated by new ones. Existing literature have applied the data augmentation (DA) to alleviate the model forgetting, while the role of DA in OCI has not been well understood so far. In this paper, we theoretically show that augmented samples with lower correlation to the original data are more effective in preventing forgetting. However, aggressive augmentation may also reduce the consistency between data and corresponding labels, which motivates us to exploit proper DA to boost the OCI performance and prevent the CF problem. We propose the Enhanced Mixup (EnMix) method that mixes the augmented samples and their labels simultaneously, which is shown to enhance the sample diversity while maintaining strong consistency with corresponding labels. Further, to solve the class imbalance problem, we design an Adaptive Mixup (AdpMix) method to calibrate the decision boundaries by mixing samples from both old and new classes and dynamically adjusting the label mixing ratio. Our approach is demonstrated to be effective on several benchmark datasets through extensive experiments, and it is shown to be compatible with other replay-based techniques.


Simfluence: Modeling the Influence of Individual Training Examples by Simulating Training Runs

arXiv.org Artificial Intelligence

Training data attribution (TDA) methods offer to trace a model's prediction on any given example back to specific influential training examples. Existing approaches do so by assigning a scalar influence score to each training example, under a simplifying assumption that influence is additive. But in reality, we observe that training examples interact in highly non-additive ways due to factors such as inter-example redundancy, training order, and curriculum learning effects. To study such interactions, we propose Simfluence, a new paradigm for TDA where the goal is not to produce a single influence score per example, but instead a training run simulator: the user asks, ``If my model had trained on example $z_1$, then $z_2$, ..., then $z_n$, how would it behave on $z_{test}$?''; the simulator should then output a simulated training run, which is a time series predicting the loss on $z_{test}$ at every step of the simulated run. This enables users to answer counterfactual questions about what their model would have learned under different training curricula, and to directly see where in training that learning would occur. We present a simulator, Simfluence-Linear, that captures non-additive interactions and is often able to predict the spiky trajectory of individual example losses with surprising fidelity. Furthermore, we show that existing TDA methods such as TracIn and influence functions can be viewed as special cases of Simfluence-Linear. This enables us to directly compare methods in terms of their simulation accuracy, subsuming several prior TDA approaches to evaluation. In experiments on large language model (LLM) fine-tuning, we show that our method predicts loss trajectories with much higher accuracy than existing TDA methods (doubling Spearman's correlation and reducing mean-squared error by 75%) across several tasks, models, and training methods.


Semantics-enhanced Temporal Graph Networks for Content Popularity Prediction

arXiv.org Artificial Intelligence

The surging demand for high-definition video streaming services and large neural network models (e.g., Generative Pre-trained Transformer, GPT) implies a tremendous explosion of Internet traffic. To mitigate the traffic pressure, architectures with in-network storage have been proposed to cache popular contents at devices in closer proximity to users. Correspondingly, in order to maximize caching utilization, it becomes essential to devise an effective popularity prediction method. In that regard, predicting popularity with dynamic graph neural network (DGNN) models achieve remarkable performance. However, DGNN models still suffer from tackling sparse datasets where most users are inactive. Therefore, we propose a reformative temporal graph network, named semantics-enhanced temporal graph network (STGN), which attaches extra semantic information into the user-content bipartite graph and could better leverage implicit relationships behind the superficial topology structure. On top of that, we customize its temporal and structural learning modules to further boost the prediction performance. Specifically, in order to efficiently aggregate the diversified semantics that a content might possess, we design a user-specific attention (UsAttn) mechanism for temporal learning module. Unlike the attention mechanism that only analyzes the influence of genres on content, UsAttn also considers the attraction of semantic information to a specific user. Meanwhile, as for the structural learning, we introduce the concept of positional encoding into our attention-based graph learning and adopt a semantic positional encoding (SPE) function to facilitate the analysis of content-oriented user-association analysis. Finally, extensive simulations verify the superiority of our STGN models and demonstrate the effectiveness in content caching.


Hike in AI-Created YouTube Videos Loaded With Malware

#artificialintelligence

Artificial Intelligence is being used to generate videos pretending to be step-by-step tutorials on how to access programs like Photoshop, Premiere Pro, Autodesk 3ds Max, AutoCAD, and others without a license. Instead, the videos are loaded with infostealer malware that scrapes the viewer's sensitive personal data stored on the device. Researchers with CloudSEK measured a month-over-month increase of 200% to 300% since November 2022 of AI-created YouTube videos with links to infostealer malware, including Vidar, RedLine, and Raccoon. Making the video lures more compelling to its targets, the CloudSEK security team added, AI video tools such as Synthesia and D-ID are being used to generate personas intended to exude trustworthiness across multiple languages and social media platforms, supercharging threat actors' ability to deliver infostealer malware. "It is well known that videos featuring humans, especially those certain facial features, appear more familiar and trustworthy," the CloudSEK report explained.