Instructional Material
Financial Engineering and Artificial Intelligence in Python
Have you ever thought about what would happen if you combined the power of machine learning and artificial intelligence with financial engineering? Today, you can stop imagining, and start doing. This course will teach you the core fundamentals of financial engineering, with a machine learning twist. We will cover must-know topics in financial engineering, such as: Exploratory data analysis, significance testing, correlations, alpha and beta Time series analysis, simple moving average, exponentially-weighted moving average Holt-Winters exponential smoothing model Efficient Market Hypothesis Random Walk Hypothesis Time series forecasting ("stock price prediction") Modern portfolio theory Efficient frontier / Markowitz bullet Mean-variance optimization Maximizing the Sharpe ratio Convex optimization with Linear Programming and Quadratic Programming Capital Asset Pricing Model (CAPM) Algorithmic trading (VIP only) Statistical Factor Models (VIP only) Regime Detection with Hidden Markov Models (VIP only) In addition, we will look at various non-traditional techniques which stem purely from the field of machine learning and artificial intelligence, such as: Classification models Unsupervised learning Reinforcement learning and Q-learning We will learn about the greatest flub made in the past decade by marketers posing as "machine learning experts" who promise to teach unsuspecting students how to "predict stock prices with LSTMs". You will learn exactly why their methodology is fundamentally flawed and why their results are complete nonsense.
Advanced Reinforcement Learning in Python: cutting-edge DQNs
This course will introduce you to the state of the art in Reinforcement Learning techniques. It will also prepare you for the next courses in this series, where we will explore other advanced methods that excel in other types of task. The course is focused on developing practical skills. Therefore, after learning the most important concepts of each family of methods, we will implement one or more of their algorithms in jupyter notebooks, from scratch.
ezCoref: Towards Unifying Annotation Guidelines for Coreference Resolution
Gupta, Ankita, Karpinska, Marzena, Zhao, Wenlong, Krishna, Kalpesh, Merullo, Jack, Yeh, Luke, Iyyer, Mohit, O'Connor, Brendan
Large-scale, high-quality corpora are critical for advancing research in coreference resolution. However, existing datasets vary in their definition of coreferences and have been collected via complex and lengthy guidelines that are curated for linguistic experts. These concerns have sparked a growing interest among researchers to curate a unified set of guidelines suitable for annotators with various backgrounds. In this work, we develop a crowdsourcing-friendly coreference annotation methodology, ezCoref, consisting of an annotation tool and an interactive tutorial. We use ezCoref to re-annotate 240 passages from seven existing English coreference datasets (spanning fiction, news, and multiple other domains) while teaching annotators only cases that are treated similarly across these datasets. Surprisingly, we find that reasonable quality annotations were already achievable (>90% agreement between the crowd and expert annotations) even without extensive training. On carefully analyzing the remaining disagreements, we identify the presence of linguistic cases that our annotators unanimously agree upon but lack unified treatments (e.g., generic pronouns, appositives) in existing datasets. We propose the research community should revisit these phenomena when curating future unified annotation guidelines.
Scientific Impact of Graph-Based Approaches in Deep Learning Studies -- A Bibliometric Comparison
Turker, Ilker, Tan, Serhat Orkun
Applying graph-based approaches in deep learning receives more attention over time. This study presents statistical analysis on the use of graph-based approaches in deep learning and examines the scientific impact of the related articles. Processing the data obtained from the Web of Science database, metrics such as the type of the articles, funding availability, indexing type, annual average number of citations and the number of access were analyzed to quantitatively reveal the effects on the scientific audience. It's outlined that deep learning-based studies gained momentum after year 2013, and the rate of graph-based approaches in all deep learning studies increased linearly from 1% to 4% within the following 10 years. Conference publications scanned in the Conference Proceeding Citation Index (CPCI) on the graph-based approaches receive significantly more citations. The citation counts of the SCI-Expanded and Emerging SCI indexed publications of the two streams are close to each other. While the citation performances of the supported and unsupported publications of the two sides were similar, pure deep learning studies received more citations on the journal publication side and graph-based approaches received more citations on the conference side. Despite their similar performance in recent years, graph-based studies show twice more citation performance as they get older, compared to traditional approaches. Annual average citation performance per article for all deep learning studies is 11.051 in 2014, while it is 22.483 for graph-based studies. Also, despite receiving 16% more access, graph-based papers get almost the same overall citation over time with the pure counterpart. This is an indication that graph-based approaches need a greater bunch of attention to follow, while pure deep learning counterpart is relatively simpler to get inside.
Sustainable Online Reinforcement Learning for Auto-bidding
Mou, Zhiyu, Huo, Yusen, Bai, Rongquan, Xie, Mingzhou, Yu, Chuan, Xu, Jian, Zheng, Bo
Recently, auto-bidding technique has become an essential tool to increase the revenue of advertisers. Facing the complex and ever-changing bidding environments in the real-world advertising system (RAS), state-of-the-art auto-bidding policies usually leverage reinforcement learning (RL) algorithms to generate real-time bids on behalf of the advertisers. Due to safety concerns, it was believed that the RL training process can only be carried out in an offline virtual advertising system (VAS) that is built based on the historical data generated in the RAS. In this paper, we argue that there exists significant gaps between the VAS and RAS, making the RL training process suffer from the problem of inconsistency between online and offline (IBOO). Firstly, we formally define the IBOO and systematically analyze its causes and influences. Then, to avoid the IBOO, we propose a sustainable online RL (SORL) framework that trains the auto-bidding policy by directly interacting with the RAS, instead of learning in the VAS. Specifically, based on our proof of the Lipschitz smooth property of the Q function, we design a safe and efficient online exploration (SER) policy for continuously collecting data from the RAS. Meanwhile, we derive the theoretical lower bound on the safety of the SER policy. We also develop a variance-suppressed conservative Q-learning (V-CQL) method to effectively and stably learn the auto-bidding policy with the collected data.
Data augmentation on-the-fly and active learning in data stream classification
Malialis, Kleanthis, Papatheodoulou, Dimitris, Filippou, Stylianos, Panayiotou, Christos G., Polycarpou, Marios M.
There is an emerging need for predictive models to be trained on-the-fly, since in numerous machine learning applications data are arriving in an online fashion. A critical challenge encountered is that of limited availability of ground truth information (e.g., labels in classification tasks) as new data are observed one-by-one online, while another significant challenge is that of class imbalance. This work introduces the novel Augmented Queues method, which addresses the dual-problem by combining in a synergistic manner online active learning, data augmentation, and a multi-queue memory to maintain separate and balanced queues for each class. We perform an extensive experimental study using image and time-series augmentations, in which we examine the roles of the active learning budget, memory size, imbalance level, and neural network type. We demonstrate two major advantages of Augmented Queues. First, it does not reserve additional memory space as the generation of synthetic data occurs only at training times. Second, learning models have access to more labelled data without the need to increase the active learning budget and / or the original memory size. Learning on-the-fly poses major challenges which, typically, hinder the deployment of learning models. Augmented Queues significantly improves the performance in terms of learning quality and speed. Our code is made publicly available.
Seeking AI resources for students in your university classroom?
It's no secret that artificial intelligence (AI) is one of the hottest topics in the tech world today. Every day, it seems like there's a new story about how AI is being used to improve some aspect of our lives, from personal assistants to driverless cars. Given all the hype, it's no wonder that educators are eager to introduce AI concepts to their students. Now, thanks to resources inside Intel's 5-module teaching kit for AI inference teaching the Intel Distribution of OpenVINO toolkit, it is easier than ever to introduce the concepts of deep learning AI to students. Get your students hands-on coding experience with this teacher kit, which comes with a lesson plan, 5-modules of workbooks, videos, quizzes, and Jupyter* Notebook coding lab tutorials.
[100%OFF] Build, Host & Manage Super-Fast WordPress Websites On 10Web
Would you like to learn how to Create Super-Fast and Amazing WordPress Websites using Elementor and manage their backend and development using 10Web web hosting to manage the Website's Security and Performance? Here in this course, you will learn how you can design responsive and amazing business and personal websites using Elementor Page Builder and 10Web's AI builder without any coding or programming knowledge. Even you don't need any WordPress Technical knowledge or any specific design knowledge to design these awesome Websites. Not only design but you will also learn how you can develop and manage the backend of these websites using 10Web hosting with any networking or programming knowledge very easy to make your websites super fast and secure with almost everything from SEO to Performance and Security with AI and Automation. If you are a Freelance Web Designer or a Web Design Agency owner or if you are looking to start a Web Design Business or your Freelance WordPress Web Design carrier, this course would help you with Managing Multiple Websites at one dashboard with automated backups and security options and not only that but you will get to know about Whitelabeling and branding options to help you be more professional secure and hassle-free because AI will be taking care of everything.
Deep Learning With Keras and TensorFlow
Go from beginner to mastery in Neural Networks with OpenCV's new course offering The "Deep Learning with Keras and TensorFlow" course is an intermediate level course, curated exclusively for both beginners and professionals. The course covers the basics as well as the advanced level concepts. The course contains content based videos along with practical demonstrations, that performs and explains each step required to complete the task. If you're new to this technology, then don't worry - the course covers the topics from the basics. If you have done some programming before, you should pick it up quickly.
TinyML with Arduino Nano RP2040 Connect
Machine learning model development for tiny low power microcontroller such as Arduino nano RP2040 connect. As you know, the TinyML field is constantly growing and developing. So, keeping in mind more sections with theoretical explanations with hands-on project ideas will be included in the near future. Tiny machine learning, which targets battery-operated devices, is broadly defined as a rapidly expanding field of machine learning technologies and applications that includes hardware (dedicated integrated circuits), algorithms, and software that can perform on-device sensor data analytics at extremely low power, typically in the mW range and below. It eliminates the requirement to send data to the cloud for classification thus providing more security.