Goto

Collaborating Authors

 Instructional Material


Hierarchical Fallback Architecture for High Risk Online Machine Learning Inference

arXiv.org Artificial Intelligence

These systems can fail unexpectedly in a variety of different ways. Notably, applications Open Banking powered machine learning applications require novel that rely on online inference are subject to their inability robustness approaches to deal with challenging stress and failure to keep up with the expected operating procedures while, now scenarios. In this paper we propose an hierarchical fallback architecture additionally, having to make tedious computational tasks for these for improving robustness in high risk machine learning AI/ML applications, typically resulting in timeouts, infrastructure applications with a focus in the financial domain. We define generic outages and, often, failures in external dependencies such as third failure scenarios often found in online inference that depend on party data providers (external API calls) [7]. When the underlying external data providers and we describe in detail how to apply the machine learning applications are presented with strong robustness hierarchical fallback architecture to address them. Finally, we offer requirements, fallback or fall-over strategies are needed to keep a real world example of its applicability in the industry for near-real operations running, even in the event of unexpected failures. In time transactional fraud risk evaluation using Open Banking data finance, specifically applications that require real time risk mitigation and under extreme stress scenarios.


Using Code Generation to Solve Open Instances of Combinatorial Design Problems

arXiv.org Artificial Intelligence

The Handbook of Combinatorial Designs catalogs many types of combinatorial designs, together with lists of open instances for which existence has not yet been determined. We develop a constructive protocol CPro1, which uses Large Language Models (LLMs) to generate code that constructs combinatorial designs and resolves some of these open instances. The protocol starts from a definition of a particular type of design, and a verifier that reliably confirms whether a proposed design is valid. The LLM selects strategies and implements them in code, and scaffolding provides automated hyperparameter tuning and execution feedback using the verifier. Most generated code fails, but by generating many candidates, the protocol automates exploration of a variety of standard methods (e.g.


A spectral clustering-type algorithm for the consistent estimation of the Hurst distribution in moderately high dimensions

arXiv.org Machine Learning

Scale invariance (fractality) is a prominent feature of the large-scale behavior of many stochastic systems. In this work, we construct an algorithm for the statistical identification of the Hurst distribution (in particular, the scaling exponents) undergirding a high-dimensional fractal system. The algorithm is based on wavelet random matrices, modified spectral clustering and a model selection step for picking the value of the clustering precision hyperparameter. In a moderately high-dimensional regime where the dimension, the sample size and the scale go to infinity, we show that the algorithm consistently estimates the Hurst distribution. Monte Carlo simulations show that the proposed methodology is efficient for realistic sample sizes and outperforms another popular clustering method based on mixed-Gaussian modeling. We apply the algorithm in the analysis of real-world macroeconomic time series to unveil evidence for cointegration.


Scary moment China's controversial ChatGPT rival, DeepSeek, changes its answers in REAL-TIME when probed about President Xi or the Tiananmen Square Massacre

Daily Mail - Science & tech

If you ask a historian what happened on June 4, 1989, you will learn that this was the date that hundreds of protesters were killed in the Tiananmen Square Massacre. However, if you ask China's controversial ChatGPT rival, DeepSeek, you are unlikely to get any response at all. During testing, MailOnline watched in real-time as answers to sensitive questions were scrubbed and replaced with evasive replies. On topics censored by the Chinese Communist Party (CCP), the bot briefly attempts to respond before wiping away its own answers in front of the user's eyes. DeepSeek actively censors topics related to protests such as the White Paper Movement and Hong Kong's pro-democratic Umbrella Movement.


How to install the Copilot app for Windows 11

PCWorld

Instead of integrating Copilot as a function in Windows, Microsoft has presented its AI assistant as a new app. The app seems to automatically install itself onto your machine via the monthly updates, but only if you're in the US. However, if you're from the EU, the app is only available on the Microsoft Store. We'll show you how to manually install it (if you chose to embrace Copilot and its features that is). Here's how I fixed it To install the app, open the Microsoft Store via the Start menu, type Copilot into the search field, and click on "Microsoft Copilot."


MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs

arXiv.org Artificial Intelligence

We present MultiChallenge, a pioneering benchmark evaluating large language models (LLMs) on conducting multi-turn conversations with human users, a crucial yet underexamined capability for their applications. MultiChallenge identifies four categories of challenges in multi-turn conversations that are not only common and realistic among current human-LLM interactions, but are also challenging to all current frontier LLMs. All 4 challenges require accurate instruction-following, context allocation, and in-context reasoning at the same time. We also develop LLM as judge with instance-level rubrics to facilitate an automatic evaluation method with fair agreement with experienced human raters. Despite achieving near-perfect scores on existing multi-turn evaluation benchmarks, all frontier models have less than 50% accuracy on MultiChallenge, with the top-performing Claude 3.5 Sonnet (June 2024) achieving just a 41.4% average accuracy.


Online-BLS: An Accurate and Efficient Online Broad Learning System for Data Stream Classification

arXiv.org Artificial Intelligence

The state-of-the-art online learning models generally conduct a single online gradient descent when a new sample arrives and thus suffer from suboptimal model weights. To this end, we introduce an online broad learning system framework with closed-form solutions for each online update. Different from employing existing incremental broad learning algorithms for online learning tasks, which tend to incur degraded accuracy and expensive online update overhead, we design an effective weight estimation algorithm and an efficient online updating strategy to remedy the above two deficiencies, respectively. Specifically, an effective weight estimation algorithm is first developed by replacing notorious matrix inverse operations with Cholesky decomposition and forward-backward substitution to improve model accuracy. Second, we devise an efficient online updating strategy that dramatically reduces online update time. Theoretical analysis exhibits the splendid error bound and low time complexity of our model. The most popular test-then-training evaluation experiments on various real-world datasets prove its superiority and efficiency. Furthermore, our framework is naturally extended to data stream scenarios with concept drift and exceeds state-of-the-art baselines.


Implementation of a Generative AI Assistant in K-12 Education: The CGScholar AI Helper Initiative

arXiv.org Artificial Intelligence

This paper focuses on the piloting of the CGScholar AI Helper, a Generative AI (GenAI) assistant tool that aims to provide feedback on writing in high school contexts. The aim was to use GenAI to provide formative and summative feedback on students' texts in English Language Arts (ELA) and History. The trials discussed in this paper relate to Grade 11, a crucial learning phase when students are working towards college readiness. These trials took place in two very different schools in the Midwest of the United States, one in a low socio-economic background with low-performance outcomes and the other in a high socio-economic background with high-performance outcomes. The assistant tool used two main mechanisms "prompt engineering" based on participant teachers' assessment rubric and "fine-tuning" a Large Language Model (LLM) from a customized corpus of teaching materials using Retrieval Augmented Generation (RAG). This paper focuses on the CGScholar AI Helper's potential to enhance students' writing abilities and support teachers in ELA and other subject areas requiring written assignments.


Reviews: Gradient based sample selection for online continual learning

Neural Information Processing Systems

This paper proposes an approach to optimally select samples for a small replay buffer to perform continual learning (CL) without forgetting. GEM/A-GEM) the problem is formulated from the perspective of constrained optimisation (minimise loss on current sample subject to loss not increasing on previous ones). Unlike GEM, with clear separation and knowledge of tasks, this approach addresses the general non-stationary learning problem. The paper proposes a theoretical argument for using the variance of gradients to select samples for the buffer. One related work that could also be cited is "Adapting Auxiliary Losses using Gradient Similarity", by Du et al, 2018 (https://arxiv.org/abs/1812.02224)


I didn't know what the heck I was doing on ChatGPT until I took this course

Popular Science

I'm not going to lie--when ChatGPT first came out and blew everyone's minds, I was pretty hesitant about it. I'm not going to say I was anti-AI, but I just figured I'd do the work myself to ensure it was right, especially since I'd heard a few of my coworkers complain about how ChatGPT could never give them perfect results. But in recent months, I've started getting so much more scrambled with work, and it's not super sustainable to rely on myself for all the answers. So, I finally started branching out and using ChatGPT, but ran into similar frustrations my coworkers did. Thankfully, I found this ChatGPT beginner course for only 9.99, and it's seriously upgraded how I understand the chatbot and create prompts.