Instructional Material
A tutorial on learning from preferences and choices with Gaussian Processes
Benavoli, Alessio, Azzimonti, Dario
Preference modelling lies at the intersection of economics, decision theory, machine learning and statistics. By understanding individuals' preferences and how they make choices, we can build products that closely match their expectations, paving the way for more efficient and personalised applications across a wide range of domains. The objective of this tutorial is to present a cohesive and comprehensive framework for preference learning with Gaussian Processes (GPs), demonstrating how to seamlessly incorporate rationality principles (from economics and decision theory) into the learning process. By suitably tailoring the likelihood function, this framework enables the construction of preference learning models that encompass random utility models, limits of discernment, and scenarios with multiple conflicting utilities for both object- and label-preference. This tutorial builds upon established research while simultaneously introducing some novel GP-based models to address specific gaps in the existing literature.
scCDCG: Efficient Deep Structural Clustering for single-cell RNA-seq via Deep Cut-informed Graph Embedding
Xu, Ping, Ning, Zhiyuan, Xiao, Meng, Feng, Guihai, Li, Xin, Zhou, Yuanchun, Wang, Pengfei
Single-cell RNA sequencing (scRNA-seq) is essential for unraveling cellular heterogeneity and diversity, offering invaluable insights for bioinformatics advancements. Despite its potential, traditional clustering methods in scRNA-seq data analysis often neglect the structural information embedded in gene expression profiles, crucial for understanding cellular correlations and dependencies. Existing strategies, including graph neural networks, face challenges in handling the inefficiency due to scRNA-seq data's intrinsic high-dimension and high-sparsity. Addressing these limitations, we introduce scCDCG (single-cell RNA-seq Clustering via Deep Cut-informed Graph), a novel framework designed for efficient and accurate clustering of scRNA-seq data that simultaneously utilizes intercellular high-order structural information. scCDCG comprises three main components: (i) A graph embedding module utilizing deep cut-informed techniques, which effectively captures intercellular high-order structural information, overcoming the over-smoothing and inefficiency issues prevalent in prior graph neural network methods. (ii) A self-supervised learning module guided by optimal transport, tailored to accommodate the unique complexities of scRNA-seq data, specifically its high-dimension and high-sparsity. (iii) An autoencoder-based feature learning module that simplifies model complexity through effective dimension reduction and feature extraction. Our extensive experiments on 6 datasets demonstrate scCDCG's superior performance and efficiency compared to 7 established models, underscoring scCDCG's potential as a transformative tool in scRNA-seq data analysis. Our code is available at: https://github.com/XPgogogo/scCDCG.
Clustering in Dynamic Environments: A Framework for Benchmark Dataset Generation With Heterogeneous Changes
Yazdani, Danial, Branke, Juergen, Khorshidi, Mohammad Sadegh, Omidvar, Mohammad Nabi, Li, Xiaodong, Gandomi, Amir H., Yao, Xin
Clustering in dynamic environments is of increasing importance, with broad applications ranging from real-time data analysis and online unsupervised learning to dynamic facility location problems. While meta-heuristics have shown promising effectiveness in static clustering tasks, their application for tracking optimal clustering solutions or robust clustering over time in dynamic environments remains largely underexplored. This is partly due to a lack of dynamic datasets with diverse, controllable, and realistic dynamic characteristics, hindering systematic performance evaluations of clustering algorithms in various dynamic scenarios. This deficiency leads to a gap in our understanding and capability to effectively design algorithms for clustering in dynamic environments. To bridge this gap, this paper introduces the Dynamic Dataset Generator (DDG). DDG features multiple dynamic Gaussian components integrated with a range of heterogeneous, local, and global changes. These changes vary in spatial and temporal severity, patterns, and domain of influence, providing a comprehensive tool for simulating a wide range of dynamic scenarios.
Lecture notes on rough paths and applications to machine learning
Cass, Thomas, Salvi, Cristopher
These notes expound the recent use of the signature transform and rough path theory in data science and machine learning. We develop the core theory of the signature from first principles and then survey some recent popular applications of this approach, including signature-based kernel methods and neural rough differential equations. The notes are based on a course given by the two authors at Imperial College London.
MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
Yue, Murong, Mifdal, Wijdane, Zhang, Yixuan, Suh, Jennifer, Yao, Ziyu
Mathematical modeling (MM) is considered a fundamental skill for students in STEM disciplines. Practicing the MM skill is often the most effective when students can engage in group discussion and collaborative problem-solving. However, due to unevenly distributed teachers and educational resources needed to monitor such group activities, students do not always receive equal opportunities for this practice. Excitingly, large language models (LLMs) have recently demonstrated strong capability in both modeling mathematical problems and simulating characters with different traits and properties. Drawing inspiration from the advancement of LLMs, in this work, we present MATHVC, the very first LLM-powered virtual classroom containing multiple LLM-simulated student characters, with whom a human student can practice their MM skill. To encourage each LLM character's behaviors to be aligned with their specified math-relevant properties (termed "characteristics alignment") and the overall conversational procedure to be close to an authentic student MM discussion (termed "conversational procedural alignment"), we proposed three innovations: integrating MM domain knowledge into the simulation, defining a symbolic schema as the ground for character simulation, and designing a meta planner at the platform level to drive the conversational procedure. Through experiments and ablation studies, we confirmed the effectiveness of our simulation approach and showed the promise for MATHVC to benefit real-life students in the future.
Learn AI and automation tools for just 60
AI has been taking the world by storm over the past few years. Everyone is interested in saving time and money through automation, and there are myriad tools to help you do so. With The Ultimate Artificial Intelligence & Automation Developer Bundle, not only will you learn how to leverage existing tools, but you'll also learn how to create your own. This 13-course bundle includes courses from leading instructors like Dr. Chris Mall (4.6/5-star instructor rating), Bryan Guerra (4.4/5-star rating), and Alex Genadinik (4.4/5-star rating). You'll cover tools like ChatGPT, Midjourney, and DALL-E to learn how to effectively leverage text and image generators to save time and level up your productivity.
Indexing Analytics to Instances: How Integrating a Dashboard can Support Design Education
Jain, Ajit, Kerne, Andruid, Lupfer, Nic, Britain, Gabriel, Perrine, Aaron, Choe, Yoonsuck, Keyser, John, Huang, Ruihong, Seo, Jinsil, Sungkajun, Annie, Lightfoot, Robert, McGuire, Timothy
We investigate how to use AI-based analytics to support design education. The analytics at hand measure multiscale design, that is, students' use of space and scale to visually and conceptually organize their design work. With the goal of making the analytics intelligible to instructors, we developed a research artifact integrating a design analytics dashboard with design instances, and the design environment that students use to create them. We theorize about how Suchman's notion of mutual intelligibility requires contextualized investigation of AI in order to develop findings about how analytics work for people. We studied the research artifact in 5 situated course contexts, in 3 departments. A total of 236 students used the multiscale design environment. The 9 instructors who taught those students experienced the analytics via the new research artifact. We derive findings from a qualitative analysis of interviews with instructors regarding their experiences. Instructors reflected on how the analytics and their presentation in the dashboard have the potential to affect design education. We develop research implications addressing: (1) how indexing design analytics in the dashboard to actual design work instances helps design instructors reflect on what they mean and, more broadly, is a technique for how AI-based design analytics can support instructors' assessment and feedback experiences in situated course contexts; and (2) how multiscale design analytics, in particular, have the potential to support design education. By indexing, we mean linking which provides context, here connecting the numbers of the analytics with visually annotated design work instances.
Teaching Higher-Order Logic Using Isabelle
Lund, Simon Tobias, Villadsen, Jรธrgen
Higher-order logic, also known as simple type theory [3], has been described as the combination of functional programming and logic [9], and has proved a very powerful tool for the formalization of mathematics and computer science. It is an expressive enough logic to cover a wide array of fields, while still being built on relatively simple principles, and a number of proof assistants based on higher-order logic are available. We consider formal reasoning in the generic proof assistant Isabelle [10, 11]. In the present paper we are taking advantage of the genericity of Isabelle, but we also find that Isabelle is at least as user-friendly and intuitive as other proof assistants of comparable power. Although Isabelle is generic and comes with a number of object logics like first-order logic (FOL) and axiomatic set theory (ZF), the default object logic is higher-order logic, called Isabelle/HOL.
Automatic Authorities: Power and AI
Forthcoming in Collaborative Intelligence: How Humans and AI are Transforming our World, Arathi Sethumadhavan and Mira Lane (eds.), Seth Lazar, Australian National University Man, a child in understanding of himself, has placed in his hands physical tools of incalculable power. He plays with them like a child, and whether they work harm or good is largely a matter of accident. The instrumentality becomes a master and works fatally as if possessed of a will of its own-- not because it has a will but because man has not. Introduction As rapid advances in Artificial Intelligence and the rise of some of history's most potent corporations meet the diminished neoliberal state, people are increasingly subject to power exercised by means of automated systems. Machine learning, big data, and related computational technologies now underpin vital government services from criminal justice to tax auditing, public health to social services, immigration to defence (Citron, 2008; Calo and Citron, 2020; Engstrom et al., 2020). Google and Amazon connect consumers and producers in new algorithmic markets (Nadler and Cicilline, 2020). Google's search algorithm--and possibly in the near future OpenAI's GPT-4 or another large language model--determines, for many, how they find out about everything from how to vote to where to get vaccinated. Meta, Twitter, TikTok, Google and others algorithmically decide whose speech is amplified, reduced, or restricted (Vaidhyanathan, 2011; Pasquale, 2015; Gillespie, 2018; Suzor, 2019). And a new wave of products based on rapid advances in Large Language Models (LLMs) have the potential to further transform our economic and political lives. Automatic Authorities are automated computational systems used to exercise power over us by substantially determining what we may know, what we may have, and what our options will be. This chapter is based on, and substantially revises, my'Power and AI: Nature and Justification', in the Oxford Handbook of AI Governance (Justin Bullock et al., eds). My thanks to the publisher for their permission to use this material. But what normative lessons should we draw from these analyses? Power is everywhere, and is not necessarily bad.
Explaining EDA synthesis errors with LLMs
Qiu, Siyu, Tan, Benjamin, Pearce, Hammond
Training new engineers in digital design is a challenge, particularly when it comes to teaching the complex electronic design automation (EDA) tooling used in this domain. Learners will typically deploy designs in the Verilog and VHDL hardware description languages to Field Programmable Gate Arrays (FPGAs) from Altera (Intel) and Xilinx (AMD) via proprietary closed-source toolchains (Quartus Prime and Vivado, respectively). These tools are complex and difficult to use -- yet, as they are the tools used in industry, they are an essential first step in this space. In this work, we examine how recent advances in artificial intelligence may be leveraged to address aspects of this challenge. Specifically, we investigate if Large Language Models (LLMs), which have demonstrated text comprehension and question-answering capabilities, can be used to generate novice-friendly explanations of compile-time synthesis error messages from Quartus Prime and Vivado. To perform this study we generate 936 error message explanations using three OpenAI LLMs over 21 different buggy code samples. These are then graded for relevance and correctness, and we find that in approximately 71% of cases the LLMs give correct & complete explanations suitable for novice learners.