AITopics | Instructional Material

Collaborating Authors

Instructional Material

An Information Theoretic Perspective on Conformal Prediction

Correia, Alvaro H. C., Massoli, Fabio Valerio, Louizos, Christos, Behboodi, Arash

arXiv.org Machine LearningJun-26-2024

Conformal Prediction (CP) is a distribution-free uncertainty estimation framework that constructs prediction sets guaranteed to contain the true answer with a user-specified probability. Intuitively, the size of the prediction set encodes a general notion of uncertainty, with larger sets associated with higher degrees of uncertainty. In this work, we leverage information theory to connect conformal prediction to other notions of uncertainty. More precisely, we prove three different ways to upper bound the intrinsic uncertainty, as described by the conditional entropy of the target variable given the inputs, by combining CP with information theoretical inequalities. Moreover, we demonstrate two direct and useful applications of such connection between conformal prediction and information theory: (i) more principled and effective conformal training objectives that generalize previous approaches and enable end-to-end training of machine learning models from scratch, and (ii) a natural mechanism to incorporate side information into conformal prediction. We empirically validate both applications in centralized and federated learning settings, showing our theoretical results translate to lower inefficiency (average prediction set size) for popular CP methods.

conformal prediction, inequality, prediction, (15 more...)

arXiv.org Machine Learning

2405.0214

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > New York > New York County > New York City (0.04)
Asia > Japan (0.04)
(7 more...)

Genre:

Overview (0.92)
Research Report > New Finding (0.46)
Instructional Material > Course Syllabus & Notes (0.46)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Facebook, Instagram are using your data to train AI: Learn how to protect it

FOX NewsJun-25-2024, 19:29:07 GMT

Kurt "Cyberguy" Knutsson talks about how to protect your social media posts; a toddler getting trapped in a Tesla after the battery died. Meta may have paused its plans to train artificial intelligence models for the lucky ones living in Europe, where laws protect people using Facebook and Instagram better than Americans. Here in the good ole USA, both Facebook and Instagram have already been combing through public posts from U.S. accounts to train and improve its AI capabilities, including its chatbot, since last year. The proposed privacy policy update for European Union and U.K. users, originally scheduled for June 26, would have allowed Meta to use publicly shared content for AI training. However, users and regulatory agencies opposed this plan, leading to its indefinite postponement in those regions.

cyberguy, meta, stop meta, (15 more...)

FOX News

Country:

Europe (0.56)
North America > United States (0.36)

Genre: Instructional Material > Course Syllabus & Notes (0.40)

Industry:

Information Technology > Security & Privacy (1.00)
Information Technology > Services (0.85)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.52)

Add feedback

Using Helium Balloon Flying Drones for Introductory CS Education

Cao, Stanley, Gregg, Christopher

arXiv.org Artificial IntelligenceJun-25-2024

In the rapidly evolving field of computer science education, novel approaches to teaching fundamental concepts are crucial for engaging a diverse student body. Given the growing demand for a computing-skilled workforce, it is essential to adapt educational methods to capture the interest of a broader audience than what current computing education typically targets. Engaging educational experiences have been shown to have a positive impact on learning outcomes and examination performance, especially within computing education. Moreover, physical computing devices have been shown to correlate with increased student motivation when students are studying computer science.

curriculum, drone, student, (13 more...)

arXiv.org Artificial Intelligence

2407.16909

Country:

North America > United States > New York > New York County > New York City (0.07)
North America > United States > District of Columbia > Washington (0.05)
South America > Suriname > Marowijne District > Albina (0.05)
(7 more...)

Genre:

Instructional Material > Course Syllabus & Notes (0.47)
Research Report > Promising Solution (0.34)

Industry:

Information Technology (1.00)
Education > Curriculum > Subject-Specific Education (1.00)
Education > Educational Setting (0.99)
Transportation (0.85)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.84)

Add feedback

Solving Hard Mizar Problems with Instantiation and Strategy Invention

Jakubův, Jan, Janota, Mikoláš, Urban, Josef

arXiv.org Artificial IntelligenceJun-25-2024

In this work, we prove over 3000 previously ATP-unproved Mizar/MPTP problems by using several ATP and AI methods, raising the number of ATP-solved Mizar problems from 75\% to above 80\%. First, we start to experiment with the cvc5 SMT solver which uses several instantiation-based heuristics that differ from the superposition-based systems, that were previously applied to Mizar,and add many new solutions. Then we use automated strategy invention to develop cvc5 strategies that largely improve cvc5's performance on the hard problems. In particular, the best invented strategy solves over 14\% more problems than the best previously available cvc5 strategy. We also show that different clausification methods have a high impact on such instantiation-based methods, again producing many new solutions. In total, the methods solve 3021 (21.3\%) of the 14163 previously unsolved hard Mizar problems. This is a new milestone over the Mizar large-theory benchmark and a large strengthening of the hammer methods for Mizar.

grackle, mizar problem, springer, (15 more...)

arXiv.org Artificial Intelligence

2406.17762

Country:

Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > United States > District of Columbia > Washington (0.04)
(8 more...)

Genre:

Research Report (0.40)
Instructional Material (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.68)

Add feedback

Enhancing Explainability of Knowledge Learning Paths: Causal Knowledge Networks

Wei, Yuang, Zhou, Yizhou, Jiang, Yuan-Hao, Jiang, Bo

arXiv.org Artificial IntelligenceJun-25-2024

A reliable knowledge structure is a prerequisite for building effective adaptive learning systems and intelligent tutoring systems. Pursuing an explainable and trustworthy knowledge structure, we propose a method for constructing causal knowledge networks. This approach leverages Bayesian networks as a foundation and incorporates causal relationship analysis to derive a causal network. Additionally, we introduce a dependable knowledge-learning path recommendation technique built upon this framework, improving teaching and learning quality while maintaining transparency in the decision-making process.

causal relationship, experiment, knowledge component, (15 more...)

arXiv.org Artificial Intelligence

2406.17518

Country:

Asia > China > Shanghai > Shanghai (0.05)
Asia > Singapore (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(2 more...)

Genre:

Research Report (1.00)
Instructional Material (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image Classification

Niemeijer, Joshua, Ehrhardt, Jan, Uzunova, Hristina, Handels, Heinz

arXiv.org Artificial IntelligenceJun-25-2024

The usage of medical image data for the training of large-scale machine learning approaches is particularly challenging due to its scarce availability and the costly generation of data annotations, typically requiring the engagement of medical professionals. The rapid development of generative models allows towards tackling this problem by leveraging large amounts of realistic synthetically generated data for the training process. However, randomly choosing synthetic samples, might not be an optimal strategy. In this work, we investigate the targeted generation of synthetic training data, in order to improve the accuracy and robustness of image classification. Therefore, our approach aims to guide the generative model to synthesize data with high epistemic uncertainty, since large measures of epistemic uncertainty indicate underrepresented data points in the training set. During the image generation we feed images reconstructed by an auto encoder into the classifier and compute the mutual information over the class-probability distribution as a measure for uncertainty.We alter the feature space of the autoencoder through an optimization process with the objective of maximizing the classifier uncertainty on the decoded image. By training on such data we improve the performance and robustness against test time data augmentations and adversarial attacks on several classifications tasks.

augmentation, classifier, epistemic uncertainty, (15 more...)

arXiv.org Artificial Intelligence

2406.17473

Country:

Europe > Germany (0.05)
Europe > Switzerland (0.04)

Genre:

Research Report (1.00)
Instructional Material > Online (0.41)
Instructional Material > Course Syllabus & Notes (0.41)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (0.62)
Information Technology > Security & Privacy (0.50)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

The Overcooked Generalisation Challenge

Ruhdorfer, Constantin, Bortoletto, Matteo, Penzkofer, Anna, Bulling, Andreas

arXiv.org Artificial IntelligenceJun-25-2024

We introduce the Overcooked Generalisation Challenge (OGC) - the first benchmark to study agents' zero-shot cooperation abilities when faced with novel partners and levels in the Overcooked-AI environment. This perspective starkly contrasts a large body of previous work that has trained and evaluated cooperating agents only on the same level, failing to capture generalisation abilities required for real-world human-AI cooperation. Our challenge interfaces with state-of-the-art dual curriculum design (DCD) methods to generate auto-curricula for training general agents in Overcooked. It is the first cooperative multi-agent environment specially designed for DCD methods and, consequently, the first benchmarked with state-of-the-art methods. It is fully GPU-accelerated, built on the DCD benchmark suite minimax, and freely available under an open-source license: https://git.hcics.simtech.uni-stuttgart.de/public-projects/OGC. We show that current DCD algorithms struggle to produce useful policies in this novel challenge, even if combined with recent network architectures that were designed for scalability and generalisability. The OGC pushes the boundaries of real-world human-AI cooperation by enabling the research community to study the impact of generalisation on cooperating agents.

agent, layout, proceedings, (9 more...)

arXiv.org Artificial Intelligence

2406.17949

Country:

Europe > Germany > Baden-Württemberg > Stuttgart Region > Stuttgart (0.24)
North America > Canada > Quebec > Montreal (0.14)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
(11 more...)

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (0.34)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.67)

Add feedback

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Penedo, Guilherme, Kydlíček, Hynek, allal, Loubna Ben, Lozhkov, Anton, Mitchell, Margaret, Raffel, Colin, Von Werra, Leandro, Wolf, Thomas

arXiv.org Artificial IntelligenceJun-25-2024

The performance of a large language model (LLM) depends heavily on the quality and size of its pretraining dataset. However, the pretraining datasets for state-of-the-art open LLMs like Llama 3 and Mixtral are not publicly available and very little is known about how they were created. In this work, we introduce FineWeb, a 15-trillion token dataset derived from 96 Common Crawl snapshots that produces better-performing LLMs than other open pretraining datasets. To advance the understanding of how best to curate high-quality pretraining datasets, we carefully document and ablate all of the design choices used in FineWeb, including in-depth investigations of deduplication and filtering strategies. In addition, we introduce FineWeb-Edu, a 1.3-trillion token collection of educational text filtered from FineWeb. LLMs pretrained on FineWeb-Edu exhibit dramatically better performance on knowledge- and reasoning-intensive benchmarks like MMLU and ARC. Along with our datasets, we publicly release our data curation codebase and all of the models trained during our ablation experiments.

dataset, fineweb, fineweb-edu, (14 more...)

arXiv.org Artificial Intelligence

2406.17557

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Middle East > Jordan (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(3 more...)

Genre:

Instructional Material (0.93)
Research Report (0.64)

Industry: Education > Educational Setting (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

The Effects of Embodiment and Personality Expression on Learning in LLM-based Educational Agents

Sonlu, Sinan, Bendiksen, Bennie, Durupinar, Funda, Güdükbay, Uğur

arXiv.org Artificial IntelligenceJun-24-2024

This work investigates how personality expression and embodiment affect personality perception and learning in educational conversational agents. We extend an existing personality-driven conversational agent framework by integrating LLM-based conversation support tailored to an educational application. We describe a user study built on this system to evaluate two distinct personality styles: high extroversion and agreeableness and low extroversion and agreeableness. For each personality style, we assess three models: (1) a dialogue-only model that conveys personality through dialogue, (2) an animated human model that expresses personality solely through dialogue, and (3) an animated human model that expresses personality through both dialogue and body and facial animations. The results indicate that all models are positively perceived regarding both personality and learning outcomes. Models with high personality traits are perceived as more engaging than those with low personality traits. We provide a comprehensive quantitative and qualitative analysis of perceived personality traits, learning parameters, and user experiences based on participant ratings of the model types and personality styles, as well as users' responses to open-ended questions.

agent, participant, personality, (14 more...)

arXiv.org Artificial Intelligence

2407.10993

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.05)
North America > United States > New Jersey > Middlesex County > Piscataway (0.05)
(12 more...)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)
Instructional Material (1.00)
Research Report > Experimental Study (0.93)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)
Education > Curriculum (0.93)
Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

Data-driven Modeling in Metrology -- A Short Introduction, Current Developments and Future Perspectives

Schneider, Linda-Sophie, Krauss, Patrick, Schiering, Nadine, Syben, Christopher, Schielein, Richard, Maier, Andreas

arXiv.org Artificial IntelligenceJun-24-2024

Abstract: Mathematical models are vital to the field of metrology, playing a key role in the derivation of measurement results and the calculation of uncertainties from measurement data, informed by an understanding of the measurement process. These models generally represent the correlation between the quantity being measured and all other pertinent quantities. Such relationships are used to construct measurement systems that can interpret measurement data to generate conclusions and predictions about the measurement system itself. Classic models are typically analytical, built on fundamental physical principles. However, the rise of digital technology, expansive sensor networks, and high-performance computing hardware have led to a growing shift towards data-driven methodologies. This trend is especially prominent when dealing with large, intricate networked sensor systems in situations where there is limited expert understanding of the frequently changing real-world contexts. Here, we demonstrate the variety of opportunities that data-driven modeling presents, and how they have been already implemented in various real-world applications.

application, data-driven modeling, modeling, (13 more...)

arXiv.org Artificial Intelligence

2406.16659

Country:

Europe > Germany > Saarland > Saarbrücken (0.04)
North America > United States > New Jersey (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(5 more...)

Genre:

Research Report (1.00)
Overview (1.00)
Instructional Material > Course Syllabus & Notes (0.46)

Industry:

Energy (1.00)
Education (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (0.93)
(2 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Modeling & Simulation (1.00)
Information Technology > Internet of Things (1.00)
(8 more...)

Add feedback