AITopics | Dietterich, Thomas

Collaborating Authors

Dietterich, Thomas

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

WithdrarXiv: A Large-Scale Dataset for Retraction Study

Rao, Delip, Young, Jonathan, Dietterich, Thomas, Callison-Burch, Chris

arXiv.org Artificial IntelligenceDec-4-2024

Retractions play a vital role in maintaining scientific integrity, yet systematic studies of retractions in computer science and other STEM fields remain scarce. We present WithdrarXiv, the first large-scale dataset of withdrawn papers from arXiv, containing over 14,000 papers and their associated retraction comments spanning the repository's entire history through September 2024. Through careful analysis of author comments, we develop a comprehensive taxonomy of retraction reasons, identifying 10 distinct categories ranging from critical errors to policy violations. We demonstrate a simple yet highly accurate zero-shot automatic categorization of retraction reasons, achieving a weighted average F1-score of 0.96. Additionally, we release WithdrarXiv-SciFy, an enriched version including scripts for parsed full-text PDFs, specifically designed to enable research in scientific feasibility studies, claim verification, and automated theorem proving. These findings provide valuable insights for improving scientific quality control and automated verification systems. Finally, and most importantly, we discuss ethical issues and take a number of steps to implement responsible data release while fostering open science in this area.

category, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2412.03775

Country: North America > United States (1.00)

Genre: Research Report (0.85)

Industry:

Health & Medicine (1.00)
Government > Regional Government > North America Government > United States Government (0.94)
Law (0.70)
Information Technology > Security & Privacy (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

Benchmarking Neural Network Robustness to Common Corruptions and Perturbations

Hendrycks, Dan, Dietterich, Thomas

arXiv.org Machine LearningMar-28-2019

In this paper we establish rigorous benchmarks for image classifier robustness. MAGENET-C,standardizes and expands the corruption robustness topic, while showing which classifiers are preferable in safety-critical applications. MAGENET-Pwhich enables researchers to benchmark a classifier's robustness to common perturbations. Unlike recent robustness research, this benchmark evaluates performance on common corruptions and perturbations not worst-case adversarial perturbations. We find that there are negligible changes in relative corruption robustness from AlexNet classifiers to ResNet classifiers. Afterward we discover ways to enhance corruption and perturbation robustness. We even find that a bypassed adversarial defense provides substantial common perturbation robustness. Together our benchmarks may aid future work toward networks that robustly generalize. The human vision system is robust in ways that existing computer vision systems are not (Recht et al., 2018; Azulay & Weiss, 2018). Unlike current deep learning classifiers (Krizhevsky et al., 2012; He et al., 2015; Xie et al., 2016), the human vision system is not fooled by small changes in query images. Humans are also not confused by many forms of corruption such as snow, blur, pixelation, and novel combinations of these. Humans can even deal with abstract changes in structure and style. Achieving these kinds of robustness is an important goal for computer vision and machine learning.

deep learning, neural network, robustness, (21 more...)

arXiv.org Machine Learning

1903.12261

Country: North America > United States > California (0.14)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Sample-Based Tree Search with Fixed and Adaptive State Abstractions

Hostetler, Jesse, Fern, Alan, Dietterich, Thomas

Journal of Artificial Intelligence ResearchDec-14-2017

Sample-based tree search (SBTS) is an approach to solving Markov decision problems based on constructing a lookahead search tree using random samples from a generative model of the MDP. It encompasses Monte Carlo tree search (MCTS) algorithms like UCT as well as algorithms such as sparse sampling. SBTS is well-suited to solving MDPs with large state spaces due to the relative insensitivity of SBTS algorithms to the size of the state space. The limiting factor in the performance of SBTS tends to be the exponential dependence of sample complexity on the depth of the search tree. The number of samples required to build a search tree is O((|A|B)^d), where |A| is the number of available actions, B is the number of possible random outcomes of taking an action, and d is the depth of the tree. State abstraction can be used to reduce B by aggregating random outcomes together into abstract states. Recent work has shown that abstract tree search often performs substantially better than tree search conducted in the ground state space. This paper presents a theoretical and empirical evaluation of tree search with both fixed and adaptive state abstractions. We derive a bound on regret due to state abstraction in tree search that decomposes abstraction error into three components arising from properties of the abstraction and the search algorithm. We describe versions of popular SBTS algorithms that use fixed state abstractions, and we introduce the Progressive Abstraction Refinement in Sparse Sampling (PARSS) algorithm, which adapts its abstraction during search. We evaluate PARSS as well as sparse sampling with fixed abstractions on 12 experimental problems, and find that PARSS outperforms search with a fixed abstraction and that search with even highly inaccurate fixed abstractions outperforms search without abstraction. These results establish progressive abstraction refinement as a promising basis for new tree search algorithms, and we propose directions for future work within the progressive refinement framework.

abstraction, artificial intelligence, machine learning, (19 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.5483

AI Access Foundation

11096

Journal of Artificial Intelligence Research

Country: North America > United States > Oregon (0.14)

Genre: Research Report > New Finding (0.92)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

A Meta-Analysis of the Anomaly Detection Problem

Emmott, Andrew, Das, Shubhomoy, Dietterich, Thomas, Fern, Alan, Wong, Weng-Keen

arXiv.org Artificial IntelligenceAug-26-2016

This article provides a thorough meta-analysis of the anomaly detection problem. To accomplish this we first identify approaches to benchmarking anomaly detection algorithms across the literature and produce a large corpus of anomaly detection benchmarks that vary in their construction across several dimensions we deem important to real-world applications: (a) point difficulty, (b) relative frequency of anomalies, (c) clusteredness of anomalies, and (d) relevance of features. We apply a representative set of anomaly detection algorithms to this corpus, yielding a very large collection of experimental results. We analyze these results to understand many phenomena observed in previous work. First we observe the effects of experimental design on experimental results. Second, results are evaluated with two metrics, ROC Area Under the Curve and Average Precision. We employ statistical hypothesis testing to demonstrate the value (or lack thereof) of our benchmarks. We then offer several approaches to summarizing our experimental results, drawing several conclusions about the impact of our methodology as well as the strengths and weaknesses of some algorithms. Last, we compare results against a trivial solution as an alternate means of normalizing the reported performance of algorithms. The intended contributions of this article are many; in addition to providing a large publicly-available corpus of anomaly detection benchmarks, we provide an ontology for describing anomaly detection contexts, a methodology for controlling various aspects of benchmark creation, guidelines for future experimental design and a discussion of the many potential pitfalls of trying to measure success in this field.

algorithm, oncology, us government, (21 more...)

arXiv.org Artificial Intelligence

1503.01158

Country: North America > United States (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Oncology (0.92)
Government (0.68)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)

Add feedback

Automatic Discovery and Transfer of Task Hierarchies in Reinforcement Learning

Mehta, Neville (Oregon State University) | Ray, Soumya (Case Western Reserve University) | Tadepalli, Prasad (Oregon State University) | Dietterich, Thomas (Oregon State University)

AI MagazineApr-18-2011

A principal one among them is the existence of multiple domains that share the same underlying causal structure for actions. We describe an approach that exploits this shared causal structure to discover a hierarchical task structure in a source domain, which in turn speeds up learning of task execution knowledge in a new target domain. Our approach is theoretically justified and compares favorably to manually designed task hierarchies in learning efficiency in the target domain. We demonstrate that causally motivated task hierarchies transfer more robustly than other kinds of detailed knowledge that depend on the idiosyncrasies of the source domain and are hence less transferable.

artificial intelligence, reinforcement learning, task hierarchy, (6 more...)

AI Magazine

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Add feedback

Automatic Discovery and Transfer of Task Hierarchies in Reinforcement Learning

Mehta, Neville (Oregon State University) | Ray, Soumya (Case Western Reserve University) | Tadepalli, Prasad (Oregon State University) | Dietterich, Thomas (Oregon State University)

AI MagazineApr-18-2011

Sequential decision tasks present many opportunities for the study of transfer learning. A principal one among them is the existence of multiple domains that share the same underlying causal structure for actions. We describe an approach that exploits this shared causal structure to discover a hierarchical task structure in a source domain, which in turn speeds up learning of task execution knowledge in a new target domain. Our approach is theoretically justiﬁed and compares favorably to manually designed task hierarchies in learning efﬁciency in the target domain. We demonstrate that causally motivated task hierarchies transfer more robustly than other kinds of detailed knowledge that depend on the idiosyncrasies of the source domain and are hence less transferable.

hierarchy, planning & scheduling, us government, (21 more...)

AI Magazine

Country: North America > United States (1.00)

Industry:

Leisure & Entertainment (0.46)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Add feedback

An Ensemble Learning and Problem Solving Architecture for Airspace Management

Zhang, Xiaoqin (Shelly) (University of Massachusetts) | Yoon, Sungwook (Arizona State University) | DiBona, Phillip (Lockheed Martin Advanced Technology Laboratories) | Appling, Darren (Georgia Institute of Technology) | Ding, Li (Rensselaer Polytechnic Institute) | Doppa, Janardhan (Oregon State University) | Green, Derek (University of Wyoming) | Guo, Jinhong (Lockheed Martin Advanced Technology Laboratories) | Kuter, Ugur (University of Maryland) | Levine, Geoff (University of Illinois at Urbana) | MacTavish, Reid (Georgia Institute of Technology) | McFarlane, Daniel (Lockheed Martin Advanced Technology Laboratories) | Michaelis, James (Rensselaer Polytechnic Institute) | Mostafa, Hala (University of Massachusetts) | Ontanon, Santiago (Georgia Institute of Technology) | Parker, Charles (Georgia Institute of Technology) | Radhakrishnan, Jainarayan (University of Wyoming) | Rebguns, Anton (University of Massachusetts) | Shrestha, Bhavesh (Fujitsu Laboratories of America) | Song, Zhexuan (Georgia Institute of Technology) | Trewhitt, Ethan (University of Massachusetts) | Zafar, Huzaifa (University of Massachusetts) | Zhang, Chongjie (University of Massachusetts) | Corkill, Daniel (University of Illinois at Urbana-Champaign) | DeJong, Gerald (Oregon State University) | Dietterich, Thomas (Arizona State University) | Kambhampati, Subbarao (University of Massachusetts) | Lesser, Victor (Rensselaer Polytechnic Institute) | McGuinness, Deborah L. (Georgia Institute of Technology) | Ram, Ashwin (University of Wyoming) | Spears, Diana (Oregon State University) | Tadepalli, Prasad (Georgia Institute of Technology) | Whitaker, Elizabeth (Oregon State University) | Wong, Weng-Keen (Rensselaer Polytechnic Institute) | Hendler, James (Lockheed Martin Advanced Technology Laboratories) | Hofmann, Martin (Lockheed Martin Advanced Technology Laboratories) | Whitebread, Kenneth

AAAI ConferencesJul-14-2009

In this paper we describe the application of a novel learning and problem solving architecture to the domain of airspace management, where multiple requests for the use of airspace need to be reconciled and managed automatically. The key feature of our "Generalized Integrated Learning Architecture" (GILA) is a set of integrated learning and reasoning (ILR) systems coordinated by a central meta-reasoning executive (MRE). Each ILR learns independently from the same training example and contributes to problem-solving in concert with other ILRs as directed by the MRE. Formal evaluations show that our system performs as well as or better than humans after learning from the same training data. Further, GILA outperforms any individual ILR run in isolation, thus demonstrating the power of the ensemble architecture for learning and problem solving.

air transportation, conflict, neural network, (19 more...)

AAAI Conferences

Twenty-First IAAI Conference

Country: North America > United States > Massachusetts (0.14)

Genre:

Instructional Material > Course Syllabus & Notes (0.68)
Research Report (0.46)

Industry:

Transportation > Infrastructure & Services (0.72)
Transportation > Air (0.72)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Case-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.68)

Add feedback

Improving the Performance of Radial Basis Function Networks by Learning Center Locations

Wettschereck, Dietrich, Dietterich, Thomas

Neural Information Processing SystemsDec-31-1992

Three methods for improving the performance of (gaussian) radial basis function (RBF) networks were tested on the NETtaik task. In RBF, a new example is classified by computing its Euclidean distance to a set of centers chosen by unsupervised methods. The application of supervised learning to learn a non-Euclidean distance metric was found to reduce the error rate of RBF networks, while supervised learning of each center's variance resulted in inferior performance. The best improvement in accuracy was achieved by networks called generalized radial basis function (GRBF) networks. In GRBF, the center locations are determined by supervised learning. After training on 1000 words, RBF classifies 56.5% of letters correct, while GRBF scores 73.4% letters correct (on a separate test set). From these and other experiments, we conclude that supervised learning of center locations can be very important for radial basis function learning.

inductive learning, neural network, supervised learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Oregon (0.29)
North America > United States > California (0.28)

Genre: Research Report > New Finding (0.48)

Industry: Education > Educational Setting (0.41)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)

Add feedback

Improving the Performance of Radial Basis Function Networks by Learning Center Locations

Wettschereck, Dietrich, Dietterich, Thomas

Neural Information Processing SystemsDec-31-1992

inductive learning, neural network, supervised learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Oregon (0.29)
North America > United States > California (0.28)

Genre: Research Report > New Finding (0.48)

Industry: Education > Educational Setting (0.41)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)

Add feedback

Improving the Performance of Radial Basis Function Networks by Learning Center Locations

Wettschereck, Dietrich, Dietterich, Thomas

Neural Information Processing SystemsDec-31-1992

Three methods for improving the performance of (gaussian) radial basis function (RBF) networks were tested on the NETtaik task. In RBF, a new example is classified by computing its Euclidean distance to a set of centers chosen by unsupervised methods. The application of supervised learning to learn a non-Euclidean distance metric was found to reduce the error rate of RBF networks, while supervised learning of each center's variance resultedin inferior performance. The best improvement in accuracy was achieved by networks called generalized radial basis function (GRBF) networks. In GRBF, the center locations are determined by supervised learning. After training on 1000 words, RBF classifies 56.5% of letters correct, while GRBF scores 73.4% letters correct (on a separate test set). From these and other experiments, we conclude that supervised learning of center locations can be very important for radial basis function learning.

inductive learning, neural network, supervised learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Oregon (0.29)
North America > United States > California (0.28)

Genre: Research Report > New Finding (0.48)

Industry: Education > Educational Setting (0.41)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)

Add feedback