AITopics | Supervised Learning

Collaborating Authors

Supervised Learning

Supervised learning is the machine learning task of learning a function that maps an input to an output based on example input-output pairs. (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

Automated Machine Learning for Positive-Unlabelled Learning

Saunders, Jack D., Freitas, Alex A.

arXiv.org Artificial IntelligenceJan-12-2024

Positive-Unlabelled (PU) learning is a growing field of machine learning that aims to learn classifiers from data consisting of labelled positive and unlabelled instances, which can be in reality positive or negative, but whose label is unknown. An extensive number of methods have been proposed to address PU learning over the last two decades, so many so that selecting an optimal method for a given PU learning task presents a challenge. Our previous work has addressed this by proposing GA-Auto-PU, the first Automated Machine Learning (Auto-ML) system for PU learning. In this work, we propose two new Auto-ML systems for PU learning: BO-Auto-PU, based on a Bayesian Optimisation approach, and EBO-Auto-PU, based on a novel evolutionary/Bayesian optimisation approach. We also present an extensive evaluation of the three Auto-ML systems, comparing them to each other and to well-established PU learning methods across 60 datasets (20 real-world datasets, each with 3 versions in terms of PU learning characteristics).

auto-pu system, classifier, search space, (15 more...)

arXiv.org Artificial Intelligence

2401.06452

Country:

South America > Paraguay > Asunción > Asunción (0.05)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Wisconsin (0.04)
(3 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)
Overview (0.92)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Diagnostic Medicine (1.00)
Health & Medicine > Therapeutic Area > Neurology (0.67)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
(3 more...)

Add feedback

Feature Network Methods in Machine Learning and Applications

Mu, Xinying, Kon, Mark

arXiv.org Machine LearningJan-9-2024

A machine learning (ML) feature network is a graph that connects ML features in learning tasks based on their similarity. This network representation allows us to view feature vectors as functions on the network. By leveraging function operations from Fourier analysis and from functional analysis, one can easily generate new and novel features, making use of the graph structure imposed on the feature vectors. Such network structures have previously been studied implicitly in image processing and computational biology. We thus describe feature networks as graph structures imposed on feature vectors, and provide applications in machine learning. One application involves graph-based generalizations of convolutional neural networks, involving structured deep learning with hierarchical representations of features that have varying depth or complexity. This extends also to learning algorithms that are able to generate useful new multilevel features. Additionally, we discuss the use of feature networks to engineer new features, which can enhance the expressiveness of the model. We give a specific example of a deep tree-structured feature network, where hierarchical connections are formed through feature clustering and feed-forward learning. This results in low learning complexity and computational efficiency. Unlike "standard" neural features which are limited to modulated (thresholded) linear combinations of adjacent ones, feature networks offer more general feedforward dependencies among features. For example, radial basis functions or graph structure-based dependencies between features can be utilized.

artificial intelligence, feature vector, machine learning, (18 more...)

arXiv.org Machine Learning

2401.04874

Country: North America > United States > Massachusetts > Suffolk County > Boston (0.04)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.81)

Add feedback

Grand Canyon record set by 92-year-old after months of training

FOX NewsJan-7-2024, 09:00:21 GMT

Alfredo Aliaga Burdio, 92, set a Guinness World Record when he made a 24-mile hike across the Grand Canyon last October. A 92-year-old man is making headlines and setting records after he successfully took on a nearly 24-mile hike across the Grand Canyon in Arizona. Alfredo Aliaga Burdio, who currently resides in Berlin, completed his record-setting trek across the Grand Canyon on Oct. 15, 2023. That journey led to Burdio claiming the title of oldest person to cross the Grand Canyon rim-to-rim on foot (male), according to an announcement on New Year's Day by the Guinness World Records. Burdio's journey, which lasted for a total of 34 hours and 2 minutes, included 21 hours and 15 minutes of actual hiking time.

burdio, grand canyon, guinness world record, (14 more...)

FOX News

Country:

North America > United States > Arizona (0.25)
North America > United States > Texas (0.05)

Industry: Media > News (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.63)

Add feedback

GEqO: ML-Accelerated Semantic Equivalence Detection

Haynes, Brandon, Alotaibi, Rana, Pavlenko, Anna, Leeka, Jyoti, Jindal, Alekh, Tian, Yuanyuan

arXiv.org Artificial IntelligenceJan-2-2024

Large scale analytics engines have become a core dependency for modern data-driven enterprises to derive business insights and drive actions. These engines support a large number of analytic jobs processing huge volumes of data on a daily basis, and workloads are often inundated with overlapping computations across multiple jobs. Reusing common computation is crucial for efficient cluster resource utilization and reducing job execution time. Detecting common computation is the first and key step for reducing this computational redundancy. However, detecting equivalence on large-scale analytics engines requires efficient and scalable solutions that are fully automated. In addition, to maximize computation reuse, equivalence needs to be detected at the semantic level instead of just the syntactic level (i.e., the ability to detect semantic equivalence of seemingly different-looking queries). Unfortunately, existing solutions fall short of satisfying these requirements. In this paper, we take a major step towards filling this gap by proposing GEqO, a portable and lightweight machine-learning-based framework for efficiently identifying semantically equivalent computations at scale. GEqO introduces two machine-learning-based filters that quickly prune out nonequivalent subexpressions and employs a semi-supervised learning feedback loop to iteratively improve its model with an intelligent sampling mechanism. Further, with its novel database-agnostic featurization method, GEqO can transfer the learning from one workload and database to another. Our extensive empirical evaluation shows that, on TPC-DS-like queries, GEqO yields significant performance gains-up to 200x faster than automated verifiers-and finds up to 2x more equivalences than optimizer and signature-based equivalence detection approaches.

artificial intelligence, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3626710

2401.0128

Genre: Research Report (0.82)

Industry: Energy > Oil & Gas (0.34)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)
(4 more...)

Add feedback

Kernel Density Estimation for Multiclass Quantification

Moreo, Alejandro, González, Pablo, del Coz, Juan José

arXiv.org Machine LearningJan-2-2024

Quantification (variously called learning to quantify or class prevalence estimation) is the area of supervised machine learning concerned with estimating the percentages of instances from a population (hereafter, a bag of examples) belonging to each of the classes of interest [González et al., 2017, Esuli et al., 2023]. Quantification finds applications in many disciplines, like the social sciences, epidemiology, or market research, in which the interest lies at the aggregate level, i.e., in which inferring characteristics of the single individual (e.g., via classification, or via regression) is of little concern since knowing group-level information is all we need. Despite the fact that binary quantification (i.e., the setting in which the classes of interest are positive vs. negative) has been, by far, the most studied scenario in the quantification literature [Card and Smith, 2018, Forman, 2008, Bella et al., 2010, Esuli and Sebastiani, 2015, Hassan et al., 2020, Moreo and Sebastiani, 2021], the truth is that many of the applications of quantification naturally arise in the multiclass regime, i.e., in cases in which there are more than two mutually exclusive classes. Examples of multiclass settings are ubiquitous, and may include the allocation of human resources to different departments in a company [Forman, 2005], the analysis of different phytoplankton species that could exist in a water sample [González et al., 2019], or the analysis of the various causes of death studied in verbal autopsies [King and Lu, 2008], to name a few. A more concrete example could consist of providing answers to questions like: "What is the percentage of tweets conveying positive, neutral, and negative opinions concerning a specific hashtag?"

histogram, multiclass quantification, posterior probability, (15 more...)

arXiv.org Machine Learning

2401.0049

Country:

North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(8 more...)

Genre:

Research Report > Experimental Study (0.92)
Research Report > New Finding (0.67)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
(3 more...)

Add feedback

Universal consistency of the $k$-NN rule in metric spaces and Nagata dimension. II

Kumari, Sushma, Pestov, Vladimir G.

arXiv.org Artificial IntelligenceDec-30-2023

We continue to investigate the $k$ nearest neighbour learning rule in separable metric spaces. Thanks to the results of C\'erou and Guyader (2006) and Preiss (1983), this rule is known to be universally consistent in every metric space $X$ that is sigma-finite dimensional in the sense of Nagata. Here we show that the rule is strongly universally consistent in such spaces in the absence of ties. Under the tie-breaking strategy applied by Devroye, Gy\"{o}rfi, Krzy\.{z}ak, and Lugosi (1994) in the Euclidean setting, we manage to show the strong universal consistency in non-Archimedian metric spaces (that is, those of Nagata dimension zero). Combining the theorem of C\'erou and Guyader with results of Assouad and Quentin de Gromard (2006), one deduces that the $k$-NN rule is universally consistent in metric spaces having finite dimension in the sense of de Groot. In particular, the $k$-NN rule is universally consistent in the Heisenberg group which is not sigma-finite dimensional in the sense of Nagata as follows from an example independently constructed by Kor\'anyi and Reimann (1995) and Sawyer and Wheeden (1992).

classifier, dimension, metric space, (14 more...)

arXiv.org Artificial Intelligence

2305.17282

Country:

North America > United States > New York (0.04)
South America > Brazil > Santa Catarina > Florianópolis (0.04)
South America > Brazil > Paraíba > João Pessoa (0.04)
(5 more...)

Genre: Research Report (0.63)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (1.00)

Add feedback

Spectral Persistent Homology: Persistence Signals

Van Huffel, Michael Etienne, Palo, Matteo

arXiv.org Machine LearningDec-28-2023

In this paper, we present a novel family of descriptors for persistence diagrams, reconceptualizing them as signals in $\mathbb R^2_+$. This marks a significant advancement in Topological Data Analysis. Our methodology transforms persistence diagrams into a finite-dimensional vector space through functionals of the discrete measures induced by these diagrams. While our focus is primarily on frequency-based transformations, we do not restrict our approach exclusively to this types of techniques. We term this family of transformations as $Persistence$ $Signals$ and prove stability for some members of this family against the 1-$Kantorovitch$-$Rubinstein$ metric, ensuring its responsiveness to subtle data variations. Extensive comparative analysis reveals that our descriptor performs competitively with the current state-of-art from the topological data analysis literature, and often surpasses, the existing methods. This research not only introduces a groundbreaking perspective for data scientists but also establishes a foundation for future innovations in applying persistence diagrams in data analysis and machine learning.

dataset, persistence diagram, persistence signal, (14 more...)

arXiv.org Machine Learning

2312.17093

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States (0.04)
Europe > Spain > Andalusia > Granada Province > Granada (0.04)

Genre:

Overview (0.93)
Research Report > New Finding (0.68)
Research Report > Promising Solution (0.46)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Data Science > Data Quality > Data Transformation (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.34)

Add feedback

2024 will break the extreme temperature records set in 2023

New ScientistDec-27-2023, 18:00:00 GMT

THE past year was the hottest on record, but 2023 is unlikely to hold that dubious honour for long. "We've never had a big El Niño like this on the background of global warming," says Adam Scaife at the Met Office, the UK's national…

extreme temperature record, warming

New Scientist

Country:

Europe > United Kingdom (0.34)
Pacific Ocean (0.14)
Europe > Spain (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.40)

Add feedback

Using Enriched Category Theory to Construct the Nearest Neighbour Classification Algorithm

Pugh, Matthew, Grundy, Jo, Cirstea, Corina, Harris, Nick

arXiv.org Artificial IntelligenceDec-27-2023

Exploring whether Enriched Category Theory could provide the foundation of an alternative approach to Machine Learning. This paper is the first to construct and motivate a Machine Learning algorithm solely with Enriched Category Theory. In order to supplement evidence that Category Theory can be used to motivate robust and explainable algorithms, it is shown that a series of reasonable assumptions about a dataset lead to the construction of the Nearest Neighbours Algorithm. In particular, as an extension of the original dataset using profunctors in the category of Lawvere metric spaces. This leads to a definition of an Enriched Nearest Neighbours Algorithm, which consequently also produces an enriched form of the Voronoi diagram. This paper is intended to be accessible without any knowledge of Category Theory

category, category theory, nearest neighbour algorithm, (12 more...)

arXiv.org Artificial Intelligence

2312.16529

Country: Europe > United Kingdom (0.15)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.39)

Add feedback

Observable Propagation: A Data-Efficient Approach to Uncover Feature Vectors in Transformers

Dunefsky, Jacob, Cohan, Arman

arXiv.org Artificial IntelligenceDec-26-2023

A key goal of current mechanistic interpretability research in NLP is to find linear features (also called "feature vectors") for transformers: directions in activation space corresponding to concepts that are used by a given model in its computation. Present state-of-the-art methods for finding linear features require large amounts of labelled data -- both laborious to acquire and computationally expensive to utilize. In this work, we introduce a novel method, called "observable propagation" (in short: ObsProp), for finding linear features used by transformer language models in computing a given task -- using almost no data. Our paradigm centers on the concept of observables, linear functionals corresponding to given tasks. We then introduce a mathematical theory for the analysis of feature vectors: we provide theoretical motivation for why LayerNorm nonlinearities do not affect the direction of feature vectors; we also introduce a similarity metric between feature vectors called the coupling coefficient which estimates the degree to which one feature's output correlates with another's. We use ObsProp to perform extensive qualitative investigations into several tasks, including gendered occupational bias, political party prediction, and programming language detection. Our results suggest that ObsProp surpasses traditional approaches for finding feature vectors in the low-data regime, and that ObsProp can be used to better understand the mechanisms responsible for bias in large language models. Code for experiments can be found at github.com/jacobdunefsky/ObservablePropagation.

excerpt, feature vector, highest-activating token, (12 more...)

arXiv.org Artificial Intelligence

2312.16291

Country:

North America > United States (0.14)
Europe > Denmark > Capital Region > Kongens Lyngby (0.14)
Europe > United Kingdom > England > Cornwall > Isles of Scilly (0.04)
(10 more...)

Genre: Research Report > New Finding (1.00)

Industry: Government (1.00)

Technology:

Information Technology > Data Science > Data Mining > Feature Extraction (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback