AITopics | Supervised Learning

Collaborating Authors

Supervised Learning

Supervised learning is the machine learning task of learning a function that maps an input to an output based on example input-output pairs. (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

Smoothed Embeddings for Certified Few-Shot Learning

Pautov, Mikhail, Kuznetsova, Olesya, Tursynbek, Nurislam, Petiushko, Aleksandr, Oseledets, Ivan

arXiv.org Artificial IntelligenceFeb-2-2022

Randomized smoothing is considered to be the state-of-the-art provable defense against adversarial perturbations. However, it heavily exploits the fact that classifiers map input objects to class probabilities and do not focus on the ones that learn a metric space in which classification is performed by computing distances to embeddings of classes prototypes. In this work, we extend randomized smoothing to few-shot learning models that map inputs to normalized embeddings. We provide analysis of Lipschitz continuity of such models and derive robustness certificate against $\ell_2$-bounded perturbations that may be useful in few-shot learning scenarios. Our theoretical results are confirmed by experiments on different datasets.

algorithm 1, prototype, smoothed embedding, (14 more...)

arXiv.org Artificial Intelligence

2202.01186

Country:

Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
Asia > Russia (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.34)

Add feedback

Probability estimation and structured output prediction for learning preferences in last mile delivery

Canoy, Rocsildes, Bucarey, Victor, Molenbruch, Yves, Mulamba, Maxime, Mandi, Jayanta, Guns, Tias

arXiv.org Artificial IntelligenceJan-25-2022

We study the problem of learning the preferences of drivers and planners in the context of last mile delivery. Given a data set containing historical decisions and delivery locations, the goal is to capture the implicit preferences of the decision-makers. We consider two ways to use the historical data: one is through a probability estimation method that learns transition probabilities between stops (or zones). This is a fast and accurate method, recently studied in a VRP setting. Furthermore, we explore the use of machine learning to infer how to best balance multiple objectives such as distance, probability and penalties. Specifically, we cast the learning problem as a structured output prediction problem, where training is done by repeatedly calling the TSP solver. Another important aspect we consider is that for last-mile delivery, every address is a potential client and hence the data is very sparse. Hence, we propose a two-stage approach that first learns preferences at the zone level in order to compute a zone routing; after which a penalty-based TSP computes the stop routing. Results show that the zone transition probability estimation performs well, and that the structured output prediction learning can improve the results further. We hence showcase a successful combination of both probability estimation and machine learning, all the while using standard TSP solvers, both during learning and to compute the final solution; this means the methodology is applicable to other, real-life, TSP variants, or proprietary solvers.

matrix, probability, transition probability, (12 more...)

arXiv.org Artificial Intelligence

2201.10269

Country:

South America > Chile > O'Higgins Region > Cachapoal Province > Rancagua (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(3 more...)

Genre: Research Report > New Finding (0.66)

Industry:

Transportation (0.47)
Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.47)
(3 more...)

Add feedback

Theoretical analysis and computation of the sample Frechet mean for sets of large graphs based on spectral information

Ferguson, Daniel, Meyer, Francois G.

arXiv.org Machine LearningJan-15-2022

To characterize the location (mean, median) of a set of graphs, one needs a notion of centrality that is adapted to metric spaces, since graph sets are not Euclidean spaces. A standard approach is to consider the Frechet mean. In this work, we equip a set of graphs with the pseudometric defined by the norm between the eigenvalues of their respective adjacency matrix. Unlike the edit distance, this pseudometric reveals structural changes at multiple scales, and is well adapted to studying various statistical problems for graph-valued data. We describe an algorithm to compute an approximation to the sample Frechet mean of a set of undirected unweighted graphs with a fixed size using this pseudometric.

chet mean, eigenvalue, graph, (11 more...)

arXiv.org Machine Learning

2201.05923

Country:

Europe > Switzerland > Basel-City > Basel (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Communications (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.34)

Add feedback

Generalized Shape Metrics on Neural Representations

Williams, Alex H., Kunz, Erin, Kornblith, Simon, Linderman, Scott W.

arXiv.org Machine LearningJan-12-2022

Understanding the operation of biological and artificial networks remains a difficult and important challenge. To identify general principles, researchers are increasingly interested in surveying large collections of networks that are trained on, or biologically adapted to, similar tasks. A standardized set of analysis tools is now needed to identify how network-level covariates -- such as architecture, anatomical brain region, and model organism -- impact neural representations (hidden layer activations). Here, we provide a rigorous foundation for these analyses by defining a broad family of metric spaces that quantify representational dissimilarity. Using this framework we modify existing representational similarity measures based on canonical correlation analysis to satisfy the triangle inequality, formulate a novel metric that respects the inductive biases in convolutional layers, and identify approximate Euclidean embeddings that enable network representations to be incorporated into essentially any off-the-shelf machine learning method. We demonstrate these methods on large-scale datasets from biology (Allen Institute Brain Observatory) and deep learning (NAS-Bench-101). In doing so, we identify relationships between neural representations that are interpretable in terms of anatomical features and model performance.

matrix, neural representation, representation, (14 more...)

arXiv.org Machine Learning

2110.14739

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(6 more...)

Genre: Research Report > New Finding (0.67)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.35)

Add feedback

DeHIN: A Decentralized Framework for Embedding Large-scale Heterogeneous Information Networks

Imran, Mubashir, Yin, Hongzhi, Chen, Tong, Huang, Zi, Zheng, Kai

arXiv.org Artificial IntelligenceJan-7-2022

Modeling heterogeneity by extraction and exploitation of high-order information from heterogeneous information networks (HINs) has been attracting immense research attention in recent times. Such heterogeneous network embedding (HNE) methods effectively harness the heterogeneity of small-scale HINs. However, in the real world, the size of HINs grow exponentially with the continuous introduction of new nodes and different types of links, making it a billion-scale network. Learning node embeddings on such HINs creates a performance bottleneck for existing HNE methods that are commonly centralized, i.e., complete data and the model are both on a single machine. To address large-scale HNE tasks with strong efficiency and effectiveness guarantee, we present \textit{Decentralized Embedding Framework for Heterogeneous Information Network} (DeHIN) in this paper. In DeHIN, we generate a distributed parallel pipeline that utilizes hypergraphs in order to infuse parallelization into the HNE task. DeHIN presents a context preserving partition mechanism that innovatively formulates a large HIN as a hypergraph, whose hyperedges connect semantically similar nodes. Our framework then adopts a decentralized strategy to efficiently partition HINs by adopting a tree-like pipeline. Then, each resulting subnetwork is assigned to a distributed worker, which employs the deep information maximization theorem to locally learn node embeddings from the partition it receives. We further devise a novel embedding alignment scheme to precisely project independently learned node embeddings from all subnetworks onto a common vector space, thus allowing for downstream tasks like link prediction and node classification.

dehin, node, subnetwork, (15 more...)

arXiv.org Artificial Intelligence

2201.02757

Country:

Oceania > Australia > Queensland > Brisbane (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Education (0.68)
Information Technology (0.67)
Health & Medicine (0.46)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(5 more...)

Add feedback

6 months after Biden touted 'independence' from COVID-19, cases set records

FOX NewsJan-5-2022, 17:31:46 GMT

Fox News White House correspondent Jacqui Heinrich discusses the Biden administration's failure to deliver at-home COVID tests on'Special Report.' It's been six months since President Biden said the U.S. was close to declaring "independence from COVID-19," and yet the pandemic still shows no signs of slowing after the country set a global record for the number of cases Monday due to the spread of the highly transmissible omicron variant. The U.S. reported more than 1 million new coronavirus infections on Monday, setting a global record and almost doubling the previous record set last week. Hospitalizations have also skyrocketed across the country, but deaths have held relatively steady in recent weeks. President Biden listens during a virtual meeting about reducing the costs of meat through increased competition in the meat processing industry in the South Court Auditorium at the Eisenhower Executive Office Building on Jan. 3, 2022, in Washington, D.C. (Photo by Sarah Silbiger/Getty Images) Biden gave a speech Tuesday maintaining his position that "this continues to be a pandemic of the unvaccinated," even though breakthrough cases of COVID-19 among people who are fully vaccinated continue to rise across the country as new variants emerge.

biden, covid-19, independence, (15 more...)

FOX News

Country:

North America > United States > District of Columbia > Washington (0.27)
North America > United States > Wisconsin > Milwaukee County > Milwaukee (0.05)
North America > United States > Illinois > Cook County > Chicago (0.05)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Communications (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.61)

Add feedback

#008 Shallow Neural Network - Master Data Science

#artificialintelligenceDec-29-2021, 19:38:01 GMT

In this post we will see how to vectorize across multiple training examples. The outcome will be similar to what we saw in Logistic Regression. These equations tell us how, when given an input feature vector $x $, we can generate predictions. If we have $m $ training examples we need to repeat this proces $m $ times. The notation $ a {[2](i)} $ means that we are talking about activation in the second layer that comes from $i {th} $ training example.

bmatrix, textbf, training example, (14 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)

Add feedback

CORE: A Knowledge Graph Entity Type Prediction Method via Complex Space Regression and Embedding

Ge, Xiou, Wang, Yun-Cheng, Wang, Bin, Kuo, C. -C. Jay

arXiv.org Artificial IntelligenceDec-19-2021

Research on knowledge graph (KG) construction, completion, inference, and applications has grown rapidly in recent years since it offers a powerful tool for modeling human knowledge in graph forms. Nodes in KGs denote entities and links represent relations between entities. The basic building blocks of KG are entity-relation triples in form of (subject, predicate, object) introduced by the Resource Description Framework (RDF). Learning representations for entities and relations in low dimensional vector spaces is one of the most active research topics in the field. Entity type offers a valuable piece of information to KG learning tasks. Better results in KG-related tasks have been achieved with the help of entity type. For example, TKRL [1] uses a hierarchical type encoder for KG completion by incorporating entity type information. AutoETER [2] adopts a similar approach but encodes the type information with projection matrices. Based on DistMult [3] and ComplEx [4] embedding, [5] propose an improved factorization model without explicit type supervision.

entity type prediction, prediction, type prediction, (12 more...)

arXiv.org Artificial Intelligence

2112.10067

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > Singapore (0.04)
North America > Canada (0.04)
(3 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.35)

Add feedback

Improving scripts with a memory of natural feedback

Tandon, Niket, Madaan, Aman, Clark, Peter, Yang, Yiming

arXiv.org Artificial IntelligenceDec-16-2021

How can an end-user provide feedback if a deployed structured prediction model generates incorrect output? Our goal is to allow users to correct errors directly through interaction, without retraining, by giving feedback on the model's output. We create a dynamic memory architecture with a growing memory of feedbacks about errors in the output. Given a new, unseen input, our model can use feedback from a similar, past erroneous state. On a script generation task, we show empirically that the model learns to apply feedback effectively (up to 30 points improvement), while avoiding similar past mistakes after deployment (up to 10 points improvement on an unseen set). This is a first step towards strengthening deployed models, potentially broadening their utility.

dataset, interaction, natural feedback, (15 more...)

arXiv.org Artificial Intelligence

2112.09737

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Washington > King County > Seattle (0.04)
(2 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.35)

Add feedback

Distance and Hop-wise Structures Encoding Enhanced Graph Attention Networks

Huang, Zhiguo, Chen, Xiaowei, Wang, Bojuan

arXiv.org Artificial IntelligenceDec-6-2021

Many works have proven that existing neighbor-averaging Graph Neural Networks cannot efficiently catch structure information, such GNNs cannot even catch degree features in some cases. The reason is intuitive: as the neighbor-averaging GNNs can only combine neighbor's feature vectors for every node, if the neighbor's feature vectors contains no structure information, the hop-wise neighbor-averaging GNNs can only catch degree information at best([1];[2];[3]). So, as an intuitive idea, injecting structure information into feature vectors may improve the performance of GNNs. Numerous works have shown that injecting structure, distance, position or spatial information can significantly improve performance of neighbor-averaging GNNs([4];[5];[6];[7];[8];[9];[10]). However, existing works have their problems. Some of them has very high computation complexity which can not apply to large-scale graph(MotifNet[4]). Some of them simply concatenate structure information with intrinsic feature vector (ID-GNN[6]; P-GNN[8]; DE-GNN[9]), which may confuse the signals of different feature. For example, in ogbn-arxiv dataset, the intrinsic feature is semantic embedding of headline or abstract, which provides total different signal with structure information. Some of them are graph-level-task oriented and only deal with small graph(Graphormer[7]; SubGNN[10]).

correct and smooth 0, information, structure information, (12 more...)

arXiv.org Artificial Intelligence

2112.02868

Country:

Asia > China > Tianjin Province > Tianjin (0.04)
Asia > China > Zhejiang Province > Hangzhou (0.04)

Genre: Research Report (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)

Add feedback