AITopics | linear rule

Collaborating Authors

linear rule

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Sometimes I am a Tree: Data Drives Unstable Hierarchical Generalization

Qin, Tian, Saphra, Naomi, Alvarez-Melis, David

arXiv.org Artificial IntelligenceDec-19-2024

Language models (LMs), like other neural networks, often favor shortcut heuristics based on surface-level patterns. Although LMs behave like n-gram models early in training, they must eventually learn hierarchical syntactic representations to correctly apply grammatical rules out-of-distribution (OOD). In this work, we use case studies of English grammar to explore how complex, diverse training data drives models to generalize OOD. We construct a framework that unifies our understanding of random variation with training dynamics, rule selection with memorization, and data diversity with complexity. We show that these factors are nuanced, and that intermediate levels of diversity and complexity lead to inconsistent behavior across random seeds and to unstable training dynamics. Our findings emphasize the critical role of training data in shaping generalization patterns and illuminate how competing model strategies lead to inconsistent generalization outcomes across random seeds.

machine learning, natural language, training data, (19 more...)

arXiv.org Artificial Intelligence

2412.04619

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Latvia > Lubāna Municipality > Lubāna (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > Belgium > Flanders > Flemish Brabant > Leuven (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Learning Syntax Without Planting Trees: Understanding When and Why Transformers Generalize Hierarchically

Ahuja, Kabir, Balachandran, Vidhisha, Panwar, Madhur, He, Tianxing, Smith, Noah A., Goyal, Navin, Tsvetkov, Yulia

arXiv.org Artificial IntelligenceMay-31-2024

Natural language is structured hierarchically: words are grouped into phrases or constituents, which can be further grouped to form higher-level phrases up to the full sentence. How well do the neural network models trained on language data learn this phrase structure of human language has been a subject of great interest. A flurry of past work have shown that syntax trees can be recovered from recurrent neural network (RNN) and transformer-based models trained on large-scale language corpora (Tenney et al., 2019, Peters et al., 2018, Lin et al., 2019, Wu et al., 2020). While these studies provide useful evidence of the aforementioned phenomenon, they do not shed light on the architectural choices, training paradigms or dataset characteristics that lead models to learn the phrase structure of language. A useful tool to understand these model and dataset specific properties is through the test for hierarchical generalization, i.e., evaluating the capability of a model to generalize to novel syntactic forms, which were unseen during training. A classic problem to test for hierarchical generalization is question formation, where given a declarative sentence, e.g., My walrus does move the dogs that do wait., the task is to transform it into a question: Does my walrus move the dogs that do wait? The task is accomplished by moving one auxiliary verb to the front. The correct choice to move does in this example (rather than do), is predicted both by a hierarchical rule based on the phrase-structure syntax of the sentence, and by a linear rule that says to move the first auxiliary. Hence, as a test for hierarchical generalization, we can ask, for neural networks trained from scratch on data that is consistent with both hierarchical and linear rules (i.e.,

generalization, grammar, hierarchical generalization, (17 more...)

arXiv.org Artificial Intelligence

2404.16367

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > Ontario > Toronto (0.04)
Europe > Italy > Tuscany > Florence (0.04)
(8 more...)

Genre: Research Report > New Finding (1.00)

Add feedback

Post-hoc Bias Scoring Is Optimal For Fair Classification

Chen, Wenlong, Klochkov, Yegor, Liu, Yang

arXiv.org Machine LearningOct-9-2023

We consider a binary classification problem under group fairness constraints, which can be one of Demographic Parity (DP), Equalized Opportunity (EOp), or Equalized Odds (EO). We propose an explicit characterization of Bayes optimal classifier under the fairness constraints, which turns out to be a simple modification rule of the unconstrained classifier. Namely, we introduce a novel instancelevel measure of bias, which we call bias score, and the modification rule is a simple linear rule on top of the finite amount of bias scores. Based on this characterization, we develop a post-hoc approach that allows us to adapt to fairness constraints while maintaining high accuracy. In the case of DP and EOp constraints, the modification rule is thresholding a single bias score, while in the case of EO constraints we are required to fit a linear modification rule with 2 parameters. The method can also be applied for composite group-fairness criteria, such as ones involving several sensitive attributes. We achieve competitive or better performance compared to both in-processing and post-processing methods across three datasets: Adult, COMPAS, and CelebA. Unlike most post-processing methods, we do not require access to sensitive attributes during the inference time. Significant improvements have been made in classification tasks using machine learning (ML) algorithms. With ML algorithms being deployed in more and more decision-making applications, it is crucial to ensure fairness in their predictions. Although the debate on what is fairness and how to measure it is ongoing (Caton & Haas, 2023), oftentimes group fairness measures are utilized in practice due to the simplicity of their verification (Chouldechova, 2017; Hardt et al., 2016a), which conform to the intuition that predictions should not be biased toward a specific group of the population.

artificial intelligence, constraint, machine learning, (17 more...)

arXiv.org Machine Learning

2310.05725

Country: North America > United States > California > Santa Cruz County > Santa Cruz (0.04)

Genre: Research Report (1.00)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Conditional Linear Regression

Calderon, Diego, Juba, Brendan, Li, Sirui, Li, Zongyi, Ruan, Lisa

arXiv.org Machine LearningJun-6-2018

Linear regression is the task of modeling the relationship between a result variable and some explanatory variables by a linear rule. Linear regression is a standard tool of statistical analysis, with widespread applications spanning essentially all of the sciences. While the standard linear regression task seeks to model the majority of the data, we consider problems where a regression fit could exist for some subset or portion of the data, that does not necessarily model the majority of the data. We will consider cases in which the subset with a linear model is described by some simple condition; in other words, we desire to perform linear regression on this conditional distribution. Note that neither the condition nor the model is known in advance.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Machine Learning

1806.02326

Country: North America > United States (1.00)

Genre: Research Report (0.50)

Industry: Health & Medicine (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Conditional Linear Regression

Calderon, Diego (University of Arkansas) | Juba, Brendan (Washington University in St. Louis) | Li, Zongyi (Washington University in St. Louis) | Ruan, Lisa (M.I.T.)

AAAI ConferencesFeb-8-2018

In this case, we would be interested used in biological and social sciences to predict events and to in identifying a segment of the population for which describe possible relationships between variables. When addressing a linear rule is highly predictive of the price of certain cars, the task of prediction, machine learning and statistics whereas this linear rule may not provide a good prediction commonly focus on capturing the vast majority of data, overall in the larger population. Let us imagine that for this occasionally ignoring a segment of the population as "outliers" data set, and for a target fraction of the population, we found or "noise," which could be helpful to better understand a simple rule that describes the subpopulation, along with the data. Previous work by Juba (2016) gave an algorithm its linear fit.

algorithm, artificial intelligence, machine learning, (14 more...)

AAAI Conferences

Thirty-Second AAAI Conference on Artificial Intelligence

Country:

Africa > South Sudan > Equatoria > Central Equatoria > Juba (0.28)
Asia > Afghanistan > Parwan Province > Charikar (0.08)
North America > United States > New York > New York County > New York City (0.05)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

Conditional Sparse Linear Regression

Juba, Brendan

arXiv.org Machine LearningAug-17-2016

Linear regression, the fitting of linear relationships among variables in a data set, is a standard tool in data analysis. In particular, for the sake of interpretability and utility in further analysis, we desire to find highly sparse linear relationships, i.e., involving only a few variables. Of course, such simple linear relationships often will not hold across an entire population. But, more frequently there will exist conditions - perhaps a range of parameters or a segment of a larger population - under which such sparse models fit the data quite well. For example, Rosenfeld et al. [16] used data mining heuristics to identify small segments of a population in which a few additional risk factors were highly predictive of certain kinds of cancer, whereas these same risk factors were not significant in the overall population. Simple rules for special cases may also hint at the more complex general rules. More generally, we need to develop new techniques to reason about populations in which most members are atypical in some way, which are colloquially (and somewhat abusively) referred to as long-tailed distributions. We are seeking principled alternatives to ad-hoc approaches such as trying a variety of methods for clustering the data and hoping that the identified clusters can be modeled well.

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

1608.05152

Country:

North America > United States (0.14)
Africa (0.14)

Genre: Research Report (0.50)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.34)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.63)

Add feedback

Polynomial Rewritings for Linear Existential Rules

Gottlob, Georg (University of Oxford) | Manna, Marco (University of Calabria) | Pieris, Andreas (Vienna University of Technology)

AAAI ConferencesJul-15-2015

We consider the scenario of ontology-based query answering. It is generally accepted that true scalability in this setting can only be achieved via query rewriting, which in turn allows for the exploitation of standard RDBMSs. In this work, we close two open fundamental questions related to query rewriting. We establish that linear existential rules are polynomially combined rewritable, while full linear rules are polynomially (purely) rewritable; in both cases, the target query language consists of first-order or non-recursive Datalog queries. An immediate consequence of our results is that DLR-Lite_R, the extension of DL-Lite_R with n-ary roles, is polynomially combined rewritable.

linear rule, proof generator, query, (13 more...)

AAAI Conferences

Twenty-Fourth International Joint Conference on Artificial Intelligence

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > Italy > Calabria (0.04)
Europe > Austria > Vienna (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.56)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.49)

Add feedback