AITopics

2306.14087

Country:

Oceania > New Zealand (0.04)
North America > United States > California > Santa Clara County > Stanford (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory > Minimum Complexity Machines (0.30)

Duersch, Jed A., Catanach, Thomas A.

Parsimonious Inference

arXiv.org Machine LearningMar-2-2021

Bayesian inference provides a uniquely rigorous approach to obtain principled justification for uncertainty in predictions, yet it is difficult to articulate suitably general prior belief in the machine learning context, where computational architectures are pure abstractions subject to frequent modifications by practitioners attempting to improve results. Parsimonious inference is an information-theoretic formulation of inference over arbitrary architectures that formalizes Occam's Razor; we prefer simple and sufficient explanations. Our universal hyperprior assigns plausibility to prior descriptions, encoded as sequences of symbols, by expanding on the core relationships between program length, Kolmogorov complexity, and Solomonoff's algorithmic probability. We then cast learning as information minimization over our composite change in belief when an architecture is specified, training data are observed, and model parameters are inferred. By distinguishing model complexity from prediction information, our framework also quantifies the phenomenon of memorization. Although our theory is general, it is most critical when datasets are limited, e.g. small or skewed. We develop novel algorithms for polynomial regression and random forests that are suitable for such data, as demonstrated by our experiments. Our approaches combine efficient encodings with prudent sampling strategies to construct predictive ensembles without cross-validation, thus addressing a fundamental challenge in how to efficiently obtain predictions from data.

complexity, information, prediction, (17 more...)

arXiv.org Machine Learning

2103.02165

Country: North America > United States > California > Alameda County > Livermore (0.04)

Genre: Research Report > New Finding (0.67)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Energy (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
(2 more...)

The Foundations of Deep Learning with a Path Towards General Intelligence

arXiv.org Artificial IntelligenceJun-22-2018

Like any field of empirical science, AI may be approached axiomatically. We formulate requirements for a general-purpose, human-level AI system in terms of postulates. We review the methodology of deep learning, examining the explicit and tacit assumptions in deep learning research. Deep Learning methodology seeks to overcome limitations in traditional machine learning research as it combines facets of model richness, generality, and practical applicability. The methodology so far has produced outstanding results due to a productive synergy of function approximation, under plausible assumptions of irreducibility and the efficiency of back-propagation family of algorithms. We examine these winning traits of deep learning, and also observe the various known failure modes of deep learning. We conclude by giving recommendations on how to extend deep learning methodology to cover the postulates of general-purpose AI including modularity, and cognitive architecture. We also relate deep learning to advances in theoretical neuroscience research.

artificial intelligence, deep learning, machine learning, (15 more...)

1806.08874

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)
(7 more...)

Genre:

Overview (0.66)
Research Report (0.40)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Orseau, Laurent, McGill, Simon McGregor, Legg, Shane

Agents and Devices: A Relative Definition of Agency

arXiv.org Machine LearningMay-31-2018

According to Dennett, the same system may be described using a `physical' (mechanical) explanatory stance, or using an `intentional' (belief- and goal-based) explanatory stance. Humans tend to find the physical stance more helpful for certain systems, such as planets orbiting a star, and the intentional stance for others, such as living animals. We define a formal counterpart of physical and intentional stances within computational theory: a description of a system as either a device, or an agent, with the key difference being that `devices' are directly described in terms of an input-output mapping, while `agents' are described in terms of the function they optimise. Bayes' rule can then be applied to calculate the subjective probability of a system being a device or an agent, based only on its behaviour. We illustrate this using the trajectories of an object in a toy grid-world domain.

artificial intelligence, bayesian inference, machine learning, (17 more...)

arXiv.org Machine Learning

1805.12387

Country: Europe (0.46)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

AITopics Original LinksJan-18-2017, 10:13:30 GMT

Ray Solomonoff, Pioneer in Artificial Intelligence, Dies at 83

Ray Solomonoff, a physicist who was one of the founders of the field of artificial intelligence, died on Dec. 7 in Boston. He was 83 and had homes in New Ipswich, N.H., and Cambridge, Mass. The cause was a ruptured brain aneurysm, said his wife, Grace. As a child Mr. Solomonoff developed what would become a lifelong passion for mathematical theorems, and as a teenager he became captivated with idea of creating machines that could learn and ultimately think. In 1952 he met Marvin Minsky, a cognitive scientist who was also exploring the idea of machine learning, and John McCarthy, a young mathematician.

artificial intelligence, machine learning, solomonoff, (6 more...)

AITopics Original Links

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.26)
North America > United States > New Hampshire (0.06)
North America > United States > Illinois > Cook County > Chicago (0.06)
Europe > Switzerland (0.06)

Genre: Personal > Obituary (0.54)

Technology:

Information Technology > Artificial Intelligence > History (0.57)
Information Technology > Artificial Intelligence > Machine Learning (0.52)

Ultimate Intelligence Part II: Physical Measure and Complexity of Intelligence

arXiv.org Artificial IntelligenceMay-11-2016

We continue our analysis of volume and energy measures that are appropriate for quantifying inductive inference systems. We extend logical depth and conceptual jump size measures in AIT to stochastic problems, and physical measures that involve volume and energy. We introduce a graphical model of computational complexity that we believe to be appropriate for intelligent machines. We show several asymptotic relations between energy, logical depth and volume of computation for inductive inference. In particular, we arrive at a "black-hole equation" of inductive inference, which relates energy, volume, space, and algorithmic information for an optimal inductive inference solution. We introduce energy-bounded algorithmic entropy. We briefly apply our ideas to the physical limits of intelligent computation in our universe.

artificial intelligence, computation, machine learning, (15 more...)

1504.03303

Country: Europe (0.68)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.47)

Ultimate Intelligence Part I: Physical Completeness and Objectivity of Induction

arXiv.org Artificial IntelligenceApr-9-2015

We propose that Solomonoff induction is complete in the physical sense via several strong physical arguments. We also argue that Solomonoff induction is fully applicable to quantum mechanics. We show how to choose an objective reference machine for universal induction by defining a physical message complexity and physical message probability, and argue that this choice dissolves some well-known objections to universal induction. We also introduce many more variants of physical message complexity based on energy and action, and discuss the ramifications of our proposals. "If you wish to make an apple pie from scratch, you must first invent the universe."

artificial intelligence, machine learning, probability, (18 more...)

1501.00601

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

arXiv.org Machine LearningNov-23-2011

Falsification and future performance

Balduzzi, David

We show these capacity measures count the number of hypotheses about a dataset that a learning algorithm falsifies when it finds the classifier in its repertoire minimizing empirical risk. It then follows from that the future performance of predictors on unseen data is controlled in part by how many hypotheses the learner falsifies. As a corollary we show that empirical VC-entropy quantifies the message length of the true hypothesis in the optimal code of a particular probability distribution, the so-called actual repertoire.

artificial intelligence, hypothesis, machine learning, (17 more...)

arXiv.org Machine Learning

1111.5648

Country: Europe > Germany (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.47)

Diverse Consequences of Algorithmic Probability

arXiv.org Artificial IntelligenceNov-7-2011

We reminisce and discuss applications of algorithmic probability to a wide range of problems in artificial intelligence, philosophy and technological society. We propose that Solomonoff has effectively axiomatized the field of artificial intelligence, therefore establishing it as a rigorous scientific discipline. We also relate to our own work in incremental machine learning and philosophy of complexity.

logic & formal reasoning, machine learning, solomonoff, (17 more...)

1107.2788

Country:

Europe (0.68)
North America > United States (0.46)
Asia > Middle East (0.46)

Genre: Research Report (0.40)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

arXiv.org Artificial IntelligenceDec-1-2009

Is there an Elegant Universal Theory of Prediction?

Legg, Shane

Could there exist an elegant and universal theory of sequence pre diction? Solomonoff's model of induction rapidly learns to make optimal predict ions for any computable sequence, including probabilistic ones [13, 14]. In deed the problem of sequence prediction could well be considered solved [9, 8], if it were not for the fact that Solomonoff's theoretical model is incomputab le. Among computable theories there exist powerful general predict ors, such as the Lempel-Ziv algorithm [5] and Context Tree Weighting [18], that can learn to predict some complex sequences, but not others. Some prediction methods, such as the Minimum Description Length principle [12] and the Minimum Messa ge Length principle [17], can even be viewed as computable approximation s to Solomonoff induction [10]. However in practice their power and genera lity are limited by the power of compression and coding methods employed, as well as having a significantly reduced data efficiency as compared to Solom onoff induction [11]. This work was supported by SNF grant 200020-107616.

artificial intelligence, machine learning, sequence, (17 more...)

cs/0606070

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory > Minimum Complexity Machines (0.35)