AITopics | Mathematical & Statistical Methods

Collaborating Authors

Mathematical & Statistical Methods

News Overviews Instructional Materials AI-Alerts Classics

A Causal Research Pipeline and Tutorial for Psychologists and Social Scientists

arXiv.org Machine LearningJun-24-2022

Causality is a fundamental part of the scientific endeavour to understand the world. Unfortunately, causality is still taboo in much of psychology and social science. Motivated by a growing number of recommendations for the importance of adopting causal approaches to research, we reformulate the typical approach to research in psychology to harmonize inevitably causal theories with the rest of the research pipeline. We present a new process which begins with the incorporation of techniques from the confluence of causal discovery and machine learning for the development, validation, and transparent formal specification of theories. We then present methods for reducing the complexity of the fully specified theoretical model into the fundamental submodel relevant to a given target hypothesis. From here, we establish whether or not the quantity of interest is estimable from the data, and if so, propose the use of semi-parametric machine learning methods for the estimation of causal effects. The overall goal is the presentation of a new research pipeline which can (a) facilitate scientific inquiry compatible with the desire to test causal theories (b) encourage transparent representation of our theories as unambiguous mathematical objects, (c) to tie our statistical models to specific attributes of the theory, thus reducing under-specification problems frequently resulting from the theory-to-model gap, and (d) to yield results and estimates which are causally meaningful and reproducible. The process is demonstrated through didactic examples with real-world data, and we conclude with a summary and discussion of limitations.

artificial intelligence, cit, machine learning, (18 more...)

arXiv.org Machine Learning

2206.05175

Country:

Europe > Ukraine > Kyiv Oblast > Kyiv (0.04)
North America > United States > New York (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(6 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.67)
Health & Medicine > Epidemiology (0.46)
Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.87)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

When Satisfiability Solving Meets Symbolic Computation

Communications of the ACMJun-23-2022, 08:35:35 GMT

Plotkin, M. Binary codes with specified minimum distance.

matrix, projective plane, solver, (16 more...)

Communications of the ACM

AI-Alerts: 2022 > 2022-06 > AAAI AI-Alert for Jun 29, 2022 (1.00)

Country:

North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)
North America > United States > Illinois (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.68)

Add feedback

Smart Biostatistics

#artificialintelligenceJun-19-2022, 09:05:07 GMT

The registration fee is 49 € per person. Don't hesitate to contact us directly using the form on this page.

smart biostatistic

#artificialintelligence

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.40)

Add feedback

Hurwitz-Riemann Zeta And Other Special Probability Distributions - AI Summary

#artificialintelligenceJun-18-2022, 01:21:18 GMT

All the solutions were probability distributions, and in this article we introduce an even larger, generic class of problems (chaotic discrete dynamical systems) with known solution. Each dynamical system discussed here (or in my previous article) comes with two distributions: The name Hurwitz and Riemann-Zeta is just a reminder of their strong connection to number theory problems such as continued fractions, approximation of irrational numbers by rational ones, the construction and distribution of the digits of random numbers in various numeration systems, and the famous Riemann Hypothesis that has a one million dollar prize attached to it. The most well known probability distribution related to these functions is the discrete Zipf distribution. The author defines a family of distribution that generalizes the exponential power, normal, gamma, Weibull, Rayleigh, Maxwell-Boltzmann and chi-squared distributions, with applications in actuarial sciences. Our Hurwitz-Riemann Zeta distribution is yet another example arising this time from discrete dynamical systems, continuous on [0, 1].

discrete dynamical system, dynamical system, functional equation, (15 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.78)

Add feedback

Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning

Wiltzer, Harley, Meger, David, Bellemare, Marc G.

arXiv.org Machine LearningJun-17-2022

Continuous-time reinforcement learning offers an appealing formalism for describing control problems in which the passage of time is not naturally divided into discrete increments. Here we consider the problem of predicting the distribution of returns obtained by an agent interacting in a continuous-time, stochastic environment. Accurate return predictions have proven useful for determining optimal policies for risk-sensitive control, learning state representations, multiagent coordination, and more. We begin by establishing the distributional analogue of the Hamilton-Jacobi-Bellman (HJB) equation for It\^o diffusions and the broader class of Feller-Dynkin processes. We then specialize this equation to the setting in which the return distribution is approximated by $N$ uniformly-weighted particles, a common design choice in distributional algorithms. Our derivation highlights additional terms due to statistical diffusivity which arise from the proper handling of distributions in the continuous-time setting. Based on this, we propose a tractable algorithm for approximately solving the distributional HJB based on a JKO scheme, which can be implemented in an online control algorithm. We demonstrate the effectiveness of such an algorithm in a synthetic control problem.

artificial intelligence, distributional hamilton-jacobi-bellman equation, machine learning, (1 more...)

arXiv.org Machine Learning

2205.12184

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.60)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.53)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.40)

Add feedback

Planning Courses for Student Success at the American College of Greece

Christou, Ioannis T., Vagianou, Evgenia, Vardoulias, George

arXiv.org Artificial IntelligenceJun-16-2022

We model the problem of optimizing the schedule of courses a student at the American College of Greece will need to take to complete their studies. We model all constraints set forth by the institution and the department, so that we guarantee the validity of all produced schedules. We formulate several different objectives to optimize in the resulting schedule, including fastest completion time, course difficulty balance, and so on, with a very important objective our model is capable of capturing being the maximization of the expected student GPA given their performance on passed courses using Machine Learning and Data Mining techniques. All resulting problems are Mixed Integer Linear Programming problems with a number of binary variables that is in the order of the maximum number of terms times the number of courses available for the student to take. The resulting Mathematical Programming problem is always solvable by the GUROBI solver in less than 10 seconds on a modern commercial off-the-self PC, whereas the manual process that was installed before used to take department heads that are designated as student advisors more than one hour of their time for every student and was resulting in sub-optimal schedules as measured by the objectives set forth.

constraint, objective, student, (15 more...)

arXiv.org Artificial Intelligence

2207.02659

Country:

South America > Chile (0.04)
North America > United States > New Jersey > Hudson County > Hoboken (0.04)
North America > United States > Kentucky (0.04)
(4 more...)

Genre:

Instructional Material > Course Syllabus & Notes (1.00)
Research Report (0.64)

Industry: Education > Educational Setting > Higher Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.46)

Add feedback

Classification of Stochastic Processes with Topological Data Analysis

#artificialintelligenceJun-10-2022, 06:13:23 GMT

We used the raw, statistical and the topological features to classify time series sampled from different stochastic processes. In our simulation experiments we sampled times series from Wiener and Cauchy processes in both balanced and unbalanced sampling schemes. We then compared machine learning classification models built on topological features and statistical features we engineered on the sampled time series. The results show that the engineered topological features perform consistently better than statistical or raw features in building machine learning classification models even when a given dataset is unbalanced. Our experimental result show that the topologically engineered features alone can distinguish between different stochastic processes, even when statistical or raw features do not.

classification, stochastic process, topological data analysis, (4 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.93)

Add feedback

Linear Algebra and Optimization for Machine Learning: A Textbook: Aggarwal, Charu C.: 9783030403461: Books - Amazon

#artificialintelligenceJun-9-2022, 07:06:13 GMT

PDF has better equation formatting than kindle. Charu Aggarwal is a Distinguished Research Staff Member (DRSM) at the IBM T. J. Watson Research Center in Yorktown Heights, New York. He has worked extensively in the field of data mining, with particular interests in data streams, privacy, uncertain data and social network analysis. He has published 19 (8 authored and 11 edited) books, over 400 papers in refereed venues, and has applied for or been granted over 80 patents. Because of the commercial value of the above-mentioned patents, he has received several invention achievement awards and has thrice been designated a Master Inventor at IBM.

associate editor, linear algebra and optimization, recipient, (13 more...)

#artificialintelligence

Country:

North America > United States > New York (0.26)
North America > United States > Massachusetts (0.06)

Genre: Personal > Honors (0.34)

Industry:

Information Technology (1.00)
Retail > Online (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining (0.91)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.42)

Add feedback

Classification of Stochastic Processes with Topological Data Analysis

Güzel, İsmail, Kaygun, Atabey

arXiv.org Artificial IntelligenceJun-8-2022

In this study, we examine if engineered topological features can distinguish time series sampled from different stochastic processes with different noise characteristics, in both balanced and unbalanced sampling schemes. We compare our classification results against the results of the same classification tasks built on statistical and raw features. We conclude that in classification tasks of time series, different machine learning models built on engineered topological features perform consistently better than those built on standard statistical and raw features.

artificial intelligence, machine learning, topological data analysis, (2 more...)

arXiv.org Artificial Intelligence

doi: 10.1002/cpe.7732

2206.03973

Genre: Research Report (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.60)

Add feedback

Computing the Variance of Shuffling Stochastic Gradient Algorithms via Power Spectral Density Analysis

Domingo-Enrich, Carles

arXiv.org Machine LearningJun-1-2022

When solving finite-sum minimization problems, two common alternatives to stochastic gradient descent (SGD) with theoretical benefits are random reshuffling (SGD-RR) and shuffle-once (SGD-SO), in which functions are sampled in cycles without replacement. Under a convenient stochastic noise approximation which holds experimentally, we study the stationary variances of the iterates of SGD, SGD-RR and SGD-SO, whose leading terms decrease in this order, and obtain simple approximations. To obtain our results, we study the power spectral density of the stochastic gradient noise sequences. Our analysis extends beyond SGD to SGD with momentum and to the stochastic Nesterov's accelerated gradient method. We perform experiments on quadratic objective functions to test the validity of our approximation and the correctness of our findings.

artificial intelligence, machine learning, shuffling stochastic gradient algorithm, (3 more...)

arXiv.org Machine Learning

2206.00632

Genre: Research Report > New Finding (0.53)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.80)

Add feedback