AITopics | Mathematical & Statistical Methods

Collaborating Authors

Mathematical & Statistical Methods

News Overviews Instructional Materials AI-Alerts Classics

Calculus. No Linear Algebra? Someone please clear this for me. • /r/MachineLearning

@machinelearnbotMay-19-2016, 07:50:45 GMT

Are you sure it was me? I don't recall ever going to CMU. Although when I was in high school I was working in Arby's and I saw a customer who looked just like me and he was wearing a CMU hoodie. I thought at the time that he might be from the future, although I'm not sure why I'd ever go to an Arby's again.

artificial intelligence, linear algebra, machinelearning, (3 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.40)

Add feedback

New book: Doing Data Science - Straight Talk from the Frontline

@machinelearnbotMay-11-2016, 04:25:08 GMT

Now that people are aware that data can make the difference in an election or a business model, data science as an occupation is gaining ground. But how can you get started working in a wide-ranging, interdisciplinary field that's so clouded in hype? This insightful book, based on Columbia University's Introduction to Data Science class, tells you what you need to know. In many of these chapter-long lectures, data scientists from companies such as Google, Microsoft, and eBay share new algorithms, methods, and models by presenting case studies and the code they use. If you're familiar with linear algebra, probability, and statistics, and have programming experience, this book is an ideal introduction to data science.

artificial intelligence, data science, straight talk, (11 more...)

@machinelearnbot

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.69)

Add feedback

Learning theory estimates with observations from general stationary stochastic processes

Hang, Hanyuan, Feng, Yunlong, Steinwart, Ingo, Suykens, Johan A. K.

arXiv.org Machine LearningMay-10-2016

This paper investigates the supervised learning problem with observations drawn from certain general stationary stochastic processes. Here by \emph{general}, we mean that many stationary stochastic processes can be included. We show that when the stochastic processes satisfy a generalized Bernstein-type inequality, a unified treatment on analyzing the learning schemes with various mixing processes can be conducted and a sharp oracle inequality for generic regularized empirical risk minimization schemes can be established. The obtained oracle inequality is then applied to derive convergence rates for several learning schemes such as empirical risk minimization (ERM), least squares support vector machines (LS-SVMs) using given generic kernels, and SVMs using Gaussian kernels for both least squares and quantile regression. It turns out that for i.i.d.~processes, our learning rates for ERM recover the optimal rates. On the other hand, for non-i.i.d.~processes including geometrically $\alpha$-mixing Markov processes, geometrically $\alpha$-mixing processes with restricted decay, $\phi$-mixing processes, and (time-reversed) geometrically $\mathcal{C}$-mixing processes, our learning rates for SVMs with Gaussian kernels match, up to some arbitrarily small extra term in the exponent, the optimal rates. For the remaining cases, our rates are at least close to the optimal rates. As a by-product, the assumed generalized Bernstein-type inequality also provides an interpretation of the so-called "effective number of observations" for various mixing processes.

artificial intelligence, inequality, machine learning, (15 more...)

arXiv.org Machine Learning

1605.02887

Country:

Europe (0.93)
North America > United States (0.47)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback

Linear Algebra Formulas for Econometrics

@machinelearnbotMay-4-2016, 04:11:58 GMT

Econometrics is fundamental to many of the problems that data scientists care about, and it requires many skills. There's philosophical skill, for thinking about whether fixed effects or random effects models are more appropriate, for example, or what the direction of causality in a particular problem is. There's some coding, including knowing the right commands to interact with statistical programs like Stata or R, and how to interpret their output. There's the intuition to know which policy issues are worth researching, the political skill to obtain data or grant money, even the writing skill to communicate ideas. And "beneath" it all there is linear algebra: matrix formulas for the estimators that are reported, interpreted, and acted on.

artificial intelligence, econometrics, estimator, (5 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.64)

Add feedback

Fast nonlinear embeddings via structured matrices

Choromanski, Krzysztof, Fagan, Francois

arXiv.org Machine LearningApr-25-2016

We present a new paradigm for speeding up randomized computations of several frequently used functions in machine learning. In particular, our paradigm can be applied for improving computations of kernels based on random embeddings. Above that, the presented framework covers multivariate randomized functions. As a byproduct, we propose an algorithmic approach that also leads to a significant reduction of space complexity. Our method is based on careful recycling of Gaussian vectors into structured matrices that share properties of fully random matrices. The quality of the proposed structured approach follows from combinatorial properties of the graphs encoding correlations between rows of these structured matrices. Our framework covers as special cases already known structured approaches such as the Fast Johnson-Lindenstrauss Transform, but is much more general since it can be applied also to highly nonlinear embeddings. We provide strong concentration results showing the quality of the presented paradigm.

artificial intelligence, machine learning, matrix, (17 more...)

arXiv.org Machine Learning

1604.07356

Country: North America > Canada > British Columbia (0.14)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.67)

Add feedback

Newton's Laws of Marriage

The New YorkerApr-20-2016, 13:00:13 GMT

I wish we could derive the rest of the phænomena of nature by the same kind of reasoning from mechanical principles. LAW I: A body in motion will be kept in motion. A body at rest will be asked what its plans for the day are. The First Law deals primarily with inertia--which is often mistakenly identified as "relaxing"--and the different ways one body can affect another inert (and perfectly content) body. Conversely, it states that a body in motion will be kept in motion with a list of errands, written on the back of an envelope, before that body "becomes one with the couch for the rest of the day," which seems like an unnecessary characterization.

artificial intelligence, marriage, reaction, (5 more...)

The New Yorker

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.40)

Add feedback

Constructive Preference Elicitation by Setwise Max-margin Learning

Teso, Stefano, Passerini, Andrea, Viappiani, Paolo

arXiv.org Machine LearningApr-20-2016

In this paper we propose an approach to preference elicitation that is suitable to large configuration spaces beyond the reach of existing state-of-the-art approaches. Our setwise max-margin method can be viewed as a generalization of max-margin learning to sets, and can produce a set of "diverse" items that can be used to ask informative queries to the user. Moreover, the approach can encourage sparsity in the parameter space, in order to favor the assessment of utility towards combinations of weights that concentrate on just few features. We present a mixed integer linear programming formulation and show how our approach compares favourably with Bayesian preference elicitation alternatives and easily scales to realistic datasets.

artificial intelligence, boutilier, optimization problem, (15 more...)

arXiv.org Machine Learning

1604.0602

Country:

Europe > Italy (0.14)
Europe > France (0.14)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.85)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.68)

Add feedback

Computing Optimal Monitoring Strategy for Detecting Terrorist Plots

Wang, Zhen (Nanyang Technological University) | Yin, Yue (University of Chinese Academy of Sciences) | An, Bo (Nanyang Technological University)

AAAI ConferencesApr-19-2016

In recent years, terrorist organizations (e.g., ISIS or al-Qaeda) are increasingly directing terrorists to launch coordinated attacks in their home countries. One example is the Paris shootings on January 7, 2015.By monitoring potential terrorists, security agencies are able to detect and stop terrorist plots at their planning stage.Although security agencies may have knowledge about potential terrorists (e.g., who they are, how they interact), they usually have limited resources and cannot monitor all terrorists.Moreover, a terrorist planner may strategically choose to arouse terrorists considering the security agency's monitoring strategy. This paper makes five key contributions toward the challenging problem of computing optimal monitoring strategies: 1) A new Stackelberg game model for terrorist plot detection;2) A modified double oracle framework for computing the optimal strategy effectively;3) Complexity results for both defender and attacker oracle problems;4) Novel mixed-integer linear programming (MILP) formulations for best response problems of both players;and 5) Effective approximation algorithms for generating suboptimal responses for both players.Experimental evaluation shows that our approach can obtain a robust enough solution outperforming widely-used centrality based heuristics significantly and scale up to realistic-sized problems.

game theory, law enforcement, vertex, (21 more...)

AAAI Conferences

Thirtieth AAAI Conference on Artificial Intelligence

Country: Asia > Middle East (0.28)

Industry: Law Enforcement & Public Safety > Terrorism (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.86)

Add feedback

The Complexity Landscape of Decompositional Parameters for ILP

Ganian, Robert (Technische Universität Wien) | Ordyniak, Sebastian (Technische Universität Wien)

AAAI ConferencesApr-19-2016

Integer Linear Programming (ILP) can be seen as the archetypical problem for NP-complete optimization problems, and a wide range of problems in artificial intelligence are solved in practice via a translation to ILP. Despite its huge range of applications, only few tractable fragments of ILP are known, probably the most prominent of which is based on the notion of total unimodularity. Using entirely different techniques, we identify new tractable fragments of ILP by studying structural parameterizations of the constraint matrix within the framework of parameterized complexity. In particular, we show that ILP is fixed-parameter tractable when parameterized by the treedepth of the constraint matrix and the maximum absolute value of any coefficient occurring in the ILP instance. Together with matching hardness results for the more general parameter treewidth, we draw a detailed complexity landscape of ILP w.r.t. decompositional parameters defined on the constraint matrix.

artificial intelligence, coefficient, optimization problem, (17 more...)

AAAI Conferences

Thirtieth AAAI Conference on Artificial Intelligence

Country:

North America > United States > California (0.14)
Europe > Austria > Vienna (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.66)

Add feedback

Linear Algebra in Julia

@machinelearnbotApr-13-2016, 11:55:28 GMT

Most people (including myself) are drawn to Julia by its lofty goals. Speed of C, statistical packages of R, and ease of Python?--it sounds two good to be true. However, I haven't seen anyone who has looked into it say the developers behind the language aren't on track to accomplish these goals.

artificial intelligence, linear algebra

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.40)

Add feedback