AITopics

2001.02798

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > New Jersey > Hudson County > Hoboken (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(6 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.87)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.66)

Shi, Xiaofei, Woodruff, David P.

Sublinear Time Numerical Linear Algebra for Structured Matrices

arXiv.org Machine LearningDec-12-2019

We show how to solve a number of problems in numerical linear algebra, such as least squares regression, $\ell_p$-regression for any $p \geq 1$, low rank approximation, and kernel regression, in time $T(A) \poly(\log(nd))$, where for a given input matrix $A \in \mathbb{R}^{n \times d}$, $T(A)$ is the time needed to compute $A\cdot y$ for an arbitrary vector $y \in \mathbb{R}^d$. Since $T(A) \leq O(\nnz(A))$, where $\nnz(A)$ denotes the number of non-zero entries of $A$, the time is no worse, up to polylogarithmic factors, as all of the recent advances for such problems that run in input-sparsity time. However, for many applications, $T(A)$ can be much smaller than $\nnz(A)$, yielding significantly sublinear time algorithms. For example, in the overconstrained $(1+\epsilon)$-approximate polynomial interpolation problem, $A$ is a Vandermonde matrix and $T(A) = O(n \log n)$; in this case our running time is $n \cdot \poly(\log n) + \poly(d/\epsilon)$ and we recover the results of \cite{avron2013sketching} as a special case. For overconstrained autoregression, which is a common problem arising in dynamical systems, $T(A) = O(n \log n)$, and we immediately obtain $n \cdot \poly(\log n) + \poly(d/\epsilon)$ time. For kernel autoregression, we significantly improve the running time of prior algorithms for general kernels. For the important case of autoregression with the polynomial kernel and arbitrary target vector $b\in\mathbb{R}^n$, we obtain even faster algorithms. Our algorithms show that, perhaps surprisingly, most of these optimization problems do not require much more time than that of a polylogarithmic number of matrix-vector multiplications.

algorithm, matrix, woodruff, (17 more...)

1912.0606

Country:

North America > United States > California > Alameda County > Berkeley (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(3 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

#artificialintelligenceDec-10-2019, 05:29:30 GMT

New Books and Resources for DSC Members

We are in the process of writing and adding new material (compact eBooks) exclusively available to our members, and written in simple English, by world leading experts in AI, data science, and machine learning. We invite you to sign up here to not miss these free books. This book is intended for busy professionals working with data of any kind: engineers, BI analysts, statisticians, operations research, AI and machine learning professionals, economists, data scientists, biologists, and quants, ranging from beginners to executives. In about 300 pages and 28 chapters it covers many new topics, offering a fresh perspective on the subject, including rules of thumb and recipes that are easy to automate or integrate in black-box systems, as well as new model-free, data-driven foundations to statistical science and predictive analytics. The approach focuses on robust techniques; it is bottom-up (from applications to theory), in contrast to the traditional top-down approach. The material is accessible to practitioners with a one-year college-level exposure to statistics and probability.

ajit jaokar, application, member only, (12 more...)

Genre: Summary/Review (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.81)
Information Technology > Data Science > Data Mining (0.71)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.31)

#artificialintelligenceDec-6-2019, 07:18:21 GMT

Machine Learning for Signal Processing: Data Science, Algorithms, and Computational Statistics: Max A. Little: 9780198714934: Amazon.com: Books

This book provides an excellent pathway for gaining first-class expertise in machine learning. It provides both the technical background that explains why certain approaches, but not others, are best practice in real world problems, and a framework for how to think about and approach new problems. I highly recommend it for people with a signal processing background who are seeking to become an expert in machine learning.

computational statistic, machine learning, signal processing, (4 more...)

Country: North America > United States > Massachusetts (0.16)

Industry: Retail > Online (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.40)

#artificialintelligenceDec-5-2019, 04:10:45 GMT

A Quasi-Newton Method Based Vertical Federated Learning Framework for Logistic Regression

Data privacy and security becomes a major concern in building machine learning models from different data providers. Federated learning shows promise by leaving data at providers locally and exchanging encrypted information. This paper studies the vertical federated learning structure for logistic regression where the data sets at two parties have the same sample IDs but own disjoint subsets of features. Existing frameworks adopt the first-order stochastic gradient descent algorithm, which requires large number of communication rounds. To address the communication challenge, we propose a quasi-Newton method based vertical federated learning framework for logistic regression under the additively homomorphic encryption scheme.

logistic regression, quasi-newton method, vertical federated learning framework, (1 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.94)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.67)

Kovalev, Dmitry, Mishchenko, Konstantin, Richtárik, Peter

Stochastic Newton and Cubic Newton Methods with Simple Local Linear-Quadratic Rates

arXiv.org Machine LearningDec-3-2019

We present two new remarkably simple stochastic second-order methods for minimizing the average of a very large number of sufficiently smooth and strongly convex functions. The first is a stochastic variant of Newton's method (SN), and the second is a stochastic variant of cubically regularized Newton's method (SCN). We establish local linear-quadratic convergence results. Unlike existing stochastic variants of second order methods, which require the evaluation of a large number of gradients and/or Hessians in each iteration to guarantee convergence, our methods do not have this shortcoming. For instance, the simplest variants of our methods in each iteration need to compute the gradient and Hessian of a {\em single} randomly selected function only. In contrast to most existing stochastic Newton and quasi-Newton methods, our approach guarantees local convergence faster than with first-order oracle and adapts to the problem's curvature. Interestingly, our method is not unbiased, so our theory provides new intuition for designing new stochastic methods.

arxiv preprint arxiv, hessian, international conference, (12 more...)

1912.01597

Country:

North America > United States > California > Los Angeles County > Long Beach (0.04)
North America > Canada (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
(2 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

#artificialintelligenceDec-2-2019, 10:24:30 GMT

Introduction to Applied Linear Algebra – Vectors, Matrices, and Least Squares

This book is used as the textbook for the course EE103 (Stanford) and EE133A (UCLA), where you will find additional related material. If you find an error not listed in our errata list, please do let us know about it. You're welcome to use the lecture slides posted below, but we'd appreciate it if you acknowledge the source.

matrix, vector

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.40)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.13)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.40)

#artificialintelligenceNov-30-2019, 22:48:37 GMT

Linear Algebra and Learning from Data

Also included is an essay from SIAM News'The Functions of Deep Learning' (December 2018) A second distributor for SIAM members is siam.org We will confirm orders for this new book by email.

distributor, linear algebra and learning

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.40)
North America > United States > Massachusetts > Norfolk County > Wellesley (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)

Genre: Summary/Review (0.74)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

#artificialintelligenceNov-25-2019, 14:37:31 GMT

Gilbert Strang: Linear Algebra, Deep Learning, Teaching, and MIT OpenCourseWare AI Podcast

Gilbert Strang is a professor of mathematics at MIT and perhaps one of the most famous and impactful teachers of math in the world. His MIT OpenCourseWare lectures on linear algebra have been viewed millions of times. This conversation is part of the Artificial Intelligence podcast.

gilbert strang, lexfridman, mit opencourseware ai podcast, (5 more...)

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.08)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.76)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.44)

Bhadra, Somnath, Chakraborty, Kaustav, Sengupta, Srijan, Lahiri, Soumendra

A Bootstrap-based Inference Framework for Testing Similarity of Paired Networks

arXiv.org Machine LearningNov-24-2019

We live in an interconnected world where network valued data arises in many domains, and, fittingly, statistical network analysis has emerged as an active area in the literature. However, the topic of inference in networks has received relatively less attention. In this work, we consider the paired network inference problem where one is given two networks on the same set of nodes, and the goal is to test whether the given networks are stochastically similar in terms of some notion of similarity. We develop a general inferential framework based on parametric bootstrap to address this problem. Under this setting, we address two specific and important problems: the equality problem, i.e., whether the two networks are generated from the same random graph model, and the scaling problem, i.e., whether the underlying probability matrices of the two random graph models are scaled versions of each other.

bootstrap iteration, hypothesis, test statistic, (11 more...)

1911.06869

Country:

North America > United States > Illinois (0.04)
North America > United States > Virginia (0.04)
Europe > Hungary > Hajdú-Bihar County > Debrecen (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)
Health & Medicine > Pharmaceuticals & Biotechnology (0.45)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.69)