AITopics

2002.04267

Country:

Europe > Poland > Masovia Province > Warsaw (0.04)
North America > United States > Virginia > Alexandria County > Alexandria (0.04)

Genre:

Research Report > Experimental Study (0.51)
Research Report > New Finding (0.32)

Industry:

Health & Medicine (0.68)
Banking & Finance > Credit (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.51)

#artificialintelligenceFeb-10-2020, 07:09:35 GMT

Machine Learning, Business analytics & Data Science with R

I am avoiding repeating same models with Python but included linear regression & logistic regression for continuation purpose. Going forward, I will cover other techniques with Python like image recognition, sentiment analysis etc. Image recognition is in progress & course will be updated soon with it. Unlike most machine learning courses out there, the Complete Machine Learning & Data Science with R-2019 is comprehensive. We are not only covering popular machine learning techniques but also additional techniques like ANOVA & CART techniques. Course is structured into various parts like R programming, data selection & manipulation, applied statistics & data visualization. This will help you with the structure of data science and machine learning.

analytic & data science, business analytic & data science, machine learning, (8 more...)

Genre: Instructional Material > Course Syllabus & Notes (0.56)

Industry: Education > Educational Setting > Online (0.92)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.73)

#artificialintelligenceFeb-9-2020, 23:42:34 GMT

60 Interview Questions On Machine Learning

We frequently come out with resources for aspirants and job seekers in data science to help them make a career in this vibrant field. Cracking interviews especially where understating of machine learning is needed can be tricky. Here are 60 most commonly asked interview questions for data scientists, broken into linear regression, logistic regression and clustering.

artificial intelligence, interview question, machine learning, (1 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.75)

He, Jingyu, Hahn, P. Richard

Stochastic tree ensembles for regularized nonlinear regression

arXiv.org Machine LearningFeb-9-2020

Tree-based algorithms for supervised learning, such as Classification and Regression Trees (CART) (Breiman et al., 1984), random forests (Breiman, 1996, 2001), adaBoost (Freund and Schapire, 1997), and gradient boosting (Breiman, 1997; Friedman, 2001, 2002), are widely used for applied supervised learning. As a whole, these methods are popular in applied settings due to their speed and accuracy in mean estimation and out-of-sample prediction tasks. One limitation of such methods is their well-known sensitivity to tuning parameters, which require costly cross-validation to optimize. Bayesian additive regression trees (BART) (Chipman et al., 2007, 2010) is a popular model-based alternative that is often more accurate than other treebased methods; specifically, BART boasts valuable robustness to the choice of tuning-parameters. However, relative to random forests and boosting, BART's wider adoption has been slowed by its more severe computational demands, owing to its reliance on a random walk Metropolis-Hastings Markov chain Monte Carlo (MCMC) algorithm. Despite this limitation, BART has inspired a considerable body of research in recent years.

algorithm, node, split criterion, (14 more...)

2002.03375

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > California (0.04)
North America > United States > Arizona (0.04)

Genre: Research Report (0.50)

Industry: Health & Medicine (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
(2 more...)

Paskov, Hristo, Paskov, Alex, West, Robert

Learning High Order Feature Interactions with Fine Control Kernels

arXiv.org Machine LearningFeb-9-2020

We provide a methodology for learning sparse statistical models that use as features all possible multiplicative interactions among an underlying atomic set of features. While the resulting optimization problems are exponentially sized, our methodology leads to algorithms that can often solve these problems exactly or provide approximate solutions based on combining highly correlated features. We also introduce an algorithmic paradigm, the Fine Control Kernel framework, so named because it is based on Fenchel Duality and is reminiscent of kernel methods. Its theory is tailored to large sparse learning problems, and it leads to efficient feature screening rules for interactions. These rules are inspired by the Apriori algorithm for market basket analysis -- which also falls under the purview of Fine Control Kernels, and can be applied to a plurality of learning problems including the Lasso and sparse matrix estimation. Experiments on biomedical datasets demonstrate the efficacy of our methodology in deriving algorithms that efficiently produce interactions models which achieve state-of-the-art accuracy and are interpretable.

algorithm, equation, interaction, (14 more...)

2002.03298

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.64)

Industry:

Education > Focused Education > Special Education (0.45)
Health & Medicine > Therapeutic Area (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)

#artificialintelligenceFeb-8-2020, 04:02:57 GMT

Introduction to Bayesian Logistic Regression

Let's review the concepts underlying Bayesian statistical analysis by walking through a simple classification model. The data come from the 1988 Bangladesh Fertility Survey, where 1934 observations were taken from women in urban and rural areas. The authors of the dataset, Mn and Cleland aimed to determine trends and causes of fertility as well as differences in fertility and child mortality. We will use the data in order to train a Bayesian logistic regression model that can predict if a given woman uses contraception. The dataset is well suited to Bayesian logistic regression because being able to quantify uncertainty when analyzing fertility is the major component of population dynamics that decide the size, structure, and composition of populations (source 1, source 2).

posterior distribution, probability, use contraceptive, (10 more...)

Country: Asia > Bangladesh (0.25)

Genre:

Research Report > New Finding (0.94)
Research Report > Experimental Study (0.94)

Industry: Health & Medicine (0.80)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.32)

Fan, Yiying, Sun, Jiayang

Subsampling Winner Algorithm for Feature Selection in Large Regression Data

arXiv.org Machine LearningFeb-7-2020

Feature selection from a large number of covariates (aka features) in a regression analysis remains a challenge in data science, especially in terms of its potential of scaling to ever-enlarging data and finding a group of scientifically meaningful features. For example, to develop new, responsive drug targets for ovarian cancer, the actual false discovery rate (FDR) of a practical feature selection procedure must also match the target FDR. The popular approach to feature selection, when true features are sparse, is to use a penalized likelihood or a shrinkage estimation, such as a LASSO, SCAD, Elastic Net, or MCP procedure (call them benchmark procedures). We present a different approach using a new subsampling method, called the Subsampling Winner algorithm (SWA). The central idea of SWA is analogous to that used for the selection of US national merit scholars. SWA uses a "base procedure" to analyze each of the subsamples, computes the scores of all features according to the performance of each feature from all subsample analyses, obtains the "semifinalist" based on the resulting scores, and then determines the "finalists," i.e., the most important features. Due to its subsampling nature, SWA can scale to data of any dimension in principle. The SWA also has the best-controlled actual FDR in comparison with the benchmark procedures and the randomForest, while having a competitive true-feature discovery rate. We also suggest practical add-on strategies to SWA with or without a penalized benchmark procedure to further assure the chance of "true" discovery. Our application of SWA to the ovarian serous cystadenocarcinoma specimens from the Broad Institute revealed functionally important genes and pathways, which we verified by additional genomics tools. This second-stage investigation is essential in the current discussion of the proper use of P-values.

procedure, swa, true feature, (15 more...)

2002.02903

Country:

North America > United States > New York (0.04)
North America > United States > Virginia > Fairfax County > Fairfax (0.04)
North America > United States > Virginia > Alexandria County > Alexandria (0.04)
North America > United States > Ohio > Cuyahoga County > Cleveland (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.86)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

#artificialintelligenceFeb-6-2020, 04:37:59 GMT

#006A Fast Logistic Regression Master Data Science

When we are programming Logistic Regression or Neural Networks we should avoid explicit $for $ loops. It's not always possible, but when we can, we should use built-in functions or find some other ways to compute it. Vectorizing the implementation of Logistic Regression makes the code highly efficient. In this post we will see how we can use this technique to compute gradient descent without using even a single $for $ loop. This code was non-vectorized and highly inefficent so we need to transform it.

enspace, logistic regression master data science, textbf, (7 more...)

Genre:

Research Report > New Finding (0.89)
Research Report > Experimental Study (0.89)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.89)

Somacal, Agustín, Boechi, Leonardo, Jonckheere, Matthieu, Lefieux, Vincent, Picard, Dominique, Smucler, Ezequiel

Uncovering differential equations from data with hidden variables

arXiv.org Machine LearningFeb-6-2020

Examples include meteorology, biology, and physics. The usual way to model deterministic dynamical systems is by using (partial) differential equations. Typically, differential equations models for a given dynamical system are derived using apriori insights into the problem at hand; then the model is validated using empirical observations. In an era in which massive data-sets pertaining to different fields of science are widely available, an interesting problem is whether it is possible for a useful differential equations model to be learned directly from data, without any major modeling effort required by the researcher. Our goal in this paper is to develop a general methodology for building such differential equations models in contexts in which not all relevant variables are observed, that is, in cases in which the main variable of interest depends on other variables of which no measurements are available. As a concrete example, consider the following problem. RTE, the electricity transmission system operator of France, uses high-level simulations of hourly temperature series to study the impact different climate scenarios have on electricity consumption, and hence on the French electrical power grid.

artificial intelligence, dynamical system, machine learning, (18 more...)

2002.0225

Country:

Europe > France (0.25)
South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.05)

Genre: Research Report (0.40)

Industry: Energy > Power Industry (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Bunea, Florentina, Strimas-Mackey, Seth, Wegkamp, Marten

Interpolation under latent factor regression models

arXiv.org Machine LearningFeb-6-2020

This work studies finite-sample properties of the risk of the minimum-norm interpolating predictor in high-dimensional regression models. If the effective rank of the covariance matrix $\Sigma$ of the $p$ regression features is much larger than the sample size $n$, we show that the min-norm interpolating predictor is not desirable, as its risk approaches the risk of predicting the response by $0$. However, our detailed finite sample analysis reveals, surprisingly, that this behavior is not present when the regression response and the features are jointly low-dimensional, and follow a widely used factor regression model. Within this popular model class, and when the effective rank of $\Sigma$ is smaller than $n$, while still allowing for $p \gg n$, both the bias and the variance terms of the excess risk can be controlled, and the risk of the minimum-norm interpolating predictor approaches optimal benchmarks. Moreover, through a detailed analysis of the bias term, we exhibit model classes under which our upper bound on the excess risk approaches zero, while the corresponding upper bound in the recent work arXiv:1906.11300v3 diverges. Furthermore, we show that minimum-norm interpolating predictors analyzed under factor regression models, despite being model-agnostic, can have similar risk to model-assisted predictors based on principal components regression, in the high-dimensional regime.

factor regression model, probability, theorem 7, (14 more...)

2002.02525

Country:

North America > United States > New York > Tompkins County > Ithaca (0.04)
North America > United States > California (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.81)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)