AITopics

2011.14572

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > New York (0.04)
Europe > Switzerland > Geneva > Geneva (0.04)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

#artificialintelligenceNov-30-2020, 09:42:17 GMT

All-in-One:Machine Learning,DL,NLP,AWS Deply [Hindi][Python]

Online Courses Udemy - All-in-One:Machine Learning,DL,NLP,AWS Deply [Hindi][Python], Complete hands-on Machine Learning Course with Data Science, NLP, Deep Learning and Artificial Intelligence Created by Rishi Bansal English Students also bought Java from Zero to First Job: Part 1 - Java Basics and OOP C Programming for Beginners - Master the C Fundamentals Full-Stack Web Development For Beginners The Complete Java Programmer: From Scratch to Advanced Python and Django Full-Stack Web Development for beginners Learn To Create AI Assistant (JARVIS) With Python Preview this course GET COUPON CODE Description This course is designed to cover maximum Concept of Machine Learning. Anyone can opt for this course. No prior understanding of Machine Learning is required. As a Bonus Introduction Natural Language Processing and Deep Learning is included. Below Topics are covered Chapter - Introduction to Machine Learning - Machine Learning?

learning, machine learning, regression, (14 more...)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.59)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.58)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

arXiv.org Machine LearningNov-30-2020

A Hypergradient Approach to Robust Regression without Correspondence

Xie, Yujia, Mao, Yixiu, Zuo, Simiao, Xu, Hongteng, Ye, Xiaojing, Zhao, Tuo, Zha, Hongyuan

We consider a regression problem, where the correspondence between input and output data is not available. Such shuffled data is commonly observed in many real world problems. Taking flow cytometry as an example, the measuring instruments are unable to preserve the correspondence between the samples and the measurements. Due to the combinatorial nature, most of existing methods are only applicable when the sample size is small, and limited to linear regression models. To overcome such bottlenecks, we propose a new computational framework - ROBOT- for the shuffled regression problem, which is applicable to large data and complex models. Specifically, we propose to formulate the regression without correspondence as a continuous optimization problem. Then by exploiting the interaction between the regression model and the data correspondence, we propose to develop a hypergradient approach based on differentiable programming techniques. Such a hypergradient approach essentially views the data correspondence as an operator of the regression, and therefore allows us to find a better descent direction for the model parameter by differentiating through the data correspondence. ROBOT is quite general, and can be further extended to the inexact correspondence setting, where the input and output data are not necessarily exactly aligned. Thorough numerical experiments show that ROBOT achieves better performance than existing methods in both linear and nonlinear regression tasks, including real-world applications such as flow cytometry and multi-object tracking.

artificial intelligence, correspondence, machine learning, (16 more...)

2012.00123

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report (0.50)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Neal, Brady, Huang, Chin-Wei, Raghupathi, Sunand

RealCause: Realistic Causal Inference Benchmarking

arXiv.org Machine LearningNov-30-2020

There are many different causal effect estimators in causal inference. However, it is unclear how to choose between these estimators because there is no ground-truth for causal effects. A commonly used option is to simulate synthetic data, where the ground-truth is known. However, the best causal estimators on synthetic data are unlikely to be the best causal estimators on realistic data. An ideal benchmark for causal estimators would both (a) yield ground-truth values of the causal effects and (b) be representative of real data. Using flexible generative models, we provide a benchmark that both yields ground-truth and is realistic. Using this benchmark, we evaluate 66 different causal estimators.

dataset, estimator, visualization, (11 more...)

2011.15007

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Oceania > Australia (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
(2 more...)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

#artificialintelligenceNov-29-2020, 19:50:58 GMT

Blending Ensemble Machine Learning With Python

Blending is an ensemble machine learning algorithm. It is a colloquial name for stacked generalization or stacking ensemble where instead of fitting the meta-model on out-of-fold predictions made by the base model, it is fit on predictions made on a holdout dataset. Blending was used to describe stacking models that combined many hundreds of predictive models by competitors in the $1M Netflix machine learning competition, and as such, remains a popular technique and name for stacking in competitive machine learning circles, such as the Kaggle community. In this tutorial, you will discover how to develop and evaluate a blending ensemble in python. Blending Ensemble Machine Learning With Python Photo by Nathalie, some rights reserved. Blending is an ensemble machine learning technique that uses a machine learning model to learn how to best combine the predictions from multiple contributing ensemble member models.

dataset, ensemble, prediction, (14 more...)

Genre: Instructional Material > Course Syllabus & Notes (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

#artificialintelligenceNov-29-2020, 19:50:28 GMT

Linear Regression: Zero to Hero

In this blog, we are going to discuss the most important algorithm in machine learning and deep learning Linear Regression. "In Linear Regression Our Main Task is to find the best fitted line" As we see on the plot above that the best-fitted line on the data points is L0. There can be more best-fitted lines on the data points like l1, and l2, etc. then the question is, how do we find the best-fitted line above all of them?? We calculate the distance of the line from each point in the graph then find the MSE. After that, whichever line gives us the minimum error, we choose that line as our best-fitted line. In the plot below, we are measuring the distance of L0 From all the points and then just finding the error and comparing it with other lines.

best-fitted line, linear regression, zero

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.93)

#artificialintelligenceNov-29-2020, 18:06:29 GMT

MARS: Multivariate Adaptive Regression Splines -- How to Improve on Linear Regression?

Machine Learning is making huge leaps forward, with an increasing number of algorithms enabling us to solve complex real-world problems. This story is part of a deep dive series explaining the mechanics of Machine Learning algorithms. In addition to giving you an understanding of how ML algorithms work, it also provides you with Python examples to build your own ML models. Before we dive into the specifics of MARS, I assume that you are already familiar with Linear Regression. Looking at the algorithm's full name -- Multivariate Adaptive Regression Splines -- you would be correct to guess that MARS belongs to the group of regression algorithms used to predict continuous (numerical) target variables.

algorithm, hinge function, multivariate adaptive regression spline, (7 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

#artificialintelligenceNov-29-2020, 08:50:34 GMT

Predicting best quality of wine using Linear Regression and PyTorch

In this notebook we will predict the best quality of the wine using PyTorch and linear regression. If you haven't checked out my previous blog on Linear Regression check this out . First of all lets import required libraries.. Now lets analyse our dataset.. its important to analyse to see what we are dealing with.. Training Dataset: The sample of data used to fit the model. The actual dataset that we use to train the model (weights and biases in the case of a Neural Network). The model sees and learns from this data.

best quality, dataset, linear regression and pytorch, (7 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Zhang, Amy X., Bao, Le, Daniels, Michael J.

Approximate Cross-validated Mean Estimates for Bayesian Hierarchical Regression Models

arXiv.org Machine LearningNov-28-2020

We introduce a novel procedure for obtaining cross-validated predictive estimates for Bayesian hierarchical regression models (BHRMs). Bayesian hierarchical models are popular for their ability to model complex dependence structures and provide probabilistic uncertainty estimates, but can be computationally expensive to run. Cross-validation (CV) is therefore not a common practice to evaluate the predictive performance of BHRMs. Our method circumvents the need to re-run computationally costly estimation methods for each cross-validation fold and makes CV more feasible for large BHRMs. By conditioning on the variance-covariance parameters, we shift the CV problem from probability-based sampling to a simple and familiar optimization problem. In many cases, this produces estimates which are equivalent to full CV. We provide theoretical results and demonstrate its efficacy on publicly available data and in simulations.

approximation, cv fold, equation, (14 more...)

2011.14238

Country:

North America > United States > Florida > Alachua County > Gainesville (0.14)
North America > United States > Pennsylvania > Centre County > State College (0.04)
North America > United States > New York > New York County > New York City (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine (0.93)
Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

arXiv.org Machine LearningNov-28-2020

Optimal Semi-supervised Estimation and Inference for High-dimensional Linear Regression

Deng, Siyi, Ning, Yang, Zhao, Jiwei, Zhang, Heping

There are many scenarios such as the electronic health records where the outcome is much more difficult to collect than the covariates. In this paper, we consider the linear regression problem with such a data structure under the high dimensionality. Our goal is to investigate when and how the unlabeled data can be exploited to improve the estimation and inference of the regression parameters in linear models, especially in light of the fact that such linear models may be misspecified in data analysis. In particular, we address the following two important questions. (1) Can we use the labeled data as well as the unlabeled data to construct a semi-supervised estimator such that its convergence rate is faster than the supervised estimators? (2) Can we construct confidence intervals or hypothesis tests that are guaranteed to be more efficient or powerful than the supervised estimators? To address the first question, we establish the minimax lower bound for parameter estimation in the semi-supervised setting. We show that the upper bound from the supervised estimators that only use the labeled data cannot attain this lower bound. We close this gap by proposing a new semi-supervised estimator which attains the lower bound. To address the second question, based on our proposed semi-supervised estimator, we propose two additional estimators for semi-supervised inference, the efficient estimator and the safe estimator. The former is fully efficient if the unknown conditional mean function is estimated consistently, but may not be more efficient than the supervised approach otherwise. The latter usually does not aim to provide fully efficient inference, but is guaranteed to be no worse than the supervised approach, no matter whether the linear model is correctly specified or the conditional mean function is consistently estimated.

estimator, inference, unlabeled data, (14 more...)

2011.14185

Country:

North America > United States > Wisconsin > Dane County > Madison (0.14)
North America > United States > New York > Tompkins County > Ithaca (0.04)
North America > United States > Texas (0.04)
(2 more...)

Genre: Research Report > New Finding (0.92)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)