AITopics

2006.12724

Country:

Europe > Netherlands (0.14)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > Canada > Quebec (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry:

Banking & Finance > Economy (1.00)
Government > Regional Government > North America Government > United States Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

#artificialintelligenceNov-7-2020, 09:25:10 GMT

A Perfect Intro To Supervised Learning

We're taught to ride the bicycle, a few lanes around the house. We're taught to solve a few examples of the math. The next similar numerical you encounter is something you learn training yourself over the previous ones. I believe nothing is complex. I've often been intimidated by these buzzwords.

algorithm, learning, supervised learning, (13 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.36)

#artificialintelligenceNov-7-2020, 09:25:07 GMT

Linear Regression

Regression is an Algorithm of the Supervised Learning model. When the output or the dependent feature is continuous and labeled then, we apply the Regression Algorithm. Regression is used to find the relation or equation between the Independent variables and the output variable. E.g., given below, we have variable x₁, x₂, ….,xₙ, which contribute towards the output of variable y. We have to find a relation between x variables and dependent variable y.

equation, principal minor, regression, (14 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Cai, T. Tony, Wang, Yichen, Zhang, Linjun

The Cost of Privacy in Generalized Linear Models: Algorithms and Minimax Lower Bounds

arXiv.org Machine LearningNov-7-2020

The trade-off between differential privacy and statistical accuracy in generalized linear models (GLMs) is studied. We propose differentially private algorithms for parameter estimation in both low-dimensional and high-dimensional sparse GLMs and characterize their statistical performance. We establish privacy-constrained minimax lower bounds for GLMs, which imply that the proposed algorithms are rate-optimal up to logarithmic factors in sample size. The lower bounds are obtained via a novel technique, which is based on Stein's Lemma and generalizes the tracing attack technique for privacy-constrained lower bounds. This lower bound argument can be of independent interest as it is applicable to general parametric models. Simulated and real data experiments are conducted to demonstrate the numerical performance of our algorithms.

algorithm, assumption, privacy, (14 more...)

2011.039

Country:

Oceania > Australia > New South Wales (0.04)
North America > United States > Pennsylvania (0.04)
North America > United States > California (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

#artificialintelligenceNov-6-2020, 17:55:42 GMT

The Solution to a Data Science Problem is not Unique

Data science projects vary in scope and complexity. Sometimes, the project could be as simple as producing summary statistics, charts, and visualizations. It could also involve building a regression model, classification model, or forecasting using a time-dependent dataset. The project could also be very complex and difficult, with no clear guidance as to the specific type of model to use. In this case, it is the task of the data science aspirant or professional to come up with a model that best suitable for addressing project goals and objectives.

dataset, jupyter notebook, regression, (14 more...)

Industry: Transportation (0.35)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.71)

#artificialintelligenceNov-6-2020, 09:20:06 GMT

Timeline for Data Science Competence

For anyone interested in jumping into the field of data science, one of the most important questions to ask is: How long does it take to gain competency in data science? This article will discuss the typical timeline for data science competency. The time required to gain competency in data science depends on the level of competency. In Section II, we will discuss the three levels of data science. In Section III, we discuss the time required for gaining data science competency based on the level of interest.

competency, data science, data science competency, (10 more...)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.36)

#artificialintelligenceNov-6-2020, 01:30:27 GMT

Machine Learning Algorithms: Everything You Need to Know - Business Module Hub

If you're an AI professional or aspire to be one, one thing you must be aware of is: machine learning algorithms are your closest aid and ally. These algorithms can also be annoying. Given that there is a multitude of algorithms. The knowledge of algorithms is essential to be an effective AI engineer, data scientist, and machine learning engineer. To give you a gist of how these algorithms work, let's get down to know these algorithms.

algorithm, learning algorithm, machine learning algorithm, (14 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.75)

Nakıp, Mert, Çopur, Onur, Güzeliş, Cüneyt

Curse of Small Sample Size in Forecasting of the Active Cases in COVID-19 Outbreak

arXiv.org Machine LearningNov-6-2020

During the COVID-19 pandemic, a massive number of attempts on the predictions of the number of cases and the other future trends of this pandemic have been made. However, they fail to predict, in a reliable way, the medium and long term evolution of fundamental features of COVID-19 outbreak within acceptable accuracy. This paper gives an explanation for the failure of machine learning models in this particular forecasting problem. The paper shows that simple linear regression models provide high prediction accuracy values reliably but only for a 2-weeks period and that relatively complex machine learning models, which have the potential of learning long term predictions with low errors, cannot achieve to obtain good predictions with possessing a high generalization ability. It is suggested in the paper that the lack of a sufficient number of samples is the source of low prediction performance of the forecasting models. The reliability of the forecasting results about the active cases is measured in terms of the cross-validation prediction errors, which are used as expectations for the generalization errors of the forecasters. To exploit the information, which is of most relevant with the active cases, we perform feature selection over a variety of variables. We apply different feature selection methods, namely the Pairwise Correlation, Recursive Feature Selection, and feature selection by using the Lasso regression and compare them to each other and also with the models not employing any feature selection. Furthermore, we compare Linear Regression, Multi-Layer Perceptron, and Long-Short Term Memory models each of which is used for prediction active cases together with the mentioned feature selection methods. Our results show that the accurate forecasting of the active cases with high generalization ability is possible up to 3 days only because of the small sample size of COVID-19 data.

active case, feature selection method, forecasting, (11 more...)

2011.03628

Country:

Europe > Austria > Vienna (0.14)
Asia > Middle East > Republic of Türkiye > İzmir Province > İzmir (0.05)
North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.05)
(10 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

arXiv.org Machine LearningNov-6-2020

Estimation, Confidence Intervals, and Large-Scale Hypotheses Testing for High-Dimensional Mixed Linear Regression

Zhang, Linjun, Ma, Rong, Cai, T. Tony, Li, Hongzhe

This paper studies the high-dimensional mixed linear regression (MLR) where the output variable comes from one of the two linear regression models with an unknown mixing proportion and an unknown covariance structure of the random covariates. Building upon a high-dimensional EM algorithm, we propose an iterative procedure for estimating the two regression vectors and establish their rates of convergence. Based on the iterative estimators, we further construct debiased estimators and establish their asymptotic normality. For individual coordinates, confidence intervals centered at the debiased estimators are constructed. Furthermore, a large-scale multiple testing procedure is proposed for testing the regression coefficients and is shown to control the false discovery rate (FDR) asymptotically. Simulation studies are carried out to examine the numerical performance of the proposed methods and their superiority over existing methods. The proposed methods are further illustrated through an analysis of a dataset of multiplex image cytometry, which investigates the interaction networks among the cellular phenotypes that include the expression levels of 20 epitopes or combinations of markers.

algorithm, estimator, regression, (14 more...)

2011.03598

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)
North America > United States > New Jersey > Middlesex County > Piscataway (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

#artificialintelligenceNov-5-2020, 23:05:12 GMT

Calculating Linear Regression and Linear Best Fit an In-depth Tutorial with Math and Python

This tutorial's code is available on Github and its full implementation as well on Google Colab. Simple linear regression is a statistical approach that allows us to study and summarize the relationship between two continuous quantitative variables. Simple linear regression is used in machine learning models, mathematics, statistical modeling, forecasting epidemics, and other quantitative fields. Out of the two variables, one variable is called the dependent variable, and the other variable is called the independent variable. Our goal is to predict the dependent variable's value based on the value of the independent variable.

dataset, equation, squidward, (13 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.93)