AITopics

1711.04817

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

#artificialintelligenceJan-15-2018, 20:03:39 GMT

How to Generate Test Datasets in Python with scikit-learn - Machine Learning Mastery

Test datasets are small contrived datasets that let you test a machine learning algorithm or test harness. The data from test datasets have well-defined properties, such as linearly or non-linearity, that allow you to explore specific algorithm behavior. The scikit-learn Python library provides a suite of functions for generating samples from configurable test problems for regression and classification. In this tutorial, you will discover test problems and how to use them in Python with scikit-learn. A problem when developing and implementing machine learning algorithms is how do you know whether you have implemented them correctly.

algorithm, dataset, test problem, (12 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.32)

@machinelearnbotJan-15-2018, 16:53:44 GMT

Linear Regression, GLMs and GAMs with R Udemy

Linear Regression, GLMs and GAMs with R demonstrates how to use R to extend the basic assumptions and constraints of linear regression to specify, model, and interpret the results of generalized linear (GLMs) and generalized additive (GAMs) models. The course demonstrates the estimation of GLMs and GAMs by working through a series of practical examples from the book Generalized Additive Models: An Introduction with R by Simon N. Wood (Chapman & Hall/CRC Texts in Statistical Science, 2006). Linear statistical models have a univariate response modeled as a linear function of predictor variables and a zero mean random error term. The assumption of linearity is a critical (and limiting) characteristic. Generalized linear models (GLMs) relax this assumption of linearity.

artificial intelligence, glm and gam, machine learning, (7 more...)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.40)
Education > Educational Setting > Online (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

arXiv.org Machine LearningJan-15-2018

Improving Orbit Prediction Accuracy through Supervised Machine Learning

Peng, Hao, Bai, Xiaoli

Due to the lack of information such as the space environment condition and resident space objects' (RSOs') body characteristics, current orbit predictions that are solely grounded on physics-based models may fail to achieve required accuracy for collision avoidance and have led to satellite collisions already. This paper presents a methodology to predict RSOs' trajectories with higher accuracy than that of the current methods. Inspired by the machine learning (ML) theory through which the models are learned based on large amounts of observed data and the prediction is conducted without explicitly modeling space objects and space environment, the proposed ML approach integrates physics-based orbit prediction algorithms with a learning-based process that focuses on reducing the prediction errors. Using a simulation-based space catalog environment as the test bed, the paper demonstrates three types of generalization capability for the proposed ML approach: 1) the ML model can be used to improve the same RSO's orbit information that is not available during the learning process but shares the same time interval as the training data; 2) the ML model can be used to improve predictions of the same RSO at future epochs; and 3) the ML model based on a RSO can be applied to other RSOs that share some common features.

artificial intelligence, machine learning, svm model, (16 more...)

doi: 10.1016/j.asr.2018.03.001

1801.04856

Country:

Europe (0.93)
North America > United States > Massachusetts (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Government (0.93)
Energy (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

@machinelearnbotJan-13-2018, 08:47:19 GMT

Introduction about Logistic Regression Model

Hello guys, we have learnt about Linear Regression model in my previous article. Today, in this article we will get to learn the basics of Logistic Regression and some tricks to find the relation between the variables. Do you know what type of variable is used in logistic regression… Don't worry, if you don't know then let me teach the variables: In simple linear regression the variables are one dependent and one independent, In multiple linear regression there are more than one independent variable. Understand one thing if your data is in continuous form then use only linear regression model, while on the other hand, if your data is in categorical form(e.g. In this model the data been code in binary form.

artificial intelligence, machine learning, regression, (12 more...)

Genre:

Research Report > New Finding (0.99)
Research Report > Experimental Study (0.99)

Industry: Health & Medicine (0.58)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Fischer, Aurélie, Montuelle, Lucie, Mougeot, Mathilde, Picard, Dominique

Statistical learning for wind power : a modeling and stability study towards forecasting

arXiv.org Machine LearningJan-12-2018

We focus on wind power modeling using machine learning techniques. We show on real data provided by the wind energy company Ma{\"i}a Eolis, that parametric models, even following closely the physical equation relating wind production to wind speed are outperformed by intelligent learning algorithms. In particular, the CART-Bagging algorithm gives very stable and promising results. Besides, as a step towards forecast, we quantify the impact of using deteriorated wind measures on the performances. We show also on this application that the default methodology to select a subset of predictors provided in the standard random forest package can be refined, especially when there exists among the predictors one variable which has a major impact.

artificial intelligence, machine learning, turbine, (17 more...)

doi: 10.1002/we.2139

1610.01

Country: Europe > France (0.16)

Genre: Research Report > New Finding (0.48)

Industry: Energy > Renewable > Wind (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.72)

@machinelearnbotJan-11-2018, 05:34:29 GMT

The 10 Statistical Techniques Data Scientists Need to Master

Regardless of where you stand on the matter of Data Science sexiness, it's simply impossible to ignore the continuing importance of data, and our ability to analyze, organize, and contextualize it. Drawing on their vast stores of employment data and employee feedback, Glassdoor ranked Data Scientist #1 in their 25 Best Jobs in America list. So the role is here to stay, but unquestionably, the specifics of what a Data Scientist does will evolve. With technologies like Machine Learning becoming ever-more common place, and emerging fields like Deep Learning gaining significant traction amongst researchers and engineers -- and the companies that hire them -- Data Scientists continue to ride the crest of an incredible wave of innovation and technological progress. While having a strong coding ability is important, data science isn't all about software engineering (in fact, have a good familiarity with Python and you're good to go). Data scientists live at the intersection of coding, statistics, and critical thinking.

artificial intelligence, independent variable, machine learning, (11 more...)

Genre: Research Report (0.39)

Industry: Education (0.48)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.76)

@machinelearnbotJan-10-2018, 21:28:34 GMT

Logistic Regression, Decision Tree and Neural Network in R

In this course, we cover two analytics techniques: Descriptive statistics and Predictive analytics. For the predictive analytic, our main focus is the implementation of a logistic regression model a Decision tree and neural network. We well also see how to interpret our result, compute the prediction accuracy rate, then construct a confusion matrix . By the end of this course, you will be able to effectively summarize your data, visualize your data, detect and eliminate missing values, predict futures outcomes using analytical techniques described above, construct a confusion matrix, import and export a data.

artificial intelligence, decision tree learning, machine learning, (3 more...)

Genre:

Research Report > New Finding (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.40)
Education > Educational Setting > Online (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.71)

@machinelearnbotJan-10-2018, 21:28:18 GMT

Regularization in Machine Learning

One of the major aspects of training your machine learning model is avoiding overfitting. The model will have a low accuracy if it is overfitting. This happens because your model is trying too hard to capture the noise in your training dataset. By noise we mean the data points that don't really represent the true properties of your data, but random chance. Learning such data points, makes your model more flexible, at the risk of overfitting.

artificial intelligence, coefficient, machine learning, (16 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.33)

Gorodetsky, Alex A., Jakeman, John D.

Gradient-based Optimization for Regression in the Functional Tensor-Train Format

arXiv.org Machine LearningJan-10-2018

We consider the task of low-multilinear-rank functional regression, i.e., learning a low-rank parametric representation of functions from scattered real-valued data. Our first contribution is the development and analysis of an efficient gradient computation that enables gradient-based optimization procedures, including stochastic gradient descent and quasi-Newton methods, for learning the parameters of a functional tensor-train (FT). The functional tensor-train uses the tensor-train (TT) representation of low-rank arrays as an ansatz for a class of low-multilinear-rank functions. The FT is represented by a set of matrix-valued functions that contain a set of univariate functions, and the regression task is to learn the parameters of these univariate functions. Our second contribution demonstrates that using nonlinearly parameterized univariate functions, e.g., symmetric kernels with moving centers, within each core can outperform the standard approach of using a linear expansion of basis functions. Our final contributions are new rank adaptation and group-sparsity regularization procedures to minimize overfitting. We use several benchmark problems to demonstrate at least an order of magnitude lower accuracy with gradient-based optimization methods than standard alternating least squares procedures in the low-sample number regime. We also demonstrate an order of magnitude reduction in accuracy on a test problem resulting from using nonlinear parameterizations over linear parameterizations. Finally we compare regression performance with 22 other nonparametric and parametric regression methods on 10 real-world data sets. We achieve top-five accuracy for seven of the data sets and best accuracy for two of the data sets. These rankings are the best amongst parametric models and competetive with the best non-parametric methods.

artificial intelligence, machine learning, univariate function, (19 more...)

1801.00885

Country:

North America > United States > New York (0.28)
North America > United States > Michigan (0.28)

Genre: Research Report > New Finding (0.93)

Industry:

Energy (1.00)
Government > Regional Government > North America Government > United States Government (0.93)
Aerospace & Defense (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)