AITopics

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.87)

#artificialintelligenceJul-18-2018, 07:06:22 GMT

The Random Forest Algorithm – Towards Data Science

Random Forest is a flexible, easy to use machine learning algorithm that produces, even without hyper-parameter tuning, a great result most of the time. It is also one of the most used algorithms, because it's simplicity and the fact that it can be used for both classification and regression tasks. In this post, you are going to learn, how the random forest algorithm works and several other important things about it. Random Forest is a supervised learning algorithm. Like you can already see from it's name, it creates a forest and makes it somehow random.

artificial intelligence, machine learning, random forest, (16 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

#artificialintelligenceJul-14-2018, 13:12:05 GMT

Random Forest Tutorials - The Bagging Algorithm - Tutorial 2 statinfer

Bagging Bootstrapping The Bagging Algorithm Why Bagging Works LAB: Bagging Models Data scientist is called as the sexiest job of the 21st century. They take an enormous mass of messy data points (unstructured and structured) and use their formidable skills in math, statistics, and programming to clean, massage and organize. But worry not we are here to the rescue and teach you how to be a data scientist, more importantly, upgrade your analytic skills to tackle any problem in the field of data science. Join us on "statinfer.com" for becoming a "scientist in data science" Our "Machine Learning" course is now available on Udemy https://www.udemy.com/machine-learnin... Facebook link:- (Visit our facebook page we are sharing data science videos) https://www.facebook.com/aboutanalytics/ Visit our official website to go deeper into data science topics.

artificial intelligence, bagging algorithm, machine learning, (7 more...)

Genre:

Instructional Material > Online (0.31)
Instructional Material > Course Syllabus & Notes (0.31)

Industry: Education (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.40)

Oprescu, Miruna, Syrgkanis, Vasilis, Wu, Zhiwei Steven

Orthogonal Random Forest for Heterogeneous Treatment Effect Estimation

arXiv.org Machine LearningJul-12-2018

We study the problem of estimating heterogeneous treatment effects from observational data, where the treatment policy on the collected data was determined by potentially many confounding observable variables. We propose orthogonal random forest, an algorithm that combines orthogonalization, a technique that effectively removes the confounding effect in two-stage estimation, with generalized random forests [Athey et al., 2017], a flexible method for estimating treatment effect heterogeneity. We prove a consistency rate result of our estimator in the partially linear regression model, and en route we provide a consistency analysis for a general framework of performing generalized method of moments (GMM) estimation. We also provide a comprehensive empirical evaluation of our algorithms, and show that they consistently outperform baseline approaches.

artificial intelligence, experiment, machine learning, (15 more...)

1806.03467

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.50)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)

arXiv.org Machine LearningJul-11-2018

Two-Layer Mixture Network Ensemble for Apparel Attributes Classification

Han, Tianqi, Fu, Zhihui, Li, Hongyu

Recognizing apparel attributes has recently drawn great interest in the computer vision community. Methods based on various deep neural networks have been proposed for image classification, which could be applied to apparel attributes recognition. An interesting problem raised is how to ensemble these methods to further improve the accuracy. In this paper, we propose a two-layer mixture framework for ensemble different networks. In the first layer of this framework, two types of ensemble learning methods, bagging and boosting, are separately applied. Different from traditional methods, our bagging process makes use of the whole training set, not random subsets, to train each model in the ensemble, where several differentiated deep networks are used to promote model variance. To avoid the bias of small-scale samples, the second layer only adopts bagging to mix the results obtained with bagging and boosting in the first layer. Experimental results demonstrate that the proposed mixture framework outperforms any individual network model or either independent ensemble method in apparel attributes classification.

artificial intelligence, machine learning, predictor, (16 more...)

1807.10572

Country: Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report > New Finding (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Thomas, Janek, Coors, Stefan, Bischl, Bernd

Automatic Gradient Boosting

arXiv.org Machine LearningJul-10-2018

Automatic machine learning performs predictive modeling with high performing machine learning tools without human interference. This is achieved by making machine learning applications parameter-free, i.e. only a dataset is provided while the complete model selection and model building process is handled internally through (often meta) optimization. Projects like Auto-WEKA and auto-sklearn aim to solve the Combined Algorithm Selection and Hyperparameter optimization (CASH) problem resulting in huge configuration spaces. However, for most real-world applications, the optimization over only a few different key learning algorithms can not only be sufficient, but also potentially beneficial. The latter becomes apparent when one considers that models have to be validated, explained, deployed and maintained. Here, less complex model are often preferred, for validation or efficiency reasons, or even a strict requirement. Automatic gradient boosting simplifies this idea one step further, using only gradient boosting as a single learning algorithm in combination with model-based hyperparameter tuning, threshold optimization and encoding of categorical features. We introduce this general framework as well as a concrete implementation called autoxgboost. It is compared to current AutoML projects on 16 datasets and despite its simplicity is able to achieve comparable results on about half of the datasets as well as performing best on two.

artificial intelligence, machine learning, optimization, (18 more...)

1807.03873

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

#artificialintelligenceJul-7-2018, 00:06:36 GMT

How to explain gradient boosting

As Ben Gorman points out in A Kaggle Master Explains Gradient Boosting, "This is the part that gets butchered by a lot of gradient boosting explanations." His blog post does a good job of explaining it, but we give our own perspective here.

artificial intelligence, gradient, machine learning

Technology: Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

#artificialintelligenceJul-4-2018, 15:56:18 GMT

Predict Customer Churn with Gradient Boosting

Customer churn is a key predictor of the long term success or failure of a business. But when it comes to all this data, what's the best model to use? This post shows that gradient boosting is the most accurate way of predicting customer attrition. I'll show you how you can create your own data analysis using gradient boosting to identify and save those at risk customers! Customer retention should be a top priority of any business as acquiring new customers is often far more expensive that keeping existing ones.

artificial intelligence, customer, machine learning, (11 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

arXiv.org Machine LearningJun-28-2018

XGBoost: Scalable GPU Accelerated Learning

Mitchell, Rory, Adinets, Andrey, Rao, Thejaswi, Frank, Eibe

We describe the multi-GPU gradient boosting algorithm implemented in the XGBoost library (https://github.com/dmlc/xgboost). Our algorithm allows fast, scalable training on multi-GPU systems with all of the features of the XGBoost library. We employ data compression techniques to minimise the usage of scarce GPU memory while still allowing highly efficient implementation. Using our algorithm we show that it is possible to process 115 million training instances in under three minutes on a publicly available cloud computing instance. The algorithm is implemented using end-to-end GPU parallelism, with prediction, gradient calculation, feature quantisation, decision tree construction and evaluation phases all computed on device.

artificial intelligence, inductive learning, machine learning, (16 more...)

1806.11248

Country:

Oceania > New Zealand > North Island > Waikato (0.05)
North America > United States > Colorado > Larimer County > Fort Collins (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.95)

arXiv.org Machine LearningJun-28-2018

Quit When You Can: Efficient Evaluation of Ensembles with Ordering Optimization

Wang, Serena, Gupta, Maya, You, Seungil

Given a classifier ensemble and a set of examples to be classified, many examples may be confidently and accurately classified after only a subset of the base models in the ensemble are evaluated. This can reduce both mean latency and CPU while maintaining the high accuracy of the original ensemble. To achieve such gains, we propose jointly optimizing a fixed evaluation order of the base models and early-stopping thresholds. Our proposed objective is a combinatorial optimization problem, but we provide a greedy algorithm that achieves a 4-approximation of the optimal solution for certain cases. For those cases, this is also the best achievable polynomial time approximation bound unless $P = NP$. Experiments on benchmark and real-world problems show that the proposed Quit When You Can (QWYC) algorithm can speed-up average evaluation time by $2$x--$4$x, and is around $1.5$x faster than prior work. QWYC's joint optimization of ordering and thresholds also performed better in experiments than various fixed orderings, including gradient boosted trees' ordering.

artificial intelligence, base model, machine learning, (15 more...)

1806.11202

Country:

North America > United States > New York (0.04)
Africa > Senegal > Kolda Region > Kolda (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.54)