Goto

Collaborating Authors

 Regression


Statistical inference in massive datasets by empirical likelihood

arXiv.org Machine Learning

With the rapid development of science and technologies, massive data can be collected at a large speed, especially in internet and financial fields. It is generally recognized that two major challenges in large-scale learning are estimation and inference due to large amount of computation. For statistical inference on massive data sets, Kleiner et al. (2014) proposed the bag of little bootstrap (BLB) to assess the quality of estimators. However, they used only a small number of random subsets, and partial observations from each subset. This implies less efficiency in application.


Automatic Grading of Knee Osteoarthritis on the Kellgren-Lawrence Scale from Radiographs Using Convolutional Neural Networks

arXiv.org Machine Learning

The severity of knee osteoarthritis is graded using the 5-point Kellgren-Lawrence (KL) scale where healthy knees are assigned grade 0, and the subsequent grades 1-4 represent increasing severity of the affliction. Although several methods have been proposed in recent years to develop models that can automatically predict the KL grade from a given radiograph, most models have been developed and evaluated on datasets not sourced from India. These models fail to perform well on the radiographs of Indian patients. In this paper, we propose a novel method using convolutional neural networks to automatically grade knee radiographs on the KL scale. Our method works in two connected stages: in the first stage, an object detection model segments individual knees from the rest of the image; in the second stage, a regression model automatically grades each knee separately on the KL scale. We train our model using the publicly available Osteoarthritis Initiative (OAI) dataset and demonstrate that fine-tuning the model before evaluating it on a dataset from a private hospital significantly improves the mean absolute error from 1.09 (95% CI: 1.03-1.15) to 0.28 (95% CI: 0.25-0.32). Additionally, we compare classification and regression models built for the same task and demonstrate that regression outperforms classification.


Machine learning algorithms in Python Algorithmia Blog

#artificialintelligence

Python is one of the most commonly used programming languages by data scientists and machine learning engineers. Although there has been no universal study on the prevalence of machine learning algorithms in Python in machine learning, a 2019 GitHub analysis of public repositories tagged as "machine-learning" not surprisingly found that Python was the most common language used. Python outranked other languages commonly used in the data science community including R, Scala, and Julia. This is all to say that if you're interested in being a data scientist or a machine learning engineer, then understanding Python should be on your to-do list. It's important to remember though that employing machine learning techniques involves more than just coding for coding's sake.


Machine learning algorithms in Python Algorithmia Blog

#artificialintelligence

Python is one of the most commonly used programming languages by data scientists and machine learning engineers. Although there has been no universal study on the prevalence of machine learning algorithms in Python in machine learning, a 2019 GitHub analysis of public repositories tagged as "machine-learning" not surprisingly found that Python was the most common language used. Python outranked other languages commonly used in the data science community including R, Scala, and Julia. This is all to say that if you're interested in being a data scientist or a machine learning engineer, then understanding Python should be on your to-do list. It's important to remember though that employing machine learning techniques involves more than just coding for coding's sake.


What is Logistic Regression? An introduction for everyone 23

#artificialintelligence

Artificial Intelligence terms explained in a minute for everyone! This week's term is Logistic Regression. Ask any questions or remarks you have in the comments, I will gladly answer to everything! Subscribe to not miss any AI news and terms explained! Facebook: https://www.facebook.com/whats.artifi... Share this to someone who needs to learn more about Artificial Intelligence!


Contrastive Examples for Addressing the Tyranny of the Majority

arXiv.org Machine Learning

Computer vision algorithms, e.g. for face recognition, favour groups of individuals that are better represented in the training data. This happens because of the generalization that classifiers have to make. It is simpler to fit the majority groups as this fit is more important to overall error. We propose to create a balanced training dataset, consisting of the original dataset plus new data points in which the group memberships are intervened, minorities become majorities and vice versa. We show that current generative adversarial networks are a powerful tool for learning these data points, called contrastive examples. We experiment with the equalized odds bias measure on tabular data as well as image data (CelebA and Diversity in Faces datasets). Contrastive examples allow us to expose correlations between group membership and other seemingly neutral features. Whenever a causal graph is available, we can put those contrastive examples in the perspective of counterfactuals.


Measurement Error in Nutritional Epidemiology: A Survey

arXiv.org Machine Learning

This article reviews bias-correction models for measurement error of exposure variables in the field of nutritional epidemiology. Measurement error usually attenuates estimated slope towards zero. Due to the influence of measurement error, inference of parameter estimate is conservative and confidence interval of the slope parameter is too narrow. Bias-correction in estimators and confidence intervals are of primary interest. We review the following bias-correction models: regression calibration methods, likelihood based models, missing data models, simulation based methods, nonparametric models and sampling based procedures.


How to Use One-vs-Rest and One-vs-One for Multi-Class Classification

#artificialintelligence

Not all classification predictive models support multi-class classification. Algorithms such as the Perceptron, Logistic Regression, and Support Vector Machines were designed for binary classification and do not natively support classification tasks with more than two classes. One approach for using binary classification algorithms for multi-classification problems is to split the multi-class classification dataset into multiple binary classification datasets and fit a binary classification model on each. Two different examples of this approach are the One-vs-Rest and One-vs-One strategies. In this tutorial, you will discover One-vs-Rest and One-vs-One strategies for multi-class classification.


Local Model Feature Transformations

arXiv.org Machine Learning

Local learning methods are a popular class of machine learning algorithms. The basic idea for the entire cadre is to choose some non-local model family, to train many of them on small sections of neighboring data, and then to `stitch' the resulting models together in some way. Due to the limits of constraining a training dataset to a small neighborhood, research on locally-learned models has largely been restricted to simple model families. Also, since simple model families have no complex structure by design, this has limited use of the individual local models to predictive tasks. We hypothesize that, using a sufficiently complex local model family, various properties of the individual local models, such as their learned parameters, can be used as features for further learning. This dissertation improves upon the current state of research and works toward establishing this hypothesis by investigating algorithms for localization of more complex model families and by studying their applications beyond predictions as a feature extraction mechanism. We summarize this generic technique of using local models as a feature extraction step with the term ``local model feature transformations.'' In this document, we extend the local modeling paradigm to Gaussian processes, orthogonal quadric models and word embedding models, and extend the existing theory for localized linear classifiers. We then demonstrate applications of local model feature transformations to epileptic event classification from EEG readings, activity monitoring via chest accelerometry, 3D surface reconstruction, 3D point cloud segmentation, handwritten digit classification and event detection from Twitter feeds.


Data Analytics Learning Path

#artificialintelligence

Free Course - Data Analytics Learning Path 2020 A comprehensive way to master Data Analytics now at the tip of your fingers. This path includes hand-picked courses which will help you master Data Analytics with ease. It include courses on Microsoft Excel, SQL for Data Analytics, building regression models in Python, and classification models in Python. Instructor driven project Enroll Now - Data Analytics Learning Path Microsoft Excel Masterclass: Complete and Concise Course Instructor: Start Tech Academy This online tutorial teaches you complete MS Excel from the scratch covering all the essential topics such as Pivots, Macros and Analytics. Enroll now! 3 Hours 17 Lectures 2 SQL Masterclass: SQL for Data Analytics Course Instructor: Start Tech Academy Learning SQL for Data Analytics is now easy with this online tutorial.