Regression
Transaction Confirmation Time Prediction in Ethereum Blockchain Using Machine Learning
Singh, Harsh Jot, Hafid, Abdelhakim Senhaji
Blockchain offers a decentralized, immutable, transparent system of records. It offers a peer-to-peer network of nodes with no centralised governing entity making it unhackable and therefore, more secure than the traditional paper-based or centralised system of records like banks etc. While there are certain advantages to the paper-based recording approach, it does not work well with digital relationships where the data is in constant flux. Unlike traditional channels, governed by centralized entities, blockchain offers its users a certain level of anonymity by providing capabilities to interact without disclosing their personal identities and allows them to build trust without a third-party governing entity. Due to the aforementioned characteristics of blockchain, more and more users around the globe are inclined towards making a digital transaction via blockchain than via rudimentary channels. Therefore, there is a dire need for us to gain insight on how these transactions are processed by the blockchain and how much time it may take for a peer to confirm a transaction and add it to the blockchain network. This paper presents a novel approach that would allow one to estimate the time, in block time or otherwise, it would take for a mining node to accept and confirm a transaction to a block using machine learning. The paper also aims to compare the predictive accuracy of two machine learning regression models- Random Forest Regressor and Multilayer Perceptron against previously proposed statistical regression model under a set evaluation criterion. The objective is to determine whether machine learning offers a more accurate predictive model than conventional statistical models. The proposed model results in improved accuracy in prediction.
What is Logistic Regression? Types of Logistic Regression
In Previous topic we came across the first most machine learning algorithm which is Linear Regression. Now it's learn about one of the linear algorithm in this section. Logistic Regression is used to solve the classification problems, so it's called as Classification Algorithm that models the probability of output class. Consider a scenario where we need to classify whether a patient has diabetes or not. If we use linear regression for this problem, there is a need for setting up a threshold based on which classification can be done.
Lung Cancer Detection and Classification based on Image Processing and Statistical Learning
Hasan, Md Rashidul, Kabir, Muntasir Al
Lung cancer is one of the death threatening diseases among human beings. Early and accurate detection of lung cancer can increase the survival rate from lung cancer. Computed Tomography (CT) images are commonly used for detecting the lung cancer.Using a data set of thousands of high-resolution lung scans collected from Kaggle competition [1], we will develop algorithms that accurately determine in the lungs are cancerous or not. The proposed system promises better result than the existing systems, which would be beneficial for the radiologist for the accurate and early detection of cancer. The method has been tested on 198 slices of CT images of various stages of cancer obtained from Kaggle dataset[1] and is found satisfactory results. The accuracy of the proposed method in this dataset is 72.2%
Differentially Private Federated Variational Inference
Sharma, Mrinank, Hutchinson, Michael, Swaroop, Siddharth, Honkela, Antti, Turner, Richard E.
In many real-world applications of machine learning, data are distributed across many clients and cannot leave the devices they are stored on. Furthermore, each client's data, computational resources and communication constraints may be very different. This setting is known as federated learning, in which privacy is a key concern. Differential privacy is commonly used to provide mathematical privacy guarantees. This work, to the best of our knowledge, is the first to consider federated, differentially private, Bayesian learning. We build on Partitioned Variational Inference (PVI) which was recently developed to support approximate Bayesian inference in the federated setting. We modify the client-side optimisation of PVI to provide an (${\epsilon}$, ${\delta}$)-DP guarantee. We show that it is possible to learn moderately private logistic regression models in the federated setting that achieve similar performance to models trained non-privately on centralised data.
Reading The Markets -- Machine Learning Versus The Financial News
Suffice it to say that they are a form of non-linear regression tool whose underlying design found inspiration in a simplification of the basic architecture of the human brain. Many of the great advances that we have experienced in Machine Learning over the last few years make use of neural networks. The basic algorithm has been around for decades -- but it has come into its own as processing power and data availability have steadily increased. For this project we implemented our neural network in Python using the popular TensorFlow library from Google. The characteristics of our neural network, and in particular its complexity, were chosen to balance precision and generalization.
Data Science and Machine Learning
What is Confusion Matrix and Advanced Classification Metrics? After data preparation and model training, there is model evaluat... Root Mean Square Error (RMSE) How to calculate Root-Mean-Square Error? This post will cover most common ways to evaluate the regression model. The idea of reg... Machine Learning Process Simple Picture of Machine Learning Modelling Process Learning machine is computer algorithm to search patterns in massive data.... Regression Techniques Regression Techniques By Their Machine Learning Families Several Machine Learning (ML) algorithms and families are out t... Research Paper on Machine Learning Research Papers on Classifiers and Regression Models In this article, I am going to write on two most important research papers... It's Easy to Learn MapReduce process In this article, I have tried to cover MapReduce process by explaining Map and Reduce cycl... Machine Learning Overview Machine Learning Overview For easy understanding of ML overview, this post shows the cheat sheet of types of ML with some algorith... What is Confusion Matrix and Advanced Classification Metrics? After data preparation and model training, there is model evaluat... What is Confusion Matrix and Advanced Classification Metrics?
Linear Regression Analysis – Introduction to Machine Learning using Python (Part 1)
Learn how to run Multiple Linear Regression Analysis using Python from scratch! Learn the process of Machine Learning and all the tasks / steps you must undertake. Then, learn how to apply them using some dummy data and a Multiple Linear Regression Algorithm in Python using scikit (SKLearn) library. How to use our Linear Regression model How to download and install Python through Anaconda: https://youtu.be/__8BK62j-bw
Supervised vs Unsupervised Learning
In machine learning, most tasks can be easily categorized into one of two different classes: supervised learning problems or unsupervised learning problems. In supervised learning, data has labels or classes appended to it, while in the case of unsupervised learning the data is unlabeled. Let's take a close look at why this distinction is important and look at some of the algorithms associated with each type of learning. Most machine learning tasks are in the domain of supervised learning. In supervised learning algorithms, the individual instances/data points in the dataset have a class or label assigned to them.
An easy guide to choose the right Machine Learning algorithm for your task
Well, there is no straightforward and sure-shot answer to this question. The answer depends on many factors like the problem statement and the kind of output you want, type and size of the data, the available computational time, number of features and observations in the data, to name a few. It is usually recommended to gather a good amount of data to get reliable predictions. However, many a time the availability of data is a constraint. So, if the training data is smaller or if the dataset has a fewer number of observations and a higher number of features like genetics or textual data, choose algorithms with high bias/low variance like Linear regression, Naïve Bayes, Linear SVM.
Noise Induces Loss Discrepancy Across Groups for Linear Regression
This loss discrepancy across groups is especially problematic in critical applications that impact people's lives (Berk, 2012; Chouldechova, 2017). Despite the vast literature on removing loss discrepancy (Hardt et al., 2016; Khani et al., 2019; Agarwal et al., 2018; Zafar et al., 2017), the direct removal of loss discrepancy might introduce other problems such as intragroup loss discrepancy (Lipton et al., 2018) and adverse long-term impacts (Liu et al., 2018). Therefore, it is important to understand the source of loss discrepancy. Why do such loss discrepancies exist? The literature generally studies sources of loss discrepancy due to an "information deficiency" of one group--that is, one group has, for example, more noise (Corbett-Davies et al., 2017), lessPreliminary work, under review.