Goto

Collaborating Authors

 python and scikit-learn


Integrating Scikit-learn Machine Learning models into the Microsoft .NET

#artificialintelligence

While being part of a team working on designing and developing a lead scoring system prototype, I faced the challenge of integrating machine learning models into the target environment built around the Microsoft .NET ecosystem. Technically, I implemented the lead scoring predictive model using the Scikit-learn machine learning built-in algorithm for regression, more precisely Logistic Regression. Considering the phases of initial data analysis, data preprocessing, exploratory data analysis (EDA), and the data preparation for the model building itself, I used the Jupyter Notebook environment powered by Anaconda distribution for Python scientific computing. Previously, I have investigated and touched Python within Flask as a micro web framework written in this programming language. However, I aimed to integrate or deploy the machine learning model written in Python into the .NET ecosystem, using the C# programming language and Visual Studio IDE.


Multiple Linear Regression Using Python and Scikit-learn

#artificialintelligence

This article was published as a part of the Data Science Blogathon. If you are on the path of learning data science, then you definitely have an understanding of what machine learning is. In today's digital world everyone knows what Machine Learning is because it was a trending digital technology across the world. Every step towards adaptation of the future world leads by this current technology, and this current technology is led by data scientists like you and me . Here we only discuss machine learning, If you don't know what it is, then we take a brief introduction to it: Machine learning is the study of the algorithms of computers, that improve automatically through experience and by the use of data. This is the simple definition of machine learning, and when we go into deep then we find that there are huge numbers of algorithms that are used in model building.


Naive Bayes Classifier Tutorial in Python and Scikit-Learn

#artificialintelligence

Naive Bayes Classifier is a simple model that's usually used in classification problems. Despite being simple, it has shown very good results, outperforming by far other, more complicated models. This is the second article in a series of two about the Naive Bayes Classifier and it will deal with the implementation of the model in Scikit-Learn with Python. For a detailed overview of the math and the principles behind the model, please check the other article: Naive Bayes Classifier Explained. In the previous article linked above, I introduced a table of some data that we can train our classifier on.


Automate Machine Learning Workflows with Pipelines in Python and scikit-learn

#artificialintelligence

There are standard workflows in a machine learning project that can be automated. In Python scikit-learn, Pipelines help to to clearly define and automate these workflows. In this post you will discover Pipelines in scikit-learn and how you can automate common machine learning workflows. Automate Machine Learning Workflows with Pipelines in Python and scikit-learn Photo by Brian Cantoni, some rights reserved. There are standard workflows in applied machine learning.


How to perform Affinity Propagation with Python in Scikit? โ€“ MachineCurve

#artificialintelligence

Say you've got a dataset where there exist relationships between individual samples, and your goal is to identify groups of related samples within the dataset. Clustering, which is part of the class of unsupervised machine learning algorithms, is then the way to go. But what clustering algorithm to apply when you do not really know the number of clusters? Enter Affinity Propagation, a gossip-style algorithm which derives the number of clusters by mimicing social group formation by passing messages about the popularity of individual samples as to whether they're part of a certain group, or even if they are the leader of one. This algorithm, which can estimate the number of clusters/groups in your dataset itself, is the topic of today's blog post.


An Introduction to Random Forest with Python and scikit-learn

#artificialintelligence

NOTE: This post assumes basic understanding of decision trees. If you need to refresh how Decision Trees work, I recommend you to first read An Introduction to Decision Trees with Python and scikit-learn. The good thing about Random Forest is that if we understand Decision Trees very well, it should be very easy to understand Random Forest as well. The name Random Forest actually describes pretty well the extra features added. Firstly, we now have something that is random, which I'll explain more in depth.


Learn classification algorithms using Python and scikit-learn

#artificialintelligence

This tutorial is part of the Machine learning for developers learning path. In this tutorial, we describe the basics of solving a classification-based machine learning problem, and give you a comparative study of some of the current most popular algorithms. In the open Notebook, click Run to run the cells one at a time. The rest of the tutorial follows the order of the Notebook. Classification is when the feature to be predicted contains categories of values.


Machine Learning with Python and scikit-Learn: 3-in-1

@machinelearnbot

As the amount of data continues to grow at an almost incomprehensible rate, being able to understand and process data is becoming a key differentiator for IT professionals and data-scientists. The scikit-learn library is one of the most popular platforms for everyday Machine Learning and data science because it is built upon Python, a fully featured programming language. This comprehensive 3-in-1 course is your one-stop solution to everything that matters in mastering machine learning algorithms and their implementation. Develop pipelines and process data through manipulation, extraction, and data-cleansing techniques. Learn clean coding techniques which are applicable to any scalable Machine Learning projects.


Natural Language Identification Machine Learning Pipeline with Python and Scikit-Learn

#artificialintelligence

Graduate Project for Harvard's Python for Data Science (CSCI E - 29) In this project, I pulled text data from European Parliament Proceedings in 21 languages. Using Scikit-Learn, I transformed the raw text into a numerical feature matrix, and trained a Multinomial naive bayes probability model to classify input language with greater than 99% accuracy.


Implementing Machine Learning Using Python and Scikit-learn

#artificialintelligence

For machine learning, you can also use these libraries to build learning models. However, doing so requires that you have a strong appreciation of the mathematical foundation for the various machine learning algorithms.