Class Imbalance in Credit Card Fraud Detection - Part 3 : Undersampling in Spark

Jan-8-2018, 06:58:49 GMT–#artificialintelligence

Whenever we do classification in ML, we often assume that target label is evenly distributed in our dataset. This helps the training algorithm to learn the features as we have enough examples for all the different cases. For example, in learning a spam filter, we should have good amount of data which corresponds to emails which are spam and non spam. This even distribution is not always possible. Let's take an example of fraud detection.

artificial intelligence, credit card fraud detection, machine learning, (8 more...)

#artificialintelligence

Jan-8-2018, 06:58:49 GMT

News Web Page

Add feedback

Industry:
- Law Enforcement & Public Safety > Fraud (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)