Regression in Python using Sklearn, XGBoost and PySpark

Dec-7-2021, 01:55:28 GMT–#artificialintelligence

In the above story, we have used a Fitbit dataset. Based on the EDA, it was found that steps taken and calories are somewhat linearly correlated and together they may be indicative of a lower risk for all-cause mortality. More interestingly, among our data there is one dataset which has not been used yet which is a weight and BMI log. These data have a distinct nature since they are not necessarily machine generated, thereafter they serve the purpose of being'labels'. In simple words, users are collecting data regarding their activity using their Fitbit, and once in a while, they log some body information such as weight, fat and BMI. This creates an optimal scenario for a supervised learning problem, where, for example, we could use the Fitbit activity data to predict the BMI of a user.

regression, supervised learning problem, xgboost and pyspark, (8 more...)

#artificialintelligence

Dec-7-2021, 01:55:28 GMT

News Web Page

Add feedback

Industry:
- Education > Focused Education > Special Education (0.31)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (0.53)
  - Ensemble Learning (0.53)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found