Prepare data faster with PySpark and Altair code snippets in Amazon SageMaker Data Wrangler
Amazon SageMaker Data Wrangler is a purpose-built data aggregation and preparation tool for machine learning (ML). It allows you to use a visual interface to access data and perform exploratory data analysis (EDA) and feature engineering. The EDA feature comes with built-in data analysis capabilities for charts (such as scatter plot or histogram) and time-saving model analysis capabilities such as feature importance, target leakage, and model explainability. The feature engineering capability has over 300 built-in transforms and can perform custom transformations using either Python, PySpark, or Spark SQL runtime. For custom visualizations and transforms, Data Wrangler now provides example code snippets for common types of visualizations and transforms.
Jun-15-2022, 21:27:45 GMT