Hopsworks 3.0: The Python-Centric Feature Store

Aug-11-2022, 10:06:22 GMT–#artificialintelligence

Feature stores began in the world of Big Data, with Spark being the feature engineering platform for Michelangelo (the first feature store) and Hopsworks (the first open-source feature store). Nowadays, the modern data stack has assumed the role of Spark for feature stores - feature engineering code can be written that seamlessly scales to large data volumes in Snowflake, BigQuery, or Redshift. However, Python developers know that feature engineering is so much more than the aggregations and data validation you can do in SQL and DBT. Dimensionality reduction, whether using PCA or Embeddings, and transformations are fundamental steps in feature engineering that are not available in SQL, even with UDFs (user-defined functions), today. Over the last few years, we have had an increasing number of customers who prefer working with Python for feature engineering.

feature group, hopswork, pipeline, (13 more...)

#artificialintelligence

Aug-11-2022, 10:06:22 GMT

News Web Page

Add feedback

Technology:
- Information Technology
  - Data Science (1.00)
  - Artificial Intelligence > Machine Learning (0.96)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found