Goto

Collaborating Authors

 airbyte


How to Forecast Purchase Orders for Shopify Stores Using Open-Source

#artificialintelligence

Use the open-source integrated machine learning in MindsDB and the open-source data integration platform Airbyte to forecast Shopify store metrics. With the volume of data increasing exponentially, it's critical for businesses focused on e-commerce to leverage that data as quickly and efficiently as possible. Machine learning represents a disruption to increase predictive capabilities and augment human decision making for use cases like price, assortment and supply chain optimization, inventory management, delivery management, and customer support. In this'how-to' guide, we'll provide step-by-step instructions showing you how to simply and inexpensively integrate machine learning into an existing Shopify account using Airbyte, an open-source data integration platform, and MindsDB, an open-source AutoML framework that runs on top of any database. We will assume you already have Airbyte set up via Docker.


ETL Pipelines with Airflow: the Good, the Bad and the Ugly

#artificialintelligence

Airflow is a popular open-source workflow management platform. Many data teams also use Airflow for their ETL pipelines. For example, I've previously used Airflow transfer operators to replicate data between databases, data lakes and data warehouses. I've also used Airflow transformation operators to preprocess data for machine learning algorithms. But is using Airflow for your ETL pipelines a good practice today?


Bootstrap a Modern Data Stack in 5 minutes with Terraform - KDnuggets

#artificialintelligence

Modern Data Stack (MDS) is a stack of technologies that makes a modern data warehouse perform 10–10,000x better than a legacy data warehouse. Ultimately, an MDS saves time, money, and effort. The four pillars of an MDS are a data connector, a cloud data warehouse, a data transformer, and a BI & data exploration tool. Easy integration is made possible with managed and open-source tools that pre-build hundreds of ready-to-use connectors. What used to take a team of data engineers to build and maintain regularly can now be replaced with a tool for simple use cases.


Daily Crunch: Before the pandemic, Expensify made remote work cool and profitable – TechCrunch

#artificialintelligence

To get a roundup of TechCrunch's biggest and most important stories delivered to your inbox every day at 3 p.m. PDT, subscribe here. Welcome to Daily Crunch for May 25, 2021. Whether you are a developer, a startup fanatic or merely someone with wanderlust, we have something for everyone today. Well, except for disappointed investors in Lordstown Motors. They are stuck holding the bag today after the American electric vehicle company announced a pretty awful set of earnings.


Why the Future of ETL Is Not ELT, But EL(T) - KDnuggets

#artificialintelligence

How we store and manage data has completely changed over the last decade. We moved from an ETL world to an ELT world, with companies like Fivetran pushing the trend. However, we don't think it is going to stop there; ELT is a transition in our mind towards EL(T) (with EL decoupled from T). And to understand this, we need to discern the underlying reasons for this trend, as they might show what's in store for the future. This is what we will be doing in this article. Historically, the data pipeline process consisted of extracting, transforming, and loading data into a warehouse or a data lake.