Apache Beam - Create Data Processing Pipelines

May-20-2016, 18:10:51 GMT–@machinelearnbot

At the Data Science Association our members often complain about the major data engineering problem of finding the right tools and programming models to build both robust data processing pipelines and efficient ETL processes for data transformation and integration. Beam (incubating) attempts to solve this problem by providing a unified programming model to create data processing pipelines. The Apache Beam open source project is currently in incubation mode and we invite you to join the community and pitch in to help build. You start by building a program that defines the pipeline using one of the open source Beam SDKs. The pipeline is then executed by one of Beam's supported distributed processing back-ends, which include Apache Flink, Apache Spark, and Google Cloud Dataflow.

artificial intelligence, create data processing pipeline, information fusion, (6 more...)

@machinelearnbot

May-20-2016, 18:10:51 GMT

News Web Page

Add feedback

Industry:
- Information Technology > Software (0.94)

Technology:
- Information Technology
  - Data Science > Data Integration (1.00)
  - Artificial Intelligence > Representation & Reasoning
    - Information Fusion (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found