Pentaho adds native Python integration
Aiming to better support machine learning and analytical environments, Pentaho Labs yesterday announced that it has developed a native integration for the Python language through Pentaho Data Integration (PDI). PDI is essentially a portable "data machine" for ETL, which you can deploy as a stand-alone Pentaho cluster or inside a Hadoop cluster through MapReduce or YARN. Will Gorman, vice president of Pentaho Labs at Hitachi subsidiary Pentaho, says the integration means data scientists can now use of the most popular and flexible open-source languages to increase productivity and data governance while supporting predictive analytics and machine learning. He says the integration will also make data science and predictive modeling more accessible to the developer community. "Python is the environment that is growing the fastest from a community perspective," Gorman says.
Apr-25-2016, 14:16:24 GMT
- Technology: